Skip to content

HTML body not extracted #4

@ghost

Description

Bug description: when importing .msg files, the html body is not extracted.

Minimal replicable example

The attached email have an html body with encoding CP-1252 (west Europe)
ex_no_html_body.zip

In R, I did the following:

msg <- msgxtractr::read_msg("ex_no_html_body.msg")
str(msg)
List of 8
 $ headers         : NULL
 $ sender          : list()
 $ recipients      :List of 1
  ..$ :List of 3
  .. ..$ display_name : NULL
  .. ..$ address_type : NULL
  .. ..$ email_address: NULL
 $ subject         : NULL
 $ body            :List of 2
  ..$ text: chr " \r\nTest table\r\n \r\n1\r\n2\r\n3\r\n4\r\n5\r\n6\r\n7\r\n8\r\n9\r\n \r\n"
  ..$ html: NULL
 $ attachments     : list()
 $ display_envelope:List of 1
  ..$ display_to: chr "noone@none.com"
 $ times           :List of 3
  ..$ creation_time: NULL
  ..$ last_mod_time: NULL
 - attr(*, "class")= chr "msg"

The HTML element is empty. I checked if my message what not in RTF.

Any idea ?

By the way: thank you for this package !

Metadata

Metadata

Assignees

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions