Hello !
There is some code doing intentional removal of
nodes when they are the last child of a node that also contained text. Here's a very simple example about how this can lead to incorrect results (this is stuff I'm receiving from bad html emails) :
<font size="+1">Vikings: Wolves of Midgard<br></font><font size="+1">Valkyria Chronicles<br>
<br>
World Of Warcraft Battlechest</font>
The expected output would be
Vikings: Wolves of Midgard
Valkyria Chronicles
World Of Warcraft Battlechest
The actual output is:
Vikings: Wolves of MidgardValkyria Chronicles
World Of Warcraft Battlechest