It took me some time to figure this out, but there is a major bug with XML parsing when using PHP 5.2.6 and libxml2 (versions 2.7.0 to 2.7.2). Basically, when using the libxml library, it will strip out all the HTML entities so that all the "<" and "&" characters are stripped out. This renders the output as gibberish when using products like Live Writer and ScribeFire to upload content to sites like Word Press or Joomla. Other descriptions of issue (ARGH: I cannot post links do to the minimum 5 post limit):
Seach for php bug #45996
Typo3 bugtracker 00093359
The main problem is there doesn't seem to be any rush by either PHP or the libxml developers to fix this bug. There seems to be a little bit of "debate" on who needs to fix it from what I have read. At any rate, the issue is still present and no immediate patches in site. There seems to be two fixes at the moment:
1) Go back to the libxml versions less than 2.7.0 (which has potential issues), or
2) Recompile PHP5 using libexpat
Option two seems to be the best solution, but I am on Glowhost shared server (Advanced 5 Star) and do not have ability to recompile. Is option 2 something Glowhost would be able to do if I submitted a trouble ticket? I have waited many months for this bug to get resolved and I do not see anything in the PHP or libxml bug listings to indicate it will be.
Cheers,
Chris