I haven't looked at the output of OO Writer in a while, and it is probably pretty clean, but the go to library for munging HTML in Python that I have used many times over the years is BeautifulSoup. It deals with all sorts of bad HTML and makes it a lot more usable.
hth,
brett
--
Brett Morgan
Developer Programs Engineer, Blogger
-- On Tue, Jan 11, 2011 at 7:51 PM, arun kumar <arunkumar413@gmail.com> wrote:
"You will need to parse the html file generated from OpenOffice forIs there a method or function to parse the html generated from OpenOffice?
the core content that you require, and then upload that using your
client. Blogger doesn't take entire html files as Blog posts, instead
each post is a fragment of HTML."
--
You received this message because you are subscribed to the Google Groups "Blogger Developer Group" group.
To post to this group, send email to bloggerdev@googlegroups.com.
To unsubscribe from this group, send email to bloggerdev+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/bloggerdev?hl=en.
--
Brett Morgan
Developer Programs Engineer, Blogger
You received this message because you are subscribed to the Google Groups "Blogger Developer Group" group.
To post to this group, send email to bloggerdev@googlegroups.com.
To unsubscribe from this group, send email to bloggerdev+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/bloggerdev?hl=en.
No comments:
Post a Comment