Hi, we are using TagSoup to clean up HTML stored in a database column. The reason we hold that lump of text in a DB is because we have a custom XML tag in the...
... TagSoup doesn't know anything about XML vs. HTML; it just has tags, all of which are treated the same. ... It's the central point of TagSoup's design...
Thanks for the answer. I had looked at the PDF on the TagSoup website and red about the "Keep On Truckin" philosophy (and as an admirer of R. Crumb I quite...
Hi! I am trying to build the tagsoup source (v 1.2), to apply the Pushback buffer fix suggested in this thread: ...
Jaran Nilsen
jaran.nilsen@...
Oct 20, 2008 8:32 am
1191
Dear friends is 0rkut , myspace blocked in office ? Use these sites to unbl0ck those sites PVF.in usq.in __________________________________________________ Do...
... Alas, my grasp of Ant is extremely feeble. ... This suggests to me that you need to load some optional Ant task to support the XSLT actions. -- John Cowan...
Dear friends is 0rkut , myspace blocked in office ? Use these sites to unbl0ck those sites uom.in g-4.in __________________________________________________ Do...
It seems the ant version I installed through the Ubuntu repo was not good. I downloaded directly from the Ant home page and ran that version instead (with the...
Jaran Nilsen
jaran.nilsen@...
Oct 22, 2008 12:54 pm
1195
Hi All, I would like to find out how to convert html file to xml format using Tagsoup in Java 6. I have read through all the documents from http://tagsoup.info...
... TagSoup is a SAX parser, so you need to know how to use SAX parsers. You can find lots of documentation at the SAX home page, http://www.saxproject.org ....
Hi John, Thanks for your respond to this question. Sax in general is very foreign to me being a novice XML programmer. I am more comfortable with JDOM...
... The effect of this code is to read in ABC.html and to write ABC.xml, ... If you just change this line to use "org.ccil.cowan.tagsoup.Parser", you will be...
... I don't know this light_html2xml, so I can't comment on what problems there might be in it. ... That's not what's happening. TagSoup reads the HTML file...
Hi John, Â I am in the process of implementing your suggestion but needed a bit more guidances with the following questions: Â ( i ) Downloaded both...
Hi All, Â I am having difficulty parsing using Saxon and TagSoup parser on a namespace html document. The relevant content of this document are as follows: Â ...
Hi All, I can confirm that the XPath using Saxon parser ("org.ccil.cowan.tagsoup.Parser") is working with default namespace. I made the mistake of assuming...
Hello, I'm using Flying Saucer (https://xhtmlrenderer.dev.java.net/) in my application together with Tagsoup. Someone reported the following error and I'm not...
... It's very unusual: it means that someone is trying to invoke an abstract method, which normally is caught at compile time. A method was changed from...
Thanks for the quick answer John. I'll recompile it and see if it still throws the exception. Nicu ... compiled. ... html.stml. ... http://www.ccil.org/~cowan...
... I don't know. I built it as a library, and originally added the stand-alone application support for my own testing purposes, but I suspect that many...
I use tagsoup as one step in DeXSS. http://freshmeat.net/projects/dexss/ ... From: tagsoup-friends@yahoogroups.com [mailto:tagsoup-friends@yahoogroups.com] On...
I'm a library user. I used the standalone app for testing, but the projects where I've integrated TagSoup have always been as a library. Is there discussion...