Dear Friends of TagSoup! If you want to use TagSoup in XMLSpy as an external Tool to convert your .html files into XHTML this blog entry describes how you can...
Thread dump shows many thread stuck in the same method: at java/util/HashMap.get(HashMap.java:1596)[optimized] INFO | jvm 1 | 2008/07/23 02:53:30 | at...
Hello, I am using Saxon9.1.0.1j and tagsoup-1.2.jar with java version "1.6.0_05" Java(TM) SE Runtime Environment (build 1.6.0_05-b13) Java HotSpot(TM) Client...
... TagSoup's HTTP handler doesn't know anything about gzipped streams, so it assumes they are ordinary data. That could be fixed, I suppose, but I feel...
Hi, this post is really helpful, but how do you change the output encoding on TagSoup programatically (in Java)? ... US-ASCII will ... references ... to UTF-8,...
Wow, thanks for the fast response, John. I have another issue though.. In the sources that I am parsing, there are things like >, ", &, etc....
... Only by patching the code. I'm curious: why would you want to suppress entity decoding? -- John Cowan cowan@... http://ccil.org/~cowan If I have...
I'm using TagSoup to "extend" HTML, simlar to the Facebook Platform's FBML <http://wiki.developers.facebook.com/index.php/FBML> . I am reading in "HTML",...
... It seems to me that TagSoup is probably doing way more than you want, if the HTML bits are supposed to remain untouched, anyway. -- You annoy me, Rattray!...
Hmm, any suggestions as to any libraries out there that would parse out the HTML without "doing way more than i want"? I also like TagSoup because it can...
... Hmm. Looking back, you said that "<hello>" came out "<hello>". That suggests that you are not correctly escaping on output. Are you using the...
Hi, I need to parse both XML and html in saxon 9. I would like to use tagsoup to parse the html. Is there a way to use tagsoup to parse xml as well? I used...
... No, because there's no way of telling the difference between XML documents and HTML ones: "<p>Simple document</p>" could be either. Anyhow, TagSoup is...
Actually, I am not using an XMLWriter right now, I am just streaming output as a String out as I parse through the DOM. In other words, when I come across an...
... Ouch. That is generally a bad thing to do (xml anti-pattern) ... Yes, and generally positive ones. :-D (i.e. it would be faster to use, not just safer) ...
... If your program doesn't have to work correctly, you can make it as fast as you want. But seriously, XMLWriter is pretty streamlined. -- Cash registers...