Hello, I've downloaded Tagsoup version 1.1.3 from the Tagsoup homepage and am able to use it with Java 6. Now I've tried to use it with Java 5 (1.5.0_12),...
Ole Laurisch
ole.laurisch@...
Dec 12, 2007 4:29 pm
965
Hm... works for me on Java 1.5.0_14. Just to get the ball rolling, I'll ask the usual starter question (with no insult meant): are you sure you have...
For interest's sake, in our app we tell TagSoup to retain comments from the HTML input like this: XMLWriter xmlWriter; try { xmlWriter = new XMLWriter(new ...
Hi Mark, at first I wanted to answer "Hey, c'mon! Sure I have the tagsoup jar in my classpath", but then I double checked it and found out the following. All...
Ole Laurisch
ole.laurisch@...
Dec 13, 2007 7:45 am
969
I have found a web site [http://canada.com/] which uses '<?xml:namespace prefix = cwi />' in many of its pages, including its main page. The pages start with...
... Just a minor nitpick: ... It is actually not even well-formed xml: processing instructions can not have target that starts with 'xml' (case insensitive),...
... Right. Considered without regard to case: <?xml ...> is not well formed as a PI (the XML declaration is not a PI); <?xml:foo ...?> is XML well-formed, but...
... You are absolutely correct. :-) My mistake -- I did mix up rules for PI names and restrictions on reserved namespace prefixes (where anything starting with...
As a New Year's present to the TagSoup community (and to fulfill a pre-New-Year resolution of mine), I've completed development work on TagSoup 1.2. This is...
There are a great many changes, most of them fixes for long-standing bugs, in this release. Only the most important are listed here; for the rest, see the...
... Thanks. -- John Cowan cowan@... http://ccil.org/~cowan Female celebrity stalker, on a hot morning in Cairo: "Imagine, Colonel Lawrence, ninety-two...
Hello, My program take a string from database who contain something like : "<p>l’eau est froide</p>". the ’ entity is '. Into my SAX parser, I...
... There is, but can you send me the input that provokes this crash? ... You need to build with Ant after installing Saxon, as noted on the source page. Just...
Hi, I want to use tagsoup for parsing HTML that contains some custom XML tags in the <head> section. As far I understood the documentation, I have to add my...
... That's correct. What I don't know from your example is whether you guarantee that the XML parts are always going to be well-formed and valid internally,...
Wow, that was a lot of information, thank you! ... TagSoup ... I read that pdf file already, but I found nothing about the groups and their intentions in...
... You must be using some kind of output engine other than the supplied XMLWriter, I guess. I don't know what these are symptoms of. ... This is an excellent...
Hi everyone, First, thanks to John and others for your excellent work on TagSoup, it is one of those tools I find very useful, very often. I know this may be...
... Thank you! ... If the invalid structure is only in the HTML parts, and the RSS parts are basically well-structured, then TagSoup 1.2 should simply do the...
... Yep, I do realize I could do this. Since, like you mentioned, the RSS/Atom structure is quite extensible, I thought it'd be great to have a single pass ...
As I read the web page, setting the namespaces feature to false should result in elements in the default namespace. Instead, I get elements in the xhtml...