This is hopefully the last message you'll be seeing from Yahoo; the next should be a welcome message from Google. I've migrated everyone with their normal vs....
Another huge burst of spam. I haven't had nearly this much trouble on the Google Groups mailing lists I manage. Would there be any objections to my moving...
Hi, this might have been asked a couple of times already, but searching the forum did not really help. I have marked-up text, mixing HTML tags with a custom...
Hi, I noticed that after my XHTML is parsed the empty link tags <link /> are converted to start and closed tags e.g. <link></link> Does any know how I can...
Hello, I'm new to TagSoup and Groovy, and trying to parse some html, not well formed I'm afraid (why I use TagSoup). But I have some strange behaviour that I...
Only a few of them got through my personal filters, and when I noticed the pattern, I removed and blocked the sending email and removed the messages from the...
I figured it out. It was a class path problem. I'm using eclipse plug-ins so my plug-in as trying to use Xalan's SAX2DOM and the class loader was seeing two...
I have been using TagSoup for some time for various tasks and it does a great job. One application uses TagSoup to parse HTML from the clipboard. Recently...
Hi, I've encountered an issue using TagSoup and I wanted to clarify whether it is expected behaviour due to how I'm using it, or something else. The issue that...
I seem to have found another place where TagSoup gets in a bit of a huff. Perhaps there is a flag I can specify to make things better. The issue is this piece...
Hi, In a proyect where we use Tagsoup to tidy some malformed xhtml code have found that if there is an odd number of quotes on the doctype declaration tagsoup...
The documentation for XMLWriter says * <p>According to the XML Recommendation, <em>all</em> whitespace * in an XML document is potentially significant to an...
I have found a 1811 line xhtml file with unbalanced tags that causes Tagsoup to go into a loop. Is there a procedure for reporting such problems? regards, tom...
Greeting. So, I'm using the tagSoup-1.2.jar file as a stand alone program which I shell out to. What I'm trying for here, is to convert in the wild html into...
I want to use Tagsoup to process a html page (a malformed one) and i got it to work using the comand line -H flag. However when i tried it in code, following...
With TSaxon the -H switch allows one to process (ill formed) HTML files when they are the source. What about when the source file is XML and you're trying to...
Hi all, What is the best way to unit test the parser methods like startElement(), endElement(), ... one at a time, and by starting from reading an XML file...
I feel like I've seen this discussed at some point in the past 5 years, but I can't remember or find the answer. If an HTML page has an ampersand in the text,...
Hey, Thanks for some great software! I'm having some trouble with manipulating HTML by parsing it with tagsoup into a DOM and then writing it again. The main...
Here's a simplified example of the HTML I'm trying to parse: <p> <span id="data"> <p>important information</p> </span> </p> And here's what I get out of...
... I don't know. I built it as a library, and originally added the stand-alone application support for my own testing purposes, but I suspect that many...
Hello, I'm using Flying Saucer (https://xhtmlrenderer.dev.java.net/) in my application together with Tagsoup. Someone reported the following error and I'm not...