Hi, following up on an old question... I need a tool like tagsoup for the BlogEd editor that I am writing. I need to be able to go through the following...
I have used tag soup for just that, I found a few tricky things one being that it seemed to incorrectly escape characters in javascript (turning < into <)...
Super, mega-duper, cool, man! :-) You would not have any quick hints or small examples for how best to output html from the dom created by tag soup (problem...
... TagSoup doesn't export a DOM interface, but only a SAX one. You can use any of a variety of DOM or DOM-like APIs that can plug in SAX parsers: dom4j,...
Thanks to everyone for the really great help [0]. I followed your suggestions and together with some good sources on the web [1] I put together the following...
There seems to be a problem finding elements by id if one creates a DOM using the tagsoup parser. The example program below illustrates this ... package...
My guess is that this could be related there being no org.w3c.dom.traversal.DocumentTraversal classes in the default j2se 1.4.2 from sun, which uses the...
... That's definitely not the case: TagSoup reports id attributes on all elements as being of type ID. So the problem is downstream of TagSoup, and I don't...
... I very much doubt this. getElementsByTagName() does not depend on the traversal API. Much more likely you're running afoul of namespace issues here. ... ...
Elliotte Harold
elharo@...
Dec 5, 2004 10:29 am
190
... I recently discovered that almost everything to do with IDs is broken by design in DOM2. (Why does that not surprise me?) The practical impact is that it...
Elliotte Harold
elharo@...
Dec 5, 2004 10:35 am
192
Hi, Is it possible to completely disassociate the xhtml namespace with tagsoup XML output? using p.setFeature(Parser.namespacesFeature, false) is not removing...
hi! i have a problem when parsing html with tagsoup. the parsing itself is no problem, everything is fine except that sometimes tagsoup is inserting tags when...
... That's what it is meant to do. ... Basically because bare tbody elements aren't allowed inside form elements. I'm reworking the definition of form elements...
Hi, I have the html: <button name="buttonName">Button Here</button> without a surrounding "<form>" tag (it is only for appearance -- the work gets done by the...
circus_peanuts_are_yummy scripsit: (Good Ghu, how can you eat those things?!) ... That is by design. The HTML DTD says that buttons can't appear outside...
... Hehe. I used to love them when I was a kid--I don't think I've actually eaten one in 10 years, though (I don't think they can actually be digested :)). ...
How would one go about stripping the scriptlet code out of a JSP? Would it be easy to change the source of tagsoup to parse the "<%" and "%>" tags differently?...
Hi, Sorry for all the postings lately, but I'm trying to parse Struts/Tiles JSPs, which include various types of tags, like "iterate", "define", "notEqual",...
... By default, TagSoup assumes that an unknown tag is empty, because it can have no assurance that a close-tag will follow. This can be changed on the...
... desirable. ... What I meant was: Would it be relatively simple for me to change the source to parse this? I'm not very familiar with parsing, scanning,...
... Parser object. ... Yeah, I was trying to do this, cause I saw on your website that this was a parser feature, but I had no idea how to set it. ... ...
... Oh, sure. It's just a simple patch to src/definitions/html.stml, which is the XML definition of the state machine that does the scanning. ... +++...
Hi again, How would I alter the html.tssl file to make it so that <table>, <tbody>, and <tr> tags are not added to a file that contains only a <td>? I assumed...
I got it to work by detaching the elements from their parents, inserting them as children of the root ("html") element, and not messing with any of their...
... Indeed. When I first started thinking about TagSoup, it became clear almost at once that the whole thing would have to be table-driven. There is actually...
Just wanted to say thanks for this great library. Ever since I found tagsoup and wrote a little toy app with it, I have felt like I have this huge hammer and...