There seems to be a problem finding elements by id if one creates a DOM using the tagsoup parser. The example program below illustrates this ... package...
My guess is that this could be related there being no org.w3c.dom.traversal.DocumentTraversal classes in the default j2se 1.4.2 from sun, which uses the...
... That's definitely not the case: TagSoup reports id attributes on all elements as being of type ID. So the problem is downstream of TagSoup, and I don't...
... I very much doubt this. getElementsByTagName() does not depend on the traversal API. Much more likely you're running afoul of namespace issues here. ... ...
Elliotte Harold
elharo@...
Dec 5, 2004 10:29 am
190
... I recently discovered that almost everything to do with IDs is broken by design in DOM2. (Why does that not surprise me?) The practical impact is that it...
Elliotte Harold
elharo@...
Dec 5, 2004 10:35 am
192
Hi, Is it possible to completely disassociate the xhtml namespace with tagsoup XML output? using p.setFeature(Parser.namespacesFeature, false) is not removing...
hi! i have a problem when parsing html with tagsoup. the parsing itself is no problem, everything is fine except that sometimes tagsoup is inserting tags when...
... That's what it is meant to do. ... Basically because bare tbody elements aren't allowed inside form elements. I'm reworking the definition of form elements...
Hi, I have the html: <button name="buttonName">Button Here</button> without a surrounding "<form>" tag (it is only for appearance -- the work gets done by the...
circus_peanuts_are_yummy scripsit: (Good Ghu, how can you eat those things?!) ... That is by design. The HTML DTD says that buttons can't appear outside...
... Hehe. I used to love them when I was a kid--I don't think I've actually eaten one in 10 years, though (I don't think they can actually be digested :)). ...
How would one go about stripping the scriptlet code out of a JSP? Would it be easy to change the source of tagsoup to parse the "<%" and "%>" tags differently?...
Hi, Sorry for all the postings lately, but I'm trying to parse Struts/Tiles JSPs, which include various types of tags, like "iterate", "define", "notEqual",...
... By default, TagSoup assumes that an unknown tag is empty, because it can have no assurance that a close-tag will follow. This can be changed on the...
... desirable. ... What I meant was: Would it be relatively simple for me to change the source to parse this? I'm not very familiar with parsing, scanning,...
... Parser object. ... Yeah, I was trying to do this, cause I saw on your website that this was a parser feature, but I had no idea how to set it. ... ...
... Oh, sure. It's just a simple patch to src/definitions/html.stml, which is the XML definition of the state machine that does the scanning. ... +++...
Hi again, How would I alter the html.tssl file to make it so that <table>, <tbody>, and <tr> tags are not added to a file that contains only a <td>? I assumed...
I got it to work by detaching the elements from their parents, inserting them as children of the root ("html") element, and not messing with any of their...
... Indeed. When I first started thinking about TagSoup, it became clear almost at once that the whole thing would have to be table-driven. There is actually...
Just wanted to say thanks for this great library. Ever since I found tagsoup and wrote a little toy app with it, I have felt like I have this huge hammer and...
I'm having difficulty disabling namespaces in the document I'm parsing, using 1.0rc1. I'm using the following code to parse a document and manipulate it with ...
Brian Lalor
blalor-k-yahoo.f6bdbf...
Jan 27, 2005 8:08 pm
213
Good morning, all. I've been using TagSoup for some time now for various tasks and absolutely love it. Like someone else said yesterday, it's such a great...
Brian Lalor
blalor-k-yahoo.f6bdbf...
Jan 28, 2005 12:20 pm
214
Is it possible to suppress the default attributes for elements that don't already have them set? For example, every <td/> tag now has colspan="1" and...
Brian Lalor
blalor-k-yahoo.f6bdbf...
Jan 28, 2005 1:34 pm
215
I have just released TagSoup 1.0rc2. This is not really a "release candidate", but rather a set of bug fixes to TagSoup 1.0rc1. There are still some known...
... This is not tuneable at runtime, but you can alter the src/definitions/html.stml file to do so. Just look for lines containing "attribute" elements with ...