Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Show off your group to the world. Share a photo of your group with us.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 19 - 48 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
19
Currently TagSoup's behavior about entity references is as follows. If an entity is recognized by the schema, such as &nbsp;, it is turned into a single...
John Cowan
johnwcowan
Online Now Send Email
Feb 12, 2004
5:05 am
20
... I wouldn't do that, as for instance you do see people using for instance &eacute; in alt attributes. You could restrict that behaviour to href attributes...
Robin Berjon
robin.berjon@...
Send Email
Feb 12, 2004
10:52 am
21
JC: Clearly this can be fixed by being smart about not inserting ; when the entity reference is unknown. But I'm wondering if it wouldn't just be better to...
Danny Ayers
danny_ayers
Offline Send Email
Feb 12, 2004
11:32 am
22
... The SGML behavior could be something to consider. This is off the top of my head, and probably not exactly correct, but I believe an SGML parser that finds...
DuCharme, Bob (LNG-CHO)
philregion
Offline Send Email
Feb 12, 2004
2:23 pm
23
... I do that too. But unlike an SGML parser, I can't just cough and die in either of the two bad cases: unknown entity and missing semicolon. Too many HTML...
cowan@...
johnwcowan
Online Now Send Email
Feb 12, 2004
4:06 pm
24
I just got an off-list request to add support for HTML comments through the LexicalHandler interface. I wonder if anyone else thinks this feature is useful. ...
cowan@...
johnwcowan
Online Now Send Email
Feb 13, 2004
8:18 pm
25
Hi, We have a project for a national archive to translate data into standard formats for long term archiving. One of these formats is HTML. Whilst we will keep...
chris_bitmead
Offline Send Email
Feb 20, 2004
10:50 am
26
I 'll defer to John on the other questions... What does it mean "it does not convert presentation HTML to CSS"? I believe that means in cases like: ...
Danny Ayers
danny_ayers
Offline Send Email
Feb 20, 2004
11:32 am
27
... Forgive my ignorance, but is the latter valid xhtml? If so, why would anybody want to change it?...
Chris B.
chris_bitmead
Offline Send Email
Feb 20, 2004
11:40 am
28
... Forgive my ignorance, but is the latter valid xhtml? If so, why would anybody want to change it? <center> was deprecated in HTML 4.01, from which XHTML is...
Danny Ayers
danny_ayers
Offline Send Email
Feb 20, 2004
12:23 pm
29
... And do you know what JTidy and Neko do with this?...
Chris B.
chris_bitmead
Offline Send Email
Feb 20, 2004
12:31 pm
30
I'm not sure about JTidy, but the exe version of Tidy has an option - I just tried this : <center>text</center> Checking the "Output as XHTML" and "Replace...
Danny Ayers
danny_ayers
Offline Send Email
Feb 20, 2004
12:51 pm
31
Hmm, it still uses the xhtml-traditional dtd though doesn't it....
Chris B.
chris_bitmead
Offline Send Email
Feb 20, 2004
1:08 pm
32
... One possibility would be to use TagSoup as a prefilter for JTidy. The main danger is that TagSoup will mess up what JTidy would understand correctly,...
John Cowan
johnwcowan
Online Now Send Email
Feb 20, 2004
11:56 pm
33
... Correct. ... In fact, the HTML 4.01 DTD says that a center element can contain another one, so this is left alone, and two end-tags get added at the next ...
John Cowan
johnwcowan
Online Now Send Email
Feb 21, 2004
12:10 am
34
... That is what I was thinking. ... Do you have any reason to believe this could happen? ... So you're saying that.... ...
Chris B.
chris_bitmead
Offline Send Email
Feb 21, 2004
12:41 am
35
... Tidy can cope with mailformed start-tags better than TagSoup currently can; on occasion, TagSoup gets terminally confused about what's an attribute and...
John Cowan
johnwcowan
Online Now Send Email
Feb 21, 2004
1:15 am
36
... Ooh, yes please. Or if you don't have time tell me how to do it....
chris_bitmead
Offline Send Email
Feb 21, 2004
1:05 pm
37
... A quick and dirty hack is to add the following after line 343 of Parser.java (the call on theSchema.getElementType): if (type == null) return; As I say,...
John Cowan
johnwcowan
Online Now Send Email
Feb 21, 2004
7:38 pm
38
--On Saturday, February 21, 2004 11:40 AM +1100 "Chris B." <chris@...> wrote:r ... I've actually tried doing this (tagsoup as prefilter before jtidy)....
Reza Ferrydiansyah
rezaferry
Online Now Send Email
Feb 23, 2004
4:27 pm
39
FWIW I've found a really good use for TagSoup. I don't know if this is at all novel or what, but I'm writing an online tutorial on XQuery, and I'm using...
Howard Katz
howardk@...
Send Email
Feb 23, 2004
6:36 pm
40
... Fabulous! Can you mention, at least, what XQuery implementation you are using, and how you are persuading it to parse with TagSoup? -- John Cowan...
cowan@...
johnwcowan
Online Now Send Email
Feb 23, 2004
7:44 pm
41
Sure, I'm using my own engine (who else's?! :-) The exercise has also helped me uncover some new bugs in my implementation. All it took to persuade my engine...
Howard Katz
howardk@...
Send Email
Feb 23, 2004
8:57 pm
42
TagSoup maintains a stack of open elements, and knows which elements can be children of which. When a start-tag is found that can't be a child of the...
cowan@...
johnwcowan
Online Now Send Email
Feb 27, 2004
8:50 pm
43
I have just released TagSoup 0.9.2 and TSaxon 0.9.2. The changes to TagSoup: No longer inserts bogus ; after unknown entity reference without ; Consecutive...
cowan@...
johnwcowan
Online Now Send Email
Feb 27, 2004
9:55 pm
44
I've uploaded a test suite for TagSoup to the files area. (It's MIT-licensed, which should be compatible with everything else.) JUnit tests are included, as...
Joseph Walton
joe24906
Offline Send Email
Mar 6, 2004
5:34 pm
45
... Thank you! I'll move it to the TagSoup page on Monday, and probably incorporate it into the next release. ... Hmm, yes. I suppose it should generate...
John Cowan
johnwcowan
Online Now Send Email
Mar 7, 2004
9:27 am
46
Hi , I have tested tagsoup parser on http://www.yahoo.fr and i was really surpised on the events on the html content : in the first script tag content you can...
sebastien_bardoux
sebastien_ba...
Offline Send Email
Mar 9, 2004
5:47 pm
47
... I'd like to see these changes and possibly incorporate them into the next release. Can you send them to me, please? -- "While staying with the Asonu, I...
cowan@...
johnwcowan
Online Now Send Email
Mar 9, 2004
6:31 pm
48
Well, it's time for another public release of TagSoup, the SAX-compliant Java parser for nasty, ugly HTML. TagSoup 0.9.3 fixes most known bugs and provides ...
cowan@...
johnwcowan
Online Now Send Email
Mar 26, 2004
7:59 pm
Messages 19 - 48 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help