Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Want your group to be featured on the Yahoo! Groups website? Add a group photo to Flickr.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 1125 - 1163 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
1125
... That's a known problem that has to do with tags opened in each of various cells of a table and never closed again. I will fix it in the next release. -- ...
John Cowan
johnwcowan
Online Now Send Email
Jun 1, 2008
7:59 am
1128
Hello Group, I've been using TagSoup with some data for which I do not know the encoding ahead of time and playing around with auto detection of character...
Nitay Joffe
nitay@...
Send Email
Jun 3, 2008
11:52 pm
1129
Hello I hope that intent of tagsoup is to parse ugly HTML to DOM (XML) so that result displayed of both in the modern webbrowser looks the same. It means that...
Martin Zdila
m.zdila@...
Send Email
Jun 9, 2008
7:54 am
1131
Hello I found one page with following structure: <html><head>...</head><noscript><body>...</body></noscript><frameset>...</frameset></html> body was thrown out...
Martin Zdila
m.zdila@...
Send Email
Jun 9, 2008
8:52 am
1132
... Yes and no. TagSoup does attempt to produce output similar to that of Web browsers, but only within the limits of its design model. It does not contain...
John Cowan
johnwcowan
Online Now Send Email
Jun 9, 2008
2:42 pm
1133
... Thanks. I'll add this to the next release. ... When I get time and energy to work on it enough to release it. ... Not at present. ... It's just me, except...
John Cowan
johnwcowan
Online Now Send Email
Jun 9, 2008
2:47 pm
1134
hello ... What I need is simple thing ;-) - let the SAX generates events: open table, open tr, open td, text "cell1", close td, open span, text "err1", close...
Martin Zdila
m.zdila@...
Send Email
Jun 9, 2008
3:12 pm
1135
... You want to modify html.tssl, not html.stml (which is about the lexer). The simplest change *for this specific problem* is probably to add <contains...
John Cowan
johnwcowan
Online Now Send Email
Jun 9, 2008
5:29 pm
1136
... Like John said, TagSoup operates at a lower-level, "below" a dom. So what you can do is to use a tree model such as XOM, and do additional fixing _you_...
Tatu Saloranta
cowtowncoder
Offline Send Email
Jun 9, 2008
7:14 pm
1137
Hello Tatu thanks 4 the reaction ... I am actually using xerces to build DOM from TagSoup and xalan for XPath processing, transformation and serialization....
Martin Zdila
m.zdila@...
Send Email
Jun 9, 2008
8:14 pm
1138
Hello John ... Thanks 4 the hint, I'll try it. ... OK, thanks. Please see more in mail for Tatu. BTW there is one evil in the pages that even TagSoup can't...
Martin Zdila
m.zdila@...
Send Email
Jun 9, 2008
8:19 pm
1139
... True. ... Unfortunately you can't, in the general case, have one without the other. In order to know when to close tags, you need to know what elements are...
John Cowan
johnwcowan
Online Now Send Email
Jun 9, 2008
11:02 pm
1140
... Yes, in cases when some tag is not closed properly. But in my table example are all tags closed properly = it is valid XML :-). Best regards -- Martin...
Martin Zdila
m.zdila@...
Send Email
Jun 10, 2008
6:05 am
1143
... In that case you are probably better off with an XML parser. Well-formedness is a global property of a document that TagSoup can't know about in advance. ...
John Cowan
johnwcowan
Online Now Send Email
Jun 10, 2008
1:12 pm
1144
... unfortunately i have mix of such constructs (which i would like to keep) and invalid tag nesting (which i'd like to fix). so i can't use xml parser :-(. ...
Martin Zdila
m.zdila@...
Send Email
Jun 10, 2008
1:49 pm
1145
... In that case, I think that the private patch I mentioned, and as many other patches as you need, are your only option at present. -- There is / One art...
John Cowan
johnwcowan
Online Now Send Email
Jun 10, 2008
1:54 pm
1149
Hello Currently tagsoup moves block elements out of SPAN, B, I, SUP, ... and other inline elements. This causes rendering problems in following scenarios: ...
Martin Zdila
m.zdila@...
Send Email
Jun 24, 2008
7:39 am
1150
If I am not mistaken, block-level elements, by definition, cannot be included from within inline elements. Therefore, it seems that tagsoup is correctly...
Roger Ly
furbyman1976
Offline Send Email
Jun 24, 2008
8:00 am
1151
Hello Roger ... This is true. But for my usecase I need TagSoup to create such DOM which after serialization to XHTML would be rendered in major browsers in...
Martin Zdila
m.zdila@...
Send Email
Jun 24, 2008
8:32 am
1152
hello i found one page containing two "type" attributes with different values: <input tabindex="2" type="Password" value="" name="passw" maxlength='25' ...
Martin Zdila
m.zdila@...
Send Email
Jun 24, 2008
12:09 pm
1153
Dear Friends of TagSoup! If you want to use TagSoup in XMLSpy as an external Tool to convert your .html files into XHTML this blog entry describes how you can...
xmltools
Offline Send Email
Jul 6, 2008
9:47 am
1155
Thread dump shows many thread stuck in the same method: at java/util/HashMap.get(HashMap.java:1596)[optimized] INFO | jvm 1 | 2008/07/23 02:53:30 | at...
houle.benoit
Offline Send Email
Jul 24, 2008
4:05 pm
1156
By the way it is causing CPU usage up to 100% which is a quite big issue here... Thanks again....
houle.benoit
Offline Send Email
Jul 24, 2008
6:55 pm
1157
Hello, I am using Saxon9.1.0.1j and tagsoup-1.2.jar with java version "1.6.0_05" Java(TM) SE Runtime Environment (build 1.6.0_05-b13) Java HotSpot(TM) Client...
iwanttokeepanon
Offline Send Email
Aug 7, 2008
2:48 pm
1158
One more note, I am using a proxy to access the website in the document() expression. If that matters. (?)...
iwanttokeepanon
Offline Send Email
Aug 7, 2008
2:52 pm
1159
... TagSoup's HTTP handler doesn't know anything about gzipped streams, so it assumes they are ordinary data. That could be fixed, I suppose, but I feel...
John Cowan
johnwcowan
Online Now Send Email
Aug 7, 2008
4:12 pm
1160
Hi, this post is really helpful, but how do you change the output encoding on TagSoup programatically (in Java)? ... US-ASCII will ... references ... to UTF-8,...
idrism
Offline Send Email
Aug 15, 2008
5:35 pm
1161
... Pass a Reader to XMLWriter that has been set up to do the desired output encoding. -- John Cowan cowan@... http://ccil.org/~cowan I am he that...
John Cowan
johnwcowan
Online Now Send Email
Aug 15, 2008
5:38 pm
1162
Wow, thanks for the fast response, John. I have another issue though.. In the sources that I am parsing, there are things like &gt;, &quot;, &amp;, etc....
idrism
Offline Send Email
Aug 15, 2008
5:55 pm
1163
... Only by patching the code. I'm curious: why would you want to suppress entity decoding? -- John Cowan cowan@... http://ccil.org/~cowan If I have...
John Cowan
johnwcowan
Online Now Send Email
Aug 15, 2008
5:59 pm
Messages 1125 - 1163 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help