Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Real people. Real stories. See how Yahoo! Groups impacts members worldwide.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 254 - 283 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
254
Awesome tool John! This library fixes 99% of my HTML parsing woo. Was trying to build a page analyzer tool that grabs the visible content of webpage. Then I...
gimmemyhoney
Offline Send Email
Apr 6, 2005
3:40 pm
255
... That's a genuine bug: anything beginning "</script" is currently recognized as the end-of-script tag, because I don't (as I should) check for the final...
John Cowan
johnwcowan
Online Now Send Email
Apr 6, 2005
9:24 pm
256
... woo. ... will ... recognized ... the final ">". ... Yup, you are right. Did more testing and realized once it recognize "</script" the next "</" will...
gimmemyhoney
Offline Send Email
Apr 8, 2005
2:15 pm
257
Hi ! This HTML: ============================ <STYLE TYPE="text/css">.ExploreAreaCSS { FONT-SIZE: 12px; COLOR: #000000; FONT-FAMILY: Arial }.ExploreAreaBGColor...
François Beausoleil
francois_bea...
Offline Send Email
Apr 14, 2005
5:46 pm
258
This may or may not be a tagsoup problem, but perhaps someone here will know. I have a simple program that takes in html, transforms it with tagsoup into well...
Henry Story
hjsatdoc
Offline Send Email
Apr 26, 2005
11:28 am
259
Hey ! Has anyone seen this, and can give me a heads up ? ... -- François Beausoleil Solutions Technologiques Internationales Téléphone: (819) 566-5997...
François Beausoleil
francois_bea...
Offline Send Email
Apr 26, 2005
12:16 pm
260
... Sorry for not responding earlier. ... It's a bug. The routine for detecting the end of a script or style element isn't general enough yet. I'll try to...
John Cowan
johnwcowan
Online Now Send Email
Apr 26, 2005
1:34 pm
261
... I think that your XSLT implementation is adding a META (in upper case) element in HTML output mode without noticing if one is already in the result tree or...
John Cowan
johnwcowan
Online Now Send Email
Apr 26, 2005
2:13 pm
262
Yes, you are correct. My xslt implementation is the default one provided by Sun's jdk. I found an explanation of the behavior here: ...
Henry Story
hjsatdoc
Offline Send Email
Apr 26, 2005
4:04 pm
263
... Thanks. ... I think you are doing the Right Thing, since that guarantees that the meta element specifies the correct encoding. Maybe someday TagSoup will...
John Cowan
johnwcowan
Online Now Send Email
Apr 26, 2005
4:32 pm
264
John I followed your advice and subscribed to the TagSoup list. As I was saying, many popular sites have lots of nested JavaScript in their HTML and TagSoup...
Luca Passani
luca_passani
Offline Send Email
May 8, 2005
10:54 am
265
... This definitely seems to be a result of the known problem with detected end-tags in script and style elements. -- John Cowan www.ccil.org/~cowan...
John Cowan
johnwcowan
Online Now Send Email
May 17, 2005
2:55 pm
266
... It sounds like you'd be better off with jchardet, a Java port of the Mozilla encoding guesser. Its result can be set into the InputSource object you pass...
John Cowan
johnwcowan
Online Now Send Email
May 21, 2005
4:51 pm
267
Thank you, John. I'll give it a try... Luca...
Luca Passani
luca_passani
Offline Send Email
May 21, 2005
7:55 pm
268
Hello, TagSoup + XOM here. I get an error somewhere deep in my XML manipulations that emerges as a ParsingException and the message "-1" :( Unfortunately, I...
Luca Passani
luca_passani
Offline Send Email
May 23, 2005
8:33 am
269
There seems to be an amazing number of pages out there with multiple body tags! I guess this comes from people doing includes of whole pages. It would be nice...
Luca Passani
luca_passani
Offline Send Email
May 23, 2005
8:49 am
270
... That's one source. An old bug in early versions of Netscape meant that background-color attributes in multiple body tags would be interpreted dynamically,...
John Cowan
johnwcowan
Online Now Send Email
May 23, 2005
11:52 am
271
... I understand. I am parsing real (i.e. ugly) HTML using XOM's NodeFactory. What's the best strategy to remove those extra body tags? I tried using booleans...
Luca Passani
luca_passani
Offline Send Email
May 24, 2005
6:38 am
272
Luca, I wrote a SAX XMLFilter to filter body tags for my own purposes, and stuck it into CommandLine for testing. It goes between two lines of...
Klotz, Leigh
leighklotz
Offline Send Email
May 24, 2005
5:22 pm
273
... I found this common case (http://www.gazzetta.it, very popular sport site in italy): <script type="text/javascript" language="JavaScript1.2" ...
Luca Passani
luca_passani
Offline Send Email
May 28, 2005
8:17 am
274
... I am using XOM's NodeFactory to parse raw HTML. My problem is that I am using the body closing tag as the cue point to start collecting statistics about a...
Luca Passani
luca_passani
Offline Send Email
May 28, 2005
8:30 am
275
... XSLT is your friend; so is the full use of the XOM model. You are trying to strain the limits of a streaming API beyond what's reasonable. -- John Cowan...
John Cowan
johnwcowan
Online Now Send Email
May 28, 2005
6:18 pm
276
Finally a new release of TagSoup and TSaxon. Summary of changes: Convert CR and CRLF to LF in comments and PIs Force empty elements to close immediately Match...
John Cowan
johnwcowan
Online Now Send Email
May 28, 2005
9:00 pm
277
Good job! the new version seems to do a great job with JavaScript.... Luca...
Luca Passani
luca_passani
Offline Send Email
May 28, 2005
10:54 pm
278
Hello I'm trying to replace the Aelfred Parser I've been using for a simpel xhtml webbrowser with the tagsoup parser, but are a VerifyError ...
thunderbearshammer
thunderbears...
Offline Send Email
May 31, 2005
9:52 am
279
My mistake, was using jdk 1.3, with 1.4 it works fine Best regards Thorbjørn Vynne...
thunderbearshammer
thunderbears...
Offline Send Email
May 31, 2005
11:09 am
280
Hello I know I won't be able to do a compilation under jdk 1.1.8 directly, but is the code (including the code generated from the xslt transformations)...
thunderbearshammer
thunderbears...
Offline Send Email
May 31, 2005
11:10 am
281
... I use 1.4 to develop. I wonder if recompiling it with the 1.3 version of javac would make a difference. -- Only do what only you can do....
John Cowan
johnwcowan
Online Now Send Email
May 31, 2005
1:12 pm
282
... Well, you'd have to go through and change references to HashMap into Hashtable, but that's all I can think of offhand. ... Thank you. -- At the end of the...
John Cowan
johnwcowan
Online Now Send Email
May 31, 2005
1:14 pm
283
Perfect, I have it up and running under jdk 1.1.8 Thanks a lot :-) Best regards Thorbjørn Vynne ... www.reutershealth.com ... www.ccil.org/~cowan...
thunderbearshammer
thunderbears...
Offline Send Email
May 31, 2005
3:25 pm
Messages 254 - 283 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help