Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Hear how Yahoo! Groups has changed the lives of others. Take me there.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 1259 - 1355 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
1259
I want to use Tagsoup to process a html page (a malformed one) and i got it to work using the comand line -H flag. However when i tried it in code, following...
magmaruless
Offline Send Email
Mar 3, 2009
12:11 am
1260
Ok i worked around it =). I went to the page: http://home.ccil.org/~cowan/XML/tagsoup/tsaxon/StyleSheet.java And used the same method: ...
magmaruless
Offline Send Email
Mar 3, 2009
12:32 am
1261
As a followup: I ended up having to pass the output from tagSoup v1.2 into a build of htmlTidy in order to get it to parse in TinyXML for certain html samples...
kiru42
Offline Send Email
Mar 3, 2009
3:38 am
1262
... Looks like TinyXML is not a conforming XML parser, if it doesn't understand character references. To get UTF-8 output without entities, though, just...
John Cowan
johnwcowan
Online Now Send Email
Mar 4, 2009
7:19 pm
1263
... Erm, I hate to be slightly rude, but haven't we had the conversation about the command line problems re: output encodings and win32? I started this whole...
kiru42
Offline Send Email
Mar 4, 2009
8:33 pm
1264
Hello there, I'm getting a java.lang.OutOfMemoryError after 400 xslt transformations with Saxon, using tagsoup as the parser. Detailed exception:...
magmaruless
Offline Send Email
Mar 7, 2009
12:34 pm
1265
The documentation for XMLWriter says * <p>According to the XML Recommendation, <em>all</em> whitespace * in an XML document is potentially significant to an...
Klotz, Leigh
leighklotz
Offline Send Email
Mar 11, 2009
6:24 pm
1266
... If you look at the Infoset, you'll see that whitespace outside the root element is generally considered nonsignificant, despite the letter of the XML Rec....
John Cowan
johnwcowan
Online Now Send Email
Mar 11, 2009
7:23 pm
1267
... [mailto:tagsoup-friends@yahoogroups.com] On Behalf Of John Cowan ... TagSoup 1.2 ... whitespace ... root ... question. John, Thank you for your quick...
Klotz, Leigh
leighklotz
Offline Send Email
Mar 11, 2009
8:41 pm
1268
... Sorry, quite right. Since I don't use Windows, I have no idea why the output encoding is broken (if that's really what's happening). Can someone using...
John Cowan
johnwcowan
Online Now Send Email
Mar 11, 2009
9:25 pm
1269
... I can't replicate this: What I get is simply: <?xml version="1.0" standalone="yes"?> <html...
John Cowan
johnwcowan
Online Now Send Email
Mar 11, 2009
9:32 pm
1270
I have found a 1811 line xhtml file with unbalanced tags that causes Tagsoup to go into a loop. Is there a procedure for reporting such problems? regards, tom...
Tom Van Vleck
thvv
Offline Send Email
Mar 12, 2009
8:31 pm
1271
... Sure. Send it to me. -- A mosquito cried out in his pain, John Cowan "A chemist has poisoned my brain!"...
John Cowan
johnwcowan
Online Now Send Email
Mar 12, 2009
8:48 pm
1272
... xmlns="http://www.w3.org/1999/xhtml"><body><b>hello</b><i>there</i> ... elementLevel == 1, ... thus ... I misspoke in the quoted text at the top of this...
Klotz, Leigh
leighklotz
Offline Send Email
Mar 12, 2009
9:49 pm
1273
... Yes, you're right. I've never paid attention to this before. ... Aha. ... I agree: line 632 should just be flushed. -- Clear? Huh! Why a four-year-old...
John Cowan
johnwcowan
Online Now Send Email
Mar 12, 2009
10:33 pm
1274
Thanks! And to correct another typo for the record, I'm sending fragments, not fragmenents, which I guess is a back-formation from documenents. Leigh. ... ...
Klotz, Leigh
leighklotz
Offline Send Email
Mar 12, 2009
10:37 pm
1275
Hi, In a proyect where we use Tagsoup to tidy some malformed xhtml code have found that if there is an odd number of quotes on the doctype declaration tagsoup...
Miguel Garcia
miguel.garcia@...
Send Email
Mar 18, 2009
10:58 am
1276
... The real problem is that TagSoup thinks the system-id begins with a quote and ends with a quote, but doesn't realize that it's zero-length. The obvious...
John Cowan
johnwcowan
Online Now Send Email
Mar 18, 2009
8:45 pm
1277
I seem to have found another place where TagSoup gets in a bit of a huff. Perhaps there is a flag I can specify to make things better. The issue is this piece...
Michael Giles
michael_a_giles
Online Now Send Email
Mar 26, 2009
10:10 pm
1279
Hi, I've encountered an issue using TagSoup and I wanted to clarify whether it is expected behaviour due to how I'm using it, or something else. The issue that...
James Abley
taboozizi
Offline Send Email
Apr 28, 2009
10:08 am
1280
... I can't duplicate this problem with TagSoup 1.2. It turns into a <br clear="none"></br>, because there's a default attribute value in the HTML 4.0 DTD,...
John Cowan
johnwcowan
Online Now Send Email
Apr 28, 2009
9:46 pm
1281
... Sorry, that's absolutely right. A later step in my XML pipeline is removing that element. Apologies for the noise. Cheers, James...
James Abley
taboozizi
Offline Send Email
Apr 29, 2009
9:05 pm
1286
I have been using TagSoup for some time for various tasks and it does a great job. One application uses TagSoup to parse HTML from the clipboard. Recently...
Leslie Software
lesliesoftware
Offline Send Email
May 29, 2009
10:41 am
1287
I figured it out. It was a class path problem. I'm using eclipse plug-ins so my plug-in as trying to use Xalan's SAX2DOM and the class loader was seeing two...
Leslie Software
lesliesoftware
Offline Send Email
May 29, 2009
8:18 pm
1320
Only a few of them got through my personal filters, and when I noticed the pattern, I removed and blocked the sending email and removed the messages from the...
John Cowan
johnwcowan
Online Now Send Email
Jun 11, 2009
6:28 pm
1321
Hello, I'm new to TagSoup and Groovy, and trying to parse some html, not well formed I'm afraid (why I use TagSoup). But I have some strange behaviour that I...
jeremiebousquet
Offline Send Email
Jun 21, 2009
7:48 am
1322
I'll let others with better HTML knowledge confirm but I have vague recollections that in theory, b tags are supposed to always be inside other tags, e.g. p...
Paul King
pking_asert
Offline Send Email
Jun 21, 2009
9:09 am
1323
... What's happening here is that TagSoup's model of HTML doesn't believe that a B element can have a P element inside it. B elements are part of the inline...
John Cowan
johnwcowan
Online Now Send Email
Jun 21, 2009
5:58 pm
1324
... It's quite clear now, thanks for your answer ! I managed to workaround by changing all "<b>" to "<bold>" (quite ugly but it works), but maybe just removing...
jeremiebousquet
Offline Send Email
Jun 23, 2009
6:09 pm
1355
Another huge burst of spam. I haven't had nearly this much trouble on the Google Groups mailing lists I manage. Would there be any objections to my moving...
John Cowan
johnwcowan
Online Now Send Email
Sep 2, 2009
10:14 pm
Messages 1259 - 1355 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help