Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Show off your group to the world. Share a photo of your group with us.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 1247 - 1276 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
1247
Hi, I am having an issue. Tag soup seems to convert "&sigma;" to "?" Can i somehow code to leave it as is? thanks Samir...
samirss
Offline Send Email
Feb 9, 2009
5:49 pm
1248
... No. TagSoup interprets entity references on iput, but does not regenerate them on output. But if you set the output encoding to something other than...
John Cowan
johnwcowan
Online Now Send Email
Feb 9, 2009
6:52 pm
1249
I feel like I've seen this discussed at some point in the past 5 years, but I can't remember or find the answer. If an HTML page has an ampersand in the text,...
Michael Giles
michael_a_giles
Online Now Send Email
Feb 13, 2009
6:45 pm
1250
... Yes, it should be handled (and returned as a raw &, to be escaped on output as &amp;). ... @#$*, I thought I got rid of that class of bug. Apparently the...
John Cowan
johnwcowan
Online Now Send Email
Feb 13, 2009
7:21 pm
1251
Hi all, What is the best way to unit test the parser methods like startElement(), endElement(), ... one at a time, and by starting from reading an XML file...
ciel_et_espace
Offline Send Email
Feb 16, 2009
8:09 am
1252
... You got me there. Parsing is inherently a tightly coupled group of behaviors, since everything depends on building up a rather complex and varying state. ...
John Cowan
johnwcowan
Online Now Send Email
Feb 16, 2009
8:26 am
1253
... Almost by definition unit testing doesn't read files. Passing your own arguments is the right way to *unit* test. That said, it is important to test with...
Elliotte Harold
elharo@...
Send Email
Feb 16, 2009
2:42 pm
1254
Thank you for your answer. Your proposal tends to indicate that we need to go for an intrusive solution in which we modify the real code to throw exceptions...
ciel_et_espace
Offline Send Email
Feb 16, 2009
7:38 pm
1255
Yes I agree and that is what I am doing for the time being. I don't read files but I get my test input from unit test strings. BR, CP. ... from ... own ... ...
ciel_et_espace
Offline Send Email
Feb 16, 2009
7:40 pm
1256
... Without more context, I simply can't say. -- John Cowan cowan@... http://ccil.org/~cowan The penguin geeks is happy / As under the waves they lark ...
John Cowan
johnwcowan
Online Now Send Email
Feb 16, 2009
8:04 pm
1257
With TSaxon the -H switch allows one to process (ill formed) HTML files when they are the source. What about when the source file is XML and you're trying to...
neville88
Offline Send Email
Feb 24, 2009
8:58 pm
1258
... I don't know any way to do that. The -H switch is just shorthand for the Saxon switch '-x org.ccil.cowan.tagsoup.Parser', and that affects both the main...
John Cowan
johnwcowan
Online Now Send Email
Feb 24, 2009
10:11 pm
1259
I want to use Tagsoup to process a html page (a malformed one) and i got it to work using the comand line -H flag. However when i tried it in code, following...
magmaruless
Offline Send Email
Mar 3, 2009
12:11 am
1260
Ok i worked around it =). I went to the page: http://home.ccil.org/~cowan/XML/tagsoup/tsaxon/StyleSheet.java And used the same method: ...
magmaruless
Offline Send Email
Mar 3, 2009
12:32 am
1261
As a followup: I ended up having to pass the output from tagSoup v1.2 into a build of htmlTidy in order to get it to parse in TinyXML for certain html samples...
kiru42
Offline Send Email
Mar 3, 2009
3:38 am
1262
... Looks like TinyXML is not a conforming XML parser, if it doesn't understand character references. To get UTF-8 output without entities, though, just...
John Cowan
johnwcowan
Online Now Send Email
Mar 4, 2009
7:19 pm
1263
... Erm, I hate to be slightly rude, but haven't we had the conversation about the command line problems re: output encodings and win32? I started this whole...
kiru42
Offline Send Email
Mar 4, 2009
8:33 pm
1264
Hello there, I'm getting a java.lang.OutOfMemoryError after 400 xslt transformations with Saxon, using tagsoup as the parser. Detailed exception:...
magmaruless
Offline Send Email
Mar 7, 2009
12:34 pm
1265
The documentation for XMLWriter says * <p>According to the XML Recommendation, <em>all</em> whitespace * in an XML document is potentially significant to an...
Klotz, Leigh
leighklotz
Offline Send Email
Mar 11, 2009
6:24 pm
1266
... If you look at the Infoset, you'll see that whitespace outside the root element is generally considered nonsignificant, despite the letter of the XML Rec....
John Cowan
johnwcowan
Online Now Send Email
Mar 11, 2009
7:23 pm
1267
... [mailto:tagsoup-friends@yahoogroups.com] On Behalf Of John Cowan ... TagSoup 1.2 ... whitespace ... root ... question. John, Thank you for your quick...
Klotz, Leigh
leighklotz
Offline Send Email
Mar 11, 2009
8:41 pm
1268
... Sorry, quite right. Since I don't use Windows, I have no idea why the output encoding is broken (if that's really what's happening). Can someone using...
John Cowan
johnwcowan
Online Now Send Email
Mar 11, 2009
9:25 pm
1269
... I can't replicate this: What I get is simply: <?xml version="1.0" standalone="yes"?> <html...
John Cowan
johnwcowan
Online Now Send Email
Mar 11, 2009
9:32 pm
1270
I have found a 1811 line xhtml file with unbalanced tags that causes Tagsoup to go into a loop. Is there a procedure for reporting such problems? regards, tom...
Tom Van Vleck
thvv
Offline Send Email
Mar 12, 2009
8:31 pm
1271
... Sure. Send it to me. -- A mosquito cried out in his pain, John Cowan "A chemist has poisoned my brain!"...
John Cowan
johnwcowan
Online Now Send Email
Mar 12, 2009
8:48 pm
1272
... xmlns="http://www.w3.org/1999/xhtml"><body><b>hello</b><i>there</i> ... elementLevel == 1, ... thus ... I misspoke in the quoted text at the top of this...
Klotz, Leigh
leighklotz
Offline Send Email
Mar 12, 2009
9:49 pm
1273
... Yes, you're right. I've never paid attention to this before. ... Aha. ... I agree: line 632 should just be flushed. -- Clear? Huh! Why a four-year-old...
John Cowan
johnwcowan
Online Now Send Email
Mar 12, 2009
10:33 pm
1274
Thanks! And to correct another typo for the record, I'm sending fragments, not fragmenents, which I guess is a back-formation from documenents. Leigh. ... ...
Klotz, Leigh
leighklotz
Offline Send Email
Mar 12, 2009
10:37 pm
1275
Hi, In a proyect where we use Tagsoup to tidy some malformed xhtml code have found that if there is an odd number of quotes on the doctype declaration tagsoup...
Miguel Garcia
miguel.garcia@...
Send Email
Mar 18, 2009
10:58 am
1276
... The real problem is that TagSoup thinks the system-id begins with a quote and ends with a quote, but doesn't realize that it's zero-length. The obvious...
John Cowan
johnwcowan
Online Now Send Email
Mar 18, 2009
8:45 pm
Messages 1247 - 1276 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help