Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Show off your group to the world. Share a photo of your group with us.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 589 - 629 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
589
Hi there! We are using TagSoup for our Web crawler, and we found for the page at http://www.borngayprocon.org/ TagSoup consider <!-[if IE]> as a comment, and ...
Eugeny N Dzhurinsky
bofh@...
Send Email
Dec 7, 2006
9:10 am
590
Hi! I have recently come across TagSoup and want to see whether I can use it instead of JTidy. I need t be able to clean up HTML documents in a wide range of ...
Jaran Nilsen
jaranmann
Offline Send Email
Dec 7, 2006
12:37 pm
591
... That is because TagSoup does not know which characters can be safely written to which encodings, so it plays safe and uses character references for all...
John Cowan
johnwcowan
Online Now Send Email
Dec 7, 2006
1:22 pm
592
I brought up conditional IE comments a while back. I showed using some pathological examples of IE conditionals that it's impossible to proper SAX events if...
Klotz, Leigh
leighklotz
Offline Send Email
Dec 12, 2006
6:54 pm
593
... Quite so. But there is a bug involving comments that lack the second minus sign: <!-foo--> causes TagSoup to malfunction. -- John Cowan cowan@......
John Cowan
johnwcowan
Online Now Send Email
Dec 12, 2006
8:56 pm
595
... Is the already-posted patch the recommended work-around? Is there a better work-around? Thanks, -c...
chconnor
Offline Send Email
Dec 15, 2006
7:30 am
596
Hello! We faced another problem - when parsing a HTML document, which contains the link like <a href="something.php?param=value&cap=anothervalue">, the &cap is...
Eugeny N Dzhurinsky
bofh@...
Send Email
Dec 15, 2006
8:15 am
597
... I haven't had a chance to evaluate it yet. -- Well, I have news for our current leaders John Cowan and the leaders of tomorrow: the Bill of...
John Cowan
johnwcowan
Online Now Send Email
Dec 15, 2006
1:46 pm
599
... Because people do in fact often leave the final semicolon off. Browsers know which attributes contain URIs and apparently don't expand entity references in...
John Cowan
johnwcowan
Online Now Send Email
Dec 15, 2006
3:31 pm
600
... You're probably aware of this, but it's slightly more complicated than that (when isn't it?) because: 1. The semicolon may be omitted under certain...
Nick Fitzsimons
nick@...
Send Email
Dec 15, 2006
4:16 pm
610
In command line mode with TagSoup I see this: $ java -jar tagsoup-1.0.1.jar vm.html src: vm.html <?xml version="1.0" standalone="yes"?> <html...
Elliotte Harold
elharo@...
Send Email
Jan 10, 2007
3:30 pm
611
... The default output of TagSoup is XML, not HTML. If you want HTML, use the --html option; you can also turn off the XML declaration separately. ... There...
John Cowan
johnwcowan
Online Now Send Email
Jan 10, 2007
3:58 pm
612
I'm trying to build TagSoup from source in Eclipse. Eclipse complains about a missing HTMLSchema class. Is this the known problem with building under 1.5, or...
Elliotte Harold
elharo@...
Send Email
Jan 10, 2007
4:06 pm
613
... That's a generated class. You need to do "ant prepare-parser", which will fail on stock 1.5, so you need to fall back to 1.4 or install a working XSLT...
John Cowan
johnwcowan
Online Now Send Email
Jan 10, 2007
4:19 pm
614
... You're right. I don't think I'd ever noticed that attribute before. Still I'm not sure it should be in the output for a couple of reasons: 1. The attribute...
Elliotte Harold
elharo@...
Send Email
Jan 10, 2007
4:20 pm
615
... I think I've set ant up to use 1.4 but Ant is still giving me messages about "java.lang.ClassNotFoundException: ...
Elliotte Harold
elharo@...
Send Email
Jan 10, 2007
4:42 pm
616
... I have no clue. Try upgrading Xalan in the endorsed directory instead. -- Business before pleasure, if not too bloomering long before. --Nicholas van Rijn ...
John Cowan
johnwcowan
Online Now Send Email
Jan 10, 2007
4:50 pm
617
... I'm also getting some Ant deprecation messages. I'm using Ant 1.6.5. Which version of Ant are you using? -- Elliotte Rusty Harold...
Elliotte Harold
elharo@...
Send Email
Jan 10, 2007
5:06 pm
618
... You've convinced me. I've removed it from the trunk, and you can do so too by just pulling the line with /name='version'/ out of src/definitions/html.tssl...
John Cowan
johnwcowan
Online Now Send Email
Jan 10, 2007
6:12 pm
619
... "Apache Ant version 1.6.5 compiled on June 2 2005" I don't see any such warnings. -- "Well, I'm back." --Sam John Cowan <cowan@...>...
John Cowan
johnwcowan
Online Now Send Email
Jan 10, 2007
6:16 pm
620
... Possibly the problems are related then. Here's what I see: ~/projects/tagsoup-1.0.1$ ant compile Buildfile: build.xml [available] DEPRECATED - <available>...
Elliotte Harold
elharo@...
Send Email
Jan 10, 2007
6:52 pm
621
FWIW, Ant 1.7 was able to compile TagSoup though it still gave deprecation messages: This was under Java 1.5 by the way. Is the problem with the buggy 1.5 ...
Elliotte Harold
elharo@...
Send Email
Jan 10, 2007
11:21 pm
622
... Once the code is generated, it should compile fine and run fine. The XSLT is used only to generate the tables in the classes in src/templates. -- There...
John Cowan
johnwcowan
Online Now Send Email
Jan 10, 2007
11:52 pm
623
... So what's the symptom of the bug that prevents the code from being compiled under 1.5? At first glance it seems to have compiled OK for me, but I may be...
Elliotte Harold
elharo@...
Send Email
Jan 10, 2007
11:57 pm
624
... The inability to do the XSLT build due to the bad version of Xalan distributed with stock 1.5: $ ant Buildfile: build.xml init: prepare: [mkdir] Created...
John Cowan
johnwcowan
Online Now Send Email
Jan 11, 2007
12:19 am
625
I'm a novice to Java and XML, and I would like to use TagSoup with JAXP, since JAXP supports XPath 2.0. It is my understanding that DOM parsers use SAX parsers...
mateiasi2
Offline Send Email
Jan 13, 2007
4:11 am
626
... Unfortunately I have never used JAXP. Some parsers expose both SAX and DOM, notably Xerces; I don't know which packages, if any, allow pluggable SAX. I...
John Cowan
johnwcowan
Online Now Send Email
Jan 13, 2007
5:22 am
627
... Yes. If you must use xpath 2.0, that makes sense. Otherwise alternatives (like XOM [xpath 1.0 using Jaxen] + TagSoup) are (IMO) a superior choice. ... This...
Tatu Saloranta
cowtowncoder
Offline Send Email
Jan 13, 2007
6:00 pm
628
... TagSoup does not have JAXP interfaces, but I very much invite a patch from anyone who has written one. I don't have time at the minute to figure out how...
John Cowan
johnwcowan
Online Now Send Email
Jan 13, 2007
6:18 pm
629
... Ok, how about I'll give it a try. I have written JAXP wrappers before, shouldn't be too hard. -+ Tatu +- ...
Tatu Saloranta
cowtowncoder
Offline Send Email
Jan 13, 2007
7:02 pm
Messages 589 - 629 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help