Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Real people. Real stories. See how Yahoo! Groups impacts members worldwide.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 326 - 361 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
326
I have the foloowing code (it's in Coldfusion but should be easily understandable): <cfset URL = CreateObject('java', 'java.net.URL') /> <cfset...
jcollins987
Offline Send Email
Sep 4, 2005
6:07 pm
328
Hello, just wondering the following entity "&nbsp;" is transform to which JAVA caracter? I'm doing HTML to text conversion using TagSoup and I need to handle ...
Benoit Houle
oz_benoit_houle
Offline Send Email
Sep 7, 2005
6:43 pm
329
... To U+00A0. -- Verbogeny is one of the pleasurettes John Cowan <cowan@...> of a creatific thinkerizer. http://www.reutershealth.com --...
John Cowan
johnwcowan
Online Now Send Email
Sep 7, 2005
6:52 pm
330
Here is the bug fix for some of the missing setOutputProperties. It adds OMIT-XML-DECLARATION and METHOD=html. This fix deprecates setHTMLMode in favor of...
Klotz, Leigh
leighklotz
Offline Send Email
Sep 7, 2005
6:55 pm
331
The attached bug fix removes (most) dependencies on JDK 1.2 collections and adds --help to the command line parser. This patch is independent from the...
Klotz, Leigh
leighklotz
Offline Send Email
Sep 7, 2005
7:18 pm
333
I have an issue using Tagsoup. Nevermind the content, focus on the tags and entities, With this input: <p><b>Monica :</b> &lt;laughs&gt; Oh yeah. </p> ...
steph_1k1
Offline Send Email
Sep 23, 2005
12:49 pm
334
Internet Explorer has a feature known as conditional comments, which have embedded markup that is parsed by IE but treated as comment by other browsers. In...
Klotz, Leigh
leighklotz
Offline Send Email
Sep 23, 2005
6:26 pm
335
... I don't know what you mean by "gives me", because I don't know if you're using TagSoup as a SAX parser or as a stand-alone application. If as a...
John.Cowan
johnwcowan
Online Now Send Email
Sep 26, 2005
4:06 pm
336
Hey all, First off all, let me say that I ran across TagSoup earlier this week and TSaxon today, and they rock! I'm trying to put together a simple scraper to...
furbyman1976
Offline Send Email
Sep 29, 2005
7:32 am
338
... The trouble is that "&part" is a legitimate HTML entity reference to the Unicode character U+2202, PARTIAL DIFFERENTIAL. Since TagSoup does not know that...
John.Cowan
johnwcowan
Online Now Send Email
Oct 13, 2005
8:10 pm
339
... I'm going to reluctantly reject this, for three reasons: 1) I don't want to write a full conditional-comments interpreter; 2) The documentation shows that...
John.Cowan
johnwcowan
Online Now Send Email
Oct 13, 2005
9:02 pm
340
Hi, I tested tagsoup with some web pages from the wild and found it very slow for some pages. For example I obtained 20 seconds for ...
Riadh Elloumi
riadh_elloum...
Offline Send Email
Oct 19, 2005
7:29 pm
341
Hi again, I apologize for this idiot question. After checking, the delay was introduced in _my code_ by a DNS lookup delay. For information, this DNS lookup...
Riadh Elloumi
riadh_elloum...
Offline Send Email
Oct 19, 2005
9:36 pm
342
... Speed comparisons aren't meaningful on different machines. Try downloading the page first using wget or curl, and then run TagSoup against it locally. ...
John.Cowan
johnwcowan
Online Now Send Email
Oct 20, 2005
1:16 am
343
Interesting usage. I developed a set of XSS filters as SAX2 filters on top of TagSoup. Do you think that the Poesia project would be interested in XSS filters...
Klotz, Leigh
leighklotz
Offline Send Email
Oct 21, 2005
12:10 am
344
... Hi Leigh, XSS filtering is a good idea, but the main purpose of Poesia is porn filtering. As we are in alpha development, security filtering is not our ...
Riadh Elloumi
riadh_elloum...
Offline Send Email
Oct 21, 2005
7:44 pm
345
For the last couple of days I have tried to access the Tagsoup website at the following addresses without any luck. http://mercury.ccil.org/~cowan/XML/tagsoup/...
laust_ladefoged
Offline Send Email
Oct 31, 2005
8:44 pm
346
The server is back online. Best regards, Laust....
laust_ladefoged
Offline Send Email
Oct 31, 2005
10:42 pm
348
Hi all, I'm trying to use TagSoup to process a template HTML file into valid XML so that I can output an XSLT file; because XSLT will be used to manipulate the...
john_zenesis
Offline Send Email
Nov 21, 2005
2:35 pm
349
... [snip] ... You don't make clear whether you are using TagSoup from the command line or as a SAX parser library. If from the command line, use the --any ...
John.Cowan
johnwcowan
Online Now Send Email
Nov 21, 2005
5:24 pm
350
Hello, Did you solve this problem? Jan...
hans2hank
Offline Send Email
Nov 30, 2005
7:11 pm
351
Hello Everybody, I am just Java greenhorn ;-) I am not able to build tagsoup-1.0rc3 source from the build.xml. A get exactly the same problem as described in...
hans2hank
Offline Send Email
Nov 30, 2005
7:25 pm
352
... It's a problem with XSLT, not with TagSoup. ... -- John Cowan <jcowan@...> http://www.reutershealth.com I amar prestar aen, han mathon ne...
John.Cowan
johnwcowan
Online Now Send Email
Dec 1, 2005
4:52 am
353
Hello John, Thanks for a clue, but I still do not know what to do. Do I have to install another version of xalan? Where will you start if you are on my place? ...
hans2hank
Offline Send Email
Dec 1, 2005
9:50 am
354
... that's me again ;-) I have solved the problem. It really seems to be a bug in Java 1.5. I have installed 1.4 and it worked perfectly. Many regards Jan ... ...
hans2hank
Offline Send Email
Dec 1, 2005
11:51 am
355
... Indeed. I should probably put this on the Web page, now that people will be using 5.0 out of the box a lot more. AFAIK, TagSoup works fine under 5.0 once...
John.Cowan
johnwcowan
Online Now Send Email
Dec 1, 2005
1:45 pm
356
... That is exactly what I have done. It works pretty fine! ... That is a very good idea. It is the very first impression which one is confrontated with. If it...
hans2hank
Offline Send Email
Dec 2, 2005
8:24 am
359
Hello all, I'm just getting started using the tagsoup library to parse some content. What I'm discovering is that for many HTML documents I get a parsing error...
Rob Konigsberg
rikonigsberg
Offline Send Email
Feb 8, 2006
7:45 am
360
... That can't be a TagSoup error, as TagSoup never reports any errors (except low-level IOExceptions when a file cannot be read or something of the sort)....
John Cowan
johnwcowan
Online Now Send Email
Feb 8, 2006
8:12 am
361
John, thanks for the prompt reply. I may have been mistaken. I'll go back to the drawing board and see what I find. ... -- Robert Konigsberg ...
Rob Konigsberg
rikonigsberg
Offline Send Email
Feb 8, 2006
8:29 am
Messages 326 - 361 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help