Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Hear how Yahoo! Groups has changed the lives of others. Take me there.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 307 - 345 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
307
My html is as following: <pre>@misc{ granville-positive,author = "Andrew Granville", title = "On Positive Integers <=x With Prime Factors <=t log x", url =...
liliangpu
Offline Send Email
Jul 4, 2005
9:39 am
312
hi i am using tadsoup for converting html to xhtml. for this i got jar file of tagsoup. and usind command: java -jar tagsoup-1.0rc3.jar --files foo.html but it...
Arpit Patel
arpit_patel1982
Offline Send Email
Jul 15, 2005
9:19 am
313
hi i am using tadsoup for converting html to xhtml. for this i got jar file of tagsoup. and usind command: java -jar tagsoup-1.0rc3.jar --files foo.html but it...
Arpit Patel
arpit_patel1982
Offline Send Email
Jul 15, 2005
9:25 am
314
... I can only think that this has something to do with the JAVM you are using or the version of Java. Can you send back the results of "java -version"? -- ...
John.Cowan
johnwcowan
Online Now Send Email
Jul 15, 2005
8:25 pm
315
Dear tagsoup friends, I am a contributer in the Poesia project (www.poesia-filter.org), which is an Internet content filter for kids. I am using tagsoup for...
Riadh Elloumi
riadh_elloum...
Offline Send Email
Jul 17, 2005
4:28 pm
317
somewhere sufficently wild. this is relevant to the bugs you're trying to fix with the latest update ("This release cleans up long-standing problems with...
Garry Hill
garry@...
Send Email
Jul 21, 2005
1:22 am
319
Hi all, I found this in a html page from the wild: <A HREF="http://i2as.idregie.com/c.php? s=396&w=468&h=60"> Ok, that's quite brutish, but tagsoup fixes this...
Riadh Elloumi
riadh_elloum...
Offline Send Email
Jul 27, 2005
9:00 pm
320
hey tag-soup is not able to resolve my problem so i am using it........ u ppl try jericho html parcer this is well documented tagsoup hasn't proper...
Arpit Patel
arpit_patel1982
Offline Send Email
Jul 28, 2005
5:58 am
321
... TagSoup is not aware of which attributes are supposed to contain URIs, so it just does minimal SGML/XML fixup, namely converting line-ends to spaces. -- ...
John.Cowan
johnwcowan
Online Now Send Email
Jul 28, 2005
4:30 pm
322
... Well, no one product can do everything. Jericho (thanks for the reference) is about examining and perhaps modifying the HTML at the lowest level. TagSoup...
John Cowan
johnwcowan
Online Now Send Email
Jul 29, 2005
12:41 pm
323
hi johan my problem is been resolved by jericho html parcer actually i was using tag-soup for it. but jericho is well documented and it do parcing at basic...
Arpit Patel
arpit_patel1982
Offline Send Email
Jul 29, 2005
12:53 pm
324
... TagSoup conforms to the behavior of SAX parsers, and requires no programmer-level documentation of its own except in the properties and features that can...
John.Cowan
johnwcowan
Online Now Send Email
Jul 29, 2005
6:33 pm
325
I'm a bit new to Java and TagSoup. I think I've got things set up to Tag-Soup-ize some files, but I'm not sure how to feed them through. Any pointers? ...
THUFIR HAWAT
hawatdotthufir
Offline Send Email
Aug 13, 2005
7:42 am
326
I have the foloowing code (it's in Coldfusion but should be easily understandable): <cfset URL = CreateObject('java', 'java.net.URL') /> <cfset...
jcollins987
Offline Send Email
Sep 4, 2005
6:07 pm
328
Hello, just wondering the following entity "&nbsp;" is transform to which JAVA caracter? I'm doing HTML to text conversion using TagSoup and I need to handle ...
Benoit Houle
oz_benoit_houle
Offline Send Email
Sep 7, 2005
6:43 pm
329
... To U+00A0. -- Verbogeny is one of the pleasurettes John Cowan <cowan@...> of a creatific thinkerizer. http://www.reutershealth.com --...
John Cowan
johnwcowan
Online Now Send Email
Sep 7, 2005
6:52 pm
330
Here is the bug fix for some of the missing setOutputProperties. It adds OMIT-XML-DECLARATION and METHOD=html. This fix deprecates setHTMLMode in favor of...
Klotz, Leigh
leighklotz
Offline Send Email
Sep 7, 2005
6:55 pm
331
The attached bug fix removes (most) dependencies on JDK 1.2 collections and adds --help to the command line parser. This patch is independent from the...
Klotz, Leigh
leighklotz
Offline Send Email
Sep 7, 2005
7:18 pm
333
I have an issue using Tagsoup. Nevermind the content, focus on the tags and entities, With this input: <p><b>Monica :</b> &lt;laughs&gt; Oh yeah. </p> ...
steph_1k1
Offline Send Email
Sep 23, 2005
12:49 pm
334
Internet Explorer has a feature known as conditional comments, which have embedded markup that is parsed by IE but treated as comment by other browsers. In...
Klotz, Leigh
leighklotz
Offline Send Email
Sep 23, 2005
6:26 pm
335
... I don't know what you mean by "gives me", because I don't know if you're using TagSoup as a SAX parser or as a stand-alone application. If as a...
John.Cowan
johnwcowan
Online Now Send Email
Sep 26, 2005
4:06 pm
336
Hey all, First off all, let me say that I ran across TagSoup earlier this week and TSaxon today, and they rock! I'm trying to put together a simple scraper to...
furbyman1976
Offline Send Email
Sep 29, 2005
7:32 am
338
... The trouble is that "&part" is a legitimate HTML entity reference to the Unicode character U+2202, PARTIAL DIFFERENTIAL. Since TagSoup does not know that...
John.Cowan
johnwcowan
Online Now Send Email
Oct 13, 2005
8:10 pm
339
... I'm going to reluctantly reject this, for three reasons: 1) I don't want to write a full conditional-comments interpreter; 2) The documentation shows that...
John.Cowan
johnwcowan
Online Now Send Email
Oct 13, 2005
9:02 pm
340
Hi, I tested tagsoup with some web pages from the wild and found it very slow for some pages. For example I obtained 20 seconds for ...
Riadh Elloumi
riadh_elloum...
Offline Send Email
Oct 19, 2005
7:29 pm
341
Hi again, I apologize for this idiot question. After checking, the delay was introduced in _my code_ by a DNS lookup delay. For information, this DNS lookup...
Riadh Elloumi
riadh_elloum...
Offline Send Email
Oct 19, 2005
9:36 pm
342
... Speed comparisons aren't meaningful on different machines. Try downloading the page first using wget or curl, and then run TagSoup against it locally. ...
John.Cowan
johnwcowan
Online Now Send Email
Oct 20, 2005
1:16 am
343
Interesting usage. I developed a set of XSS filters as SAX2 filters on top of TagSoup. Do you think that the Poesia project would be interested in XSS filters...
Klotz, Leigh
leighklotz
Offline Send Email
Oct 21, 2005
12:10 am
344
... Hi Leigh, XSS filtering is a good idea, but the main purpose of Poesia is porn filtering. As we are in alpha development, security filtering is not our ...
Riadh Elloumi
riadh_elloum...
Offline Send Email
Oct 21, 2005
7:44 pm
345
For the last couple of days I have tried to access the Tagsoup website at the following addresses without any luck. http://mercury.ccil.org/~cowan/XML/tagsoup/...
laust_ladefoged
Offline Send Email
Oct 31, 2005
8:44 pm
Messages 307 - 345 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help