Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Want your group to be featured on the Yahoo! Groups website? Add a group photo to Flickr.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 139 - 170 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
139
I need some help from any one of the friends.i am doing a project in screen scraping i am able to scrap the screen and then parse it. but i am unable to...
anduvan xavier
xavi_chennai
Offline Send Email
Sep 2, 2004
5:19 am
140
Hi John, ... In my opinion, either is preferable to breaking the form in half :-) That's certainly not what a typical browser would do, and it breaks the...
chris@...
cmw128
Offline Send Email
Sep 2, 2004
9:27 am
141
Hi, We’ve come across a problem with a greater than sign in quoted attributes. If another attribute follows, this attribute is being excluded from the rest...
sytse@...
Send Email
Sep 3, 2004
4:10 pm
142
... This is another "damned if I do, damned if I don't" situation. There's a *lot* of HTML out there with unterminated quotes, in which case the attribute...
John Cowan
johnwcowan
Online Now Send Email
Sep 3, 2004
6:05 pm
143
Hmm, it seems bad if tagsoup mis-parses *good* html. I just did a bit of experimenting and both Mozilla and IE seem to parse this situation strictly - i.e....
Chris
chris_bitmead
Offline Send Email
Sep 4, 2004
1:06 am
144
... This argument is compelling. I'll change this in the next release. Until then, you can patch the behavior out by removing lines 52 and 153, respectively: ...
John Cowan
johnwcowan
Online Now Send Email
Sep 4, 2004
8:50 pm
145
... It probably wasn't a good idea. To the extent possible you should figure out what sanity checks IE does on its input. However IE interprets the input is...
Chris
chris_bitmead
Offline Send Email
Sep 5, 2004
9:07 am
146
Hi, I'm having a little trouble with TagSoup parsing JavaScript. The thing is for instance, that if there is a <script>document.write('<textarea'>) that...
sytse@...
Send Email
Sep 7, 2004
4:37 pm
147
... I can't reproduce this problem. What version of TagSoup are you using? -- John Cowan <jcowan@...> http://www.reutershealth.com "But no...
John Cowan
johnwcowan
Online Now Send Email
Sep 7, 2004
7:41 pm
149
... Yes, I'm sorry. I was still using the previous version of TagSoup because I made some changes to the previous one. But the problem still stands, the test...
sytse@...
Send Email
Sep 8, 2004
4:32 pm
150
... This problem results from a bad fix I made back in 0.9.4. The next release will restore the 0.9.3 code. -- Real FORTRAN programmers can program FORTRAN...
John Cowan
johnwcowan
Online Now Send Email
Sep 8, 2004
9:50 pm
151
Version 0.10.2 fixes some long-standing bugs in the areas of entity references within attribute values, well-formed names for elements and processing...
John Cowan
johnwcowan
Online Now Send Email
Sep 8, 2004
10:37 pm
152
Very nice! Did you still have any ideas on the form problem, are you thinking on changing that in a coming release? Cheers, Sytse ... This mail sent through...
sytse@...
Send Email
Sep 9, 2004
10:39 am
153
... Thank you. ... I'm still thinking about it, but I'm not ready to commit to adding it to the TODO list. -- "Kill Gorgûn! Kill orc-folk! John...
John Cowan
johnwcowan
Online Now Send Email
Sep 9, 2004
12:13 pm
154
I've put together a (new, improved - not much) Java version [1] of the naive generic XML cleaner I did for Python. As Norm Walsh put it : " I have new...
Danny Ayers
danny_ayers
Offline Send Email
Sep 10, 2004
7:25 am
155
Hi, We've had a look at the form problem, basically focussing on the following problem: When you have an input like ...
sytse@...
Send Email
Sep 10, 2004
5:27 pm
156
I'm have the following: "\rsome\r\n\tfreeform\ncontent" It's coming out as: "\rsome\r\n freeform\ncontent" at the other end. Notice the TAB character was...
francois_beausoleil
francois_bea...
Offline Send Email
Sep 17, 2004
3:46 am
157
... Not only is that not to be expected. It is to be expected that the \r and \r\n would both be changed to to \n (assuming those are not literal backslashes)....
Elliotte Harold
elharo@...
Send Email
Sep 17, 2004
12:07 pm
158
Hello, I downloaded the jar file and run it with java -Dfiles=true -jar tagsoup-0.10.2.jar test where "test" is the html file and got following error: ...
odrewien
Offline Send Email
Sep 22, 2004
10:10 am
159
... What version of Java ? You need Java 1.4 or better to have the org.sax and javax.xml packages automatically available. Else, you'll have to provide an...
François Beausoleil
francois_bea...
Offline Send Email
Sep 22, 2004
12:43 pm
160
The only code example I've found for calling tagsoup from a Java program is here: http://www.hackdiary.com/archives/000041.html I can successfully process an...
mrbobdaly
Offline Send Email
Sep 27, 2004
9:15 pm
161
Hi, I have a little problem with a document.write in TagSoup. With the following input: document.write('</scr + ipt>'); in the output </scr + ipt> will be...
sytse@...
Send Email
Sep 28, 2004
2:56 pm
163
... I use Tagsoup in a number of little scripts. Here's how I do it with JDOM: import org.ccil.cowan.tagsoup.Parser; import org.jdom.input.SAXHandler; import...
Brian Lalor
blalor-k-yahoo.f6bdbf...
Send Email
Oct 1, 2004
12:08 am
164
TagSoup 1.0rc1 (release candidate 1 of version 1.0) has been released. Please hammer on this and let me know what you find. Thanks. It's in the usual place:...
John Cowan
johnwcowan
Online Now Send Email
Oct 8, 2004
9:09 pm
165
Nux has matured, and this is to announce the availability of the nux-1.0a2 release. Nux (http://dsd.lbl.gov/nux) is a small, straightforward, and surprisingly...
Wolfgang Hoschek
whoschek@...
Send Email
Oct 12, 2004
6:50 pm
166
Wolfgang, As I warned you earlier, there is a serious license conflict here. This product includes code directly copied and pasted from XOM, which is fine, but...
Elliotte Harold
elharo@...
Send Email
Oct 12, 2004
10:30 pm
167
... One API comment. The DocumentWrapper getUnparsedEntity entity method is declared thusly: public String[] getUnparsedEntity(String name) You say this...
Elliotte Harold
elharo@...
Send Email
Oct 12, 2004
10:32 pm
168
Eliotte, Lots of fuzz about a little patch adding one constructor argument to XSLTransform! As you are well aware of, the XOM code you refer to retains your ...
Wolfgang Hoschek
whoschek@...
Send Email
Oct 12, 2004
11:29 pm
169
... No, I indicated that I would consider it after I get XML 1.0 out the door. I made no promises that I would do it. Possibly it will get in. Possibly it...
Elliotte Harold
elharo@...
Send Email
Oct 13, 2004
12:33 am
170
I've uploaded nux-1.0a3 to address Elliotte's concerns wrt. XOM: Changelog: • Separated the patched class nu.xom.xslt.XSLTransform (LGPL licensed, copyright...
Wolfgang Hoschek
whoschek@...
Send Email
Oct 13, 2004
1:26 am
Messages 139 - 170 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help