Search the web
Sign In
New User? Sign Up
tagsoup-friends · Friends of TagSoup
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Want your group to be featured on the Yahoo! Groups website? Add a group photo to Flickr.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 378 - 407 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
378
Can u plz help me that where html entities are trannsformed in the source code. Actually i don't want to tranform xml entities(&amp; , &lt; etc) in the...
Anurag Singh
as_vns_007
Offline Send Email
Mar 1, 2006
3:49 pm
379
... All SAX parsers convert entities to characters when returning values to the caller. It's up to the caller, if it intends to produce output as XML, to...
John Cowan
johnwcowan
Online Now Send Email
Mar 1, 2006
4:00 pm
380
Thanks John. I m able to do it with XMLWriter or Regex . But when i m removing "<entity name='amp' codepoint='0026'/>" from definitions.html.tssl then it...
Anurag Singh
as_vns_007
Offline Send Email
Mar 2, 2006
6:23 am
381
... When TagSoup sees an entity that's not in its tables, it returns the entity reference as text (what else is there to do, given the Keep On Truckin'...
John Cowan
johnwcowan
Online Now Send Email
Mar 2, 2006
1:16 pm
382
Hi, I'm an experienced Java programmer but I've never written a SAX application from scratch and I'm not sure how it should all hang together. I'm trying to...
Alex Worden
alexworden
Offline Send Email
Mar 2, 2006
7:32 pm
383
... Spiders aren't exactly trivial code, but thanks for thinking of TagSoup. You might want to reuse someone else's spider and then use TagSoup to postprocess...
John Cowan
johnwcowan
Online Now Send Email
Mar 2, 2006
7:41 pm
384
If you run tag soup on "&c" you get out "&amp;c&#65535;" If you run it on "<b>&c</b>" or "&c " you get out "<b>&amp;c</b>" Leigh....
Klotz, Leigh
leighklotz
Offline Send Email
Mar 3, 2006
12:45 am
385
... It's mishandling EOF somewhere. I hope to be able to do some more work on TagSoup soon. I've recently changed jobs, which has been stressful. Let me take...
John Cowan
johnwcowan
Online Now Send Email
Mar 3, 2006
3:07 am
386
hi, in order to evaluate tagsoup, I downloaded it and tried it with "java -jar tagsoup-1.0rc3 --files --html ..." on some html pages saved with 'save page...
Pierre Bru
pbru_2001
Offline Send Email
Mar 4, 2006
10:32 pm
387
... Can you send me the particular HTML page that failed? -- Even a refrigerator can conform to the XML John Cowan Infoset, as long as it has a door...
John Cowan
johnwcowan
Online Now Send Email
Mar 5, 2006
7:19 am
388
hi, I wanted to rebuild tagsoup to be able to watch it work in the debugger, but the compiler complains about HTMLSchema and HTMLScanner. I looked in the...
Pierre Bru
pbru_2001
Offline Send Email
Mar 5, 2006
5:32 pm
389
... Install Ant and say "ant" from the root TagSoup directory; that will drop the current TagSoup .jar in the dist/lib directory. Make sure you are building...
John Cowan
johnwcowan
Online Now Send Email
Mar 5, 2006
6:31 pm
390
... well... I'm using eclipse 3.1 I suppose there is an ant build file somewehe. i will try to figure out how to setup eclipse to use it. thanx. Pierre....
Pierre Bru
pbru_2001
Offline Send Email
Mar 5, 2006
7:21 pm
391
... that's funny. when you have eclipse, you do not even have javac ;) I asked eclipse to build TagSoup with the ant build file, and because of the above...
Pierre Bru
pbru_2001
Offline Send Email
Mar 5, 2006
8:02 pm
392
Hello, I was wondering when Tagsoup 1.0 would be released? I see that rc3 has been out a while. Also, is there a bugtracker for this project? I'd like to see...
mk2337
Offline Send Email
Mar 6, 2006
5:18 pm
393
... I've changed jobs and been swamped. I hope to get a chance to work on it soon. ... CHANGES in the 1.0rc3 distribution is your friend. -- John Cowan...
John Cowan
johnwcowan
Online Now Send Email
Mar 6, 2006
5:28 pm
394
... Johon, did you received the .zip I sent ? Pierre....
Pierre Bru
pbru_2001
Offline Send Email
Mar 7, 2006
8:36 am
395
... Yes. ... -- Possession is said to be nine points of the law, John Cowan but that's not saying how many points the law might have. cowan@... --Thomas...
John Cowan
johnwcowan
Online Now Send Email
Mar 7, 2006
1:08 pm
396
YES! TagSoup does black magic! :) I have the following html source file in which a <th> is incorrectly close by a </td> ... <html> <head></head> <body> ...
Bru, Pierre
pbru_2001
Offline Send Email
Mar 8, 2006
12:05 am
397
Hi, I use tagsoup to parse html and build it into XOM document. After I call document.toXML(), The original doctype declaration is gone. I notice that in the...
Shih Chang
shihc
Offline Send Email
Mar 8, 2006
12:18 am
398
... TagSoup simply ignores any DOCTYPE declaration that may be present. Parsing them correctly is too complicated for me. -- John Cowan cowan@......
John Cowan
johnwcowan
Online Now Send Email
Mar 8, 2006
3:08 am
399
hi, what is F_NOFORCE used for ? what does it imply on an element ? TIA, Pierre....
Pierre Bru
pbru_2001
Offline Send Email
Mar 8, 2006
1:44 pm
400
... if I change <element name='noframes' type='element'> <memberOf group='M_BLOCK'/> <memberOf group='M_HTML'/> <memberOf group='M_FRAME'/> <contains...
Bru, Pierre
pbru_2001
Offline Send Email
Mar 8, 2006
5:10 pm
401
... <![ %HTML.Frameset; [ <!ENTITY % noframes.content "(BODY) -(NOFRAMES)"> ]]> <!ENTITY % noframes.content "(%flow;)*"> <!ELEMENT NOFRAMES - -...
Bru, Pierre
pbru_2001
Offline Send Email
Mar 10, 2006
9:44 am
402
Hi, TagSoup add default attribute values on some (all?) elements: for example, <td> become <td rowspan="1" colspan="1"> is it possible, without modifying the...
Pierre Bru
pbru_2001
Offline Send Email
Mar 12, 2006
10:01 pm
403
... You need to remove the relevant lines from html.tssl and rebuild. -- John Cowan cowan@... www.ap.org www.ccil.org/~cowan Nobody expects the...
John Cowan
johnwcowan
Online Now Send Email
Mar 13, 2006
4:06 am
404
... Essentially it means that the element will not be closed implicitly when its parent element is closed. Instead, it will be popped off the stack and saved,...
John Cowan
johnwcowan
Online Now Send Email
Mar 13, 2006
11:17 pm
405
I'm trying to write a utility that will pull a webpage with a table on it and then extract the information from within that table. I was thinking that instead...
graham.reeds
Offline Send Email
Mar 16, 2006
12:53 am
406
TagSoup replace your SAX parser. but a SAX parser does not create a DOM tree, it just trigger events, events which give you the opportunity to create a DOM...
Bru, Pierre
pbru_2001
Offline Send Email
Mar 16, 2006
2:15 am
407
... That's right, TagSoup is a SAX parser, so it does not need to be used with a different SAX parser. You then need to have a DOM or DOM-like object model ...
John Cowan
johnwcowan
Online Now Send Email
Mar 16, 2006
4:11 am
Messages 378 - 407 of 1386   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help