Search the web
Sign In
New User? Sign Up
LingPipe
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Message search is now enhanced, find messages faster. Take it for a spin.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Problem   Message List  
Reply | Forward Message #61 of 796 |
Re: Problem


Hi,

Try escaping all XML types of characters coming in since it appears
LingPipe is trying to treat your document as XML. The way to do this
is to convert all '<' characters to '&lt;' and all '&' characters
to '&amp;'. I must admit I'm new to LingPipe so I don't know if you
can tell it to treat the incoming document as text instead of XML.


--- In LingPipe@yahoogroups.com, "gargnavendu" <gargnavendu@y...>
wrote:
> Hi,
>
> I downloaded 'The Count of Monte Cristo' from www.gutenberg.org. I
> inserted appropriate tags and used lingpipe process it. Here is the
> error i got.
>
> [Fatal Error] crsto12.txt:414:54: The entity name must immediately
> follow the '&
> ' in the entity reference.
> Exception processing
> file=data\raw\crsto12.txtorg.xml.sax.SAXParseException: The
> entity name must immediately follow the '&' in the entity
reference.
>
> Basically, it is trying parse 'Morrel & Son'
>
> regards,
>
> Navendu






Tue Jan 18, 2005 4:56 am

harpman62
Offline Offline
Send Email Send Email

Forward
Message #61 of 796 |
Expand Messages Author Sort by Date

Hi, I downloaded 'The Count of Monte Cristo' from www.gutenberg.org. I inserted appropriate tags and used lingpipe process it. Here is the error i got. [Fatal...
gargnavendu
Offline Send Email
Mar 30, 2004
1:48 am

... Navendu, The input format has to be in valid xml. You need to get an xml validator--generally findable on the web for free and use that. To check your...
reckb
Offline Send Email
Apr 3, 2004
2:27 pm

Hi, Try escaping all XML types of characters coming in since it appears LingPipe is trying to treat your document as XML. The way to do this is to convert all...
Kepler
harpman62
Offline Send Email
Jan 18, 2005
4:57 am

There are several parts to this answer. 1. Well-formed XML ... You need to replace instances of '&', '<', '>' or '"' with entity references "&amp;", "&lt;",...
Bob Carpenter
colloquialdo...
Offline Send Email
Jan 18, 2005
5:54 pm
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help