Hi,
Try escaping all XML types of characters coming in since it appears
LingPipe is trying to treat your document as XML. The way to do this
is to convert all '<' characters to '<' and all '&' characters
to '&'. I must admit I'm new to LingPipe so I don't know if you
can tell it to treat the incoming document as text instead of XML.
--- In
LingPipe@yahoogroups.com, "gargnavendu" <gargnavendu@y...>
wrote:
> Hi,
>
> I downloaded 'The Count of Monte Cristo' from www.gutenberg.org. I
> inserted appropriate tags and used lingpipe process it. Here is the
> error i got.
>
> [Fatal Error] crsto12.txt:414:54: The entity name must immediately
> follow the '&
> ' in the entity reference.
> Exception processing
> file=data\raw\crsto12.txtorg.xml.sax.SAXParseException: The
> entity name must immediately follow the '&' in the entity
reference.
>
> Basically, it is trying parse 'Morrel & Son'
>
> regards,
>
> Navendu