Hi, I modified the simpleCoref to locate the male and female nouns: he, him, his, she, her, hers; ran it over the muc6 formal set; and used the muc software...
... Thanks. ... That makes sense, but it still only does mail pronouns. And couldn't the prefix Word boundary be brought out? And why the optional * on \\W? ...
Hi, I'm struggeling with moving from 2.2.1 to 2.3.0. I'm getting an array index out of bounds exception message of -1, when trying to the following code. The...
Hello all, Heng from NYU wanted go over cross document coreference as done by us (actually not part of the distribution) so I thought I would go ahead and make...
Breck, As there an ocean between us, coming over for the night isn't an option. But I'd very much like to follow the event! Is there any chance that you can ...
All, I wasn't really planning on much of a formal presentation. Not sure what it would achieve to record conversations around a monitor. The eve will force me...
Hi LP, One (or more) interesting problems that I wold like to solve, but I am not sure if LingPipe can help there. My domain are personal names, just like John...
... am not sure if LingPipe can help there. ... extent, company names. ... name (e.g. in some ZIP region). Cool problem. Did something like this a while back...
Hi Breck, Thanks for sparing here, If your model is this sensitive to noise we are going to have a problem in general. I am guessing that Bob will chime in and...
... I'm truly sorry about the big shift. I hope it's for the better after the transition, but I've been holding my breath waiting for messages like yours to...
A number of years back, during the launch of a monthly print publication (yes, that far back), I found the purchase of a set of CDs (yes, that far back)...
Hello, please forgive me it this is a silly question. Is it possible to evaluate the output of other (3rd-party) taggers, e.g. files annotated MUC-style with...
Florian Laws
florian@...
Aug 15, 2006 7:30 pm
343
... There's nothing built in, and it's not even clear how to do it from the doc. So I took this opportunity to add another section to the NE tutorial:...
Hmm. I guess Yahoo doesn't let me do attachments with mailing lists. Oh well. The last message had the files inlined (sorry about spacing), and here's the...
Up to now the cheapest point of entry for commercial use of LingPipe is $20,000–that is keeping us out of startups in garages..... I blogged about a new...
Hi, Having returned from vacation and almost dug my way out of my mail box, here are a couple of answers to your gracious ... The stats were computed using the...
Can these classes use getters and setters? I'm trying to script these into an output, but it doesn't work because it doesn't follow the JavaBean notation. ...
We have decided to add a new page to the web site listing LingPipe users. See http://alias-i.com/lingpipe/web/customers.html We cover commercial/paid...
Mention and MentionChain ... What exactly do you need in the way of get/set functionality? Our mention implementations are immutable, so there can't be...
I have a classification task where a body of text needs to be trained either as "favorable" or "not favorable". The vast majority (around 85%) of texts will...
... If you have a lot of data, you can reduce training load one of two ways. First, you can sample from the data (better than just taking the first 10%). If...
The Named Entity Tutorial has an example program "TrainConll2002.java" where the class "Conll2002ChunkTagParser" is used. I did not see the APIs of the class...
I am new to LingPipe and trying to understand the various classes in " com.aliasi.corpus.parsers" package. I am developing an application that requires...
... I like MUC-style XML-formatted data because you don't lose the information about whitespaces. Whitespace information is only used in the rescoring...
... Assuming this is the right reference: http://wiki.apache.org/jakarta-lucene/SpellChecker then there are substantial differences between what we're doing...
Hi Bob, Just tiny addition, n-grams in Lucene SpellC are used to restrict search, but not as a Boolen filters, rather as "words" in standard lucene TF/IDF...