LingPipe Version 2.1 Released New York, 3 October 2005 LingPipe 2.1.0 is out. This release is backward compatible with 2.0 (and in almost all cases, with 1.0...
Greetings. I was trying to run experiments with a very big model file (56.8MB) generated by a 69.9MB iob file. The training was OK (I had to increase the...
... Hi Breck, Thanks for getting back to me. Indeed, I should have provided more details. So here we go: I am using the NE detection model, using your command...
... This could be due to a lot of things. If you'd like me to try to debug it for you, could you put the BIO training file and the test file up somewhere on a...
... Hi Bob, I can put it on my webpage, but unless you prefer to do it this way, let's see if it can be solved without doings so. ... Thanks for the...
... Longer's more helpful for debugging. Thanks. ... It's a problem when the decoder can't find any viable paths. I'm going to rewrite the decoder so it can...
Found it. The problem isn't the size of the model per se, but rather the high-skew (see description below) of a transition in the model interacting with...
... Yes, that's it probably! I think the way I create my training data results in skewed distributions, apart from large models. Thanks for sorting this out, ...
Final report on null returns from named-entity decoding. It does turn out to be the case that all possible continuations are being pruned. I fixed that, but...
Hi all, I've found recently about LinpPipe. I implemented a naive Bayes classifier that follows the approach from Mitchell (Machine Learning) and try it on the...
... Thanks for writing to the list -- this is a great case study. The bottom line answer to question two is that LingPipe's classifier is better because of...
Hi all, First I want to thank Bob for his detailed and competent answer. It really has been useful. However I want to make some clarifications: 1. I do not...
... That's an unusual approach, but mathematically sound. (In fact, more sound than many attempts to deal with unknown tokens.) ... This can skew the result...
LingPipe 2.1.1 is out (see the Download Page). This release patches all bugs of which we are aware. Major bug fixes include: (a) some MEDLINE handler...
... Most of the tools out there, like LingPipe, can be trained on arbitrary data. The reason we don't distribute models is that neither the CoNLL licenses nor...
Hello, My name is Dina Conti and I am student in computer science. We have got as homework to implement a naïve base classifier for text classification. We...
Hello everyone! First of all, thank you for making Ling-Pipe available! I have one question related to the coreference module: how difficult would be to...
... Your welcome. It is a pleasure to see it being used. ... That depends on your skills/resources so it is hard to really say. Writing a tutorial for this...
... That's my fault. I somehow distributed a version without the null constructor. I've attached the latest version as a tarball, because it's only 37KK. You...
Hello, ... one ... compile it ... call ... I just poked around Brian's code and see what he was talking about. Judging from the Javadocs for 2.1.1, it's still...
... Just to remind everyone of why this came up without repeating two long messages, the advantage of this generalization is that it allows the background ...
... There are actually a whole bunch of demos in the distribution and online that make use of the entity extraction API: To train, score and run models from...
I meant to use lingpipe to handle Chinese English Learners Corpus (http://www.clal.org.cn/corpus/EngSearchEngine.aspx). The results were very disappointing....
Hello, I used the classification functionality to detect sentiment as described in LingPipe tutorials. I haven't tried this yet, but just like I can train and...
Hi Bob, ... What does n-best mean in this context? Does it simply mean that the library will be able to return N possible entities ordered by their ...