Hi all, I've found recently about LinpPipe. I implemented a naive Bayes classifier that follows the approach from Mitchell (Machine Learning) and try it on the...
... Thanks for writing to the list -- this is a great case study. The bottom line answer to question two is that LingPipe's classifier is better because of...
Hi all, First I want to thank Bob for his detailed and competent answer. It really has been useful. However I want to make some clarifications: 1. I do not...
... That's an unusual approach, but mathematically sound. (In fact, more sound than many attempts to deal with unknown tokens.) ... This can skew the result...
LingPipe 2.1.1 is out (see the Download Page). This release patches all bugs of which we are aware. Major bug fixes include: (a) some MEDLINE handler...
... Most of the tools out there, like LingPipe, can be trained on arbitrary data. The reason we don't distribute models is that neither the CoNLL licenses nor...
Hello, My name is Dina Conti and I am student in computer science. We have got as homework to implement a naïve base classifier for text classification. We...
Hello everyone! First of all, thank you for making Ling-Pipe available! I have one question related to the coreference module: how difficult would be to...
... Your welcome. It is a pleasure to see it being used. ... That depends on your skills/resources so it is hard to really say. Writing a tutorial for this...
... That's my fault. I somehow distributed a version without the null constructor. I've attached the latest version as a tarball, because it's only 37KK. You...
Hello, ... one ... compile it ... call ... I just poked around Brian's code and see what he was talking about. Judging from the Javadocs for 2.1.1, it's still...
... Just to remind everyone of why this came up without repeating two long messages, the advantage of this generalization is that it allows the background ...
... There are actually a whole bunch of demos in the distribution and online that make use of the entity extraction API: To train, score and run models from...
I meant to use lingpipe to handle Chinese English Learners Corpus (http://www.clal.org.cn/corpus/EngSearchEngine.aspx). The results were very disappointing....
Hello, I used the classification functionality to detect sentiment as described in LingPipe tutorials. I haven't tried this yet, but just like I can train and...
Hi Bob, ... What does n-best mean in this context? Does it simply mean that the library will be able to return N possible entities ordered by their ...
Indeed. You can also do it to spot topics. We're using it to disambiguate genes by looking at the context. I'm going to write a tutorial in the next month for...
... I hope this answer isn't more confusing than the first go round. Sorry for any smoke it causes in your brain. That's what I'm calling confidence-based...
Can anyone explain to me benefit of using a character-based NGram tokenizer for the DynamicLMClassifier's language model over a word-based one? I notice that...
... In almost all of our large-scale experiments, character-based models have outperformed token-based classifiers. And they're easier to use, which is why...
Bob- Thank you for the clear and thoughtful response. I suppose it was just intuitive to me that a word-based language model would be much better at...
... I spent a few years in the speech recognition world, and the answer is that most large-vocabulary unconstrained dictation systems use word-level language...
Does any of the classification algorithms implemented in LingPipe allows for multilabel categories (that is allows overlaping categories)? Regards! Eduard ......
... Not directly. The best we can offer out of the box is running a whole bunch of one-vs-all classifiers. This is fine if the classes are independent. If the...
I have just come across lingpipe and still understanding it's great potential. It would be extremely useful for one of the projects I am working on. I wanted...
We don't have any models for French in particular, but there should be no character set issues if you want to process French. If you want to detect named...
Please let us know what you think about assuming JDK 1.5 for LingPipe 3.0 and beyond. Here's a quick roadmap: Sun JDK 1.6 In ... Sun just released the 1.6 JDK...