Mike just found (and patched) a bug in our scored precision recall evalaution implementation. It was using a TreeSet with a score-based comparator instead of a...
I have a training set of 45K documents for 1000 categories and don't want users of my application to have to wait while the classifier is trained every time...
Thanks for the bug report: ... ... It should indeed be "length + start". I patched it for the next release. This is what I get for starting the API with...
Dear all, could somebody help me with the code below. I want to extract author list from Medline citation. thanks in advance for any suggestions. Cheers,...
Dear All I've just start learning LingPipe for my project. My project focus on Sentiment Analysis. Now I want to try using new set of movie review from...
We received a bug report that the SVD demo program in $LINGPIPE/demos/tutorial/svd/src/Lsi.java throws a null pointer exception as of LingPipe 3.8. I...
We just released a patch for LingPipe 3.8. As usual, it's available from the home page: http://alias-i.com/lingpipe There are only two changes: 1. Bug...
I'm interested in sentimental analysis and started few tests of detection. Is it possible do sentimental analysis comparatively? For example: "Java is better...
An anonymous bug reporter indicated the model: $LINGPIPE/demos/models/langid-leipzig.classifier was missing. I didn't realize it was linked from the tutorial...
... The answer is probably "yes". In fact our evals show what the effect of various thresholds are on the one-versus-all subproblems if you look at the scored...
I accidentally left an old description in the documentation on the LingPipe home page. We have not in fact deprecated util.FeatureExtractor as we said on the...
LingPipe 3.8.0 is now available from the site: http://alias-i.com/lingpipe This is a big release — the details are available on the home page linked above. ...
Hi, I was using a custom LineParser to parse a file, when I ran into the following problem: ## Exception in thread "main" java.lang.NullPointerException at...
Hi,when I run the demo of Chinese words segmentation with Peking University corpus,OutOfMemoryError happened as follows: run-pk: [java] CHINESE TOKENS [java]...
Hi, bob: I have add the fix as u said, but it doesn't work for me now. And the same exception occurs. I have no idea about that. I dig into the code and find...
Hi, bob: I am trying to use NaiveBayesClassifier to classify text documents. Here is my code: String regex = "[a-zA-Z]+"; RegExTokenizerFactory tf = new...
Hello I am trying out the Sentiment Analysis tutorial (PolarityHierarchical.java). I want to dump the classifier as a file, so that I donot have to run through...
Hi: I am a newbie of lingpipe. I read some tutorial on the langpipe website but many NLP terms frightened me. The classification samples included in the...
A user reported a problem with HMM decoding on a specific hardware/software configuration: Athlon64/Ubuntu 8.1 64-bit/JDK 1.6 An exception is thrown in the...
Hello, I was looking at NGramBoundaryLM and wondering why exactly those begin and end characters are needed and when you'd use NGramBoundaryLM instead of...
LingPipe 3.7.0 will generate a warning when compiling all jars because of an issue with non-ASCII chars in one of our java files. It works with Windows...
LingPipe 3.7.0 is now available from: http://alias-i.com/lingpipe The only significant change is an update to the MEDLINE DTDs used by the MEDLINE parser....
Hi Bob, I have a question on Model Quality. I used the ChineseToken sample to generated a words-zh-as.CompiledSpellChecker model, which has size 78,303KB. I...
... It sure does. Thanks for the detailed bug report. The culprit is the following file: $LINGPIPE/src/com/aliasi/spell/CompiledSpellChecker.java The method...
I recently downloaded LingPipe 3.6.0 and tried the ChineseToken tutorial sample. I always got NullPointerException. My environment is: - Window XP - Java...
As of 3.0, the chunking interface completely changed so it's no longer backward compatible with 2.x code. The last version of LingPipe to support the...
Hello- I'm having a blank. In the context of spell checking, is the edit distance used between the user-entered term and the suggested term, or the reverse? I...