... I'd say the main advantage of the character LM approach is that it can learn from parts of tokens as well as across tokens, which means it's far less...
Hi Lena, A solution could be to embed your Lingpipe learner / classifier inside a framework like UIMA or GATE. That would allow to run any number of processors...
... LingPipe was designed so that it'd be easy to integrate into larger integration frameworks like UIMA or GATE. LingPipe's models are all serializable and...
... Didn't you already publish a UIMA wrapper for at least the chunkers? I have successfully used the LingPipe NER components with UIMA 1.3 Regards, Florian...
Florian Laws
florian@...
Jun 4, 2007 2:20 pm
436
Hello all, I wonder if I can use LingPipe for clustering sentences using their syntactic parse trees as features. If yes, how can accomplish this ? Best...
... The hard part's (1) generating the syntactic parse trees, and (2) deciding how to map them into features (e.g. Collins's kernel-based approach) and then...
Hi All, I read in the LingPipe documentaion that it could be used for the Japanese text segmentation. My text is encoded in UTF-8. The segmentation work well...
... How do you want to segment the text? Into words? Into sentences? If you want to segment into words, you'll need training data that has spaces between the...
LINGPIPE 3.1.0 RELEASE ====================== Minor Release ============= The latest release of LingPipe is LingPipe 3.1.0. This minor release replaces...
Hi, I'm playing with LingPipe's DynamicLMClassifier and I'm trying to categorize text into 3 categories: good, neutral & bad. The underlying algorithm for...
Great question and one that comes up again and again in different contexts in machine learning and statistics. ... That is a very very skewed training set. And...
Bob, As always, a great, clear and understandable explanation! The training set is very skewed and there is nothing I can do about that, but I didn't know that...
Bob, I'm wondering about your sentence: "Typically, you can set that threshold using held out data." What does that mean? The building of the model and the ...
... In terms of n-gram sizes, I meant you might set the neutral category's n-gram size lower than the other category's. That'll tend to decrease the neutral...
Hi all, I asked before in an email last week whether it is possible to exploit the potential of Lingpipe for Japanese segmentation? I haven't got any anwer!...
Hello Bob, Thank you very much for your previous answer. I would deeply appreciate your suggestion in the following question. I am going to use LingPipe...
We're still having problems with Sun's 1.6 JDK. Now, the problem is a failing unit test for the TfIdfClassifier. It passes Windows (32 bit and 64 bit) and...
LingPipe 3.1.1 is out. It patches some bugs reported for 3.1.0, but does not introduce any new functionality. Bugs fixed included the TF/IDF classifier bug, a...
Hi All, I am trying to use Lingpipe to classify news into positive and negative groups and got some pretty good results. Now I want to compare the result with...
... I'll repeat my ongoing suggestion to try two classifiers: positive vs. neutral and negative vs. neutral. LingPipe assumes the categories are exhaustive...
I missed responding to Lena Tenenboim's question on the group a while back, so here goes. ... What to do is going to depend heavily on whether the categories...
Thanks Bob, for the very detailed explaination. I will give it a try following your advice. I do have another question. I know Lingpipe can do Chinese word...
... The short answer is "yes" -- classifiers will work in multiple languages. The simplest and most portable approach is to use language model classifiers...
We're soft-releasing our new GUI tool for annotating corpora with named entities or other chunks. It's set up a little differently than other such tools with ...
Hi, I'm very new to data mining and am trying to use the clustering capabilities of lingpipe. I have read through the tutorial many times, but I'm still...
Hi, I am working on a web project, which allows user to do classifier training in the browser. Basically, users can train a classifier by entering text and...
... That's a great project. We've been looking for an application for this for awhile, so I'd be curious what context you're applying it in. If you have a...
Hi, Is it possible to find and sort collocations not based on...... "Collocations are phrases which are seen together more than you would expect given an...