Hi , I am trying to write a class to predict a sequence of tokens,like "AABX2, BVGD8, GGSSD, .... " and so on. I managed to use the NGramBoundaryLM class to...
When you say "predict", what exactly do you want to do? If you want to compute the probability of char array cs2 given that you've seen char array cs1 relative...
I also should've added that if you wnat to predict a sequence of tokens as the subject indicated, then you want to use a TokenizedLM. It's no more set up to...
Hi Bob, thank you the answer. By "predicting" I mean guessing the next letter that would come after a given sequence, without being part of that sequence (...
... This'll work, but it's not very algorithmically efficient. LingPipe doesn't provide an efficient way to guess the next letter in the basic language ...
My corpus of train is composed of smartphone apps reviews and it will be updated every week, so that my classifier can get smarter and smarter. I want to keep...
From the javadoc http://alias-i.com/lingpipe/docs/api/com/aliasi/classify/TradNaiveBayesClassifier.html A naive Bayes classifier may be compiled. In order to...
But can i use any other classifier which allows me to update my model with the new reviews? Or before compilation, can i keep the intermediate dynamic model,...
You can use plain old Java serialization to save a dynamic model. Later, you can reconstitute it using Java's object reader, then continue to train it. ...
About TradNaiveBayesClassifier, we can get the probability of the specified token with probToken, but for a n-gram classification how can i get the probability...
DynamicLMClassifier is an abstract class, but you can use TokenizedLM.tokenProbability(). See the doc for more info: ...
1460
Subhabrata Banerjee
subhabangalore@...
May 6, 2013 9:14 pm
Dear Moderator, I was trying to install, run and use Lingpipe. I had Eclipse IDE installed so I called the whole library in Eclipse and I am being able to run...
There is a somewhat out of date description of how to use eclipse with LingPipe here: http://alias-i.com/lingpipe/demos/tutorial/eclipse/read-me.html You are...
1462
Subhabrata Banerjee
subhabangalore@...
May 6, 2013 9:56 pm
Hi Breck, Nice to get your mail. I could port it in Eclipse on my MS-Windows7. I ran the Build All. It ran as you mentioned there are lot of warnings, etc....
Hello LingPipers, Just FYI I have ported most of the LingPipe tutorial examples to scala here: https://github.com/sujitpal/scalingpipe I forked from this...
Hi Subhabrata, I am an Eclipse user so I understand that when you hit CTRL+F11 you are trying to run Chunking.java. In Java, in order to "run" a class under...
Sujit, That looks great. I downloaded it and had a look. I don't know much about Scala but it looks interesting. What is your primary motivation for using...
Thanks Breck. Regarding motivation, its a bit silly really, the first reason is I've been in Scala learning mode since late last year, so to increase...
1468
Subhabrata Banerjee
subhabangalore@...
May 13, 2013 7:45 pm
Hi Sujit, Thanks for your kind suggestion. I could run the demos. Generally I could give my input and get the results.I think I can give files,too. My job is ...
Hi Sujit, I tried to run ant, it went fine. Then I was trying to build the CRF module and I picked up ...
1471
Subhabrata Banerjee
subhabangalore@...
May 15, 2013 4:46 pm
Dear Group, I tried to run the demo and it ran fine. I was using the CRF. In the " http://alias-i.com/lingpipe/demos/tutorial/crf/read-me.html", I could run ...
Subhabrata You need basic help with Java which is beyond the scope of this group. I suggest you find an experience Java developer to help you. How to train ...
Hi Subhabrata, I am probably not the right person to answer this question, but one thing you could attempt is to perhaps extend the TinyPosCorpus to build a...
... You might want to check out our citationEntities sandbox project, which has tools for tag-a-little, learn-a-little named-entity corpus creation. We've...
hi,
I am new to LDA and need some advices in implementing the DOC_WORDS matrix in LdaFixed.java.
I am trying to extract topics from a set of tweets and...
... The doc's here: http://alias-i.com/lingpipe/docs/api/com/aliasi/cluster/LatentDirichletAllocation.html The array int[][] is a ragged array of the word...