Hi I am fairly new to LingPipe and NLP as a whole, so it could very well be a stupid question. I am trying to extract concepts from sentences inside a document...
... As always, we suggest you start with some examples. That is, do the task by hand yourself, so we can get an idea of what you mean by "extract concepts". ...
Bob, Thanks for the reply and my apologies for the vagueness that you refer to. Here is what I meant: For ex. "Martha Stewart was involved in insider trading...
Amlan Chatterjee wrote ... OK, I kind of see where this is going, but I don't know what a "predicate graph" is and I'm not sure I see any generalizations to be...
Dear Lingpipe community members, I'm considering developing an algorithm for text compression using part-of-speech analysis as my senior project (which started...
... First of all, you need to understand the information-theoretic notion of coding. There's a direct link between language modeling and compression: if you...
Hello- I'm a bit confused by the need to provide a compilable weighted edit distance while training a spell checker. I can't figure out where and how it is...
... The short answer is that it determines the channel probabilities in the noisy channel model; that is, the model of what brainos and typos people are likely...
Thanks Bob. I do understand what the edit distance does 'at runtime', i.e. with a CompiledSpellChecker. My question is how the edit distance is used during...
... Sorry for off-topic response. I was wondering why you were asking about edit distance. The only reason edit distance is part of the training process is so...
... No, it doesn't. Not that we have anything against C or Python or C#, we just haven't had the resources. It'd be easy to do at a simple API level (e.g....
We're pleased to announce the release of LingPipe 3.3.0. As usual, you can find the whole release and download info at: http://www.alias-i.com/lingpipe ...
Hello, I have a few questions related to tuning spell checking. 1. I am always confused about the weights and costs. Is bigger (larger negative number) making...
... Lots of scalars and negations here. Penalty looks one way, probability the other; absolute value goes one way and value the other. Just remember that the...
Dear Mr. Bob Carpenter, I posted since few weeks the fellowing email (see above) where I asked you for possible use of Lingpipe for sentence or string...
Dear Mr. Bob Carpenter, I posted since few weeks the fellowing email (see below) where I asked you for possible use of Lingpipe for sentence or string...
I'm afraid the short answer is no -- I haven't written the tutorial for string comparison and don't currently have any code samples to send you that aren't...
Hi - I'm working on Language Id and was trying to download the Leipzig Corpora from http://corpora.informatik.uni-leipzig.de/download.html. To the surprise...
... I just tried it out and had no problem (i.e. it "works on my machine", where my machine is running Firefox 2.0.0.12 on Win x64 over a local ethernet...
Hello All, I have been looking over the demo for named entity recognizer. I did read and understood about Rule-Based Named Entity Detection and Exact ...
The best place to find info on building your own data and training a NE recognizer is in our "citationEntities" sandbox project. Instructions are at: ...
So I have been playing with the citationEntities, and I run the gui tool to do the manual annotation. It did form some files like: <?xml version="1.0"...
Hello, When I am training a named entity annotator and If I have input like: "INTEL CORE 2 QUAD Q6700 2.66 GHZ 8 MB CACHE 1066 MHZ LGA775 Processor" intel ->...
... You should look at the doc in citationsEntities/readme.html The entities you annotate inside of is available on the command line. ... You can specify the...
... Product identification "in the wild" is one of the harder named entity tasks. If you are tagging uniform data like a catalog or single website then it may...
Hi, I recently use LingPipe for my research. I have texts from wine menus, and I use both rule-based and dictionary-based name entity recognizer for them. My...