I will be at HLT/NAACL and will plan to stop by, but I don't have a laptop. If I bring some text files on a flash drive, would that work? -- Mary D. Taffet ...
Mary D. Taffet
mdtaffet@...
Jun 3, 2006 5:53 pm
293
... The most important thing to have (for ANYONE planning on having us help you get started) is data that you plan to process plus some idea of what you would...
I'm getting ready to upload medline baseline citations and annotations into a mysql database as suggested in the tutorial. I am using JDBC to implement a...
... Bob just re-ran results so I'll let him chime in with the numbers if he has them for bio entity recognition. ... Medline has lots of abstracts about things...
Here's the output from running the MEDLINE download script (as reported by an anonymous user): [java] Logged in to FTP Server. [java] Checking/Downloading...
Sorry this mail's getting so long, but it's a complex topic! ... Timing will vary based on the models you're using and how you configure the decoders. With...
I just got my lease and the directory name is: ftp.nlm.nih.gov/nlmdata/.medleasebaseline/gz Maybe they changed it recently? I can download from an ftp client...
I've finally tracked down my problem. We're not plugged in. Our IP address changed, so NLM would let us log in, but not see anything. That should be fixed...
I'm working on training the NE chunker to extract medical terms like diseases. The train method in the NETrain class takes an array of tokens and the...
... LingPipe expects the tags in BIO encoding (begin entity tag, continuing in-tag, and not in entity). It won't be able to do anything with DISEASE and...
Thanks Bob. So what should I specify as tags for each of the tokens in my example? Can you send me a sample training file? Thanks, -Ravi ... From: Bob...
Sorry -- I must not have been clear enough about versioning in the answer. The problem that we're about to fix with the 2.3 release is that there are two...
Bob, I used B-DISEASE, I-DISEASE and O as tags as specified in the ChunkerHandlerAdapter class. Still it is not working. -Ravi ________________________________...
Thanks Bob. I'm using 2.2.1. I'll switch to CharLmHmm chunker. When are you expecting to release 2.3? -Ravi ... From: Bob Carpenter [mailto:carp@...] ...
Dear Bob and Breck, We are using LingPipe for our project on news video indexing (http://satoh-lab.ex.nii.ac.jp/users/ledduy/Demo/). In this project, we have a...
We're happy to announce the release of LingPipe 2.3. As usual, it's available from the web site: http://www.alias-i.com/lingpipe This is a big release in...
I used the TrainGeneTag.java to train on a sample of medline abstracts. Instead of GeneTagParser I used the Muc6ChunkParser as my data is in that format. It...
I'll respond inline. ... That's certainly one way to go. The other approach to try would be to train a spelling corrector on properly cased data and then...
... First of all, the above line won't compile, because readObject returns an Object. You should be casting to Chunker, NBestChunker, or ConfidenceChunker,...
... Could you send an exact copy of what you're trying to do? From the above description, I have a feeling you're probably compiling the CompiledHmmCharLm ...
Never mind about the ant stuff. I see now. Thanks again. ... -- Christopher M. Topinka NLM-BHIRT Predoctoral Fellow in Computer Science University of Missouri...
Hello: I am just starting on a new project that needs to classify auction results from the auction title and am looking for a little direction. My initial...
What is the practical limitations of categories? I have a project coming up that will need to filter items into millions of categories. The 'classify' example...
... This is a very hard text classification problem. Unless the terms are highly discriminative, there won't be enough data to sway the balance one way or...
... That depends on your memory, the size of the models, number of models and required latency. :-) ... More seriously, though, you can expect about 1-2M...
... Thanks for pointing this out. I've patched Strings.decimalFormat(...) to return a sequence of "?"s instead of throwing an error in the case of overflow. ...
In our quest to get people comfortable working with LingPipe we are having our first Hobby Night in our Williamsburg Brooklyn office--the space is a very nice...
I've not yet implemented anything to save a trie dictionary. It "implements" the method in the Java API sense, so your code compiles. But as documented in ...