Here's the detail, from the home page announcing the new release: Maintenance Release The latest release of LingPipe is LingPipe 2.3.1. This is a ...
... Great mail! This is what makes having your source out there so much fun (not to mention so useful). I'm cross-posting this back to our mailing list. I...
... Spot on. It was a very tough call, but our main problem in apps tends to be space, not speed. Each double is 8 bytes, and there are two per internal node...
I'm using LingPipe in my research and would like to cite it in a peer-reviewed poster I am presenting. How can I do this? Thanks, Chris -- Christopher M....
... A quick look at the Chicago Manual of Style http://www.libs.uga.edu/ref/chicago.html reveals the following strange exemplary form: University of Georgia,...
Hi, I'm just starting out with LingPipe and NLP. I'm hoping that someone can point me in a general direction on how to extend LingPipe's Named Entity...
... Dan: Your first guess was on target. Given that we didn't have any tutorial material on our regular expression named entity chunker, I took this...
Hello all, I'm currently using ne-en-news-muc6.AbstractCharLmRescoringChunker on some test corpi. I've noticed that when I'm passing various news articles,...
... Actually, it doesn't respect them at all. Named entity recognition is solely done on a token-by-token level. ... The right way to do what you want to do...
Over the weekends, we began having DNS problems with alias-i.com. We haven't gotten any alias-i.com email, and the LingPipe site is temporarily down. Our...
hi! i am trying to do coreference resolution in spanish texts with lingpipe. I have a small training corpus with PERSON/LOCATION/PRONOUN annotated data. I...
... Here is a snippet that chunks text based on paragraph features which then has to be inverted to find valid text for sentence detection. This is fairly...
Hi, I've been searching through Google, the LingPipe API, and the LingPipe Yahoo group archives to no avail. I'm trying to create a TokenizedLM trained not on...
... I'm afraid you're right -- it requires some delving. I've just made the changes for 2.3.2, which we're releasing later next week. I have the Google disks...
I just put LingPipe 2.4.0 up on our web servers. It's a fairly minor change in terms of code, but the upgrades are not 100% backward compatible. I ran tests...
Hi all, this is my first post, I'm a new LingPipe user, but very impressed so far. Kudos on an excellent piece of software! As an early exercise, I'm trying to...
... May I ask what the basis for the classification is? ... OK -- you've got the right intuition here. At a high level, our text classifiers (and everyone ...
Hi Bob, ... Yup: legit blog urls vs. spam-blog urls. I ran across a paper describing classification by training exclusively on tokenized versions of the URLs. ...
David, Do you have a link to the paper you are referring to? I could use this to enhance spammer detection in Simpy (see sig). Thanks, Otis -- Simpy --...
... Exactly right. My bad for not having the reference available. I wasn't trying to be mysterious, I didn't understand it would be of general interest here....
Bob, as a followup to your comments below, I've been looking at the javadoc for ScoredPrecisionRecallEvaluation. I see that it provides "an evaluation of...
... Nope -- that's the right definition. The "operating characteristic" is implicit -- it's just a ranked evaluation. It basically tells you what the...
LingPipe 2.4.1 Released ======================= This is a patch release replacing LingPipe 2.4.0. It patches all bugs that have been reported to us; thanks to...
Hi! I'm working on a system to try to automate the analysis of customer satisfaction based on a database of their e-mail correspondences. So far we've had good...
... We've had requests to do other scalar classifiers, like reading level classification (on a grade-in-school scale). This is a general problem in statistical...