As a consultant working in the TM space (also), I am often posed the Q about how is Text Mining different from what Google (intranet) search does. (Infact,...
286
lowprofilestartup
lowprofilest...
Jul 7, 2008 7:38 pm
Hi Folks! My startup's looking for a few world-class gurus in search, NLP, etc., for a critical project starting right away. Any help finding the right folks...
287
laura.eisenmann@...
laura.eisenmann
Jul 7, 2008 8:04 pm
I will be out of the office starting 07/04/2008 and will not return until 07/14/2008. I am on vacation. I will check email occasionally, but may not be able...
288
Vadim Berman
vb_dutch
Jul 7, 2008 10:48 pm
Hi lowprofilestartup (sorry, no names here), I am running a small consultancy in Australia specializing in NLP, IR and MT (as in machine translation). Here's...
289
William Hayes
william_s_hayes
Jul 8, 2008 11:13 am
1) Text search (TS) returns documents instead of targeted information 2) Google search doesn't understand entities (people, businesses, proteins, diseases,...
290
Seth Grimes
sethgrimes
Jul 8, 2008 12:08 pm
William, thanks for the response. Original poster, perhaps there were no discussions on this perspective, of search supplanting text mining, at the recent Text...
291
comahony
Jul 8, 2008 11:41 pm
Indeed. One way to think about it is... - Search interfaces and algorithms are good at finding a needle in a haystack. They are good when you want to find a...
292
canchi121
Jul 9, 2008 6:35 am
Thanks for your responses. I would like to believe the distinction offered between TS and TM is probably what textbooks in late nineties had to say; if you...
293
William Hayes
william_s_hayes
Jul 9, 2008 9:43 am
Thanks Conor for your response. That is a quick and easy visualization - will make a better elevator speech :)...
294
Seth Grimes
sethgrimes
Jul 9, 2008 11:48 am
But... one of the arguments for text mining is that it's the technology that can help you find A (as opposed to THE) needle in a haystack, that is, when you...
295
justice.chikomba@...
Jul 9, 2008 11:53 am
I kindly request that you stop sending me TextAnalytics mail. ________________________________ From: TextAnalytics@yahoogroups.com ...
296
Seth Grimes
sethgrimes
Jul 9, 2008 12:41 pm
Folks who want to leave the list should send a message to TextAnalytics-unsubscribe@yahoogroups.com ... -- Seth Grimes Alta Plana Corp, analytical computing...
297
danastarprofessionals...
danastarprof...
Jul 11, 2008 12:09 am
DANASTAR Professional Services is looking for a full time developer to implement search tools and to build an Enterprise Search front end for one of our...
298
Seth Grimes
sethgrimes
Jul 11, 2008 8:36 pm
I've been looking into speech analytics and audio mining. The following are what I've come up with by way of leading vendors, and then I have a second list of...
299
Seth Grimes
sethgrimes
Jul 16, 2008 1:00 pm
... Business Intelligence Network The Vision for BI and Beyond ******************************************** The Business Intelligence Network Newsletter...
300
Greg Holmberg
gcholmberg
Aug 5, 2008 1:21 am
In "The Problem with Unstructured Data" (DMReview, 2003, http:// www.dmreview.com/issues/20030201/6287-1.html ) Robert Blumberg and Shaku Atre state: Merrill...
301
Ravi Shankar
ravishankar10c
Aug 5, 2008 4:57 am
I share the same feeling as greg. For all you know even if the report does exist, there is a good chance that the number was arrived at based on a survey of...
302
Greg Holmberg
gcholmberg
Aug 5, 2008 6:10 pm
I emailed the author of this article, and here's what he said: From: "Robert Blumberg" <rblumberg at att dot net> To: "'Greg Holmberg'" Subject: RE: The...
303
William Hayes
william_s_hayes
Aug 5, 2008 6:36 pm
Hi Greg, I remember searching for this as well and wound up using the Gartner reference but never found a rigorous study supporting the claim. I assumed at...
304
Seth Grimes
sethgrimes
Aug 5, 2008 6:39 pm
At the time of the DM Review article, the citation was already at least 18 months old. Here's an August 2001 reference: ...
305
Charles Patridge
charles_patr...
Aug 5, 2008 6:42 pm
To ALL, I, too, have quoted and used this figure of 80-85% of corporate info as being unstructured data. On a given number of clients (8), I was able to...
306
Seth Grimes
sethgrimes
Aug 5, 2008 6:46 pm
There's a novel by Umberto Eco called Baudolino. Eco is a semiotician and linguist. The story meanders a lot without seeming to go anywhere except ...
307
Curt Monash
camonash
Aug 5, 2008 6:50 pm
... Good finds on the history, Seth. And I agree with your point on the meaninglessness of the figure. I'm not sure I'd agree on structured vs. "unstructured"...
308
eisai@...
eisaijmf
Aug 5, 2008 6:52 pm
Folks, There was a claim like this made in the November 16, 1998 Merrill Lynch report on Enterprise Information Portals. The report is a famous one that kicked...
309
Seth Grimes
sethgrimes
Aug 5, 2008 7:01 pm
I found the 1998 M-L report... I think. It contains many references to "unstructured" info etc. and says "some estimates run as high as 80%." I'm going to blog...
310
Seth Grimes
sethgrimes
Aug 5, 2008 7:12 pm
Sorry, I didn't include the link to the report. It's http://emarkets.grm.hia.no/gem/Topic7/eip_ind.pdf . Seth ... -- Seth Grimes Alta Plana Corp, analytical...
311
rrifaieh
Aug 5, 2008 11:16 pm
When Predictive Analytics goes to Cloud: In an article published in the Communication of the ACM [1] in July 08, Brian Hayes, argues that the longstanding era...
312
Dorai Thodla
dorait
Aug 5, 2008 11:47 pm
I heard (can't recall where) that there is more data held in spread sheets and documents on individual computers than on central databases and servers. Some of...
313
mdehaaff
Aug 7, 2008 4:20 am
I have not been able to find the ML report either, however Philip Russom from TDWI wrote a great report "BI Search and Text Analytics" that contains similar...
314
Seth Grimes
sethgrimes
Aug 7, 2008 6:30 pm
I thought list members would find this blog article interesting. I'm pasting in just the first paragraph because I can't paste in the hyperlinks -- ...