The International Macintosh Users Group presents:
Recent Research in Cross-language Document Search
Group: International Macintosh Users Group (IMUG)
(A Forum for Multilingual / Multiscript Computing)
Date: October 21, 2004, 7-9 p.m.
Speaker: Fredric Gey, Ph.D. (University of California, Berkeley)
Topic: Recent Research in Cross-language Document Search
Location: Apple Computer, Apple Campus, 1 Infinite Loop, Cupertino
Take Saratoga/Sunnyvale exit off 280, turn South into
Cupertino, turn left onto Mariani Avenue, left into
Infinite Loop.
Admission: $4, free for IMUG members
Contact: Roger Sherman, (650) 859-5981
roger [dot] sherman [at] sri [dot] com
Cross-language document search research has been underway for more than
10 years now and while much progress has been made, certain research
challenges remain. This talk will review recent research in
Cross-language information retrieval, including the 2004 evaluation
workshops: NTCIR for Asian language retrieval in Japan
(
http://research.nii.ac.jp/ntcir-ws4/index.html) and CLEF for European
language retrieval (
http://clef.iei.pi.cnr.it:2002/), as well as the
U.S. DARPA “Hindi Surprise Language Exercise” of 2003. Topics to be
covered include:
- Language-specific processing (stemming, segmentation, stop-words)
- Word decompounding for German
- Translation disambiguation for bilingual dictionaries
- Parallel corpora induced lexicons
- Web corpora usage for out-of-vocabulary translation
- Special retrieval tasks (patent retrieval, cross-language question
answering)
- Geographic information retrieval
- Challenges of less-commonly taught languages
- The road ahead in cross-language information retrieval research
Dr. Fredric Gey has been doing research in cross-language information
retrieval since 1998. He and his associates have participated in every
cross-language information retrieval evaluation in the United States,
Japan and Europe. Currently he is working on retrieval (including
geographic information retrieval) of Russian language corpora and other
digital objects. Dr. Gey co-chaired the English-Arabic retrieval
evaluation track at the TREC conferences in 2001 and 2002. He
co-chaired a workshop on “Cross-language Information Retrieval Research:
The Road Ahead” at the ACM SIGIR-2002 conference in Finland. He is
co-author of the entry on “Multilingual Information Retrieval” in the
Encyclopedia of Library and Information Science and co-editor of a
forthcoming special issue on Cross-Language Information Retrieval of the
Information Processing and Management Journal.
-------------------------------------------------------------------------
IMUG has its own site on the World Wide Web:
http://www.imug.org.
Check it out! It's currently not up-to-date, but we're working on
fixing that.
For a map of our meeting location go to:
http://www.imug.org/events.html
and click on the map link.
We also post our meeting announcements and handouts at Yahoo! Groups:
http://www.yahoogroups.com, under the group name "imugi18n"
(IMUG-i-eighteen-n).
-------------------------------------------------------------------------
To be added to the IMUG mailing list, please email to:
imugi18n-subscribe@yahoogroups.com