Search the web
Sign In
New User? Sign Up
webir · Web IR & IE
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Message search is now enhanced, find messages faster. Take it for a spin.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 1 - 31 of 2606   Newest  |  < Newer  |  Older >  |  Oldest
Messages: Show Message Summaries   (Group by Topic) Sort by Date v  
#31 From: Margaret Graham <margaret.graham@...>
Date: Thu Jan 14, 1999 10:59 am
Subject: Challenge of Image Retrieval - call for participation
margaret.graham@...
Send Email Send Email
 
A New Year reminder! (and apologies to those on multiple lists)


       THE CHALLENGE OF IMAGE RETRIEVAL

CIR 99 - Second UK Conference on Image Retrieval
          25 & 26 February 1999

Venue:
    Forte Posthouse Hotel,
    Newcastle upon Tyne,
    United Kingdom

The conference aims to bring together researchers and practitioners
in the fast-growing area of image retrieval, to exchange information and
gain some idea of the significance of developments in related
disciplines.

Full details of the conference and the online booking form are
available at the web site: http://www.unn.ac.uk/iidr/conference.html

The Conference will be held in the Forte Posthouse Hotel, Newcastle
upon Tyne. All sessions will be held in the Tyne Suite. Lunches on
both days and the Conference Dinner will also be held in the Hotel.

Bed and breakfast accommodation has been arranged for delegates in
the Forte Posthouse Hotel at a special price.  Delegates are responsible
for booking their own accommodation. (see web site for details of the
rates and information on alternative hotels).

PROGRAMME DETAILS FOLLOW:

    Thursday 25 February 1999:

10.00 Registration and coffee
10.20 Welcome and Introductions: Professor David Harper,
                    Robert Gordon University

10.30 Content-Based Image Retrieval technology: can it satisfy real
                    information needs? Presentations and discussion based on a
                    JISC funded report on the state of the art of CBIR
technology,
                    led by Dr John Eakins and Margaret Graham,
                    Institute for Image Data Research, University of Northumbria
                   at Newcastle.

12.00          Lunch, system demonstrations and poster viewing

14.00 Submitted papers, session 1 - CBIR applications:
                   Chair: Martin Nail, British Library Research and Innovation
Centre

                   "The Informedia digital video library system at the Open
                   University" R van der Zwan et al, The Open University, Milton
                   Keynes

                   "Content-based retrieval for European digital libraries" C L
Bird et
                   al, IBM UK Laboratories, Hursley Park,

                   "Building systems to block pornography" Y Chan et al,
University
                  of East Anglia, Norwich

15.30 Tea/coffee, system demonstrations and poster viewing

16.00 Submitted papers, session 2 - CBIR techniques:
                   Chair: Joemon Jose, University of Glasgow

                  "Colour indexing across illumination" G D Finlayson and G Y
Tian,
                  University of Derby

                 "Robustness of shape similarity retrieval under affine
transformation"
                 S Abbassi and F Mokhtarian, University of Surrey

                "Trademark image retrieval using multiple features" S Alwis and J
                 Austin, University of York

17.30 System demonstrations and poster viewing

Evening:  Conference Dinner


     Friday 26 February 1999:

9.00 Keynote Address: Dr Michael Swain, Cambridge Research Laboratory,
                  Compaq Computer Corporation: "Image searching on the Web:
experiences
                  and practice"
                  Chair: Dr. John Eakins, University of Northumbria at
                  Newcastle

10.00 Coffee, system demonstrations and poster viewing

10.30 Submitted papers, session 3 - New approaches to image data
                    management:
                   Chair: Dr. Mark Dunlop, University of Glasgow

                  "Superordinate representation of scenes from power spectrum
shapes" A
                  Oliva et al, Institut National Polytechnique de Grenoble,
France

                  "Image browsing and navigation using hierarchical
classification" T S
                   Lai et al, University of Sunderland

                  "Evaluation of automatic shot boundary detection on a large
video test
                   suite" C O' Toole et al, City University, Dublin, Ireland

                  "A flexible architecture for content and concept-based
multimedia
                   information exploration" SM Dobie et al, University of
Southampton

12.00 Lunch, system demonstrations and poster viewing

14.30 Theme: Emerging standards for image retrieval.
                   Chair: Dr. Peter Enser, University of Brighton

                   Speakers:
                    (Speaker to be announced), Technical Advisory Service for
Images;
                    "Importance of standards".
                    Michael Day; UK Office for Library  Networking;
                   "Metadata for images"
                    Dr. Edward Hartley, University of Lancaster;
                   "Impact of MPEG-7"

                     (includes tea/coffee at 15.30)

16.30 Close of Conference

The Conference is sponsored by:
The British Computer Society Information Retrieval Specialist Group
The Institute of Information Scientists
The Robert Gordon University
The University of Northumbria at Newcastle
The British Library Research and Innovation Centre

and supported by:
The British Machine Vision Association and the Institution of
Electrical Engineers

Full Conference rate = stlg 200 + VAT
One Day registration = stlg 110 + VAT
There is 10% discount for members of the professional groups
sponsoring/supporting the Conference, and for students.
(N.B.  Accommodation is additional)

All correspondence concerning registration at CIR99 is to be addressed
to Cath Frost:

Email: cath.frost@...
Telephone: +44 (0)191 227 4646
Fax: +44 (0)191 227 4637

Cath Frost
Institute for Image Data Research
University of Northumbria at Newcastle
Newcastle
NE1 8ST

---------------------------------------------------------


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Margaret Graham
Research and Development Manager
Institute for Image Data Research
University of Northumbria at Newcastle
Newcastle upon Tyne
NE1 8ST

Tel: +44 (0)191 227 4646
Fax: +44 (0)191 227 4637
email: margaret.graham@...
Website: http://www.unn.ac.uk/iidr/
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com

#30 From: Einat Amitay <einat@...>
Date: Thu Jan 14, 1999 1:05 am
Subject: CFP: Machine Learning for Intelligent Information Access
einat@...
Send Email Send Email
 
-------------------------------------------------------------------

                 Machine Learning for Intelligent Information Access
                 ---------------------------------------------------


                                 Workshop for the
                         MACHINE LEARNING AND APPLICATIONS
             Advanced Course on Artificial Intelligence 1999 (ACAI-99)
                                  5-16 July 1999

                 Mediterranean Agronomic Institute of Chania,
                                 Crete, Greece
                 (http://www.iit.demokritos.gr/skel/eetn/acai99)



                       *** C A L L   F O R    P A P E R S ***


Description
---------------
As the space of electronically stored information continues to expand
across computer networks, the need for intelligent access to information
within multi-dimensional, partially structured and noisy spaces becomes
imperative. Examples of such spaces are newswire feeds, the World Wide
Web, digital libraries and enterprise-wide information repositories.
There has been a growing interest in augmenting or replacing traditional
information filtering and retrieval approaches with machine learning
techniques in order to build systems that can scale to the intrinsic
complexity of the task.

The goal of this two-day workshop is to focus on new work in the
application of machine learning techniques to intelligent information
access problems, such as filtering of relevant information (articles,
books, electronic pages) over information streams, text topic
categorization, enterprise-wide dissemination of information and
electronic information markets. In particular, research topics relevant
to the workshop include:

      · Text classification
      · Adaptive filtering
      · Query expansion
      · Clustering of documents
      · Scalability over very large datasets
      · Ensembling and multi-strategy learning
      · Knowledge representation for information retrieval
      · Architectures and protocols for electronic information markets

The workshop will include an overview of relevant aspects of machine
learning and information retrieval, and paper presentations discussing
the use of machine learning in various information access tasks.


Time and Place
-------------------
The workshop will take place in the afternoon sessions of July 7th and
8th 1999 as part of ACAI-99. For the workshop participants who will not
register for the whole ACAI-99 there will be a fee of 100 ECU.


Submission Details
------------------------
The workshop will consist of invited talks, presentation and discussion
sessions and hands-on demonstrations. Researchers from machine learning,
information retrieval, digital libraries and distributed artificial
intelligence who are interested in presenting their work should send a
short paper (5-8 pages) describing work in progress or completed work.
Persons desiring to participate should submit position papers (up to two
pages). We would like to encourage submissions of video demonstrations,
and working systems that can be used for hands-on demonstration. Papers
should be e-mailed in PostScript form to karakoul@...; they must
include: author's name(s), affiliation, complete mailing address, phone
number, fax number and  email address. The accepted papers will appear
in a technical report that will be prepared in collaboration with the
ACAI-99 organizing committee.


Important Dates
--------------------
Paper submissions:                         February 26, 1999
Notification of acceptance/rejection:      March 31st, 1999
Final version of the papers:               April 19th, 1999.


Organizing Committee
----------------------------
Jamie Callan, University of Massachusetts, USA (callan@...)
Claudio Carpineto, Fondazione Ugo Bordoni, Italy (carpinet@...)
Innes Ferguson, Active On-line Ltd., UK (innes@...)
Klaus Fischer, DFKI GmbH, Germany (Klaus.Fischer@...)
Grigoris Karakoulas, CIBC & University of Toronto, Canada (co-chair)
Matthias Klusch, Technical University of Chemnitz, Germany
(klusch@...)
Ramon Lopez de Mantaras, IIIA, Spain (mantaras@...)
Jorg Muller, John Wiley & Sons, UK (jpm@...)
Giovanni Semeraro, University of Bari, Italy (co-chair)
(semeraro@...)
Stuart Soltysiak, BT Labs, UK (soltysis@...)


Contact Person
-------------------
Grigoris Karakoulas
Global Analytics
Canadian Imperial Bank of Commerce
161 Bay St., 11th Floor, BCE Place
P.O. Box 500
Toronto, Ontario, Canada M5J 2S8
Phone: (416)594-7709
Fax: (416)594-8528
E-mail: karakoul@...






------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com

#29 From: Einat Amitay <einat@...>
Date: Thu Jan 14, 1999 12:48 am
Subject: [Fwd: TREC-8 Web Tracks: Proposed 2 gB collection]
einat@...
Send Email Send Email
 
--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat



------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com
Jamie,
	 Thanks for the input.  At this point, we can only answer the
first question, but maybe we (or another participant) can come up with
answers to the other two later on.

Dave

> 1.  What is the density of relevant documents for TREC-7 topics?
>

Nick Craswell tells me that the answer is as follows:

 	 Judgments   Coll        Density of relevant docs
 	 T7          VLC2        6482/18571671 = 0.03%
 	 T7          WT2g       3105/247491 = 1.25%
For comparison,
 	 T7   	    Ad Hoc 4674/528155 = 0.89%

Of course, there may be considerable variation from one topic to another.
For another set of experiments, we have been carrying out some additional
judgments (using the same judges and the same topics) on the WT2g colection.
We haven't done all topics yet, but the last time we looked the relevant
count had risen quite a bit:

 	 T7 + new    WT2g        4825/247491 = 1.95%



> 2.  What is the average number of links within a single site?
>     It looks like you have about 250 pages per site -- how
>     interconnected are they?
>
> 3.  What is the average number of links from one site out to another
>     in the corpus?  I'm wondering whether each site is an isolated
>     island, referring only to itself and sites outside of the corpus,
>     or whether the sites in the corpus are interconnected to some
>     extent.
>
> If some of these numbers are too difficult to come by, we can obviously
> do without them, but they would be nice to know.
>
> 							 Jamie
>


--
David Hawking
CSIRO Mathematics and Information Sciences
GPO Box 664
Canberra ACT 2601
Australia
email: David.Hawking@...   phone: 61-2-6216 7060  fax: 61-2-6216 7111
http://cs.anu.edu.au/people/David.Hawking

#28 From: Einat Amitay <einat@...>
Date: Thu Jan 14, 1999 12:48 am
Subject: [Fwd: TREC-8 Web Tracks: Proposed 2 gB collection]
einat@...
Send Email Send Email
 
--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat



------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com
Dave,

Your proposal looks like a good starting point.  I do have a couple
of questions, though.

1.  What is the density of relevant documents for TREC-7 topics?

2.  What is the average number of links within a single site?
     It looks like you have about 250 pages per site -- how
     interconnected are they?

3.  What is the average number of links from one site out to another
     in the corpus?  I'm wondering whether each site is an isolated
     island, referring only to itself and sites outside of the corpus,
     or whether the sites in the corpus are interconnected to some
     extent.

If some of these numbers are too difficult to come by, we can obviously
do without them, but they would be nice to know.

								 Jamie

#27 From: Einat Amitay <einat@...>
Date: Thu Jan 14, 1999 12:47 am
Subject: [Fwd: TREC-8 Web Tracks: Proposed 2 gB collection]
einat@...
Send Email Send Email
 
--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat



------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com
Dear Fellow Webbers,

	 One of the immediate questions for the small task of the TREC-8
web track is to decide which subset of the 100 gigabyte VLC2 to use.
Remember that we plan to use TREC-8 Adhoc topics in the task so we need
to have some confidence that there will be documents relevant to those
topics among the subset.  Consequently, a uniform 2% sample is unlikely
to be suitable.

	 NIST have agreed to make use of the 2 gigabyte subset collection
during the process of ad hoc topic generation.  (The assessors will develop
the topics on the ad hoc data but then also check on the hit rate in the
Web data.  NIST are unwilling to depart from their topic generation procedure
but are willing to take the Web data hit rate into account in selecting
topics.)


ACSys Proposal
--------------

We have defined a 2.1 gigabyte sample of the VLC2 containing about
250,000 documents (only about half as many as in usual ad hoc because
web documents are twice as long). We started by identifying all the
distinct hosts represented in the 100 gigabyte collection.  Then we
counted how many relevant documents were found in the VLC tasks (using
TREC-7 ad hoc topics) and ranked the hosts in order of decreasing
relevant document density.  Finally, we collected together all the
documents from the top-ranked hosts until we got a little over 2
gigabytes of data.  The number of hosts represented is 956.

This collection has the advantages that:

1. the much higher density of relevant documents on TREC-7 topics
    than the average for the VLC2, is likely to lead to a similarly
    higher density for TREC-8 topics.

2. because all available documents from each host are included,
    the proportion of dead links in this 2 gigabyte sample is likely to
    be much less than for a randomly chosen sample of the same size.
    (We still plan to provide connectivity information for the
    whole 100 gigabyte set.)


Feedback
--------

If you have alternative proposals for selecting the subset, please mail
them to this list (webtrax@...).   If no better alternative
emerges in the next few weeks, then we'll accept our own proposal and send
the data to NIST for topic definition purposes.


Dave

--
David Hawking
CSIRO Mathematics and Information Sciences
GPO Box 664
Canberra ACT 2601
Australia
email: David.Hawking@...   phone: 61-2-6216 7060  fax: 61-2-6216 7111
http://cs.anu.edu.au/people/David.Hawking

#26 From: Einat Amitay <einat@...>
Date: Tue Jan 12, 1999 5:04 am
Subject: Ubilab Summer Internship
einat@...
Send Email Send Email
 
Internships in Summer 1999
Ubilab, UBS AG
Zurich, Switzerland

Ubilab, the IT research and innovation center of UBS, one of the world's
leading financial services group, is inviting highly-qualified graduate
students to apply for summer research positions. Ubilab pursues a number
of attractive and competitive research projects, often in collaboration
with other research institutions in Switzerland and abroad. In addition,
Ubilab serves the bank as an effective IT consultant on both the
strategic and tactical levels. Internships are available in the
following areas:

   Software Engineering (object-oriented application development)
   Information Systems (database and information retrieval systems)
   Data Mining (extensible frameworks for enterprise data analysis)
   Human-Computer Interaction (visualization and speech technologies)
   Security (secure financial systems and secure cooperation)

We are looking for students with excellent problem-solving abilities, a
strong interest in one of the above-mentioned fields, and experience in
object-oriented software development. Minimum internship duration is 4
months, although exceptions are possible. Ubilab is centrally located in
the downtown Zurich area and employs some twenty researchers of various
nationalities. Please see our World
Wide Web page at http://www.ubs.com/ubilab for more information about
Ubilab and UBS.

If you are interested in a summer internship, please send a resume (by
email, fax, or postal mail) stating your academic history and research
interests. Please also provide at least two letters of recommendation
from supervisors and/or faculty members who are familiar with your
current activities. The deadline for application materials to reach us
is January 24, 1999.

Contact:
-------
Dr. Jussi Myllymaki
UBS AG, Ubilab
Postfach
CH-8098 Zurich
Switzerland
Tel. +41-1-236-4416
Fax. +41-1-236-4671
Email: Jussi.Myllymaeki@...

--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat



------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com

#25 From: Einat Amitay <einat@...>
Date: Tue Jan 5, 1999 6:39 am
Subject: CFP: Web Track in TREC-8
einat@...
Send Email Send Email
 
This message was written by Ellen Voorhees from NIST:



                     ===============================
                     CALL FOR PARTICIPATION:
                     new Web track in TREC-8
                     ===============================

The Text REtrieval Conference (TREC) workshop series encourages research
in information retrieval from large text applications by providing a
large test collection, uniform scoring procedures, and a forum for
organizations interested in comparing their results.  Now in its eighth
year, the conference has become the major experimental effort in the
field.

Each TREC conference contains a variety of tasks, called "tracks", that
allow interested participants to focus on particular sub-problems of
information retrieval.  TREC-8 will have a new Web track in which
the document collection will consist of approximately 2GB of World Wide
Web data and participants will be encouraged to investigate whether
links can be used to enhance retrieval.  Additional tasks may use a
100GB snapshot of the web.  You are invited to participate in TREC-8.

The complete call for participation for TREC-8 can be found on the TREC
web site, http://trec.nist.gov . The web site also contains more
information about the TREC program, including proceedings from previous
conferences.   The Overview paper in each proceedings gives a complete
description of the tasks performed in that year. The proceedings for
TREC-7, held in November 1998, are not yet published.

Note that detailed specifications of most TREC-8 tracks, including the
Web track, are still in process and are not yet posted on the web site.
Contacts for each track are given in the full call for participation.
Applications for participation are due at NIST by February 1, 1999.

Ellen Voorhees
TREC project manager
NIST




------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com

#24 From: Einat Amitay <einat@...>
Date: Mon Jan 4, 1999 3:28 am
Subject: SIGIR'98 Hypertext IR Workshop on-line + Summary
einat@...
Send Email Send Email
 
Hi again,

This report was written by Eric Brown and Alan Smeaton following the SIGIR '98
Hypertext IR workshop. The URL for the .ps file is:

http://lorca.compapp.dcu.ie/SIGIR98-wshop/SIGIR-Forum-summary.ps

and the abstract:

--------------------------

"The notion of searching a hypertext corpus has been around for some time, and
is
an especially important topic given the growth of the World Wide Web and the
general dissatisfaction users have with the tools currently available for
finding
information on the Web. In response to this, a workshop was held as part of
SIGIR'98 on 'Hypertext Information Retrieval for the Web' and this document
presents a brief summary of the papers presented at the workshop, along with a
set
of themes identified as a result of group discussion and some conclusions on
where
to go next." (Brown & Smeaton, to appear in SIGIR Forum).

---------------------------

Some of the workshop papers can be found online:
http://lorca.compapp.dcu.ie/SIGIR98-wshop/program.html




--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat



------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com

#23 From: Einat Amitay <einat@...>
Date: Mon Jan 4, 1999 1:35 am
Subject: TREC-8 Web Trax
einat@...
Send Email Send Email
 
(
This is a message from David Hawking, CSIRO, Australia. I have asked him to mail
this list with any other announcements about the Web Trax
+:o)
einat
)

David Hawking wrote:

> Dear Potential TREC Web-Trackers,
>
> - The mailing list for discussion of TREC-8 web tracks is:
> webtrax@...  (note also webtrax-request@...)

>
> - The web page describing the current state of play with the tracks is at
> http://pastime.anu.edu.au/TAR/webtrax.html

>
> -  A paper describing the frozen-web-snapshot/web-tracks and inviting
> participation from the Web community has been submitted to WWW8.
>
> -  We have a tentative definition of an appropriate 2 gigabyte subset of the
data
> for use in the small-data task.  It is much more likely to contain
TREC-relevant
> documents than would a random selection of the same size.  We are conducting
> various tests and will advise when we are happy with it.
>
> -  Please let me know if you can think of web-oriented mailing lists on which
we
> could profitably advertise the web tracks

>
>
> --
> David Hawking
> CSIRO Mathematics and Information Sciences
> GPO Box 664
> Canberra ACT 2601
> Australia
> email: David.Hawking@...   phone: 61-2-6216 7060  fax: 61-2-6216
7111
> http://cs.anu.edu.au/people/David.Hawking

--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat



------------------------------------------------------------------------
eGroup home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com

#22 From: foster@...
Date: Fri Dec 18, 1998 7:14 pm
Subject: CFP: KDD-99
foster@...
Send Email Send Email
 
[Hi Web-IR.  We would be very interested in receiving good research
papers on mining the web/text mining/etc.  We also solicit proposals
for workshops, panels, tutorials, demos, etc.  Hope to see some of you
in San Diego...  -- foster]



			    Call for Papers

	 KDD-99: The ACM SIGKDD Fifth International Conference
		 on Knowledge Discovery and Data Mining

		 August 15-18, 1999, San Diego, CA, USA
	     http://research.microsoft.com/datamine/kdd99/

			     Sponsored by:
	   Association for Computing Machinery (ACM) - SIGKDD
			    Co-sponsored by:
		    AAAI, ACM SIGMOD, and ACM SIGART




The continuing rapid growth of on-line data and the widespread use of
databases necessitate the development of techniques for extracting
useful knowledge and for facilitating database access. The challenge
of extracting knowledge from data is of common interest to several
fields, including statistics, databases, pattern recognition, machine
learning, data visualization, optimization, and high-performance
computing. KDD-99 will focus on techniques, applications, and
experiences, bringing together researchers and practitioners.

Starting this year, the KDD series will represent the annual
conferences of the newly formed SIGKDD--the ACM Special Interest Group
on Knowledge Discovery and Data Mining.

			        --------
			        Calendar
			        --------

	      Electronic abstracts due:  March 1, 1999
		       Submissions due:  March 5, 1999

  Notification of acceptance/rejection:  May 17, 1999
	       Camera-ready copies due:  June 14, 1999


Submission Guidelines: Please see the KDD-99 web site for detailed
instructions (http://research.microsoft.com/datamine/KDD99).  All
submissions (including research papers, panels, demos, tutorials and
industrial track submissions) must be received by March 5, 1998.
Prospective authors are encouraged to submit research papers on any
topics of relevance to knowledge discovery and data mining.  In
addition to fundamental research, we solicit papers fostering
cross-fertilization and interdisciplinary integration, as well as
papers that describe significant experiences and implementation
lessons. Topics of interest include, but are not limited to:

KDD Techniques             Human Interaction and the KDD Process
--- ----------            ----- ----------- --- --- --- -------
New KDD algorithms         Data and knowledge visualization
Mining the Web             Evaluating knowledge and potential discoveries
Text/multimedia            Interactive exploration
Data cleaning/noisy data   Visualizing large, high-dimensional data
Incremental algorithms
High-dimensional data
Background knowledge                 Mining Enterprise Databases
                                       ------ ---------- ---------
Implementation and Applications      Scalable algorithms
-------------- --- ------------      Unification of mining with querying
Implementation & use of KDD systems  Database architectures for KDD
Vertical applications                Database primitives for KDD
Case studies: success/failure        Integration: mining/warehousing/OLAP
Benchmarks


KDD-99 Organization
--- -- ------------
Program Committee Co-Chairs: Surajit Chaudhuri (SurajitC@...)
                              David Madigan (madigan@...)

General Chair: Usama Fayyad (Fayyad@...)

Awards: G. Piatetsky-Shapiro (gps@...)
Demos/Exhibits: Ismail Parsa (iparsa@...)
Industrial/Applications Track: Jim Gray (gray@...)
                                Ronny Kohavi (ronnyk@...)
Local Arrangements: Jenny Zhang (jgz@...)
Panels: Padhraic Smyth (smyth@...)
Proceedings: Kyuseok Shim (shim@...)
Publicity: Foster Provost (provost@...)
Sponsorship: Ramasamy Uthurusamy (samy@...)
Tutorials: Jiawei Han (han@...)
Workshops: Rakesh Agrawal (ragrawal@...)

------------------------------------------------------------------------
E-group home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com

#21 From: Einat Amitay <einat@...>
Date: Fri Dec 18, 1998 3:46 am
Subject: DL99 - extended submission deadline
einat@...
Send Email Send Email
 
Digital Libraries '99
The Fourth ACM Conference on Digital Libraries

==========================================================
D E A D L I N E   H A S   B E E N   E X T E N D E D   T O

                   F E B R U A R Y  8
==========================================================

The program chair has recently extended the submission deadline
for DL'99 in response to requests from many authors.  A revised
call for papers is attached below for your convenience.

Jim French, Publicity Chair
---------------------------------------------------------------

Digital Libraries '99
The Fourth ACM Conference on
Digital Libraries
Sponsored by ACM SIGIR and SIGWEB

University of California, Berkeley
August 11-14, 1999

Call For Participation
http://fox.cs.vt.edu/DL99

Introduction

The Fourth ACM Digital Libraries Conference (DL '99) will be held in
Berkeley, California, USA on August 11-14, 1999. The conference hotel
will
be the Radisson Hotel, Berkeley Marina. Conference sessions will be on
the
campus of the University of California at Berkeley and at the conference

hotel. ACM SIGIR '99 follows immediately after at the same location, to
facilitate interchange between the DL and Information Retrieval
communities.

ACM DL is the major international forum on digital libraries, with an
expanded program for the presentation of new research results, the
discussion of policy issues, and for the demonstration of new systems
and
techniques. Computer scientists, librarians, information scientists,
archivists and others in academia, government, and industry - from
around
the globe - who are leaders in the digital library area will present and

attend.  The conference attracts a broad range of professionals
including
theoreticians, collection developers, publishers, researchers,
educators,
policy makers, practicioners, developers, and designers of systems,
interfaces, and related applications.

Topics

DL '99 seeks original contributions in the broad field of digital
libraries.  Topics of particular interest include but are not limited
to:
* Algorithms: categorization, clustering, filtering, learning, protocols

* Applications: data mining, education, visualization
* Architecture: agents, bus, distributed, federated
* Art, humanities, museums: collection, conversion, markup
* Economic, legal and social: authentication, authorization,
   intellectual property rights, publishing
* HCI: design, devices, interfaces, logs, usability
* Hypertext/multimedia: authoring, linking, presenting
* Information science: information seeking, services
* Interoperability: multilingual/multicultural, standards, WWW
* Metadata: adaptations, supporting software/systems
* Policy: equity, funding, identifiers, international collaboration
* Sustainability: archiving, organizational issues, preservation
* Theory: formalisms, metrics, models, security

Submission Requirements

Submissions to DL'99 may be research papers, policy papers, system
reports,
or may be proposals for posters, demonstrations, panels, tutorials, or
workshops. All paper, poster and demonstration submissions should be
formatted documents, including appropriate bibliographies. All
submissions
will be reviewed and must be in English. Details specific to each type
of
submission are given below.

Papers

Papers must be submitted electronically as explained in online
Submission
Instructions (http://fox.cs.vt.edu/DL99/submit.htm). They must be
original
contributions (that is, not previously published, and not currently
being
considered for publication elsewhere).

Full papers of at most 5,000 words should be submitted with an abstract
of
not more than 150 words on the first page. Use Times Roman, 11 point,
2-column format just like that required for all three previous
proceedings.
There will be three categories of full papers:
* Research papers: Submissions are invited for reports of significant
   research results on all aspects of digital libraries. Such reports
   should include a substantial evaluative or validation component.
* Policy papers: Submissions are invited for discussions of significant
   policy issues related to the design, implementation, operation,
   economics, use, and other issues regarding digital libraries.
* System papers: Submissions are invited for reports on the design,
   implementation, operation, and evaluation of operational and
   prototype digital library systems. The emphasis in such
   submissions will be on reporting the experience of implementation
   of the systems and of their use.

Posters

Poster proposals should be submitted to the Program Chair. Submisssions
are
invited that fit into any of the paper categories. Posters are
especially
well suited to graphic and interactive presentation, or to report on
work
still in progress after the deadline for regular papers.

Demonstrations

Demonstration proposals should be submitted to the Program Chair.
Submissions are invited for formal demonstrations of digital library
systems and components. Demonstrations may be live or on video, but must

include a written description of the system and its unique
characteristics
and contributions. At the review stage, video submissions can be either
high quality MPEG files or NTSC VHS tapes.

Panels

Panel proposals should be submitted to the Program Chair. Submissions
are
invited for panel presentations dealing with significant, controversial
and
timely issues. Panel sessions will either be 60 or
90 minutes in length, and must be chaired by an experienced moderator
Contact information about the moderator, names and affiliations of
panelists, a prose justification, and a detailed topical outline must be

supplied.

Tutorials

Tutorial proposals should be submitted to the Program Chair. A
biographical
sketch for the presenter(s), including details on relevant prior
experience, as well as a description of the target audience and suitable

learning objectives, must accompany a detailed topical outline. Both 1/2

and full-day tutorials, covering basic, intermediate, and advanced
topics,
will be offered.

Workshops

Workshop proposals should be submitted to the Program Chair. The aim is
to
bring together a small group of people involved in a specific problem
area
of digital libraries, to advance the state-of-the-art and to encourage
collaboration in that area. Submissions must include contact and
biographical information on the organizers including prior experience,
expected audience, planned format, objectives, and a detailed topical
outline.

Reviewing Process

At least 3 members of the Program Committee will be asked to review each

paper submission. Chairs for Posters, Demonstrations,
Panels, Tutorials, and Workshops will coordinate review of those
submissions. There will be one Program Committee meeting in Virginia to
make final selections.

Important Dates and Deadlines (all dates 1999)

February 8    Paper submissions due
February 15   Tutorial, demonstration, panel, and workshop proposals due

April 9       Notification of acceptance of papers
April 23      Poster submissions due
May 8         Final copy due for all contributions
August 11-14  DL Conference

Contact Information

General Chair:
    Neil Rowe
    rowe@...
    Department of Computer Science
    Spanagel 514, Code CS/Rp
    Naval Postgraduate School
    Monterey, CA 93943 USA
    +1-831-656-2462
    +1-831-656-2814 FAX
Program Chair:
    Edward A. Fox
    fox@...
    Department of Computer Science
    660 McBryde Hall
    Virginia Tech
    Blacksburg, VA 24061-0106 USA
    +1-540-231-5113
    +1-540-231-6075 FAX

--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat



------------------------------------------------------------------------
E-group home: http://www.eGroups.com/list/webir
Free Web-based e-mail groups by eGroups.com

#20 From: Einat Amitay <einat@...>
Date: Mon Dec 14, 1998 12:51 am
Subject: CFP: Web-Based Information Visualization
einat@...
Send Email Send Email
 
======================================================================
                       CALL FOR PAPERS: WebVis '99
======================================================================


                       International Workshop on

                  Web-Based Information Visualization

         http://www.informatik.uni-konstanz.de/swe/WebVis99.html




                        in conjunction with the

                    10th International Conference on
           Database and Expert Systems Applications (DEXA'99)

               Florence, Italy, August 30 - September 3, 1999

   Workshop proceedings to be published by IEEE Computer Society Press


Information visualization combines aspects of scientific visualization,
human-computer interaction, data mining, imaging and graphics. It
focuses on information which is often abstract. This means that many
interesting classes of information have no natural and obvious physical
metaphors for representing information and to understand which
analytical tasks they support.
The largest information space is perhaps the World Wide Web, which
contains millions of pages. Information visualization in this domain
enables users to get information quickly, put it in a meaningful shape,
and to make decisions in a short time. Web-based information
visualization describes visualization applications that use the Web as
an information source, a delivery mechanism for visualization, or both.

The aim of this workshop is to bring together researchers and
practitioners who are working in key technology areas of Information
Visualization in order to discuss recent research findings and address
complementary research and development issues. Of particular interest
are papers describing different visualization techniques to make use of
the information available in the net or how Web-techniques can be used
to visualize information

Topics of interest include but are not limited to:

    - Web as new medium for visualization
    - Web-based visualization techniques (e.g., providing interactive
      views of environmental data on the WWW)
    - Typical taxonomies of information visualization (e.g., data, tasks)

    - Web-based development tools for visualization (e.g., VRML, Java)
    - Web-based visualization for data analysis (e.g., structured or
      unstructured data)
    - Visualization examples:
      * hierarchical information (e.g., web structures),
      * networks (Web-based network traffic monitoring tool)
      * content-based document clustering (e.g., clustering documents
        into an information space based on their content)
      * visualization of  Web search results
      * information space metaphors
    - Database management and information retrieval oriented enabling
      technologies supporting information visualization on the Web
    - Enabling technologies supporting the integration of Web-based
      information visualization and Web-based multimedia
    - Information visualization supporting Web technologies (e.g., how
      information visualization can make the Web easier to use)
    - Usage and user-centered Web-based information visualization (e.g.,
      how to make information visualization more useful)

The papers must clearly show how the technical solutions described
contribute to the area of web-based information visualization.

IMPORTANT DATES

        Submission deadline: .................. March 30, 1999
        Notification of acceptance: ............May 1, 1999
        Camera-ready copies: ...................June 1, 1999
        Workshop:...............................September 1 or 2, 1999

SUBMISSION DETAILS

Authors are invited to submit research contributions representing
original, unpublished work. Submitted papers will be evaluated based on
originality, significance, technical soundness, and clarity of
presentation. All papers will be refereed by at least three members of
the program committee. Accepted papers will be published by IEEE
Computer Society Press as proceedings of the DEXA'99 workshops. All
submitted papers should not be longer than 10 pages (5000 words). The
IEEE author guidelines will be sent to each author after the acceptance
of the paper.

Electronic Submission

ONLY EMAIL SUBMISSIONS of full and abstract papers in Postscript or PDF
format will be accepted.
Please send also an abstract (no more than 250 words in ASCII text) of
the paper via email including the title of the paper, the names of the
author(s) and keywords. The title page must include the name and email
address of the contact author.

Please submit your paper via e-mail to: Harald.Reiterer@...


ORGANIZING COMMITTEE

Chair:

        Harald Reiterer
        University of Constance
        Campus P.O. Box D 73, D-78457 Constance, Germany
        Tel: +49-7531-88-3704
        Fax: +49-7531-88-2048
        E-mail: Harald.Reiterer@...

Co-Chairs:

        Wolfgang Pree, University of Constance, Germany
        Horst Hoertner, Ars Electronica Center, Austria

PROGRAM COMMITTEE

        Keith Andrews, Graz University of Technology, Austria
        Rolf Daessler, FH (Polytechnic Univ.) Potsdam, Germany
        Stephen G. Eick, Bell-Labs, U.S.A.
        Miguel Feldens, University of Caxias do Sul, Brazil
        Nahum Gershon, The MITRE Corp., U.S.A.
        Matthias Hemmje, GMD Darmstadt, Germany
        Horst Hoertner, Ars Electronica Center, Austria
        Takayuki Dan Kimura, Washington University St. Louis, U.S.A.
        Thomas M. Mann, University of Constance, Germany
        Wolfgang Pree, University of Constance, Germany
        Harald Reiterer, University of Constance, Germany (Chair)
        Peter Stucki, University of Zurich, Switzerland
        Stefan Schiffer, University of Linz, Austria
        Marvin Zelkowitz, University of Maryland, U.S.A.

--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat


------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#19 From: William Chang <wchang@...>
Date: Thu Dec 10, 1998 7:54 pm
Subject: Re: Good news and a request
wchang@...
Send Email Send Email
 
Hi Einat!  --wm

William Chang is the visionary and architect of Infoseek's search engine
and now the vice president of strategy at Infoseek/GO Network.  Previously,
he developed a method for discovering protein evolutionary relationships and
helped map the genome of fission yeast for cancer research.  His other
technical interests include programming language design and string matching.
wchang@...
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#18 From: Ross Wilkinson <ross.wilkinson@...>
Date: Wed Dec 9, 1998 1:45 am
Subject: Re: Good news and a request
ross.wilkinson@...
Send Email Send Email
 
Hi Einat,

For the web page:

Ross Wilkinson is the leader of the Text-based Information Management Group
at CSIRO  (http://www.cmis.csiro.au/tim/) in Melbourne and particularly
interested in customized information delivery via the Web.

...ross

-----Original Message-----
From: Einat Amitay [mailto:einat@...]
Sent: Wed, 9 December 1998 12:18
To: Web IR & IE
Subject: [webir] Good news and a request


Hi all,

In the last four days we have tripled our numbers and we now have
representatives from many commercial research institutes and from
academic centres all around the globe.

The reason for writing this email is that I would like to ask you to
make your name and research interests known to others on the "Web IR &
IE" web site:

http://www.mri.mq.edu.au/~einat/web_ir/
http://www.mri.mq.edu.au/~einat/web_ir/people.html ("People" section)

Some of you are already listed there because you've mailed me your
preferences but most of you are absent from the list.


The other page which is relevant is the "Available Systems" page:

http://www.mri.mq.edu.au/~einat/web_ir/software.html

Even if you are not able to put your system online - it is required that
at least some publications and research description will be available on
the web. Have a look and mail me if you think you can contribute.

And - of course - any other addition is welcome!

Have a nice holiday
+:o)
einat


--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat


------------------------------------------------------------------------
Sponsored by NextCard Internet Visa.
http://ads.egroups.com/click/134/1



Free Web-based e-mail groups -- http://www.eGroups.com
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#17 From: Einat Amitay <einat@...>
Date: Wed Dec 9, 1998 1:18 am
Subject: Good news and a request
einat@...
Send Email Send Email
 
Hi all,

In the last four days we have tripled our numbers and we now have
representatives from many commercial research institutes and from
academic centres all around the globe.

The reason for writing this email is that I would like to ask you to
make your name and research interests known to others on the "Web IR &
IE" web site:

http://www.mri.mq.edu.au/~einat/web_ir/
http://www.mri.mq.edu.au/~einat/web_ir/people.html ("People" section)

Some of you are already listed there because you've mailed me your
preferences but most of you are absent from the list.


The other page which is relevant is the "Available Systems" page:

http://www.mri.mq.edu.au/~einat/web_ir/software.html

Even if you are not able to put your system online - it is required that
at least some publications and research description will be available on
the web. Have a look and mail me if you think you can contribute.

And - of course - any other addition is welcome!

Have a nice holiday
+:o)
einat


--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat


------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#16 From: fabio@...
Date: Mon Dec 7, 1998 11:53 am
Subject: WORKSHOP ON LOGICAL AND UNCERTAINTY MODELS FOR INFORMATION SYSTEMS
fabio@...
Send Email Send Email
 
WORKSHOP ON LOGICAL AND UNCERTAINTY MODELS
                  FOR INFORMATION SYSTEMS

      University College London (UCL), London, England

                    5-6 July 1999

The purpose of the workshop is to promote discussion and interaction
among members of the Information Systems community with research
interests in logical and uncertainty models for the treatment of
semi-structured and unstructured information. We are particularly
interested in experiences dealing with unstructured or poorly
structured information, since we believe that a very large part of the
information that will be available in future will be of this nature.


The workshop aims at being an international forum for the presentation
of both theoretical and applicative results. Papers describing
application experiences are particularly encouraged.

WORKSHOP CHAIRS:

       Fabio Crestani - University of Glasgow, Scotland
       Mounia Lalmas - Queen Mary & Westfield College, England

PROGRAM COMMITTEE:

       Peter Bruza - Queensland University of Technology, Australia
       Theo Huibers - DOXiS, The Netherlands
       Carlo Meghini - IEI-CNR, Pisa, Italy
       Adrian Muller - IBM, Germany
       Jian-Yun Nie - University of Montreal, Canada
       Iadh Ounis - IMAG, Grenoble, France
       Gabriella Pasi - ITIM-CNR, Milan, Italy

TOPICS OF INTEREST:

Papers are solicited dealing with, but not limited to, the following areas:

       Information Retrieval
       Information Filtering
       Multimedia Indexing and Retrieval
       Hypermedia
       Digital Libraries

where information is modelled and/or managed using any of, but not
limited to, the following approaches:

   Probabilistic Theory  Non-standard Logics  Default Reasoning
   Fuzzy Methods         Non-monotonic Logics Knowledge Acquisition
   Theory of Evidence    Meta Logics          Knowledge Representation
   Belief Networks       Situation Theory     Machine Learning
   Possibility Theory    Multivalued Logics   Inductive Methods
   Rough Sets            Description Logics   Abductive Methods
   Approximate Reasoning Belief Revision      Relevance Theory


SUBMISSION OF PAPERS:

Authors are invited to submit original papers of at most 10 pages by
e-mail to one of the email addresses below, using "LUMIS99 Submission"
as the subject line. Please submit the paper in postscript.

There is no particular format for the submission, but the cover page
should include title, authors, and the coordinates of the
corresponding author. Authors should also indicate in the first page
which of the thematic areas best describes the content of the paper
(if none is appropriate, please give a set of keywords that best
describe the topic of the paper).

To be considered, submissions must be received no later than February
12. Authors will be notified of the acceptance of their papers by March 12.

All accepted contributions will be published in a public report of the
Queen Mary & Westfield College (to be confirmed). The format
guidelines for the final paper version will be announced later. Final
camera-ready copies of accepted papers will be due by May 1st 1999.

A number of selected papers, whose final version will have to be
received by April 12, will be published in the ECSQARU'99 conference
proceedings.

CORRESPONDENCE:

Direct correspondence, inquiries and submissions relating to this
workshop should be addressed to:

       Fabio Crestani, Computing Science Department,
       University of Glasgow,
       Glasgow G12 8QQ, Scotland.
       Email: fabio@...

       Mounia Lalmas (until January 1999), Informatik VI,
       University of Dortmund,
       Dortmund D-44 227, Germany.
       Email: mounia@...

WORKSHOP HOMEPAGE:

  http://www.dcs.gla.ac.uk/lumis99

IMPORTANT DATES:

  Submission of papers        12 February 1999
  Notification of acceptance  12 March 1999
  Final submissions           12 April 1999 (for selected papers)
  Final submissions           1 May 1999 (for accepted papers)
  Workshop                    5-6 July 1999


------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#15 From: Monica Landoni <monica@...>
Date: Fri Nov 27, 1998 1:57 pm
Subject: IRSG99- CFP and Financial Support
monica@...
Send Email Send Email
 
Hello everybody,
sorry if you receive more than one copy of this message.
________________________________________________________

This is a reminder of the deadline (18 December 1998) for submitting papers
to the 21st Colloquium on Information Retrieval, which is going to be held
19-20 April 1999, Glasgow, Scotland

Please visit  the Colloquium's web site  at:
http://irsg.eu.org/colloq/ for updated information on registration and
financial support.


The Annual Colloquium on IR Research provides an opportunity for both new
and established researchers to present papers describing work in progress
or final results. Submissions from students are particularly welcome and
these can be considered for the Best Student Paper Award. Papers are
invited on any topic related to Information Retrieval including:

- Evaluation of IR systems
- Networked IR
- IR and the Web
- Hypermedia/Multimedia indexing and retrieval
- Natural language processing for IR
- Logic and IR
- User interfaces for IR
- IR in library systems
- Voice processing and retrieval
- Database and IR integration
- Data mining and information extraction
- Commercial applications of IR systems
- Image processing and retrieval
- Knowledge-based IR

SUBMISSIONS

Authors are invited to submit a camera-ready copy of their paper, in
English, to be received no later than 18 December 1998. Papers should
contain at most 7500 words and should be double-spaced. The submission
should include two copies of the paper: one anonymous copy for refereeing
and one full copy for publication in the draft proceedings. The first page
must contain the title of the paper and an abstract of not more than 100
words. Please indicate if the paper is to be considered for the Best
Student Paper Award. This Award requires that the first and primary author
be a full-time student at time of submission.

Papers will be refereed and, if accepted, will be published in the draft
proceedings which will be circulated to all delegates for use during the
Colloquium. Authors will then have until 31st May 1999 to revise their
paper for the formal Colloquium Proceedings in the light of referee's
comments and feedback from delegates. Guidelines for the submission of
papers for the formal proceedings will be circulated to the authors on
acceptance for the Colloquium.

FINANCIAL SUPPORT

CEPIS have kindly given us funding to help a large number of researchers to
attend the IRSG '99 Colloquium, please visit the Colloquium web site at
http://irsg.eu.org/colloq/ to find out more!!!!


CONTACTS

If you have any queries or problems concerning submitting a paper, please
contact the programme chair:

Dr Monica Landoni
Department of Information Science,
University of Strathclyde
26 Richmond Street, Glasgow G1 1X, U.K.
Tel: +44 (0) 141 548 4949  Fax: +44 (0) 141 553 1393
E-mail: bcs-irsg99@...


------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#14 From: Matthew Chalmers <matthew@...>
Date: Thu Nov 19, 1998 3:55 am
Subject: Re: And another one...
matthew@...
Send Email Send Email
 
>   > What kind of systems would we develop if we *began* with an
>   > assumption of a dynamic collection, and kept some other (?!)
>   > variables constant so as to make the experimental setup tractable?
>
> What other variables?
>
> In IR, we have:
>
>   - documents
>   - queries
>   - users
>   - systems
>
> If you want to examine the effect of one of these varying, you must
> keep the others constant, right?  So if you change the documents, you
> can't change the systems.  But you're doing all this to learn more, so
> you can change the system, to learn whether you improved it.

You don't have to change the system to learn more about it.

You can watch how use changes: learning effects, experience, building up
a community of use and practice, how interaction with other tools
changes e.g. the effects of other desktop components such as mail tools,
web browsers, visualisation tools, workflow systems and so on.

You might examine style and topic specificity effects. For example, how
good are our systems for retrieving technical documents compared to
poetry, for example. (Pretty different, I expect!) Multilingual corpora,
effects of having many diagrams and figures in a document versus plain
text, of having different mixtures of sound, still images, movies and
data files (e.g. the Web). The variation in linguistic style between
social and natural sciences, short newsfeed articles versus long books,
ephemeral publications versus persistent ones, etc.

There are no doubt many other possibilities for useful study that
involve a varying set of documents. I was just wondering what that might
offer us, and if we might gain a different (better?) understanding of
web activity than with a static collection.

--Matthew

--
Matthew Chalmers
Meme Media Lab, Hokkaido University, Sapporo 060-8628, Japan
Tel: +81 11 706 7255 Fax: +81 11 706 7808
http://ca.meme.hokudai.ac.jp
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#13 From: Kai.Grossjohann@...
Date: Tue Nov 17, 1998 9:03 am
Subject: Re: And another one...
Kai.Grossjohann@...
Send Email Send Email
 
Matthew Chalmers <matthew@...> writes:

   > What kind of systems would we develop if we *began* with an
   > assumption of a dynamic collection, and kept some other (?!)
   > variables constant so as to make the experimental setup tractable?

What other variables?

In IR, we have:

   - documents
   - queries
   - users
   - systems

If you want to examine the effect of one of these varying, you must
keep the others constant, right?  So if you change the documents, you
can't change the systems.  But you're doing all this to learn more, so
you can change the system, to learn whether you improved it.

I see a contradiction here...

But maybe I'm approaching this from the wrong angle altogether.

kai
--
Life is hard and then you die.
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#12 From: dayne+@...
Date: Tue Nov 17, 1998 3:02 pm
Subject: Re: Data, please
dayne+@...
Send Email Send Email
 
Hugo,

The 20-newsgroup data set might suit your purposes.  It's a collection
of 20,000 Usenet messages, 1000 from each of 20 different newsgroups.
The game is to predict which newsgroup a message belongs to.  If
interested, you can get the data set at
http://www.cs.cmu.edu/~textlearning.

Dayne

> The never-ending quest for expert-evaluated data...
>
>  Anyone knows where I may find a corpus of typical E-MAIL or
>  NEWS-GROUP messages that have been already classified by some
>  expert into classes (be it folders, FAQ questions, topics...).
>
>  I know there is no such thing as "typical e-mail", what I mean
>  is that we are looking for REAL e-mails, not article abstracts :)
>
>  I know this is not strictly WWW but interdisciplinarity
>  is so fashionable this days...
>
>  bye, thanks
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#11 From: Hugo Zaragoza <Hugo.Zaragoza@...>
Date: Tue Nov 17, 1998 2:24 pm
Subject: Data, please
Hugo.Zaragoza@...
Send Email Send Email
 
The never-ending quest for expert-evaluated data...

  Anyone knows where I may find a corpus of typical E-MAIL or
  NEWS-GROUP messages that have been already classified by some
  expert into classes (be it folders, FAQ questions, topics...).

  I know there is no such thing as "typical e-mail", what I mean
  is that we are looking for REAL e-mails, not article abstracts :)

  I know this is not strictly WWW but interdisciplinarity
  is so fashionable this days...

  bye, thanks




--
Hugo Zaragoza 	 __o              Hugo.Zaragoza@...
Neural Networks Group  _`\<,_             Tel: (33) 1 44.27.74.91
LIP6, U. Paris 6      (_)/ (_)            Fax: (33) 1 44.27.70.00
Paris (France)                http://www-poleia.lip6.fr/~zaragoza
-----------------------------------------------------------------
   LIP6 / UPMC, Boite 169 / 4, pl.Jussieu / 75252 Paris Cedex 05
-----------------------------------------------------------------
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#9 From: Matthew Chalmers <matthew@...>
Date: Mon Nov 16, 1998 10:50 am
Subject: Re: And another one...
matthew@...
Send Email Send Email
 
Hi Ross...

About your 2gig web subcollection. Good stuff.

An issue to perhaps discuss here is: does a static collection take away
too much of an essential or typical characteristic of the web i.e. its
dynamism. Sure, it makes experimentation easier, but are we in danger of
making our systems fit an atypical, over-controlled environment?

This is all part of controlled experimentation, of course: control some
variables so that you can vary and examine the effect of others. But
what have you lost and what have you gained?

What kind of systems would we develop if we *began* with an assumption
of a dynamic collection, and kept some other (?!) variables constant so
as to make the experimental setup tractable?

Regards,

--Matthew

--
Matthew Chalmers
Meme Media Lab, Hokkaido University, Sapporo 060-8628, Japan
Tel: +81 11 706 7255 Fax: +81 11 706 7808
http://ca.meme.hokudai.ac.jp
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#8 From: Keith Instone <instone@...>
Date: Mon Nov 16, 1998 3:12 pm
Subject: Re: A question
instone@...
Send Email Send Email
 
>I have a question that might sound easy but I can't find all the
>possible answers:
>How do you query a search engine for "links pointing to this URL"?
>
>I've found how to do this on Google, Hotbot and AltaVista. I have a
>partial answer for Infoseek and no answer for the rest....

I only know how to do it on InfoSeek, HotBot and AltaVista myself.

See my web site, http://usableweb.com/ for MANY examples.

Like the description for the Yale C/AIM style guide:

http://usableweb.com/items/caimguide.html

At the top of the page are links to those 3 search engines to query for
pages that point to it. I have tried to reverse-engineer the simplest
possible links (that is, throw away a lot of query fields that do not
really matter in this case).

(About every 6 months one of these 3 changes things so I have to
reverse-engineer all over again.)

Syntax:

http://www.infoseek.com/Titles?qt=link:<URL>
http://www.altavista.com/cgi-bin/query?q=link:<URL>
http://www.search.hotbot.com/?SM=url&MT=<URL>

I would welcome info on how to do this for other search engines. Or, if
someone could maintain a page on how to do this on the various search
engines, that would be great!

Keith



Keith Instone   instone@...
Usable Web      http://usableweb.com/
PO Box 7411     BowlingGreen OH 43402
                 +1 419 823-3319 -1036


------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#7 From: Ross Wilkinson <ross.wilkinson@...>
Date: Fri Nov 13, 1998 8:10 am
Subject: Re: And another one...
ross.wilkinson@...
Send Email Send Email
 
We should have a 2gigabyte sub-collection of the Web captured statically for
use in TREC shortly - dave hawking of CSIRO is co-ordinating the work.
Experiments on this data - which does not change so is reproducable should
be big enough for many experiments.

...ross

-----------
Ross Wilkinson  Ross.Wilkinson@...
CSIRO, Division of Mathematical and Information Science
723 Swanston St,  Ph: +61 3 9282 2610
Carlton VIC 3053  Fax: +61 3 9282 2600

> -----Original Message-----
> From: Einat Amitay [SMTP:einat@...]
> Sent: Friday, 13 November 1998 12:55
> To: Web IR & IE
> Subject: [webir] And another one...
>
> Just to shake you all up a bit.... (We have more than enough people by
> now).
>
> I have a question which pops every time I read an article about web
> research - How much is enough to prove your point? Is one query and a
> couple of graphs a representative answer which could be generalized to
> answer the scale of the web? I couldn't find any standard and I can't
> think of what might be reasonable (considering many of us don't have the
> disk space that is required to perform something really big).
>
> Any suggestions? Have you seen any reference to this problem before? Do
> you have any examples?
>
> +:o)
> einat
>
> --
> Einat Amitay
> einat@...
> http://www.mri.mq.edu.au/~einat
>
>
> ------------------------------------------------------------------------
> For the absolute lowest price on Computer Hardware visit:
> http://ads.egroups.com/click/56/0/bottomdollar
>
>
> Free Web-based e-mail groups -- http://www.eGroups.com
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#6 From: William Chang <wchang@...>
Date: Fri Nov 13, 1998 4:44 am
Subject: Re: And another one...
wchang@...
Send Email Send Email
 
A methodological question... thank you.  Perhaps, the web (or the larger
internet) needs to be segmented out and studied both within and across
segments so we can start to understand some of the key qualitative and
quantitative differences including variance of basic parameters.  Here
at Infoseek we have done some fairly thorough query analysis and can claim
perhaps the beginnings of a practical understanding of sampling and
convergence.  Unfortunately we are too busy productizing our research to
write papers -- except those written by former interns.

What questions are you trying to answer?

-- William

> Just to shake you all up a bit.... (We have more than enough people by
> now).
>
> I have a question which pops every time I read an article about web
> research - How much is enough to prove your point? Is one query and a
> couple of graphs a representative answer which could be generalized to
> answer the scale of the web? I couldn't find any standard and I can't
> think of what might be reasonable (considering many of us don't have the
> disk space that is required to perform something really big).
>
> Any suggestions? Have you seen any reference to this problem before? Do
> you have any examples?
>
> +:o)
> einat
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#5 From: Matthew Chalmers <matthew@...>
Date: Fri Nov 13, 1998 3:46 am
Subject: Re: And another one...
matthew@...
Send Email Send Email
 
Einat Amitay wrote:
> I have a question which pops every time I read an article about web
> research - How much is enough to prove your point? Is one query and a
> couple of graphs a representative answer which could be generalized to
> answer the scale of the web? I couldn't find any standard and I can't
> think of what might be reasonable (considering many of us don't have the
> disk space that is required to perform something really big).

You are making an argument, which may be expressed by whatever you need
to convince people. One query (or even zero queries) might be enough to
show an inconsistency or to make your point. It depends on your point.
In other situations you might need a large amount of statistical data to
prove your point, involving thousands of queries, responses, people,
whatever.

Your question seems to assume that a kind of objective measure of
convincingness exists, but I think it's a subjective and contextually
dependent issue.

--Matthew

--
Matthew Chalmers
Meme Media Lab, Hokkaido University, Sapporo 060-8628, Japan
Tel: +81 11 706 7255 Fax: +81 11 706 7808
http://ca.meme.hokudai.ac.jp
------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#4 From: Einat Amitay <einat@...>
Date: Fri Nov 13, 1998 1:54 am
Subject: And another one...
einat@...
Send Email Send Email
 
Just to shake you all up a bit.... (We have more than enough people by
now).

I have a question which pops every time I read an article about web
research - How much is enough to prove your point? Is one query and a
couple of graphs a representative answer which could be generalized to
answer the scale of the web? I couldn't find any standard and I can't
think of what might be reasonable (considering many of us don't have the
disk space that is required to perform something really big).

Any suggestions? Have you seen any reference to this problem before? Do
you have any examples?

+:o)
einat

--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat


------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#3 From: Einat Amitay <einat@...>
Date: Fri Nov 13, 1998 1:43 am
Subject: A question
einat@...
Send Email Send Email
 
Hi,

I have a question that might sound easy but I can't find all the
possible answers:
How do you query a search engine for "links pointing to this URL"?

I've found how to do this on Google, Hotbot and AltaVista. I have a
partial answer for Infoseek and no answer for the rest....

Help! Anyone?

And if it can be done on your search engine - I'd appreciate this
information very much.

I need this information for the tool I've built which "sits" on search
engines that provide this information. At the moment it works on the
above mentioned three (and possibly on the forth too) - but I'd like it
to be able to sit on as many engines as possible. I have some
interesting results and I'd like to see if the results are engine
dependent (i.e. depend on the way the engine index is built), or that
they are consistent regardless of the search engine used.

Thanks
+:o)
einat

--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat


------------------------------------------------------------------------
Free Web-based e-mail groups -- http://www.eGroups.com

#2 From: Steve Lawrence <lawrence@...>
Date: Wed Oct 28, 1998 5:27 am
Subject: ------ Scientist/RA position in learning/information retrieval -------
lawrence@...
Send Email Send Email
 
------ Scientist/RA position in learning/information retrieval -------

The NEC Research Institute has an immediate opening for a
Scientist/Research Associate in the area of learning/information
retrieval. NEC Research Institute is a basic research laboratory
located in Princeton, NJ, a short drive from Princeton University.
For more information, see http://www.neci.nj.nec.com/

This research focuses on basic issues in learning and information
retrieval and dissemination, with an emphasis on the World Wide Web. A
sample of recent research publications includes:

S. Lawrence, C.L. Giles, Searching the World Wide Web, Science, 280,
p. 98. 1998.

S. Lawrence, C.L. Giles, Context and Page Analysis for Improved Web
Search, IEEE Internet Computing, Volume 2, Number 4, pp. 38-46, 1998.

C.L. Giles, K. Bollacker, S. Lawrence, CiteSeer: An Automatic Citation
Indexing System, The 3rd ACM Conference on Digital Libraries, pp. 89-98,
1998 [shortlisted for best paper award].

These and other papers are available from:
http://www.neci.nj.nec.com/homepages/lawrence/

See also:
http://www.neci.nj.nec.com/homepages/lawrence/citeseer.html
http://www.neci.nj.nec.com/homepages/lawrence/websize.html

Candidates must have experience in research and be able to effectively
communicate research results through publications. Successful
candidates will have experience with information retrieval and machine
learning. Proficiency with Unix/Linux and the software implementation
of algorithms is a must. Tasks will also involve code maintenance,
modification and enhancement as required by the research program.

The Institute provides an outstanding research environment with many
recognized experts and excellent resources plus a competitive salary
and a strong emphasis on open publication of results. The successful
candidate will combine a genuine desire to excel in research with the
above attributes.

Interested applicants should send their resumes with names of
references by email, mail or fax to:

Dr. Steve Lawrence
Computer Science
NEC Research Institute
4 Independence Way
Princeton NJ 08540

Phone: (609) 951 2676
Fax: (609) 951 2482
lawrence@...
http://www.neci.nj.nec.com/homepages/lawrence/

The position will remain open until filled. Applicants must show
documentation of eligibility for employment. NEC is an equal
opportunity employer. EOE

--
Steve Lawrence - http://www.neci.nj.nec.com/homepages/lawrence/
______________________________________________________________________
No tricks, no gimmicks - just a great intro rate for Internet users!
NextCard Internet VISA -- Apply online now!
http://ads.egroups.com/click/63/1/nextcard



Subscribe, unsubscribe, opt for a daily digest, or start a new e-group
at http://www.eGroups.com -- Free Web-based e-mail groups.

#1 From: Einat Amitay <einat@...>
Date: Wed Oct 28, 1998 1:42 am
Subject: Hi all and welcome
einat@...
Send Email Send Email
 
Hi all,

I'm happy to say that there are about 30 of us now and this is a nice
number to start a mailing list. Please let other people know about this
list and make sure you explain what it is all about...~:o)

This mailing list is focusing on topics related to IR and IE ***on the
web***, for example:

* discussing problems with traditional IR and IE when applied to the web

* sharing data collections and "how-to" information
* posting CFP's and other announcements
* introducing new groups and research interests
* defining unique web IR and IE problems
* making new research connections

And whatever you think might be of interest to other researchers in the
field. This list is intended to be multidisciplinary and any
contribution is welcome!

If you feel like introducing yourself to the list and tell us what your
research is about, you are invited to do so.  If you'd like to be on the
"People" list on the Web IR & IE site
(http://www.mri.mq.edu.au/~einat/web_ir/) please let me know.

Thanks for joining the list and I hope that it will add another
dimension to our research.

~:o)
einat


--
Einat Amitay
einat@...
http://www.mri.mq.edu.au/~einat


------------------------------------------------------------------------
Subscribe, unsubscribe, opt for a daily digest, or start a new e-group
at http://www.eGroups.com -- Free Web-based e-mail groups.

Messages 1 - 31 of 2606   Newest  |  < Newer  |  Older >  |  Oldest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help