Hi-
We are trying to identify a field in NewsML that can be used to
identify a set of related stories consistently. This set of stories
can be about, lets say Iraq, or an event that is occurring on a single
day like "Plane Crash". We would like to always identify the latest
news article about a particular "thread". A thread may last for a
short time like a day or more longer, say months.
There are a few options available to us right now. I would like input
on the various options.
I am including a list of the fields (from the NewsML 1.2 spec) that
we are looking at below. From all the options below, it appears that
the field NameLabel may make more sense as this is in the category of
items that are "Informal Identifiers", and are not expected to be
using any markup.
It would be good to know the mechanism being used by major wires as
that can help us narrow down our options. I am including the
description of the elements from the NewsML spec for your convenience.
1. /NewsML/NewsItem/NewsComponent/NewsComponent/NewsLines/SlugLine
[The SlugLine element provides a string of text, possibly embellished
by hyperlinks and/or formatting, used to display a NewsItem's slug
line. (Note that the meaning of the term "slug line", and the uses to
which it is put, are a matter for individual providers to define
within their own workflow and business practice.)]
2. NewsML/NewsItem/Identification/NameLabel
[The NameLabel element contains a string used by human users as a name
to help identify a NewsItem. Its form is determined by the provider.
It might be identical to the textual content of the SlugLine element,
for example, but even if this is so, the system should not process the
NameLabel as a slugline. Nothing can be assumed about the nature of
the string within NameLabel beyond the fact that it can help to
identify the NewsItem to humans.]
In addition NameLabel is in the category of items that are identified
as "InformalIdentifiers". [In addition to the formal identification
mechanisms described above, NewsML provides a series of Label elements
that can be used by human users to identify NewsItems. As far as the
NewsML system is concerned, these are arbitrary strings, and cannot be
relied upon to provide a robust identification mechanism. Their sole
purpose is to provide a convenient way for humans to identify a
particular NewsItem in informal exchanges and communications, or as
part of a user interface.]
3. /NewsML/NewsItem/Identification/NewsIdentitifier/NewsItemId
[The NewsItemId is an identifier for the NewsItem. The combination of
NewsItemId and DateId must be unique among NewsItems that emanate from
the same provider. Within these constraints, the NewsItemId can take
any form the provider wishes. It may take the form of a name for the
NewsItem that will be meaningful to humans, but this is not a requirement
thanks,
-Masood