Search the web
Sign In
New User? Sign Up
openreader-format · OpenReader Publication Working Group
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Hear how Yahoo! Groups has changed the lives of others. Take me there.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
"Book Indexing" markup in XML documents ?   Message List  
Reply | Forward Message #365 of 375 |
Re: [openreader-format] "Book Indexing" markup in XML documents ?

Peter Ring wrote:

> Please also include out-of-band indexterm entries. I've had a lot
> of utility from DocBook indexterm's zone attribute:
>
> http://www.docbook.org/tdg/en/html/indexterm.html#d0e96507
>
> The generel idea is that you should be able to annotate content
> with indexterm's without changing the content. Quite often, the
> index authoring is completely separate from the content authoring.
>
> Also be sure to include attributes or elements that allow hinting
> about the purpose of the indexterm, e.g. subject index vs. author index.

Again, interesting info.

The feedback I've been getting, from here and from Jon Jermey and
David Ream (who is one of the top experts in the area of XML and
indexing) is that the issue of embedded indexing is fairly complex as
one begins to peel away the layers. It does seem premature to
implement anything in OpenReader, at least in version 1.0, for
embedded indexing.

Interestingly, no one has apparently put together a comprehensive
generic XML vocabulary for embedded indexes (and could be included in
any XML document with proper namespacing). So this leaves open some
sort of standardization in the area. Maybe this could be done in OASIS
with sponsorship by ASI (asindexing.org)? (I'll be talking with David
Ream next week for his feedback on this proposal.)

Anyway, here's my preliminary list of candidate requirements the
embedded indexing vocabulary should meet:

1) handle multiple indexes in the same publication. (that is, each
embedded index term is to be applied to one or more indexes when
generated/compiled by the reading system.)

2) define the range or scope of the index item (and it may have to
cross the natural hierarchy of the XML document -- Lee? :^) )

3) handle hierarchical terms (many indexes have 2 and even more
levels)

4) support "see", "see also", etc. (cross-referencing)

5) Peter's suggestion of "out-of-band" indexing. (Seems to imply using
XPointer to define the target and the associated scope/range.)

6) support "sort as" information (to tell the reading system how to
order the terms when the index is generated/compiled -- necessary
since some terms may be in other character sets than the primary
character set.)


Anything else?

And anyone here interested in being involved with standards work
for embedded indexing should anything get off the ground? (If so,
contact me in private.)

Jon Noring






Fri Oct 6, 2006 10:05 pm

jon_noring
Offline Offline
Send Email Send Email

Forward
Message #365 of 375 |
Expand Messages Author Sort by Date

Everyone, As part of the effort to develop a set of special OpenReader namespace elements which may be applied to any OR supported content document vocabulary,...
Jon Noring
jon_noring
Offline Send Email
Oct 5, 2006
1:29 am

Hi Jon, ... This looks rather like DocBook indexing markup. Cheers, Michael -- Print XML with Prince! http://www.princexml.com...
Michael Day
mikeday@...
Send Email
Oct 5, 2006
1:58 am

... Hmmm, maybe, I'll have to redig into DocBook. The section I looked at with regards to DocBook indexing did not appear to embed indexing information within...
Jon Noring
jon_noring
Offline Send Email
Oct 5, 2006
2:58 am

... You place <indexterm> elements through the document, then at the end place an empty <index/> element which is where the actual index will be generated. ...
Michael Day
mikeday@...
Send Email
Oct 5, 2006
4:54 am

... Cool! That clarifies it. It also shows I was close, although the indexing terms in DocBook are part of content, while in my example, they were attribute...
Jon Noring
jon_noring
Offline Send Email
Oct 5, 2006
5:25 am

Some limitations of the brief sample Jon provided and IIRC of DocBook (but lord knows, I am no DocBook expert): * Only provides for 1 index. * Definitionally...
Syd Bauman
syd_bauman
Offline Send Email
Oct 5, 2006
7:13 pm

... Thanks! From my initial study, it appears TEI is a little more powerful at representing embedded indexing information than is DocBook. In my chats with a...
Jon Noring
jon_noring
Offline Send Email
Oct 5, 2006
9:25 pm

Please also include out-of-band indexterm entries. I've had a lot of utility from DocBook indexterm's zone attribute: ...
Peter Ring
peter17ring
Offline Send Email
Oct 6, 2006
8:32 am

... Again, interesting info. The feedback I've been getting, from here and from Jon Jermey and David Ream (who is one of the top experts in the area of XML and...
Jon Noring
jon_noring
Offline Send Email
Oct 6, 2006
10:13 pm

Indexing is a hugely important type of metadata, in my opinion. I suppose that the context in which we are carrying out this discussion is mostly in regard...
rickbarry@...
rickbarry1
Offline Send Email
Oct 8, 2006
5:09 am

... Rick's message reply is an outstanding summary of several issues related to indexing. What I found of most interest is looking at author-supplied indexing,...
Jon Noring
jon_noring
Offline Send Email
Oct 16, 2006
5:36 pm
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help