Search the web
Sign In
New User? Sign Up
www-talk
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Real people. Real stories. See how Yahoo! Groups impacts members worldwide.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Transparent Content Negotiation & Search Engine Indexation.   Message List  
Reply | Forward Message #11305 of 11330 |
Re: Transparent Content Negotiation & Search Engine Indexation.



On Fri, 5 Jan 2001, Vincent-Olivier Arsenault wrote:

> Hi all,
>
> This message has been posted to the w3's list and to the technical support
> for different major search engines. Please reply to the list (I will commit
> the messages send only to me).
>
> Simply put, let say that http://www.name.com is a pointer to 8 different
> document (4 locales: en-us, en-ca, fr-ca, es-us. 2 document type: html, wml).
>
> There is no translation, the content for the locales is COMPLETELY unrelated.
>
> There is a link to switch to the 3 other locales on all the pages (both wml
> and html).
>
> The transparent content negotiation is based on the accept* and the
> user-agent HTTP headers.
>
> If there is no match in the available permutations (locales x dtd), or if
> there is nothing specified, the default locale is en-us and the default DTD
> is html.
>
> Here are my questions:
>
> Q1: Does the indexing robots make use of the content negotiation parameters
> (http headers)? If so, in what way?
>
> Q2: Does the search engines return result URIs from documents that were
> obtained with a robot using content negotiation parameters (http headers)
> that match those of the user?
> ie: Would a spanish user get http://www.name.com (not
> http://www.name.com/index.es.html, the link to the spanish section from
> other locales) for keywords on the spanish version?

I am not a robot author but I did work on content negotiation standards.

For Q1, my guess here is that robots do not currently use much of the
content negotiation parameters, if at all. Negotiation can be done at the
server in many ways, and in general there is no reliable way in which a
robot could make sense of all cases. The transparent content negotiation
specification does have a mechanism (the Alternates response header) which
would allow a robot to learn a lot about the structure of the negotiated
content, but servers will not always produce this header. If negotiation
is done with mod_negotiation in newer version of Apache, the robot *can*
get the Alternates header if it wants to, I don't know if any current
robots use this facility. Mod_negotiation does not currently support
user-agent header based negotiation so for many advanced cases of
negotiation I expect there is still a lot of hand-coding in servers,
without the option of getting an alternates header. All in all my answer
to Q2 is: the engine will probably return the link to the spanish section,
not the http://www.name.com link.

>
> thanks,
>
> vincent

Koen.





Tue Jan 9, 2001 10:51 am

koen@...
Send Email Send Email

Forward
Message #11305 of 11330 |
Expand Messages Author Sort by Date

Hi all, This message has been posted to the w3's list and to the technical support for different major search engines. Please reply to the list (I will commit ...
Vincent-Olivier Arsen...
vincent@...
Send Email
Jan 5, 2001
9:28 pm

... I am not a robot author but I did work on content negotiation standards. For Q1, my guess here is that robots do not currently use much of the content...
Koen Holtman
koen@...
Send Email
Jan 9, 2001
10:58 am
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help