Search the web
Sign In
New User? Sign Up
xml-dev
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Message search is now enhanced, find messages faster. Take it for a spin.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Parser Behaviour (serious)   Message List  
Reply | Forward Message #19374 of 26751 |
Re: Parser Behaviour (serious)

For XHTML only, if you change the suffix to 'htm' you will not get this
behavior. In the XHTML WG we were a little worried that when XHTML became
popular as XML the W3C site would not be able to cope with all the calls to
the DTD!

But i agree with you, this is a problem, when one is of line one either has
to comment out the DOCTYPE declaration or not.

The MSXML parser incidently has the option to switch of expansion of
external entities, and i believe this option is available in the latest
releases of IE5 ( but I havnt tried this.)

I believe that the correct solution is to have all parsers give this option
to the client.

The other solution would require a change to XML. IOW allow the resolution
URL to be optional, and let just a public Declaration be used. Conforming
parsers could then be instructed that they should only validate if the URI
is given.

Frank

Frank
----- Original Message -----
From: Peter Murray-Rust <peter@...>
To: <xml-dev@...>
Sent: Sunday, April 02, 2000 5:22 AM
Subject: Parser Behaviour (serious)


> <Note>This message is serious<smiley/></Note>
>
> I have been preparing a large amount of XHTML (for our VirtualXML
activity)
> and using Dave Raggett's excellent tidy program (with option -asxml) to
> produce XHTML files of the sort:
>
> <?xml version="1.0"?>
> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
> "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
> <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <title>Test page</title>
> </head>
> <body>
> <p>A test</p>
> </body>
> </html>
>
> These files work fine as HTML, and are conforming XML 1.0, but when I try
> to parse them on my laptop using either AElfred or Xerces I get:
>
> java.net.UnknownHostException: www.w3.org
>
> What's wrong? Ah! The parser is trying to resolve the URL for the DTD and
> since I'm offline (connections cost money over here) it can't. So the file
> I have created can only be processed as XML if:
> (a) I am connected online
> (b) the W3C maintain *** for all time *** a means of dereferencing either
> the FPI or the URL
>
> I can't believe this is what the community wants. It fooled me, and I've
> been working with XML for some time.
>
> I still believe that undefined parser behaviour is going to be a major
> deterrent to may people who want to take up XML. I have posted on this
> before. I am going to keep on about it. The most common reaction I seem to
> have so far is "Well that's how XML behaves - it's *your* problem to
decide
> how to process XML". This isn't good enough. In the current case I simply
> want to switch off the parser's attempt to resolve the DTD. I would
> appreciate something like:
>
> "Parser failed to resolve external SYSTEM identifier in DOCTYPE:
> http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd
> To disable DTD look-up use -nosysid option"
>
> So, for about the third time (and it took 3 times to get SAX1.0 off the
> ground, what are we going to do about specifying parser behaviour? I have
> shown in public how the failure to process external entities breaks
systems.
>
> Until we resolve this question (and probably several others), XML 1.0 is
> broken as an interoperable "standard"
>
> P.
>
> [No criticism is aimed at Dave Raggett, who has written a splendid tool,
or
> the W3C who actually have a real DTD mounted at the URL mentioned. Nor to
> the authors of the parsers who have done their best to provide a default
> behaviour, and in the absence of any guidance have required their parsers
> to access an external DTD (very reasonably).
>
>
>
>
>
***************************************************************************
> This is xml-dev, the mailing list for XML developers.
> To unsubscribe, mailto:majordomo@...&BODY=unsubscribe%20xml-dev
> List archives are available at http://xml.org/archives/xml-dev/
>
***************************************************************************


***************************************************************************
This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@...&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/
***************************************************************************



Sun Apr 2, 2000 4:55 pm

bckman@...
Send Email Send Email

Forward
Message #19374 of 26751 |
Expand Messages Author Sort by Date

Hi Steve, ... validation ... turn ... Steve said: I don't think they are working in validation mode, but my understanding is that they must fetch the DTD in...
Didier PH Martin
martind@...
Send Email
Apr 2, 2000
5:22 pm

For XHTML only, if you change the suffix to 'htm' you will not get this behavior. In the XHTML WG we were a little worried that when XHTML became popular as...
Frank Boumphrey
bckman@...
Send Email
Apr 2, 2000
5:20 pm

... It's unclear to me whether the standalone="yes" on the XML Declaration is supposed to be a signal to the processor that "it's ok *not* to go fetch the DTD...
Steve Muench
smuench@...
Send Email
Apr 2, 2000
5:54 pm

... Wrong. Non-validating processors can also include the external DTD subset (it's optional for them, not forbidden). All the best, David -- David Megginson...
David Megginson
david@...
Send Email
Apr 2, 2000
6:25 pm

Hi David, ... validation ... turn ... David said: Wrong. Non-validating processors can also include the external DTD subset (it's optional for them, not...
Didier PH Martin
martind@...
Send Email
Apr 2, 2000
7:22 pm

... More likely, because it's (a) looking for default attribute values, or (b) trying to distinguish whitespace in element content. AElfred does both of...
David Megginson
david@...
Send Email
Apr 2, 2000
7:41 pm

Hi David, David said: The safest approach seems to be to distribute XML documents normalized (all external general entities expanded and all default attribute ...
Didier PH Martin
martind@...
Send Email
Apr 2, 2000
10:49 pm

[Note: "serious" in my title was originally meant to make sure it wasn't an April fool a la ZML, but it is clear that this really *is* a serious issue.] ... My...
Peter Murray-Rust
peter@...
Send Email
Apr 12, 2000
1:54 am

... Would this be a good use for a feature manifest, listing those items which *must* be done for an XML document to be 'successfully' parsed? Obvious stuff...
Clark C. Evans
cce@...
Send Email
Apr 12, 2000
2:15 am

... I'm not sure if it will help enough, but some of us on SML-DEV have been looking at these problems from a different direction. We presented this 'Common...
Simon St.Laurent
simonstl@...
Send Email
Apr 12, 2000
2:26 am

Hi all, I still wonder why Xerces developers didn't pitched in and tried to answer this question? If I have not mistaken, what Peter Murray's problem is, he...
gopi
gopi@...
Send Email
Apr 12, 2000
6:59 am

... Peter: I'm afraid you'll find that an appreciation for the value of defining conformance is not universally shared by the drivers of the XML bus. As...
Bob Kline
bkline@...
Send Email
Apr 12, 2000
2:37 am

... Ah, but XML-DEV does not suffer from bus-drivers. It is like a swarm of communal animals which takes many directions according to many driving forces. So I...
Peter Murray-Rust
peter@...
Send Email
Apr 12, 2000
9:37 am

... Very likely so, but it won't be a clearly defined standard (with unambiguous criteria which can be used to identify conforming implementations)....
Bob Kline
bkline@...
Send Email
Apr 12, 2000
10:47 am

... That's exactly what it's supposed to mean. ... Nope, you got it. -T. *************************************************************************** This is...
Tim Bray
tbray@...
Send Email
Apr 2, 2000
6:48 pm

... I think it's simpler than you make it out to be. You have to decide whether, for what you need to do, you need the DTD or not. If you need the DTD, then...
Tim Bray
tbray@...
Send Email
Apr 2, 2000
6:59 pm

... Public identfiers don't help here -- Consider: <!DOCTYPE boy PUBLIC "anything you like" "http://www.xml.com/doctypes/boy.dtd" ... You are still required to...
Liam R. E. Quin
liamquin@...
Send Email
Apr 2, 2000
7:36 pm

... I agree with this, though it is non-trivial to maintain a local copy. *I* don't produce the files - tidy/DaveR does, and links them to a remote URL. I...
Peter Murray-Rust
peter@...
Send Email
Apr 3, 2000
12:37 am

... I think that Frank is talking specifically about MSIE. Peter was ... Changing the extension will not help here. All the best, David -- David Megginson...
David Megginson
david@...
Send Email
Apr 2, 2000
11:05 pm

... Weren't you just a little bit worried that if someone hacked the w3c site and, for example, made an optional attribute required, that it would break the...
Simon Wilson
Simon.Wilson@...
Send Email
Apr 12, 2000
1:56 am

... This is actually deeper. What happens if 3 organizations agree on a given DTD. They all refer to that DTD, having it hosted by one of the companies. And...
Clark C. Evans
cce@...
Send Email
Apr 12, 2000
2:18 am

... Aargghh! My hot button again. There's a perfectly good answer to this -- dereference the PUBLIC identifer in some local catalogue. Unfortunately, XML...
John Aldridge
john.aldridge@...
Send Email
Apr 12, 2000
2:55 am

... It is not clear that rules could ever be written. The current syntax exposes (a) whether a DTD is available and (b) whether it will change the infoset....
Tim Bray
tbray@...
Send Email
Apr 12, 2000
3:27 am

... If this is true it spells the death of interoperability for information components such as MathML, SVG and CML (chemical markup language). What Tim is...
Peter Murray-Rust
peter@...
Send Email
Apr 12, 2000
9:45 am

... This is nice. This might be going too far... but why not have a complete set of features; describing every assumption all the way down to what us SML-DEV...
Clark C. Evans
cce@...
Send Email
Apr 12, 2000
1:42 pm

... Nice, but practically I don't see W3C doing ANYTHING to make it easier to use or extend the life of DTD's, so it would have to be in a separate processing...
Frank Boumphrey
bckman@...
Send Email
Apr 12, 2000
5:52 pm

... More than that, there's this bit from the W3C Recommendation _Associating ... Simon St.Laurent XML Elements of Style / XML: A Primer, 2nd Ed. Building XML...
Simon St.Laurent
simonstl@...
Send Email
Apr 12, 2000
6:23 pm

... future ... Yes, but we are not asking the W3C to recommend anything. SAX is not recommended by any XML-specific body as far as I know, although we have ...
Peter Murray-Rust
peter@...
Send Email
Apr 12, 2000
7:33 pm

... From: "Simon St.Laurent" <simonstl@...> To: "Frank Boumphrey" <bckman@...>; "Clark C. Evans" <cce@...>; "Peter Murray-Rust"...
Michael Champion
Mike.Champion@...
Send Email
Apr 12, 2000
9:00 pm

... This is not true. To quote from Section 4.2.2 of XML 1.0: In addition to a system identifier, an external identifier may include a public identifier. An...
Paul Grosso
pgrosso@...
Send Email
Apr 12, 2000
3:32 am
 First  |  |  Last 
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help