Hello Mark, The fields making up a tag are defined in the grammar so I am following your suggestion and having some of the `lexical analysis' performed by the...
11953
Mark Lentczner
mark_in_mtn_...
May 1, 2004 4:09 pm
... Never prematurely optimize, I always say. If your examples are at all indicative, I'd be surprised if there was any significant timing differences between...
11954
Steve Taplin
steve_taplin
May 1, 2004 5:07 pm
Thanks Mark/John. charVocab was the problem. The Antlr doc adequately explains why this is necessary. I thought I'd tried this. I'll hang my head in shame......
11955
John D. Mitchell
johnm-antlr@...
May 1, 2004 5:43 pm
... [...] ... Given your examples, isn't that more like: COMMAND ( PARAMETER_NAME "(" parameter_data ")" )* "\r92;n" ... Are the types of the "parameter data"...
11956
atripp54321
May 1, 2004 6:37 pm
[Forgive the lateness of this response, I only catch up on the ANTLR group every week or two] I'm working on a C to Java translator...check out...
11957
Terence Parr
parrt2000
May 1, 2004 6:38 pm
Hi. Check out charVocabulary. Terence ... -- Professor Comp. Sci., University of San Francisco Creator, ANTLR Parser Generator, http://www.antlr.org ...
11958
Terence Parr
parrt2000
May 1, 2004 6:39 pm
Anybody object? I'm seeing this issue come up too many times. So, if you don't specify, then charVocabulary is set for you to ascii. Ter -- Professor Comp....
11959
lgcraymer
May 1, 2004 8:11 pm
Ter-- How about taking the next step? That is, support charVocabulary = "ASCII"; and charVocabulary = "unicode"; and support warnings where unicode is not...
11960
Terence Parr
parrt2000
May 1, 2004 8:22 pm
... That begs the question, "why not a table with a mapping"? People could then say Korean etc... Somebody proposed doing this properly for letters and...
11961
Brian L. Smith
antlrbrian
May 1, 2004 8:29 pm
... I would prefer 0..65535, since that is what I use for all my grammars. Having a smaller vocab (e.g. ASCII) is an optimization done to make the generated...
11962
Oliver Zeigermann
djcordhose
May 1, 2004 8:30 pm
Why not using ISO-8859-1 aka ISO latin-1? It is an extension of ASCII having all western european characters from 128..255. Oliver...
11963
Oliver Zeigermann
djcordhose
May 1, 2004 8:31 pm
... How would that look like? UTF-8? UTF-16? Something else? Oliver...
11964
lgcraymer
May 1, 2004 9:00 pm
Oliver-- Ok, so maybe I should have said charVocabulary = "UTF-8"; and UTF-16. The point is more that named character sets have an advantage in that error...
11965
william_surowiec
william_suro...
May 1, 2004 9:01 pm
I am having conceptual problems in approaching a task on which I believe StringTemplate (ST hereafter) would be excellent. I will be creating an "xml-ish" view...
11966
Terence Parr
parrt2000
May 1, 2004 9:16 pm
... Hi Bill, Does the hierarchical menu example in the paper help at all? Essentially, you would ask for person.courses which returns a list of courses via...
11967
Mike Lischke
lischke1
May 1, 2004 9:17 pm
... Me voting strongly for this option. Mike -- www.soft-gems.net...
11968
Brian L. Smith
antlrbrian
May 1, 2004 9:21 pm
... UTF-8 decoding is done before the lexer even sees the characters, at least in Java, so UTF-8 wouldn't make sense. Since ANTLR 2.x uses the "char" type for...
11969
Mike Lischke
lischke1
May 1, 2004 9:25 pm
... Not as default IMO. But taking that path for an explicit charVocab would be a nice addition. Let the general case be just Unicode and require the grammar...
11970
Mike Lischke
lischke1
May 1, 2004 9:28 pm
... This describes the transformation format not the available character range. UTF-8 can well be UTF-32 after decoding. I would also leave alone surrogates...
11971
Mike Lischke
lischke1
May 1, 2004 9:40 pm
... Don't use the transformation format identifiers as vocabulary names. This similar as if you would say "base64" instead of ASCII. These formats do not...
11972
Oliver Zeigermann
djcordhose
May 1, 2004 9:48 pm
I always seem to be the one causing confusion. Let me try to make my point clear: I understand ANTLR operates on characters, not ony bytes, is that right? So,...
11973
Oliver Zeigermann
djcordhose
May 1, 2004 9:51 pm
... Right. Sorry, I mixed this up... ... Now you seem to mix something up. Both UTF-16 and UTF-32 are character encodings as well, just as UTF-8. All of them...
11974
Mike Lischke
lischke1
May 1, 2004 10:17 pm
... Sure, but how is the internal representation? Actually, it is UTF-16. So although it is a transformation format it is also the actual character...
11975
Terence Parr
parrt2000
May 1, 2004 10:46 pm
Guys, Chapman Flack (was at Purdue might still be) gave me copious notes about the right thing to do. here is an interesting section from his notes: 2....
11976
Boulanger Jean-Louis
jlboulanger2002
May 1, 2004 10:52 pm
Hello, In a structured JAVA application, I use ANTLR. And I want say that the JAVA generated code, from grammar, is part of a package . In my grammar, I...
11977
Terence Parr
parrt2000
May 1, 2004 11:53 pm
... Salut Jean-Louis, I suggest using "header {...}" instead. :) Terence -- Professor Comp. Sci., University of San Francisco Creator, ANTLR Parser Generator,...
11978
matthew ford
Matthew.Ford@...
May 2, 2004 1:37 am
The detailed docs at www.webstringtemplates.com might be useful as they go into more detail about accessing beans within beans and also display the data as an...
11979
Brad Schick
schickb
May 2, 2004 6:27 am
I am working on a procedural language grammar that allows one statement per line. I specify line endings with the rule below. // The RCURLY lookahead will...
11980
Mike Lischke
lischke1
May 2, 2004 8:21 am
Hi Terence, ... This is what I had in mind too. Since I'm a beginnner with Java I didn't know how far the Unicode integration already is. ... That would be the...
11981
Ric Klaren
izniegoed
May 2, 2004 3:17 pm
... ASCII or the range 3-254 (extended ascii was it?) I'm not sure how many of the reserved values 0-3 are used still in backends and/or the analyzer.... ...