... Wouldn't the ease of such a task stand in proportion to how different the source language's grammar is from English? Seeing how lousy Google Translate...
Can one use utf-8 characters in the comments in a Perl script without invoking use utf8? I can and do use \x{1234} in the code, but can't very well use it in...
Benct Philip Jonsson
bpj@...
Dec 1, 2008 11:58 am
156366
UTF-8 is safe in comments and POD unless you do something silly like encode the comment-ending newline in multibyte form. But why don't you want to use the...
Mark J. Reed
markjreed@...
Dec 1, 2008 12:06 pm
156367
2008/11/26 Gary Shannon <fiziwig@...> ... I am pretty sure *lots* of people have given it lots of thought already, but I seriously doubt anything really...
David Vercauteren
njenfalgar@...
Dec 1, 2008 4:15 pm
156368
There's a long checklist of things that a language should be able to do deal with, and for some reason I'm thinking that there were example sentences as part...
Larry Sulky
larrysulky@...
Dec 1, 2008 5:29 pm
156369
... Machine translation seems to be one of those areas where, when first approached, it looks like it would not be terribly difficult. Then, a little deeper...
Gary Shannon
fiziwig@...
Dec 1, 2008 5:46 pm
156370
... The unmarked word order for English would be, of course, "The smallest boy jumped up." What is your take on preserving a marked word order when translating...
Arthaey Angosii
arthaey@...
Dec 1, 2008 6:53 pm
156371
Section four now available. 178 sentences: http://fiziwig.com/gsfa_4.txt --gary...
Gary Shannon
fiziwig@...
Dec 1, 2008 7:59 pm
156372
... It would depend on what the default and alternate available word orders are in the target language, and what emphasis or topicalization or whatever seems...
Jim Henry
jimhenry1973@...
Dec 1, 2008 8:13 pm
156373
... This thing, perhaps? http://lingweb.eva.mpg.de/fieldtools/linguaQ.html Alex...
Alex Fink
000024@...
Dec 1, 2008 9:13 pm
156374
That's it! tankau, yw Alex w...
Larry Sulky
larrysulky@...
Dec 1, 2008 10:10 pm
156375
... Naming plants is definitely a challenge. The classification doesn't help much, since there are a few families with large number of species (e.g. grains,...
Herman Miller
hmiller@...
Dec 2, 2008 4:27 am
156376
... I thought it could cause problems....
Benct Philip Jonsson
bpj@...
Dec 2, 2008 12:14 pm
156377
AFAIK, best practice for Perl 5 is to use UTF-8-encoded source with the "use utf8" pragma. At some point in the future, if not already, perl5 will assume...
Mark J. Reed
markjreed@...
Dec 2, 2008 3:37 pm
156378
E.g.: http://web.archive.org/web/20030422005721/http://www.vaior.org/ Vaior.org is no longer in service & that's the only email I see for him. Anyone know...
Sai Emrys
sai@...
Dec 2, 2008 3:53 pm
156379
... annis@biostat. wisc.edu / http://www.biostat.wisc.edu/~annis/ is my best guess. He doesn't list Vaior, but this Wm. S. Annis is also a former language...
Lars Mathiesen
thorinn@...
Dec 2, 2008 6:59 pm
156380
... I think your information is influenced by old news; the behaviour changed (IIRC) between 5.6 and 5.8. The modern meaning should cause fewer problems than...
Philip Newton
philip.newton@...
Dec 2, 2008 8:51 pm
156381
... Section Five now available. 108 Sentences: http://fiziwig.com/gsfa_5.txt I also came across a book called "Daily Drills for Better English", from 1930,...
Gary Shannon
fiziwig@...
Dec 2, 2008 9:06 pm
156382
... Maybe because in UTF-8 IT'S NOT A BOM! Unless BOM stands for "Bloody OxyMoron" in that case. I know, I know, etymology is not destiny, and lots of...
I'm not sure how directly relevant it is, but regarding case, and its...what word do I want...reenvisioning, shall we say, in a conlang, I strongly recommend...
David J. Peterson
dedalvs@...
Dec 2, 2008 9:42 pm
156385
... Moreover it seems to have nasty effects, e.g. it is AFAIK **not** ignored if something searches for something else at the beginning of the file (uh, did...
Benct Philip Jonsson
bpj@...
Dec 3, 2008 8:02 am
156386
Something I recently read in Mikael Parkvall's _Limits of Language_ (great linguisticsy resource, btw) gave me an idea for a re-envisioning of a certain script...
Alex Fink
000024@...
Dec 3, 2008 8:15 am
156387
... I just found out a good situation to use the \x{} syntax: when the string consists only of (a) Combining Diacritical Mark(s). Properly these should be...
Benct Philip Jonsson
bpj@...
Dec 3, 2008 8:37 am
156388
Is it supposed to be a lot of digits with diacritics? It looks lake an evil cousin of Paleotype if you ask me!...
Benct Philip Jonsson
bpj@...
Dec 3, 2008 8:43 am
156389
On Wed, 03 Dec 2008 03:37:37 -0500, Benct Philip Jonsson <bpj@...> ... You can always '$foo =~ s/\A \s* (.*?) \s* \z/$1/smx' right before printing (or...
Paul Bennett
paul.w.bennett@...
Dec 3, 2008 11:05 am
156390
I'm not sure I understand this new product correctly, but will this "fix" the whole display problem on the web if the font creator buys the product, or does...
David J. Peterson
dedalvs@...
Dec 3, 2008 12:23 pm
156391
... By the looks of it, the purchaser will be the website owner, who can then use the product on their website to convert text to the desired font on the fly...
Rik Roots
rik@...
Dec 3, 2008 1:24 pm
156392
2008/12/3 David J. Peterson <dedalvs@...> ... The font designer doesn't have to do anything. The web page designer has to use the product, and the...