Skip to search.

Breaking News Visit Yahoo! News for the latest.

×Close this window

edict-jmdict · The JMdict/EDICT Group

The Yahoo! Groups Product Blog

Check it out!

Group Information

  • Members: 140
  • Category: Other
  • Founded: Jul 18, 2006
  • Language: English
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Real people. Real stories. See how Yahoo! Groups impacts members worldwide.

Messages

Advanced
Messages Help
  Newest  |  < Newer  |  Older >  |  Oldest
Topics Messages Latest Post

In a couple of previous postings the question of Mysql vs other databases for the jmdict project has been raised but not commented upon. While experimenting...
21 Oct 11, 2006
2:11 am

Jim Breen
breen_jim
Send Email

Hi everyone, I'm wondering if anyone knew of any existing tools out there that might be able to tell me, given the string of a word or a bunch of words, which...
5 Oct 10, 2006
2:25 pm

Hans-Joerg Bibiko
hansjoergbibiko
Send Email

This message is about Human beings, Democracy, UNHCR, Refugees, The Iraqis, Islam, Kurds, Human rights, Respect, Money, Donations, Angelina Jolie, Pavarotti,...
2 Oct 5, 2006
11:02 am

Jim Breen
breen_jim
Send Email

I guess it's to be expected that if one's name and email address can be preloaded into the amendment form, people want it in the new entry form too. So I blew...
4 Oct 5, 2006
6:40 am

Jim Breen
breen_jim
Send Email

I've been trying to make some sense of the "variant" field in kanjidic2.xml and I have some questions about the current structure. == Same kun-yomi...
3 Oct 4, 2006
5:39 am

Ben Bullock
benkasminbul...
Send Email

G'day All, Rene & I have been discussing the list of words (available on a WWW page or two) of words that are considered non-PC in various contexts. For...
40 Oct 3, 2006
7:35 am

Ren辿 Malenfant
reneneedsser...
Send Email

The papers from last year's Web_as_Corpus workshop are now available online (only 15 months after the event.) See: http://wackybook.sslmit.unibo.it/ Some good...
1 Oct 2, 2006
1:13 am

Jim Breen
breen_jim
Send Email

All, The purpose of this is not to start a flamewar of which is better over that, etc., but to gage what tool would be best for the job. What follows here is...
34 Sep 28, 2006
11:43 pm

Jim Breen
breen_jim
Send Email

Mornin' I just noticed that the German meanings seem to have fallen out of Kanjidic2. % grep 'm_lang="de"' kanjidic2.xml | wc -l 0 Did I miss an e-mail about...
3 Sep 28, 2006
11:22 pm

Jim Breen
breen_jim
Send Email

I propose adding information on touon, kan'on, goon and kan'youon to kanjidic. The current format looks like this: <reading...
11 Sep 28, 2006
11:15 pm

Jim Breen
breen_jim
Send Email

Dear all, I don't know whether this index would be useful for KanjiDIC. Lawrence Howell and Hikaru Morimoto set up a database with etymological data of 1940 ...
3 Sep 28, 2006
2:23 pm

indrek pehk
indoraq
Send Email

Hi all, I am new here, and am really interested in the JMDICT project! I am currently interested in converting the XML file to database, as many of you as I...
2 Sep 28, 2006
3:19 am

Jim Breen
breen_jim
Send Email

Here is a possible schema for the jmdict project which I offer for discussion. The attached zip file containins: README.txt -- This file. schema.png -- Schema...
29 Sep 28, 2006
2:51 am

Jim Breen
breen_jim
Send Email

Hi all, I've made some changes to the edict management tool demo - I call it Benedict incidentally. This doesn't include anything from Stuart's schema yet...
3 Sep 27, 2006
1:01 am

Jim Breen
breen_jim
Send Email

Well with a little bit of playing around in Access I have deterimed that there are 1262 words* in Edict that - Have two or more different senses given - Are...
1 Sep 26, 2006
5:03 pm

Paul Blay
blay_paul
Send Email

In the entry for "あっと言う間に", the third <keb> element should be " あっとゆう間に", not "あっとゆう間". There are also several...
4 Sep 20, 2006
10:41 pm

Jim Breen
breen_jim
Send Email

Jim. You have an entry in your wishlist: ENAMDICT gps coordinates for place names Stuart McGraw Dream on 8-)} If they were available, they could be added...
15 Sep 20, 2006
9:52 am

wmaton
Send Email

I think this counts as a "wish list" idea but ... In Edict (/JMdict) there are a lot of (exp) type entries and compound words where the headword is actually...
4 Sep 19, 2006
6:38 am

Jim Breen
breen_jim
Send Email

I was trying to work out what "<variant&gt;" meant in the context of kanjidic2.xml, so I picked a random test point using the classic Nelson: [$B10(B] is a...
6 Sep 19, 2006
5:30 am

Jim Breen
breen_jim
Send Email

Those of you who watch the daily diff files may have noticed a steady trickle of $B30Mh8l(B entries being merged. I have been working through a long list...
32 Sep 18, 2006
9:02 am

phil_ronan
Send Email

I have put the "sens" (<misc>) tag into operation, and changed the markings on the PC entries Rene has flagged. They look OK as far as I can see. Adding such a...
1 Sep 18, 2006
1:17 am

Jim Breen
breen_jim
Send Email

It looks like this needs its own thread. I already mentioned the problem with "阿吽" (="Om"/"Aun") -- the iso639 code for sanskrit is "sa", not "sanskr" ...
4 Sep 17, 2006
5:59 pm

phil_ronan
Send Email

One thing that a lot of people don't know is that the school grades for kanji also include kanji readings. In the field r_status for ja_kun and ja_on readings...
7 Sep 17, 2006
12:40 pm

Ben Bullock
benkasminbul...
Send Email

As many on the list will know, I have been trying to move to stage where the dictionary files can be edited online without everything going past me. I have...
76 Sep 17, 2006
11:48 am

Jim Breen
breen_jim
Send Email

In the process of parsing jmdict for loading info a database, I gathered some information of the use of "keywords"; (xml entities and fixed strings used in...
5 Sep 17, 2006
10:09 am

Jim Breen
breen_jim
Send Email

Since all of the "misclass"; q_codes in kanjidic2 are actually skip codes, I suggest making <q_code qc_type="misclass">PP2-3-14</q_code> into something like ...
2 Sep 16, 2006
2:19 pm

Jim Breen
breen_jim
Send Email

At the moment there is stuff like this in kanjidic2.xml: <literal>$B9~(B</literal> (that is "komu" from moushikomu in case the encodings go wrong). ...
2 Sep 16, 2006
12:39 am

Jim Breen
breen_jim
Send Email

... It certainly is a pointless argument, nobody involved has the slightest say in what words end up being spoken by the citizens of Japan. It also has...
1 Sep 15, 2006
9:28 pm

Paul Blay
blay_paul
Send Email

Two changes to the kanjidic2.xml file today: - the hangul readings have been revised. Francis Bond and Kyonghee Paik checked them, and found some mapping...
3 Sep 15, 2006
9:04 am

Jim Breen
breen_jim
Send Email

Some of the hangul in kanjidic2.xml are repeated twice for the same character. There seem to be two romanizations in "korean_r" but the same hangul. I thought...
9 Sep 12, 2006
8:00 am

Ben Bullock
benkasminbul...
Send Email
  Newest  |  < Newer  |  Older >  |  Oldest
Add to My Yahoo!      XML What's This?

Copyright 2010 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines NEW - Help