Hi Friedel,
Good to hear from you.
We have had a very good relationship with Nemeth Laszlo the creator of hunspel. He has responded to many of our queries with useful suggestions. However, it happens that the problem of the order of diacritics was not on our high on the priority list of our problems as we were finding ways ways around it. In agreement with you, I think it will be good to have the Unicode normalization feature to Hunspel.
Tunde
> Subject: RE: [A12n-Collab] Re:Tech support for Yoruba orthography
> From: friedel@...
> To: a12n-collaboration@...
> Date: Tue, 30 Dec 2008 07:03:21 +0200
>
> On Sa, 2008-12-27 at 11:05 +0100, Tunde Adegbola wrote:
> > This may remain a problem for some time, as long as legacy ('Tokunbo')
> > computer systems remain in circulation. There are still lots of
> > computer systems of various ages in circulation in Nigeria.
> > Furthermore, I have been encountering another flavor of the same
> > problem in my efforts in developing a Yoruba spell checker. Even when
> > the glyphs look right, different sequences of application of the
> > diacritics are seen by the computer as different spellings.
>
> Hallo Tunde
>
> Did you perhaps request from the hunspell developer to implement support
> for unicode normalisation? That way it wouldn't matter which order or
> way they were typed. I'm sure it would be a useful feature for many
> languages. Unicode normalisation means that a character sequence is
> processed and the output is in a predictable format (in terms of
> precomposed vs combined characters). That way the development of the
> spell checker might be able to ignore this issue and have it handled by
> the underlying spell checking technology.
>
> In South Africa we are not really hitting this problem, since we have
> code points in unicode for all our characters.
>
> Friedel
>
> --
> Recently on my blog:
> http://translate.org.za/blogs/friedel/en/content/re-bringing-all-translation-management-tools-together
>
Invite your mail contacts to join your friends list with Windows Live Spaces. It's easy! Try it!
Good to hear from you.
We have had a very good relationship with Nemeth Laszlo the creator of hunspel. He has responded to many of our queries with useful suggestions. However, it happens that the problem of the order of diacritics was not on our high on the priority list of our problems as we were finding ways ways around it. In agreement with you, I think it will be good to have the Unicode normalization feature to Hunspel.
Tunde
-----------------------------------------------------------------------------------------------
Tunde Adegbola (Ph.D.)
Executive Director
African Languages Technology Initiative
(Alt-I ... Inserting African issues into the agenda of the knowledge age)
11 Oluyole Way, New Bodija Ibadan, Nigeria.
+234 8034019398
------------------------------------------------------------------------------------------------
> Subject: RE: [A12n-Collab] Re:Tech support for Yoruba orthography
> From: friedel@...
> To: a12n-collaboration@...
> Date: Tue, 30 Dec 2008 07:03:21 +0200
>
> On Sa, 2008-12-27 at 11:05 +0100, Tunde Adegbola wrote:
> > This may remain a problem for some time, as long as legacy ('Tokunbo')
> > computer systems remain in circulation. There are still lots of
> > computer systems of various ages in circulation in Nigeria.
> > Furthermore, I have been encountering another flavor of the same
> > problem in my efforts in developing a Yoruba spell checker. Even when
> > the glyphs look right, different sequences of application of the
> > diacritics are seen by the computer as different spellings.
>
> Hallo Tunde
>
> Did you perhaps request from the hunspell developer to implement support
> for unicode normalisation? That way it wouldn't matter which order or
> way they were typed. I'm sure it would be a useful feature for many
> languages. Unicode normalisation means that a character sequence is
> processed and the output is in a predictable format (in terms of
> precomposed vs combined characters). That way the development of the
> spell checker might be able to ignore this issue and have it handled by
> the underlying spell checking technology.
>
> In South Africa we are not really hitting this problem, since we have
> code points in unicode for all our characters.
>
> Friedel
>
> --
> Recently on my blog:
> http://translate.org.za/blogs/friedel/en/content/re-bringing-all-translation-management-tools-together
>
Invite your mail contacts to join your friends list with Windows Live Spaces. It's easy! Try it!