Search the web
Sign In
New User? Sign Up
catmt
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Message search is now enhanced, find messages faster. Take it for a spin.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Some BLEU rates   Message List  
Reply | Forward Message #1184 of 1200 |
Re: Some BLEU rates

--- In catmt@yahoogroups.com, "Samuel Murray (Home)" <leuce@...> wrote:
>
> Samuel Murray (Home) wrote:
>
> > Bacchus BG wrote:
>
> >> I look for BLEU rates of the following language pairs:
> >> English-Spanish and Spanish- English for "Google translate" and
> >> "Bablefish".
>
> > You could do similar analysis by aligning the reference translations
> > into TMs, then reverse the TMs, and then performing CAT
translation on
> > the machine translation target text. This will not be a BLEU
analysis
> > but it will be a non-programmer's solution to evaluating machine
> > translation in a way which is remotely similar to BLEU (since it
would
> > result in a percentage count).
>
> Okay, I just did it, and here are the results.
>
> Samuel
>
> ===================================
>
> = A very simple statistical analysis of two MT systems =
>
> == Method ==
>
> 1. Get an exact translation of an English text, to be used as human
> translated reference text. I used Marina Soldati's sample translations
> at her ProZ.com profile at http://www.proz.com/profile/108251 (used
> without permission).
> 2. Align the reference translation and turn it into a TM.
> 3. Reverse the TM (i.e. switch source and target languages).
> 4. Get Google and Babelfish to translate the source text (note that
> Babelfish does not indicate paragraph breaks).
> 5. Use a CAT tool such as Wordfast to translate the MT translations
> back, using the reversed TM from the reference translation, and note
the
> sentence fuzzy match percentages (remember to use a fresh reversed TM
> for each test, as the CAT tool may adjust the TM during the first test).
>
> == Statistical result (numbers) ==
>
> === Google ===
>
> Raw match percentages: 0, 76, 69, 83, 0, 0, 66, 0, 64, 0, 0, 73, 0, 0,
> 0, 100, 80, 0, 88, 0, 0, 0, 0, 0, 0, 62, 0, 0, 0, 64, 0, 0
> Total of match percentages: 825
> Median (all): 0
> Average (all): 25
> Median (non-zero only): 73
> Average (non-zero only): 75
>
> === Babelfish ===
>
> Raw match percentages: 0, 74, 65, 60, 0, 0, 64, 0, 0, 0, 0, 0, 0, 0, 0,
> 62, 0, 0, 84, 0, 69, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 68
> Total of match percentages: 564
> Median (all): 0
> Average (all): 17
> Median (non-zero only): 66
> Average (non-zero only): 68
>
> == Comparative result of numbers ==
>
> === Matches above 60% ===
>
> Google gives 11 matches
> Babelfish gives 8 matches
>
> === One-on-one comparison ===
>
> These are the sentences for which both MT translations showed a 60%
> match or higher with the reference text.
>
> Google : Babelfish
> 76 : 74
> 69 : 65
> 83 : 60
> 66 : 64
> 80 : 62
> 88 : 84
>
> ==
>
> Samuel
>



Thank you very much Samuel. I'm quite content with that statistic
given in percentages.
It was pretty good response.




Mon Jul 14, 2008 1:06 pm

bacchusbg
Offline Offline
Send Email Send Email

Forward
Message #1184 of 1200 |
Expand Messages Author Sort by Date

Hello, I look for BLEU rates of the following language pairs: English-Spanish and Spanish- English for "Google translate" and "Bablefish", I hope someone will...
Bacchus BG
bacchusbg
Offline Send Email
Jul 11, 2008
3:54 pm

... It would depend on the reference texts, wouldn't it? The questions is, where can you find exact translations in EN-ES and ES-EN for free? The Wikipedia...
Samuel Murray (Home)
lacialacia
Offline Send Email
Jul 14, 2008
12:45 pm

... Okay, I just did it, and here are the results. Samuel =================================== = A very simple statistical analysis of two MT systems = ==...
Samuel Murray (Home)
lacialacia
Offline Send Email
Jul 14, 2008
12:45 pm

... translation on ... analysis ... would ... the ... Thank you very much Samuel. I'm quite content with that statistic given in percentages. It was pretty...
bacchusbg
Offline Send Email
Jul 29, 2008
5:20 pm

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 14, 2008
12:45 pm

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 29, 2008
5:20 pm

Will someone please - enlighten me as to whatever BLEU means in this context? - do something about our friend mr. Álvaro Diaz? ...
Danilo Nogueira
danilo_tradutor
Offline Send Email
Jul 29, 2008
5:46 pm

... http://en.wikipedia.org/wiki/Bilingual_evaluation_understudy "The central idea behind the metric is that the closer a machine translation is to a...
Samuel Murray (Home)
lacialacia
Offline Send Email
Aug 5, 2008
10:27 am

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 29, 2008
5:20 pm

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 29, 2008
5:20 pm

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 29, 2008
5:20 pm
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help