Search the web
Sign In
New User? Sign Up
catmt
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Want your group to be featured on the Yahoo! Groups website? Add a group photo to Flickr.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Some BLEU rates   Message List  
Reply | Forward Message #1178 of 1200 |
Re: [catmt] Some BLEU rates

Samuel Murray (Home) wrote:

> Bacchus BG wrote:

>> I look for BLEU rates of the following language pairs:
>> English-Spanish and Spanish- English for "Google translate" and
>> "Bablefish".

> You could do similar analysis by aligning the reference translations
> into TMs, then reverse the TMs, and then performing CAT translation on
> the machine translation target text. This will not be a BLEU analysis
> but it will be a non-programmer's solution to evaluating machine
> translation in a way which is remotely similar to BLEU (since it would
> result in a percentage count).

Okay, I just did it, and here are the results.

Samuel

===================================

= A very simple statistical analysis of two MT systems =

== Method ==

1. Get an exact translation of an English text, to be used as human
translated reference text. I used Marina Soldati's sample translations
at her ProZ.com profile at http://www.proz.com/profile/108251 (used
without permission).
2. Align the reference translation and turn it into a TM.
3. Reverse the TM (i.e. switch source and target languages).
4. Get Google and Babelfish to translate the source text (note that
Babelfish does not indicate paragraph breaks).
5. Use a CAT tool such as Wordfast to translate the MT translations
back, using the reversed TM from the reference translation, and note the
sentence fuzzy match percentages (remember to use a fresh reversed TM
for each test, as the CAT tool may adjust the TM during the first test).

== Statistical result (numbers) ==

=== Google ===

Raw match percentages: 0, 76, 69, 83, 0, 0, 66, 0, 64, 0, 0, 73, 0, 0,
0, 100, 80, 0, 88, 0, 0, 0, 0, 0, 0, 62, 0, 0, 0, 64, 0, 0
Total of match percentages: 825
Median (all): 0
Average (all): 25
Median (non-zero only): 73
Average (non-zero only): 75

=== Babelfish ===

Raw match percentages: 0, 74, 65, 60, 0, 0, 64, 0, 0, 0, 0, 0, 0, 0, 0,
62, 0, 0, 84, 0, 69, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 68
Total of match percentages: 564
Median (all): 0
Average (all): 17
Median (non-zero only): 66
Average (non-zero only): 68

== Comparative result of numbers ==

=== Matches above 60% ===

Google gives 11 matches
Babelfish gives 8 matches

=== One-on-one comparison ===

These are the sentences for which both MT translations showed a 60%
match or higher with the reference text.

Google : Babelfish
76 : 74
69 : 65
83 : 60
66 : 64
80 : 62
88 : 84

==

Samuel



Sat Jul 12, 2008 8:47 am

lacialacia
Offline Offline
Send Email Send Email

Forward
Message #1178 of 1200 |
Expand Messages Author Sort by Date

Hello, I look for BLEU rates of the following language pairs: English-Spanish and Spanish- English for "Google translate" and "Bablefish", I hope someone will...
Bacchus BG
bacchusbg
Offline Send Email
Jul 11, 2008
3:54 pm

... It would depend on the reference texts, wouldn't it? The questions is, where can you find exact translations in EN-ES and ES-EN for free? The Wikipedia...
Samuel Murray (Home)
lacialacia
Offline Send Email
Jul 14, 2008
12:45 pm

... Okay, I just did it, and here are the results. Samuel =================================== = A very simple statistical analysis of two MT systems = ==...
Samuel Murray (Home)
lacialacia
Offline Send Email
Jul 14, 2008
12:45 pm

... translation on ... analysis ... would ... the ... Thank you very much Samuel. I'm quite content with that statistic given in percentages. It was pretty...
bacchusbg
Offline Send Email
Jul 29, 2008
5:20 pm

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 14, 2008
12:45 pm

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 29, 2008
5:20 pm

Will someone please - enlighten me as to whatever BLEU means in this context? - do something about our friend mr. Álvaro Diaz? ...
Danilo Nogueira
danilo_tradutor
Offline Send Email
Jul 29, 2008
5:46 pm

... http://en.wikipedia.org/wiki/Bilingual_evaluation_understudy "The central idea behind the metric is that the closer a machine translation is to a...
Samuel Murray (Home)
lacialacia
Offline Send Email
Aug 5, 2008
10:27 am

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 29, 2008
5:20 pm

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 29, 2008
5:20 pm

Muchas gracias por su mensaje. Del 07/07/08 al 18/07/08 no estaré localizable en la oficina. En casos urgentes pueden ponerse en contacto con Mercedes Miguel...
alvaro.diaz@...
h4ize
Offline Send Email
Jul 29, 2008
5:20 pm
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help