Ustav formalni a aplikovane lingvistiky
Vas srdecne zve na
Seminar formalni lingvistiky
vedeny prof. E. Hajicovou
Seminar se kona v pondeli od 13:30
v budove MFF UK, Malostranske nam. 25,
4. patro, mistnost S1 (c. 428)
12. 11. 2007
Petr Kaderka, Martin Havlik a Nino Peterek (UJC AV CR, UFAL MFF UK)
Korpus Dialog
Abstrakt:
Korpus DIALOG je specialni korpus mluvene cestiny: tvori jej videonahravky a
prepisy televiznich diskusnich poradu (politickych debat, interview, talkshow
aj.). Neverejna pracovni podoba korpusu obsahuje vice nez 2 miliony textovych
slov. V prednasce pojedname o historii korpusu, jeho soucasne podobe a take o
vyhledech do budoucna. Predstavime rovnez prvni verejne pristupnou cast korpusu,
nazvanou DIALOG 0.1 (http://ujc.dialogy.cz).
19. 11. 2007
Seminar se nekona
26. 11. 2007
Sarka Zikanova (UFAL MFF UK)
Moznosti anotace diskurzu v Prazskem zavislostnim korpusu na zaklade
anotace v Penn Discourse Treebank
Abstrakt:
Prednaska se venuje zakladnimu seznameni se zpusobem anotace v Penn
Discourse Treebank a vyhodnoceni, ktere z jeho informaci jsou jiz
zachyceny v Prazskem zavislostnim korpusu na tektogramaticke rovine a
ktere dalsi informace by bylo pripadne vhodne do prazske anotace diskurzu
zavest.
3. 12. 2007
Alfonso Medina Urrea (Instituto de Ingenieria, UNAM (Universidad Nacional
Autonoma de Mexico), Ciudad de Mexico)
Towards the measurement of morphological variation in diachronic corpora
Abstract:
Morphological profiles of a wide variety of languages can be gathered
applying unsupervised methods for the morphological segmentation of
graphical words. Comparison of different profiles of one given language from
different diachronic states can be used for obtaining general measurements of
variation at the morphological level.
In this presentation, quantitative data for three centuries of the Spanish
language spoken in Mexico will be presented (XVI, XVIII and XX centuries) with
the intent of corroborating (or not) intuitions put forward by philologists.