Ustav Ceskeho narodniho korpusu FF UK
Ustav teoreticke a komputacni lingvistiky FF UK
Vas srdecne zvou na prednasku
RPC: An annotated parallel aligned corpus
of Slavic languages, and what it's good for
kterou proslovi
Ruprecht von Waldenfels
Institut fuer Slavistik, Universitaet Regensburg
v patek 16. listopadu v 10:00 hodin
na FF UK, nam. J. Palacha, c. 104
Abstract:
In this talk I describe technical and practical issues in the building and use
of the Regensburg Parallel corpus, a small parallel corpus of Slavic languages
that at this point includes texts in Russian, Belorussian, Ukrainian, Polish,
Czech, Slovak, Serbian, Croatian and Bulgarian. In the first part of the talk, I
give an overview of design considerations, formats and other decisions involved
in the creation of the corpus. In the second part, I will give an example of its
use as a heuristic device instrument in linguistic comparative research focusing
on the comparison of the cognate items da(va)t / da(wa)c / da(va)t' plus
infinitive in Russian, Polish and Czech.