Check out the new features of the release 4.0.

ParTree - Parallel Treebanks: A multilingual corpus of movie subtitles.

Ref. 2253

Resource

Resource type

Corpus

Resource description

A mulitlingual parallel corpus of movie subtitles. Contains raw data (SRT subtitle files), text (aligned on the sentence level) and parsed sentences (conllu format with UD-style annotations).

Keywords

parallel corpus, subtitle, multilingual, Universal Dependency annotation scheme, treebank, dependency parsing

Validation information

-

Participants

-