Conference Publication Details
Mandatory Fields
Zhechev V.;Way A.
Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference
Automatic generation of parallel treebanks
2008
December
Published
1
()
Optional Fields
1105
1112
The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of which very few exist. The ones that exist are mainly hand-crafted and too small for reliable use in data-oriented applications. In this paper we introduce a novel platform for fast and robust automatic generation of parallel treebanks. The software we have developed based on this platform has been shown to handle large data sets. We also present evaluation results demonstrating the quality of the derived treebanks and discuss some possible modifications and improvements that can lead to even better results. We expect the presented platform to help boost research in the field of data-oriented machine translation and lead to advancements in other fields where parallel treebanks can be employed. © 2008. Licensed under the Creative Commons.
Grant Details