Conference Publication Details
Mandatory Fields
Valia Kordoni, Antal van den Bosch, Katia Lida Kermanidis, Vilelmini Sosoni, Kostadin Cholakov, Iris Hendrickx, Matthias Huck and Andy Way
LREC 2016, Tenth International Conference on Language Resources and Evaluation
Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
2016
May
Published
0
()
Optional Fields
MOOCs, statistical machine translation, crowdsourcing, CrowdFlower, entity recognition, sentiment analysis
Portorož, Slovenia
23-MAY-16
28-MAY-16
The present work is an overview of the TraMOOC (Translation for Massive Open Online Courses) research and innovation project, a machine translation approach for online educational content. More specifically, videolectures, assignments, and MOOC forum text is automatically translated from English into eleven European and BRIC languages. Unlike previous approaches to machine translation, the output quality in TraMOOC relies on a multimodal evaluation schema that involves crowdsourcing, error type markup, an error taxonomy for translation model comparison, and implicit evaluation via text mining, i.e. entity recognition and its performance comparison between the source and the translated text, and sentiment analysis on the students' forum posts. Finally, the evaluation output will result in more and better quality in-domain parallel data that will be fed back to the translation engine for higher quality output. The translation service will be incorporated into the Iversity MOOC platform and into the VideoLectures.net digital library portal.
EU H2020, SFI
http://www.computing.dcu.ie/~away/PUBS/2016/TraMOOC.pdf
Grant Details
Science Foundation Ireland (SFI)
13/RC/2106