Conference Publication Details
Mandatory Fields
Haithem Afli, Walid Aransa, Pintu Lohar and Andy Way
CICLING 2016: 17th International Conference on Intelligent Text Processing and Computational Linguistics,
From Arabic User-Generated Content to Machine Translation: Integrating Automatic Error Correction
2016
April
Published
1
()
Optional Fields
Automatic Error Correction, Machine translation, pre-processing, Arabic User-Generated content.
Konya, Turkey
03-APR-16
09-MAY-16
With the wide spread of the social media and online forums, individual users have been able to actively participate in the generation of online content in different languages and dialects. Arabic is one of the fastest growing languages used on Internet, but dialects (like Egyptian and Saudi Arabian) have a big share of the Arabic online content. There are many differences between Dialectal Arabic and Modern Standard Arabic which cause many challenges for Machine Translation of informal Arabic language. In this paper, we investigate the use of Automatic Error Correction method to improve the quality of Arabic User-Generated texts and its automatic translation. Our experiments show that the new system with automatic correction module outperforms the baseline system by nearly 22.59% of relative improvement.
https://www.researchgate.net/profile/Haithem_Afli/publication/301552184_From_Arabic_User-Generated_Content_to_Machine_Translation_Integrating_Automatic_Error_Correction/links/5735b71e08ae298602e08b1a/From-Arabic-User-Generated-Content-to-Machine-Translati
Grant Details
Science Foundation Ireland (SFI)
13/RC/2106