GUCL Talk: Binyam Ephrem Siyoum: Resource Building for Parsing Amharic Text

GU Computational Linguistics (GUCL) guest speaker

Binyam Ephrem Siyoum, Addis Ababa University

Resource Building for Parsing Amharic Text: Morphological-rich language with less resource

In recent years, different language processing applications are demanding the state-of-the-art parsers. Specifically, high-quality parsers are required for applications like question answering, machine translation and information summarization. In order to train or develop efficient parser then it has become a trained to create high-quality treebank, a linguistically annotated corpus which includes morphological annotations and syntactic annotations. Treebanks play a role in promoting research in parsing natural languages but also contributes for linguistics theory and corpus-based language analysis. Furthermore, treebanks are important resources for the respective language to building and testing data-driven tools serving as gold standard. Such a resource has been developed for highly resourced language. However, building a resource for a morphologically-rich and less resourced language is difficult. In this presentation I will talk about the general tasks involved in the development of a treebank for Amharic parser. I will explain the problems at each stages and the solution we proposed.

Bio: Binyam Ephrem obtained an M.Phil. in Linguistics from Norwegian University of Science and Technology (NTNU) and expected to complete his Ph.D. in Language Technology at Addis Ababa University. He has over ten years of professional experience in teaching and research in higher education in Addis Ababa University, Ethiopia. Mr. Seyoum has involved in an international as well as local projects related to language technology on Ethiopian languages. He has presented academic papers including recently at the LREC 2108 and COLING 2018 conferences. He has led projects which are composed of dynamic team members that boosted his enthusiasm for diligence and hard work in communicating with various IT and Language experts. Mr. Binyam is skilled in Natural Language Processing, Text Processing, and Machine learning.

Friday, November 16 at 2:00pm

Poulton Hall, 230
1421 37th St., N.W., Washington

Departments

Georgetown College, Linguistics

Event Contact Name

Conor Sinclair

Subscribe
Google Calendar iCal Outlook

Recent Activity