2018
COLING
COLING 2018
Contemporary Amharic Corpus: Automatically Morpho-Syntactically Tagged Amharic Corpus
Abstract
AbstractWe introduced the contemporary Amharic corpus, which is automatically tagged for morpho-syntactic information. Texts are collected from 25,199 documents from different domains and about 24 million orthographic words are tokenized. Since it is partly a web corpus, we made some automatic spelling error correction. We have also modified the existing morphological analyzer, HornMorpho, to use it for the automatic tagging.
🌉
Interdisciplinary Bridge
— Computer Science and Natural Language Processing
🧭
Keyword Pioneer
— morpho-syntactic tagging
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio