2017
EACL
EACL 2017
Joining Hands: Exploiting Monolingual Treebanks for Parsing of Code-mixing Data
Abstract
AbstractIn this paper, we propose efficient and less resource-intensive strategies for parsing of code-mixed data. These strategies are not constrained by in-domain annotations, rather they leverage pre-existing monolingual annotated resources for training. We show that these methods can produce significantly better results as compared to an informed baseline. Due to lack of an evaluation set for code-mixed structures, we also present a data set of 450 Hindi and English code-mixed tweets of Hindi multilingual speakers for evaluation.
🌉
Interdisciplinary Bridge
— Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— monolingual treebank
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Natural Language Processing > Understanding > Parsing
Natural Language Processing > Understanding > Syntax
Natural Language Processing > Resources & Methods > Multilingual NLP
Machine Learning > Learning Paradigms > Transfer Learning
Machine Learning > Learning Types > Transfer Learning
Natural Language Processing > Applications > Text Generation