Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble

Peerat Limkonchotiwat; Wannaphong Phatthiyaphaibun; Raheem Sarwar; Ekapol Chuangsuwanich; Sarana Nutanong

2020 EMNLP EMNLP 2020

Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble

Abstract

AbstractLike many Natural Language Processing tasks, Thai word segmentation is domain-dependent. Researchers have been relying on transfer learning to adapt an existing model to a new domain. However, this approach is inapplicable to cases where we can interact with only input and output layers of the models, also known as “black boxes”. We propose a filter-and-refine solution based on the stacked-ensemble learning paradigm to address this black-box limitation. We conducted extensive experimental studies comparing our method against state-of-the-art models and transfer learning. Experimental results show that our proposed solution is an effective domain adaptation method and has a similar performance as the transfer learning method.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — filter-and-refine approach

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Peerat Limkonchotiwat , Wannaphong Phatthiyaphaibun , Raheem Sarwar , Ekapol Chuangsuwanich , Sarana Nutanong

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Application Areas > Domain Adaptation Machine Learning > Learning Types > Transfer Learning Machine Learning > Learning Paradigms > Federated Learning Machine Learning > Learning Types > Ensemble Learning Machine Learning > Learning Types > Domain Adaptation Natural Language Processing > Applications > Text Processing

Keywords

transfer learning domain adaptation ensemble learning word segmentation thai language black-box model stacked ensemble filter-and-refine approach black-box limitation thai word segmentation

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020