2025
AAAI
AAAI 2025
Linear Decoding of Morphology Relations in Language Models (Student Abstract)
Abstract
Abstract The recent success of transformer language models owes much to their conversational fluency, which includes linguistic and morphological proficiency. An affine Taylor approximation has been found to be a good approximation for transformer computations over certain factual and encyclopedic relations. We show that the truly linear approximation W s, where s is a early layer representation of the base form and W is a local model derivative, is necessary and sufficient to approximate morphological derivation, achieving above 80% top-1 accuracy across most morphological tasks in the Bigger Analogy Test Set. We argue that many morphological forms in transformer models are likely linearly encoded.
🌉
Interdisciplinary Bridge
— Deep Learning and Interdisciplinary and Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— affine taylor approximation
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Core Methods > Representation Learning
Natural Language Processing > Understanding > Semantic Analysis
Natural Language Processing > Resources & Methods > Large Language Models
Interdisciplinary > Linguistics > Morphology
Natural Language Processing > Resources & Methods > Language Modeling
Deep Learning > Models > Transformers