2025
COLING
COLING 2025
Challenges in Technical Regulatory Text Variation Detection
Abstract
AbstractWe present a preliminary study on the feasibility of using current natural language processing techniques to detect variations between the construction codes of different jurisdictions. We formulate the task as a sentence alignment problem and evaluate various sentence representation models for their performance in this task. Our results show that task-specific trained embeddings perform marginally better than other models, but the overall accuracy remains a challenge. We also show that domain-specific fine-tuning hurts the task performance. The results highlight the challenges of developing NLP applications for technical regulatory texts.
🌉
Interdisciplinary Bridge
— Computer Science and Interdisciplinary and Natural Language Processing
🧭
Keyword Pioneer
— variation detection
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Natural Language Processing > Applications > Information Retrieval
Natural Language Processing > Applications > Text Classification
Computer Science > Applications > Information Retrieval
Natural Language Processing > Applications > Natural Language Inference
Interdisciplinary > Linguistics > Syntax