Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Natural Language Processing
›
Applications
›
Text Processing
421 directly classified papers
Papers per year
2013: 1
2016: 1
2017: 15
2018: 34
2019: 58
2020: 51
2021: 49
2022: 58
2023: 53
2024: 54
2025: 47
Papers
Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing
EMNLP 2024
Overview of the 9th Social Media Mining for Health Applications (#SMM4H) Shared Tasks at ACL 2024 – Large Language Models and Generalizability for Social Media NLP
ACL 2024
Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting
EMNLP 2024
ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs
EMNLP 2024
Anonymization Through Substitution: Words vs Sentences
ACL 2024
Enhanced Optical Character Recognition by Optical Sensor Combined with BERT and Cosine Similarity Scoring (Student Abstract)
AAAI 2024
TRoTR: A Framework for Evaluating the Re-contextualization of Text Reuse
EMNLP 2024
Distributional Properties of Subword Regularization
EMNLP 2024
Towards Context-aware Normalization of Variant Characters in Classical Chinese Using Parallel Editions and BERT
ACL 2024
Chinese Spelling Correction as Rephrasing Language Model
AAAI 2024
Automatic sentence segmentation of clinical record narratives in real-world data
EMNLP 2024
Arabic Diacritics in the Wild: Exploiting Opportunities for Improved Diacritization
ACL 2024
Neural Search Space in Gboard Decoder
EMNLP 2024
InsertGNN: A Hierarchical Graph Neural Network for the TOEFL Sentence Insertion Problem
EMNLP 2024
Edit-Constrained Decoding for Sentence Simplification
EMNLP 2024
SumTablets: A Transliteration Dataset of Sumerian Tablets
ACL 2024
Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks
EMNLP 2024
LLMs to Replace Crowdsourcing For Parallel Data Creation? The Case of Text Detoxification
EMNLP 2024
Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains
EMNLP 2024
Adaptive Immune-based Sound-Shape Code Substitution for Adversarial Chinese Text Attacks
EMNLP 2024
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation
EMNLP 2024
Two Issues with Chinese Spelling Correction and A Refinement Solution
ACL 2024
Improving Grammatical Error Correction via Contextual Data Augmentation
ACL 2024
Enhancing Swedish Parliamentary Data: Annotation, Accessibility, and Application in Digital Humanities
EMNLP 2024
SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation
NIPS 2024
<
1
2
3
4
5
…
17
>