Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Data Science & Analytics
›
Methods
›
Data Mining
643 directly classified papers
Papers per year
2001: 1
2004: 2
2006: 8
2007: 1
2008: 3
2009: 8
2010: 12
2011: 13
2012: 10
2013: 19
2014: 10
2015: 9
2016: 15
2017: 44
2018: 37
2019: 52
2020: 66
2021: 62
2022: 57
2023: 67
2024: 92
2025: 55
Papers
Mining the Past: A Comparative Study of Classical and Neural Topic Models on Historical Newspaper Archives
NAACL 2025
Map&Make: Schema Guided Text to Table Generation
ACL 2025
Health Sentinel: An AI Pipeline For Real-time Disease Outbreak Detection
ACL 2025
Assessing Critical Thinking Components in Romanian Secondary School Textbooks: A Data Mining Approach to the ROTEX Corpus
ACL 2025
Overview of the Fifth Workshop on Scholarly Document Processing
ACL 2025
Proactive Data-driven Scheduling of Business Processes
IJCAI 2025
Cognitive Geographies of Catastrophe Narratives: Georeferenced Interview Transcriptions as Language Resource for Models of Forced Displacement
COLING 2025
ml4xcube: Machine Learning Toolkits for Earth System Data Cubes
AAAI 2025
A cultural shift in Western perceptions of Palestine
COLING 2025
UPSC2M: Benchmarking Adaptive Learning from Two Million MCQ Attempts
ACL 2025
STARQA: A Question Answering Dataset for Complex Analytical Reasoning over Structured Databases
EMNLP 2025
Identifying and analyzing ‘noisy’ spelling errors in a second language corpus
NAACL 2025
Measuring Mental Health Variables in Computational Research: Toward Validated, Dimensional, and Transdiagnostic Approaches
NAACL 2025
Statistical and Neural Methods for Hawaiian Orthography Modernization
EMNLP 2025
Understanding Microtargeting Pattern on Social Media
AAAI 2025
Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space
AAAI 2025
Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
AAAI 2025
Temporal Streaming Batch Principal Component Analysis for Time Series Classification (Student Abstract)
AAAI 2025
Efficient Numerical Integration in Reproducing Kernel Hilbert Spaces via Leverage Scores Sampling
JMLR 2025
Reliability of Topic Modeling
NAACL 2025
Media of Langue: Exploring Word Translation Network
NAACL 2025
DSBC : Data Science task Benchmarking with Context engineering
AACL 2025
Towards Event Extraction with Massive Types: LLM-based Collaborative Annotation and Partitioning Extraction
EMNLP 2025
Bidirectional Topic Matching: Quantifying Thematic Intersections Between Climate Change and Climate Mitigation News Corpora Through Topic Modelling
ACL 2025
Quality Assessment of Tabular Data using Large Language Models and Code Generation
EMNLP 2025
<
1
2
3
4
5
…
26
>