Factuality Evaluation Using Reasoning and World Modeling
Abstract
Abstract Large language models (LLMs) have rapidly become primary tools for searching and generating information given a carefully designed prompt (may contain few-shot examples). However, these models frequently produce factually incorrect statements that are not consistent with verifiable facts and reliable sources, raising fundamental questions about how these models store, update, and reason with facts. Improving factuality, therefore, requires more than surface-level mitigation strategies: it demands a deeper understanding of how LLMs construct and maintain world models, and how reasoning processes can be guided to remain faithful to the verifiable information. Existing strategies, such as retrieval-augmented generation, training-time alignment, post hoc verification, etc., partly address these challenges but do not provide a holistic account of how facts are internally stored, updated, or grounded in external knowledge sources. My research addresses this gap by studying factuality through the dual lens of reasoning and world modeling, asking how LLMs encode facts, how adversarial or linguistic perturbations compromise factual reasoning, and how interpretability tools can reveal and correct model vulnerabilities. In this work, I aim to develop a framework in which an LLM interacts with an explicit external knowledge source, thereby forming a robust world model for factual evaluation.