2025
AAAI
AAAI 2025
Visual Question Answering for Peruvian Cuisine in Regional Spanish
Abstract
Abstract This project leverages Visual Question Answering (VQA) to promote Peruvian gastronomy by utilizing a culturally rich dataset and advanced models such as LLaVA-1.5 and GPT-2 Large. The evaluation will comprise both automated metrics and culinary expert assessments. This system addresses regional variations in dish names, promotes inclusivity by involving Peruvians from diverse regions in dataset construction, and enhances cultural representation.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Computer Vision and Natural Language Processing
🧭
Keyword Pioneer
— regional spanish
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Multimodal Learning
Natural Language Processing > Applications > Machine Reading Comprehension
Artificial Intelligence > Core AI > Large Language Models
Computer Vision > Core AI > Multimodal Learning
Natural Language Processing > Applications > Visual Question Answering