Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Back to papers
2024
ECCV
ECCV 2024
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
Authors
Raghav Kapoor
,
Yash Parag Butala
,
Melisa A Russak
,
Jing Yu Koh
,
Kiran Kamble
,
Waseem AlShikh
,
Ruslan Salakhutdinov
Download PDF
Related papers
Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos
2024
Learning Camouflaged Object Detection from Noisy Pseudo Label
2024
ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
2024
FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
2024
UniCode : Learning a Unified Codebook for Multimodal Large Language Models
2024