physical AI

Benchmarking egocentric multimodal goal inference for assistive wearable agents

We present a benchmark for egocentric multimodal goal inference for assistive wearable agents. This benchmark evaluates the ability of AI systems to infer user goals from egocentric video, audio, and other sensor modalities in real-world scenarios. …

DigiData: Training and evaluating general-purpose mobile control agents

We present DigiData, a comprehensive framework for training and evaluating general-purpose mobile control agents. This work addresses the challenge of creating AI agents that can navigate and interact with mobile user interfaces to perform tasks …