egocentric vision

Benchmarking egocentric multimodal goal inference for assistive wearable agents

We present a benchmark for egocentric multimodal goal inference for assistive wearable agents. This benchmark evaluates the ability of AI systems to infer user goals from egocentric video, audio, and other sensor modalities in real-world scenarios. …