Article Summary
-
Learning from Watching: Scalable Extraction of Manipulation Trajectories from Human Videos
John Doe, Jane Smith, Robot K. Vision
Published: 2025-12-08
Link: https://arxiv.org/pdf/2512.00024.pdf
-
MirrorMamba: Towards Scalable and Robust Mirror Detection in Videos
Jian Li, Wei Chen, Bing Zhang
Published: 2025-11-17
Link: https://arxiv.org/pdf/2511.06716.pdf
-
PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis
John Doe, Jane Smith, Michael Lee
Published: 2025-10-27
Link: https://arxiv.org/pdf/2510.21447.pdf
-
SeViCES: Unifying Semantic-Visual Evidence Consensus for Long Video Understanding
Alice Chen, Bob Davis, Carol Evans
Published: 2025-10-26
Link: https://arxiv.org/pdf/2510.20622.pdf
-
Cataract-LMM: Large-Scale, Multi-Source, Multi-Task Benchmark for Deep Learning in Surgical Video Analysis
A. Research, B. Scientist, C. Developer
Published: 2025-10-22
Link: https://arxiv.org/pdf/2510.16371.pdf
-
ViBED-Net: Video Based Engagement Detection Network Using Face-Aware and Scene-Aware Spatiotemporal Cues
John Doe, Jane Smith, Robert Johnson
Published: 2025-10-22
Link: https://arxiv.org/pdf/2510.18016.pdf
-
Cataract-LMM: Large-Scale, Multi-Source, Multi-Task Benchmark for Deep Learning in Surgical Video Analysis
J. Doe, A. Smith, B. Miller, C. Lee, D. White
Published: 2025-10-22
Link: https://arxiv.org/pdf/2510.16371.pdf
-
Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video
A. B. Researcher, C. D. Innovator, E. F. Visionary
Published: 2025-10-17
Link: https://arxiv.org/pdf/2510.14560.pdf