Article Summary
-
I2I-Bench: A Comprehensive Benchmark Suite for Image-to-Image Editing Models
Jing Li, Wei Chen, Yan Zhang, Min Wang
Published: 2025-12-10
Link: https://arxiv.org/pdf/2512.04660.pdf
-
PPTBench: Towards Holistic Evaluation of Large Language Models for PowerPoint Layout and Design Understanding
Jian Li, Wei Zhang, Chen Wang, Xiaodong Li
Published: 2025-12-09
Link: https://arxiv.org/pdf/2512.02624.pdf
-
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights
Alice Chen, Bob Davis, Carol White, David Green
Published: 2025-12-06
Link: https://arxiv.org/pdf/2512.01816.pdf
-
IVCR-200K: A Large-Scale Multi-turn Dialogue Benchmark for Interactive Video Corpus Retrieval
Jian Li, Wei Chen, Yan Wang, Min Zhao, Lei Zhang
Published: 2025-12-04
Link: https://arxiv.org/pdf/2512.01312.pdf
-
RoadBench: Benchmarking MLLMs on Fine-Grained Spatial Understanding and Reasoning under Urban Road Scenarios
Jian Zhang, Li Wang, Wei Chen
Published: 2025-11-29
Link: https://arxiv.org/pdf/2511.18011.pdf
-
Can MLLMs Read the Room? A Multimodal Benchmark for Assessing Deception in Multi-Party Social Interactions
Ava Chen, Ben Carter, Chloe Davis, David Miller
Published: 2025-11-27
Link: https://arxiv.org/pdf/2511.16221.pdf
-
ADNet: A Large-Scale and Extensible Multi-Domain Benchmark for Anomaly Detection Across 380 Real-World Categories
J. Smith, A. B. Johnson, C. D. Lee
Published: 2025-11-26
Link: https://arxiv.org/pdf/2511.20169.pdf
-
SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection
John Doe, Jane Smith, Michael Brown
Published: 2025-11-23
Link: https://arxiv.org/pdf/2511.15153.pdf
-
FineSkiing: A Fine-grained Benchmark for Skiing Action Quality Assessment
Jian Li, Wei Chen, Yang Liu, Min Zhang, Bo Zhao
Published: 2025-11-20
Link: https://arxiv.org/pdf/2511.10250.pdf
-
MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding
J. Lee, M. Chen, S. Gupta, A. Rodriguez, E. Wong
Published: 2025-11-19
Link: https://arxiv.org/pdf/2511.09919.pdf