Article Summary
-
Concept-based Explainable Data Mining with VLM for 3D Detection
Jian Li, Wei Zhang, Chen Wang, Xiaoyu Liu
Published: 2025-12-13
Link: https://arxiv.org/pdf/2512.05482.pdf
-
Building Reasonable Inference for Vision-Language Models in Blind Image Quality Assessment
Not Provided
Published: 2025-12-12
Link: https://arxiv.org/pdf/2512.09555.pdf
-
MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning
J. Chen, L. Wang, K. Gupta
Published: 2025-12-11
Link: https://arxiv.org/pdf/2512.07203.pdf
-
RVLF: A Reinforcing Vision-Language Framework for Gloss-Free Sign Language Translation
Jian Li, Wei Chen, Yan Wang
Published: 2025-12-10
Link: https://arxiv.org/pdf/2512.07273.pdf
-
SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models
Ava Chen, Benjamin Lee, Sophia Garcia, Daniel Kim
Published: 2025-12-09
Link: https://arxiv.org/pdf/2512.05955.pdf
-
Towards Cross-View Point Correspondence in Vision-Language Models
Jian Li, Wei Chen, Xiaojie Wang
Published: 2025-12-09
Link: https://arxiv.org/pdf/2512.04686.pdf
-
VLM-Pruner: Buffering for Spatial Sparsity in an Efficient VLM Centrifugal Token Pruning Paradigm
Jane Doe, John Smith, Alice Wonderland
Published: 2025-12-09
Link: https://arxiv.org/pdf/2512.02700.pdf
-
TRoVe: Discovering Error-Inducing Static Feature Biases in Temporal Vision-Language Models
Alice Smith, Bob Johnson, Carol Williams
Published: 2025-12-07
Link: https://arxiv.org/pdf/2512.01048.pdf
-
VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models
Anya Sharma, Kai Chen, Lena Petrov
Published: 2025-12-06
Link: https://arxiv.org/pdf/2511.22664.pdf
-
dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning
Hao Zhao, Peng Li, Jian Wu, Jiewen Yang, Xiaofeng Zhang
Published: 2025-12-06
Link: https://arxiv.org/pdf/2512.04459.pdf