Article Summary
-
Unleashing the Intrinsic Visual Representation Capability of Multimodal Large Language Models
Jian Li, Wei Chen, Ying Zhang
Published: 2025-12-13
Link: https://arxiv.org/pdf/2512.06281.pdf
-
Building Reasonable Inference for Vision-Language Models in Blind Image Quality Assessment
Not Provided
Published: 2025-12-12
Link: https://arxiv.org/pdf/2512.09555.pdf
-
Efficient-VLN: A Training-Efficient Vision-Language Navigation Model
Anonymous Author A, Anonymous Author B, Anonymous Author C
Published: 2025-12-12
Link: https://arxiv.org/pdf/2512.10310.pdf
-
PrunedCaps: A Case For Primary Capsules Discrimination
Jian Li, Wei Chen, Hui Zhang
Published: 2025-12-12
Link: https://arxiv.org/pdf/2512.06003.pdf
-
Animal Re-Identification on Microcontrollers
Dr. Anya Sharma, Prof. Ben Carter, Ms. Chloe Davis
Published: 2025-12-12
Link: https://arxiv.org/pdf/2512.08198.pdf
-
Any4D: Unified Feed-Forward Metric 4D Reconstruction
Lena Chen, Markus Schmidt, Sofia Garcia, David Lee
Published: 2025-12-12
Link: https://arxiv.org/pdf/2512.10935.pdf
-
GeoLoom: High-quality Geometric Diagram Generation from Textual Input
A. Researcher, B. Developer, C. Scientist
Published: 2025-12-12
Link: https://arxiv.org/pdf/2512.08180.pdf
-
FacePhys: State of the Heart Learning
John Doe, Jane Smith, Michael Brown
Published: 2025-12-11
Link: https://arxiv.org/pdf/2512.06275.pdf
-
Performance Evaluation of Deep Learning for Tree Branch Segmentation in Autonomous Forestry Systems
Not available - Article content inaccessible
Published: 2025-12-11
Link: https://arxiv.org/pdf/2512.05418.pdf
-
AutoLugano: A Deep Learning Framework for Fully Automated Lymphoma Segmentation and Lugano Staging on FDG-PET/CT
John Doe, Jane Smith, Robert Johnson, Emily White
Published: 2025-12-10
Link: https://arxiv.org/pdf/2512.07206.pdf