Article Summary
Search
Showing results for:
Large language models
—
Clear filter
Thinking with Images via Self-Calling Agent
Li Wei, Chen Jie, Wang Siyu
Self-calling agents
Visual reasoning
Large language models
Multimodal AI
Agentic AI
Published: 2025-12-11
Link:
https://arxiv.org/pdf/2512.08511.pdf
DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation
Jia Li, Wei Wang, Min Chen
Dialect robustness
Multimodal generation
Benchmarking
Large language models
Speech synthesis
Published: 2025-10-23
Link:
https://arxiv.org/pdf/2510.14949.pdf