ML + Vision Top-6 Agent Survey - ICLR 2025 - Page 3 of 3¶
Overview | Previous: ICLR 2025 p2 | Page 3 / 3 | Next: CVPR 2023 p1
- Venue: International Conference on Learning Representations
- Year: 2025
- Page: 3 / 3
- Papers: 61-74 / 74
Papers
Bridging Compressed Image Latents and Multimodal Large Language Models Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Mitigating Object Hallucination in MLLMs via Data-augmented Phrase-level Alignment Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
γ-MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Language Agents Meet Causality - Bridging LLMs and Causal World Models Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Do LLM Agents Have Regret? A Case Study in Online Learning and Games Paper
Abstract
Not stated in metadata.
Claim
Not stated in abstract.
Overview | Previous: ICLR 2025 p2 | Page 3 / 3 | Next: CVPR 2023 p1