mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation Paper • 2505.24073 • Published May 29, 2025
Demystifying the Visual Quality Paradox in Multimodal Large Language Models Paper • 2506.15645 • Published Jun 18, 2025 • 4
SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems Paper • 2506.07564 • Published Jun 9, 2025 • 6
BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage Paper • 2506.02479 • Published Jun 3, 2025
Generative AI for Autonomous Driving: Frontiers and Opportunities Paper • 2505.08854 • Published May 13, 2025 • 1
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving Paper • 2412.15208 • Published Dec 19, 2024
Can Large Vision Language Models Read Maps Like a Human? Paper • 2503.14607 • Published Mar 18, 2025 • 10
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Paper • 2412.15206 • Published Dec 19, 2024
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Paper • 2502.13146 • Published Feb 18, 2025 • 1
GrowLength: Accelerating LLMs Pretraining by Progressively Growing Training Length Paper • 2310.00576 • Published Oct 1, 2023 • 2
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning Paper • 2401.01325 • Published Jan 2, 2024 • 27