Safety at Scale: A Comprehensive Survey of Large Model Safety Paper • 2502.05206 • Published Feb 2, 2025 • 3
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law Paper • 2507.18576 • Published Jul 24, 2025 • 8
Simulated Ensemble Attack: Transferring Jailbreaks Across Fine-tuned Vision-Language Models Paper • 2508.01741 • Published Aug 3, 2025 • 1
Imperceptible Jailbreaking against Large Language Models Paper • 2510.05025 • Published Oct 6, 2025 • 33
Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes? Paper • 2506.14805 • Published Jun 3, 2025 • 3
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs Paper • 2511.12710 • Published Nov 16, 2025 • 38