TruthfulVQA The First Benchmark for Multimodal Truthfulness Evaluation Align Anything Training All-Modality Models to Follow Instructions with Language Feedback Debate with Images Detecting Deceptive Behaviors in Multimodal Large Language Models Eval-Anything Comprehensive Safety Evaluation for Any-to-Any Multimodal Models