MSTS: A Multimodal Safety Test Suite for Vision-Language Models Paper • 2501.10057 • Published 7 days ago • 7
Language Model Council: Benchmarking Foundation Models on Highly Subjective Tasks by Consensus Paper • 2406.08598 • Published Jun 12, 2024 • 5