CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models May 24, 2024 • 21
Llama 3.3 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.3 models, including the configurations • 1 item • Updated about 1 month ago • 10
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated about 1 month ago • 102