CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models
Meta releases CyberSecEval 2, a comprehensive evaluation framework measuring cybersecurity risks and capabilities of LLMs. The tool tests malicious code generation, vulnerability exploitation, and attack defense across models including Llama.