KQL Benchmark Dashboard

Comprehensive AI evaluation framework testing large language models' ability to generate cybersecurity detection rules using real-world attack scenarios

Model Performance Comparison

Performance vs. Cost Analysis

Performance Over Time

Dive Deeper into the Benchmark

Explore our comprehensive methodology, detailed model analysis, and the complete dataset of cybersecurity scenarios used to evaluate AI performance.