KQL Benchmark Dashboard
Comprehensive AI evaluation framework testing large language models' ability to generate cybersecurity detection rules using real-world attack scenarios
Model Performance Comparison
Performance vs. Cost Analysis
Performance Over Time
Dive Deeper into the Benchmark
Explore our comprehensive methodology, detailed model analysis, and the complete dataset of cybersecurity scenarios used to evaluate AI performance.