Question Analysis
A Windows host shows chrome.exe starting with a --load-extension parameter. What folder name was specified in that flag?
7a714703-9f6b-461c-b06d-e6aeac650f27
Question ID
1/16
Models Correct
1
Valid Answer
6%
Success Rate
Reference Answer
Human-validated correct answer for this question
extension
Model Performance Comparison
See how different AI models performed on this question
Model | Answer | KQL Query | Status | Attempts | Time | Cost |
---|---|---|---|---|---|---|
gemini-2.5-flash-preview-04-17 | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0322 |
gpt-35-turbo | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0116 |
gpt-4-turbo-2024-04-09 | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.2184 |
gpt-4.1 | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0380 |
gpt-4.1-finetuned | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0360 |
gpt-4.1-mini | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0050 |
gpt-4.1-nano | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0019 |
gpt-4o | $env:TEMP\extension\ | DeviceProcessEvents
| where ProcessCommandLine contains "chrome.exe" and ProcessCommandLine contains "--load-extension"
| extend ExtensionFolder = extract("--load-extension=([^\\s]*)", 1, ProcessCommandLine)
| project DeviceId, DeviceName, ProcessCommandLine, ExtensionFolder | Correct | 2 | 19.71s | 0.0276 |
grok-3-beta | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0845 |
grok-3-mini-beta | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0075 |
o1-high | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.6913 |
o1-low | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.6311 |
o3-mini-high | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0359 |
o3-mini-low | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0369 |
o4-mini-high | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0436 |
o4-mini-low | No answer provided | No query generated | Incorrect | 5 | 0.00s | 0.0368 |