DELGMark Results Dashboard

Telemetry and results breakdown of saved model benchmark records.

Model Records15
Models Tested15
Highest Score87%
Average Score70%

Model Records

Evaluation Detail (%)

qwen3.5-397b-a17bUpdated: Jun 9, 05:45 PM
83%
Coding85%
Debugging91%
Refactoring93%
Agent Tooling68%
Security67%
DELG Knowledge97%
Context Window80%