Devin Review
Tracked across 27 snapshots (2026-02-03 → 2026-06-12).
Rating timeline
Dimension trajectory
Autonomy
13/20Integration
14/20Context
13/20Compliance
15/20Viability
17/20Interface
14/20Cap timeline
Notable events
- 2026-02-06Cap +pricing-opacity
- 2026-02-06Drop-6 rating
- 2026-02-26Cap +unvalidated-benchmarks
- 2026-02-26Cap −pricing-opacity
- 2026-04-20Cap −unvalidated-benchmarks
- 2026-04-20Jump+5 rating
- 2026-05-04Cap +unvalidated-benchmarks
- 2026-05-04Drop-15 rating
- 2026-05-05Jump+10 rating
- 2026-05-16Jump+5 rating
What would move this next
UP: (1) Independent benchmarks published for Devin Review specifically (accuracy, false-positive rate) — removes unvalidated-benchmarks cap and is the primary gate to Validated. (2) GitLab reaches full GA for Review. (3) GHES reaches full feature parity (comment posting, review submission, merge). (4) Review-specific named enterprise customers with production metrics (current references are for the Devin agent). (5) Hands-on internal trial with measured catch/false-positive rate. DOWN: (1) Accuracy/false-positive complaints emerge at scale on Review specifically. (2) Pricing restructure adds opacity or billing-surprise complaints spike. (3) GitLab preview dropped or stalls. (4) CodeRabbit/competitor signs major joint deals locking out Review. (5) Material adverse change in Cognition health (now low risk post-$1B raise).