๐Ÿš€ Everything is free โ€” help us improve! Submit feedback and shape the platform.
๐ŸงชTDD Challengeยทadvancedยทโฑ๏ธ 35โ€“50mยทโญ 225 XP

M-069Build a Quality Regression Detector

Description

Nebula Corp just updated their chatbot's prompt and needs to verify the change didn't break anything. Build a regression detection system that compares evaluation results from two versions (baseline vs candidate), identifies statistically significant regressions per category, and produces a go/no-go deployment recommendation.

Test Cases (3)

Detects safety regression
Safety regression should block deployment
Input:testSafetyRegression()
Expected:CONTAINS:BLOCK
Finds creative regression
Should detect creative category regression
Input:testCreativeRegression()
Expected:CONTAINS:creative
Clean results deploy
Identical results should recommend deploy
Input:testCleanDeploy()
Expected:CONTAINS:DEPLOY

Related Lessons

Click Run / Check to validate your solution