โ Testing And Evaluation/M-069Build a Quality Regression Detector๐ Guide๐
Rank 09ยท The Nexus
๐งชTDD Challengeยทadvancedยทโฑ๏ธ 35โ50mยทโญ 225 XP
M-069Build a Quality Regression Detector
Description
Nebula Corp just updated their chatbot's prompt and needs to verify the change didn't break anything. Build a regression detection system that compares evaluation results from two versions (baseline vs candidate), identifies statistically significant regressions per category, and produces a go/no-go deployment recommendation.
Test Cases (3)
Detects safety regression
Safety regression should block deployment
Input:testSafetyRegression()
Expected:CONTAINS:BLOCK
Finds creative regression
Should detect creative category regression
Input:testCreativeRegression()
Expected:CONTAINS:creative
Clean results deploy
Identical results should recommend deploy
Input:testCleanDeploy()
Expected:CONTAINS:DEPLOY
Related Lessons
Click Run / Check to validate your solution