Testing for Bias in Educational AI Assistants: Methodology, Results and Remediation
The first published methodology for systematic bias evaluation of conversational AI in schools. We test the Marge assistant across 46 matched query pairs spanning 8 protected characteristics, reporting a mean bias score of 3.35 (Minor) with candid findings on the limits of prompt engineering.
Download Report