#Tech#AI Safety#Policy
I Built an Open-Source Tool That Audits AI Models for Cultural Bias in 11 Languages
After six months of development, here's what I learned about how Western-trained models fail non-Western users.
EG SEALVERIFIEDJun 19, 2026
41 upvotes · 0 comments
The Problem
Most AI bias research focuses on English-language outputs. But the world doesn't run on English. When I tested five major frontier models against culturally-specific prompts in Mandarin, Arabic, Swahili, and Tamil, the failure modes were striking — and rarely documented.
What I Built
Lingua-Audit is an open-source evaluation framework that:
- Tests for 14 distinct cultural failure modes
- Supports 11 languages out of the box
- Generates reports auditable by non-technical policy teams
- Is fully reproducible — every prompt and grading rubric is public
Top Three Findings
- Honorific collapse: Models routinely fail at hierarchical address systems in Japanese, Korean, and Tamil.
- Religious sensitivity drift: Outputs on Islamic jurisprudence often default to a single school of thought without disclosure.
- Diaspora erasure: Models treat diaspora identities (Tamil-Singaporean, Lebanese-Brazilian) as edge cases, often defaulting to country-of-origin assumptions.
Try It Yourself
The full repo is open. PRs welcome.
Endorsements & feedback
No endorsements yet. Be the first.