December 2025, Healthcare LLM Evaluations, Sakhi's story, and Splice Beta

Benchmarking AI models for maternal health across English, Hindi, and Marathi. The story behind Sakhi — WhatsApp-based maternal health literacy. Splice Beta 2025 in Chiang Mai and GTSInnovationalogue with Carnegie India.

SakhiHealth AI

We are benchmarking leading AI models using the Sakhi Dataset, a parallel set of maternal health questions undergoing expert validation in English, Hindi, and Marathi. Early findings reveal a critical safety concern: AI models perform inconsistently across languages, which could deliver incomplete or inaccurate maternal health guidance at scale.

Sakhi is a WhatsApp-based maternal health literacy conversational agent providing reliable, verified information in local languages. Over recent months, we surveyed women in Jalgaon, Maharashtra, and piloted Sakhi using a human-in-the-loop approach, where healthcare professionals reviewed content and handled queries beyond our curated, expert-verified knowledge base.

AI healthcare tools reach millions in India, but safety evaluations focus overwhelmingly on English. We don't know how models perform in Hindi and Marathi, risking misleading guidance at scale for the populations who need it most. Our goal is to help ensure that future health AI systems provide safe and equitable guidance across all languages and communities.

We attended the Splice Beta 2025 conference in Chiang Mai, Thailand, connecting with the global community working at the intersection of AI, digital discourse, and social impact. We also attended the GTSInnovationalogue, co-hosted by Carnegie India and the Ministry of External Affairs, India, to discuss AI adoption frameworks.

Builds, bugs, and breakthroughs

Policy watch

Cool research to follow

Useful data drops

← Earlier · November 2025

November 2025, Happy Thanksgiving from SimPPL — introducing Arbiter

Later → January 2026

January 2026, SimPPL's 2025 Wrapped and Arbiter detects something before YouTube does