Research

Methodology, findings, and technical details behind Vigil.

Coming soon

Vigil: An Evaluation Framework for AI Safety in Mental Health Conversations

A comprehensive look at why existing AI safety benchmarks fail to capture the nuance of mental health interactions, and how Vigil's four-stage pipeline addresses this gap. Covers the design philosophy, evaluation dimensions, and initial findings across eight vulnerability states.

Methodology Findings

Coming soon

Technical Deep Dive: Multi-Turn Scenario Generation and Evidence-Based Judging

How Vigil generates realistic multi-turn conversations, validates scenario diversity, and performs two-phase evidence-based scoring. Includes implementation details, prompt engineering decisions, and lessons learned from iterating on the pipeline.

Technical Implementation