NAVIGATOR
Continuous auditing and expert escalation for mental-health LLM safety
Benchmarks drift and ground-truth labels contain errors. We use a multi-agent, human-in-the-loop process that improves labeling accuracy by 16% and reduces human review by up to 85×, continuously learning from expert labels to tune the process maps. Read the paper ↗
Harbor Dataset
HarborNo scored conversations yet
Run the scoring script to process conversations
Mental Health Datasets
Click a dataset to view • ♡ datasets you want scoredHarbor Dataset
Harbor Native DataConversations created directly in Harbor with human-expert annotations
Dataset Card for "mental_health_counseling_conversations_sharegpt" More Information needed
CREDIT: Dataset Card for "heliosbrahma/mental_health_chatbot_dataset" Dataset Description Dataset Summary This dataset contains conversational pair of questions and answers in a single text related to Mental Health. Dataset was curated from popular healthcare blogs l
Mental Health Queries and Personality Dataset Overview This dataset encompasses a collection of mental health queries paired with personality scores and responses generated by a Large Language Model (LLM). It aims to provide insights into the interplay between personality trait