VoiceClinic (Clinical Intake) — AI Pre-Consultation Voice Agent

Name: VoiceClinic Review
Item: VoiceClinic
Rating: 3
Author: ToolyHealth Editors

A Node.js voice agent that conducts bilingual pre-consultation intake and generates a structured audio podcast for physician review. Last updated: 2026-06-21.

One-sentence verdict: A production-oriented voice-intake prototype with Docker, JWT dashboard, and integration tests — a solid scaffold, but not HIPAA-ready out of the box.

What the System Is

VoiceClinic — the repository is published as clinical-intake — is a Node.js / Express application that automates pre-consultation clinical interviews. The patient speaks into a browser microphone; Groq transcribes the speech, an LLM (DeepSeek via an OpenAI-compatible client) selects the next clinical question, and ElevenLabs speaks it back. After the encounter, the system compiles a downloadable MP3 "podcast" summary and exposes a JWT-secured physician dashboard.

The deploy guide reports the service was started on port 5005.

Key data
Category	Clinical Intake
Language	Node.js / Express
License	MIT
Self-hosted	Yes
AI providers	Groq STT, DeepSeek LLM, ElevenLabs TTS
Database	File-system sessions; no persistent relational DB
Auth	JWT for physician dashboard
Tests	19 integration tests (documented as passing)

How to Install and Deploy

The repository includes a Dockerfile and docker-compose.yml. The documented flow is:

cd /data2/docker/going-global/repos/clinical-intake
cp .env.example .env
# Add GROQ_API_KEY, LLM_API_KEY, ELEVENLABS_API_KEY
docker compose up -d

Alternatively, the deploy guide uses npm directly:

cd /data2/docker/going-global/repos/clinical-intake
npm install
OPENAI_API_KEY=sk-xxx GROQ_API_KEY=gsk-xxx PORT=5005 npm start

Then open http://localhost:5005 for the patient intake, and http://localhost:5005/login.html for the physician dashboard.

How to Test

The documented test flow is:

Open the patient intake URL and start a voice encounter in English or French.
Answer the spoken clinical questions.
Wait for the encounter to complete and verify that a podcast / MP3 summary is generated.
Log in to the physician dashboard with the configured credentials and review session history.
Run npm test to execute the 19 integration tests, which spawn a local Express server with mock API credentials.

Privacy & Compliance

VoiceClinic is not HIPAA compliant as shipped. The README describes an "anonymous by design" flow with no patient identifiers stored and transient audio processing, but there are no documented BAAs, audit logs, encryption at rest, or access-control policies. Every voice request is sent to Groq, the LLM provider, and ElevenLabs. Do not use it for real patient data without a compliance review.

VoiceClinic vs Commercial Clinical Documentation

Dimension	VoiceClinic	Commercial Scribes (e.g., Nuance DAX, Ambience Healthcare, Abridge)
Cost	Free / self-hosted + API usage	Subscription per clinician
Deployment	Docker one-command or npm	Cloud SaaS or enterprise install
Voice pipeline	Groq + DeepSeek + ElevenLabs	Proprietary ASR/NLU/TTS
EMR integration	None; MP3 + dashboard only	EHR-integrated notes
Bilingual support	English / French	Varies by vendor
Security posture	JWT login, no documented HIPAA controls	BAA, audit logs, encryption
Customization	Full source code; question files are plain text	Vendor-controlled
Clinical validation	None claimed	Often clinically reviewed

Who Should Use It

Developers building a voice-first intake prototype.
Teams that want a Dockerized starting point with STT/LLM/TTS wired together.
Organizations that need a bilingual (EN/FR) intake demo and are willing to add compliance controls themselves.

Who Shouldn't Use It

Clinics needing HIPAA-compliant intake without additional engineering.
Practices that require EHR integration or structured note export.
Users expecting a commercially supported medical device.

FAQ

Is VoiceClinic free?

Yes. The code is MIT-licensed. You pay for your own server and any Groq, LLM, and ElevenLabs API usage.

What languages does it support?

The checked-in question files support English and French clinical questionnaires.

Does it store audio recordings?

The README states audio is processed transiently and only the final structured podcast persists. Verify this against the deployed configuration before using with sensitive data.

Verdict

VoiceClinic is the most deployment-ready voice-intake prototype in this batch. It has Docker, a test suite, a physician dashboard, and a clear STT→LLM→TTS pipeline. The main gaps are compliance (no HIPAA controls), no structured data export beyond the podcast, and reliance on third-party API keys. It is a strong scaffold for a development team, not a drop-in clinical product.

Ratings: Deployability 4/5 · Value vs Commercial 3/5 · Privacy Compliance 2/5