We turned ChatGPT into a real voice agent

No API. No SDK. No backend. Just raw audio routing.
This isn't a product. It isn't even a proper integration.
But it works. And it freaks people out.

👀 Here's what happened

Some founders wanted to build a voice agent. They estimated:

⏱ 6 months
👨‍💻 3 devs
☁️ Cloud infrastructure

We said "nah." And built a working prototype in 2 hours – without writing a single line of code.

Just ChatGPT Desktop + Jitsi + Voicemeeter + curiosity.

🎥 Demo (Split Screen)

This is not a mockup – it actually worked.

🔊 Join the Live Room

The voice agent may or may not be active right now. The room is always open. Try speaking. If the ghost responds – it's real ChatGPT.

🛠️ Build It Yourself

Install Voicemeeter Potato (free)
Install ChatGPT Desktop App and log into ChatGPT Plus
Enable Advanced Voice Mode in ChatGPT settings
Set Windows default input/output devices for ChatGPT and Jitsi (see below)
Launch a free room on Jitsi as the operator (use Edge browser)

Send the Jitsi room link to the guest user or embed it using iframe like this:

<iframe
  class="w-full h-full"
  src="https://meet.jit.si/YOUR_ROOM_NAME#lang=en&config.startWithVideoMuted=true&config.disableVideo=true&config.startWithAudioMuted=false&config.prejoinPageEnabled=false"
  allow="microphone; fullscreen; display-capture"
  allowfullscreen
  sandbox="allow-same-origin allow-scripts allow-popups allow-forms" />

Open ChatGPT app and Jitsi room on the same machine
Use Voicemeeter to route audio as shown below

Audio Routing Diagram:

VAIO AUX IN ← Jitsi mic (User)
B1 → ChatGPT hears user
ChatGPT → VAIO3
B2 → Jitsi hears ChatGPT
A1 → Operator local monitor

Windows Audio Settings:

Output: Voicemeeter In 3 (VAIO3)
Input: Voicemeeter Out B1
Jitsi mic: Voicemeeter Out B2
ChatGPT mic: Voicemeeter Out B1

Windows Sound Mixer Settings showing Voicemeeter configuration

Voicemeeter Potato interface with audio routing controls

Jitsi meeting room audio device selection

⚠️ Important Notes:

No API accessed — this uses ChatGPT Desktop via audio routing
May violate OpenAI Terms regarding voice redistribution
Not a product — for research/demos only
Requires ChatGPT Plus for Advanced Voice Mode

⚔️ This is not a polished SaaS

This is what happens when engineers stop waiting for permission.
No startup. No prompt engineering. Just pipes.

Think you can improve this? Make it browser-only? Auto-on?
Then do it. And send us the link.

Built something weird with ChatGPT Voice? Email us. Let's compare madness.

Built in 2 hours. No servers. No code. Just intent and pipes.