No API. No SDK. No backend. Just raw audio routing.
This isn't a product. It isn't even a proper integration.
But it works. And it freaks people out.
Some founders wanted to build a voice agent. They estimated:
We said "nah." And built a working prototype in 2 hours – without writing a single line of code.
Just ChatGPT Desktop + Jitsi + Voicemeeter + curiosity.
This is not a mockup – it actually worked.
The voice agent may or may not be active right now. The room is always open. Try speaking. If the ghost responds – it's real ChatGPT.
<iframe class="w-full h-full" src="https://meet.jit.si/YOUR_ROOM_NAME#lang=en&config.startWithVideoMuted=true&config.disableVideo=true&config.startWithAudioMuted=false&config.prejoinPageEnabled=false" allow="microphone; fullscreen; display-capture" allowfullscreen sandbox="allow-same-origin allow-scripts allow-popups allow-forms" />
VAIO AUX IN ← Jitsi mic (User) B1 → ChatGPT hears user ChatGPT → VAIO3 B2 → Jitsi hears ChatGPT A1 → Operator local monitor
Output: Voicemeeter In 3 (VAIO3) Input: Voicemeeter Out B1 Jitsi mic: Voicemeeter Out B2 ChatGPT mic: Voicemeeter Out B1
This is what happens when engineers stop waiting for permission.
No startup. No prompt engineering. Just pipes.
Think you can improve this? Make it browser-only? Auto-on?
Then do it. And send us the link.
Built something weird with ChatGPT Voice? Email us. Let's compare madness.