Ask HN: What's the best realtime, local, TTS solution? Live call interpretation

So I'm trying to build a system that listens to calls as they're happening. All implementations antigravity/codex/cursor throws out have been really janky and ineffective. Spent a couple days prompt engineering without finding an elegant solution. Anybody have insights?

6 points | by Wright007 1 day ago

1 comments

  • ahmedgagan 1 day ago
    What about recording the audio of your output? device and then transcribing it