Built a privacy-first real-time translation platform with Ollama
We've been building PolyTalk, an open-source real-time translation platform powered by Ollama.
Unlike most translation tools, it's not limited to speech-to-speech translation. It can translate audio from microphones, browser tabs, meetings, videos, and other audio sources in real time.
Current stack: • faster-whisper for speech-to-text • Ollama-compatible models for translation • Piper for text-to-speech
Privacy was a major goal, so the platform can run entirely on your own infrastructure.
Would love feedback from the community, especially around multilingual models and real-time translation workloads.