Realtime AI infrastructure for interactive agents and voice applications
Inworld AI builds realtime generative models and infrastructure for conversational agents, with particular depth in voice AI. The tech stack reveals a platform-as-a-service architecture: Node.js + Python backend, Unreal Engine + Unity for client integration, PyTorch for model work, and orchestration via Docker + Kubernetes. Active projects span TTS APIs, model routing, billing systems, and CI/CD automation. The pain-point list (streaming concurrency, low-latency voice serving, interop between C++ and Node.js) matches the engineering-heavy hiring mix and signals a company scaling realtime inference at the infrastructure layer, not just fine-tuning off-the-shelf models.
Inworld AI provides realtime generative models and agent infrastructure for developers building interactive AI applications—companions, educational assistants, health tools, and enterprise agents. The platform includes voice AI models, intelligent model routing and optimization, and an Agent Runtime designed to handle millions of concurrent users. Founded in 2021 by former DeepMind and Google (Dialogflow) engineers, the company operates as a research-led product team of 51–200 employees, headquartered in Mountain View. Hiring spans engineering, research, and go-to-market roles across the US, Canada, Switzerland, Germany, and Serbia.
Node.js, Python, C++, C#, PyTorch (ML), Docker, Kubernetes (orchestration), Unreal Engine, Unity (client SDKs), Next.js + React (web), GitHub Actions + Jenkins (CI/CD), Terraform (infrastructure).
TTS API, voice infrastructure scaling, agent runtime, dynamic A/B experiments, consumer AI platform, API-based model services, interactive demos, system-wide billing, and CI/CD pipeline automation for applications and infrastructure.
Other companies in the same industry, closest in size