Realtime AI infrastructure for interactive agents and voice applications
Inworld AI builds infrastructure for realtime generative AI applications—voice models, agent runtimes, and multimodal inference at scale. The tech stack (PyTorch, vLLM, CUDA, Kubernetes, Ray, Terraform) reflects a systems-heavy engineering org optimizing for low-latency inference and multi-tenant serving. Active projects on model serving optimization, dynamic A/B experiments, and distributed inference scaling, paired with hiring across research and infrastructure roles, signal a company solving the hard problems of deploying interactive AI at concurrent scale rather than building consumer products.
Inworld AI develops realtime AI infrastructure and generative models for interactive applications—companion apps, educational agents, and enterprise AI assistants. The company serves developers building AI experiences that require sub-second latency and sophisticated agent behavior. Founded in 2021 by former DeepMind and Google (Dialogflow) leaders, Inworld operates as a research-driven infrastructure company rather than a traditional application layer. The product surfaces include optimized voice and multimodal models, an Agent Runtime for orchestration, and intelligent model routing across cloud infrastructure (GCP, Azure, Oracle). The 51–200-person team is concentrated in engineering and research, with expanding sales efforts in North America.
Python, C++, PyTorch, vLLM, CUDA, Kubernetes, Ray, Terraform, Terragrunt, ArgoCD, Ansible, GCP, Azure, and Oracle Cloud. Also Unreal Engine and Unity for client integration, and JavaScript/TypeScript for API services.
Mountain View, California. Hiring spans United States, Canada, Switzerland, Serbia, and Germany.
Inworld AI's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →
This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.