Speak builds conversational AI tutors using OpenAI, with a tech stack anchored on PyTorch, RAG, BigQuery, and Snowflake for LLM training and data pipelines. The company is actively scaling across multiple markets and languages while managing core challenges around LLM output quality, content hygiene, and platform reliability. Hiring velocity is accelerating with engineering and marketing leading growth — a pattern consistent with simultaneous product expansion (new markets, B2B entry, app redesign) and distribution challenges in competitive language-learning segments.
Notable leadership hires: Head of Growth, Head of Marketing
Speak operates an AI language tutoring application that emphasizes conversational practice over traditional lesson structures. The product serves learners across 15+ languages with a mobile-first experience (iOS/Android via Swift, SwiftUI, RxJava). The company achieved market leadership in South Korea since its 2019 launch and now operates across multiple geographies with offices in San Francisco, Seoul, Tokyo, Taipei, and Ljubljana. Revenue operations are supported by HubSpot, Amplitude for analytics, and Looker for reporting. The business model spans consumer (B2C) and education institution (B2B) channels, with recent focus on B2B expansion and market entry into Japan.
Speak uses OpenAI models as the core LLM, supplemented with RAG (retrieval-augmented generation) and PyTorch for training and fine-tuning custom language learning models.
Speak's backend runs on Python, PostgreSQL, BigQuery, Snowflake, and Redshift for data infrastructure; mobile apps use Swift/SwiftUI (iOS) and RxJava (Android); frontend uses React and TypeScript; deployment via Jenkins and Bitrise CI/CD.
Speak's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →
This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.