Sesame is building custom embedded operating systems and firmware to run large language models on consumer wearables in real time. The stack—Linux, RTOS (Zephyr, FreeRTOS, QNX), C/C++, PyTorch, vLLM, SGLang—reflects a hardware-software integration challenge: they're not just training models but optimizing them for power, latency, and memory on constrained devices. Engineering dominates hiring (40 of 71 roles), with 50 senior-level positions, signaling deep technical depth required to bridge firmware, AI inference, and manufacturing at scale.
Sesame designs and manufactures AI-enabled wearable consumer products, founded in 2023 and headquartered in San Francisco. The company operates across the full hardware stack: custom embedded operating systems, real-time firmware (including audio capture and rendering), LLM inference serving (via vLLM and SGLang), and end-to-end manufacturing. Active projects span firmware development workflows, test fixture design, and scaling LLM training and inference on edge devices. Hiring is accelerating across engineering and manufacturing, with recent expansion into Taiwan, indicating volume production ramp.
Custom embedded RTOS platforms built on Zephyr, FreeRTOS, and QNX, alongside Linux and AOSP for AI inference layers.
PyTorch for training, vLLM and SGLang for LLM inference serving, with Ray for distributed workloads.
Other companies in the same industry, closest in size