Multimodal AI platform spanning vision, language, and generative models
SenseTime is a public AI software company building generative, vision, and decision-intelligence models at scale. The tech stack reflects a mature ML infrastructure operation: TensorFlow, PyTorch, MLflow, and Langchain for model development; Docker and Kubernetes for orchestration; and NVIDIA hardware (including Jetson edge devices) for inference. Active projects range from pretraining large language models to embedded systems for robotics, signaling diversified commercialization across both cloud and edge deployments. Hiring has decelerated but remains concentrated in engineering and product roles, with a notable pipeline of interns—typical for research-led AI orgs scaling from academic roots.
Founded in 2014 and publicly traded, SenseTime operates as an AI software platform company with 5,001–10,000 employees headquartered in Hong Kong. The company's technical focus spans multimodal model research (vision, natural language processing, decision intelligence) and infrastructure work in AI chips, sensors, and computing systems. Projects under way include medical AI databases, quadruped robot embedded integration, and large language model pretraining. The organization sells into enterprise and automotive verticals; they also maintain standards-development and governance activities in AI ethics. Sales and marketing functions exist but are smaller than engineering—consistent with a research-first business model.
TensorFlow, PyTorch, and MLflow for model training and lifecycle management; Langchain and RAG (Retrieval-Augmented Generation) for language applications. Infrastructure runs on Docker, Kubernetes, and NVIDIA GPUs including Jetson edge devices.
Active projects include pretraining large language models, medical AI product databases, quadruped robot embedded system integration, deep learning software stack optimization, and machine learning development ecosystems.
Other companies in the same industry, closest in size