Abaka AI provides high-quality training data and annotation services for advanced AI systems, with deep technical focus on computer vision, video understanding, and 3D/4D datasets. The stack reveals a research-forward organization — heavy use of PyTorch, CLIP, Diffusers, plus specialized 3D tools (COLMAP, Open3D, Blender) — and active projects around NeRF, Gaussian splatting, and video-language model evaluation signal heavy investment in generative and spatial reasoning tasks rather than commodity labeling.
Abaka AI, founded in 2019 and based in Palo Alto, delivers training datasets and data annotation services for frontier AI companies building generative, multimodal, and autonomous systems. The platform spans the full data lifecycle: dataset sourcing, cleaning, collection, annotation, and model evaluation, plus proprietary annotation tooling. The company operates with a research-heavy team structure and active technical projects in scalable pipelines, 3D dataset design, and performance optimization, indicating product development alongside services delivery.
Core: PyTorch, CLIP, Diffusers. Specialized tools: Blender, COLMAP, Open3D for 3D/4D work. Collaboration: Slack, Discord. Creative/design: Figma, Photoshop, Canva. Development: Python, Cursor.
Scalable multimodal training pipelines, 3D/4D dataset design, video-language model evaluation, NeRF and Gaussian splatting research, automated data discovery, and benchmark standardization for spatial reasoning tasks.
Other companies in the same industry, closest in size