Data annotation and engineering services for generative AI model development
Innodata is a data engineering and annotation company serving AI builders at scale. The tech stack reveals a services operation: annotation tooling (Labelbox, CVAT, Prodigy), cloud infrastructure (Docker, Kubernetes, SageMaker Ground Truth), and sales/marketing automation (Salesforce, Pardot). With 577 data professionals, 287 roles posted in the last 30 days, and active hiring across 25+ countries, the company is visibly scaling production capacity for LLM training and evaluation work — specifically linguistic annotation, chain-of-thought data creation, and bias/accuracy remediation.
Notable leadership hires: Sales Director
Innodata is a publicly traded (NASDAQ: INOD) data engineering and AI services company founded in 1988 and headquartered in New Jersey. The company provides data annotation, data extraction, and data engineering services to technology companies and enterprises building generative and traditional AI systems. Core offerings include image and video annotation, linguistic data curation, LLM training data preparation, and content review for AI response quality. The workforce of 5,000–10,000 is distributed globally and heavily weighted toward mid-level data annotators and junior operators, reflecting a labor-intensive, geographically distributed production model.
Innodata has active hiring across 25+ countries: United States, China, Canada, Spain, Brazil, Israel, Bangladesh, India, Philippines, Indonesia, Japan, Mexico, Thailand, Italy, South Korea, Ireland, France, Saudi Arabia, Uzbekistan, Qatar, Germany, Tunisia, Estonia, Colombia, and Armenia.
The company's annotation stack includes Labelbox, CVAT, and Prodigy for data labeling; Amazon SageMaker Ground Truth for training data; and cloud infrastructure (Docker, Kubernetes) for deployment and scaling.
Other companies in the same industry, closest in size