Zyte operates a dual-revenue model: a self-serve API for automated web data extraction and a managed services team handling custom scraping work. The tech stack reveals a heavy ML orientation—PyTorch, TensorFlow, scikit-learn, plus OpenAI and Anthropic integrations—paired with distributed systems infrastructure (Kubernetes, Kafka, Erlang), suggesting the API is shifting toward AI-driven content parsing and validation rather than pure pattern matching. Active projects center on a multi-tenant model registry, GenAI validation pipelines, and core platform scaling to handle high-throughput extraction workloads.
Notable leadership hires: Platform Team Lead, MLOps Team Lead, ML Ops Lead, Head of Product
Zyte provides web scraping solutions for data-driven organizations, operating from Cork, Ireland with 201–500 employees distributed across 15 countries. The company offers two primary products: Zyte API, a self-serve platform for automated web data collection and extraction, and Zyte Data, a white-glove managed services offering for complex custom extraction projects. The company maintains Scrapy, an open-source web scraping framework, and has built an in-house legal function focused on web data extraction compliance. Engineering dominates the organization, with significant investment in data infrastructure, reflecting the technical complexity of building IP-aware extraction systems that operate at scale.
Zyte's stack includes Python, PyTorch, TensorFlow, Kubernetes, Apache Kafka, Go, Rust, Java, and integrations with OpenAI and Anthropic for AI-powered extraction and validation.
Zyte is headquartered in Ballincollig, Cork, Ireland. The company employs 201–500 people across Argentina, Hungary, Portugal, Brazil, Romania, Croatia, Uruguay, Poland, Slovenia, Spain, India, Canada, UK, Ireland, and the US.
Other companies in the same industry, closest in size