Modash operates a creator discovery and search platform indexing 400M+ influencer profiles, built on a data-heavy stack (Elasticsearch, Milvus, PySpark, Airflow, SageMaker) that signals serious ML and vector-search capabilities. Active hiring skews 10 of 18 roles toward data and senior-level positions, reflecting internal focus on scaling search latency, embedding generation, and turning unstructured social data into structured insights—all documented pain points. The platform integrates deeply with Shopify and has shifted from discovery toward API-first positioning and pricing expansion.
Modash provides an end-to-end influencer marketing platform for Shopify brands, combining creator search, discovery, and campaign management. The core product indexes over 400 million influencer profiles and enables brands to search, filter, and manage creator partnerships at scale. Internally, the company operates a substantial data and ML infrastructure built on AWS (Kinesis, Lambda, EMR, Glue, Athena, SageMaker), vector databases (Milvus), and processing pipelines (PySpark, Airflow) to extract collaborations from raw social data and generate multi-modal embeddings. Founded in 2018 and based in Tallinn, Estonia, the company employs 51–200 people and is actively expanding engineering, data, and sales capacity across Estonia, Portugal, Canada, Spain, Romania, and the UK.
Frontend: Vue and Pinia. Backend: Node.js and TypeScript on AWS (Lambda, ECS, Kinesis). Data: PySpark, Apache Airflow, Elasticsearch, Milvus, SageMaker, Glue, Athena, EMR. Storage: MongoDB, DynamoDB, PostgreSQL, Aurora, Apache Iceberg. Also uses Shopify and GCP.
Creator search engine indexing 400M+ profiles, multi-modal embeddings from images/video/text/audio, AI-assisted search with LLMs, low-latency indexing and querying pipelines, collaboration extraction from social data, API-first customer journey, and org scaling toward €30M ARR.
Modash's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →
This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.