OSINT and public intelligence platform powered by ML and vector search
Tadaweb operates a publicly available information (OSINT) and web intelligence platform anchored in Go, Python, and a multi-cloud stack (AWS, Azure, GCP). Active projects center on RAG, vector search, and semantic indexing—layered on PostgreSQL, MongoDB, BigQuery, and Spanner—indicating a pivot toward AI-driven retrieval and ranked relevance over raw data collection. Security and engineering roles dominate the hiring mix, paired with acknowledged pain points around GDPR compliance, latency, and cost optimization, suggesting platform maturity coupled with operational complexity at scale.
Tadaweb delivers an OSINT and web intelligence platform designed for corporate investigations, risk management, and intelligence workflows. Founded in 2011 and based in Luxembourg, the company combines public-source data collection with machine learning and search to surface actionable intelligence. The product stack spans data ingestion (Kafka, Airflow), storage (PostgreSQL, MongoDB, BigQuery, Spanner), ML (TensorFlow, PyTorch, scikit-learn), and retrieval (vector search via pgvector, Azure Cognitive Search). Current work includes a brand-new integrated platform, medallion architecture implementation, and end-to-end ML pipeline buildout. The 51–200-person organization operates primarily across the UK, US, Luxembourg, and Germany.
Go, Python, TypeScript, PostgreSQL, MongoDB, BigQuery, Spanner, pgvector, Kafka, Apache Airflow, TensorFlow, PyTorch, with cloud deployment on AWS, Azure, and GCP.
RAG systems, vector search strategy, semantic search platform, OSINT collection improvements, end-to-end ML pipelines, medallion data architecture, and a brand-new integrated platform consolidating multiple services.
Other companies in the same industry, closest in size