Veridion Tech Stack

Private-company data platform covering 130M+ businesses with ML-driven enrichment

Information Services Bucharest 11–50 employees Founded 2019 Privately Held

Veridion operates a data enrichment platform focused on private-company intelligence, built on a distributed architecture (Spark, Cassandra, Kafka, Kubernetes) designed to handle petabyte-scale ingestion and processing. The tech stack and project list reveal an organization scaling data extraction and normalization at infrastructure level—web scraping via Puppeteer/Playwright, NLP model training, and API resilience—while fighting coverage gaps and unstructured data extraction. The balanced mix of engineering and data hiring (11 of 17 roles) indicates they're treating data pipeline maturity as a core competitive moat.

Tech Stack 25 technologies

Core StackApache Spark Cassandra Elasticsearch PostgreSQL Node.js TypeScript Scala Playwright Kubernetes Kafka RabbitMQ Python Java AWS Jenkins Apache Cassandra HDFS Puppeteer Excel SQL R GCP Azure Bitbucket Pipelines Travis CI

What Veridion Is Building

◆Challenges

Scaling data processing
Managing petabytes of data
Scaling client-facing apis
Scaling distributed data systems
Enforcing engineering best practices
Coverage gaps in data extraction
Data fragmentation
Scaling data ingestion
Scaling data extraction from unstructured web

▲Active Projects

Company events planning
Office operations management
Poc development
Nlp model development and deployment
Single source of truth data platform
Data extraction and processing mechanisms
Normalizing data
Scaling client-facing apis
Building clean, distributed, and resilient services
Single source of truth product

Hiring Activity

Steady15 roles · 6 in 30d

Department

Engineering

Data

Sales

Seniority

Mid

Junior

Senior

Intern

Company intelligence

Find more companies like Veridion by tech stack, pain points and active projects

Get started free

About Veridion

Veridion is a Romanian data intelligence company founded in 2019, providing business enrichment datasets covering over 130 million private companies. The platform serves procurement, insurance, underwriting, and market intelligence teams with real-time classification and supplier-sourcing data. Built on Apache Spark, Cassandra, and Kafka, the product ingests and normalizes company data from web sources using NLP and machine learning, then exposes it via client-facing APIs. The team operates from Bucharest with 11–50 employees, hiring primarily in Romania.

HeadquartersBucharest

Company Size11–50 employees

Founded2019

Hiring MarketsRomania

Frequently Asked Questions

What tech stack does Veridion use?

Veridion's core stack includes Apache Spark and Cassandra for distributed data storage, Kafka for streaming, Kubernetes for orchestration, Elasticsearch for search, PostgreSQL for transactional data, Python, Scala, and Java for processing, and Puppeteer/Playwright for web extraction.

What data sources does Veridion cover?

Veridion covers 130M+ private companies globally with enrichment data. The platform extracts and normalizes company information from unstructured web sources using NLP and machine learning to support procurement, underwriting, and market intelligence use cases.