中

中数科技有限公司 Tech Stack

Data middleware platform for supply-chain analytics and web data collection

Wholesale 重庆市, 重庆市 ~4 employees

中数科技 operates a data middleware stack built on Java, Kafka, Spark, Flink, and ClickHouse—classical components for ETL and real-time streaming. The tech choices reveal a company focused on processing high-volume, multi-source data pipelines: Hadoop/HDFS for storage, Hudi for incremental updates, SeaTunnel for data movement, and web-scraping tools (Scrapy, Pyspider, Nutch) for ingestion. Active projects center on data collection, distributed crawling, and a middle-platform architecture, with documented pain around ETL performance, crawler efficiency, and handling concurrent loads—typical friction points for early-stage data infrastructure companies.

Tech Stack 26 technologies

Core StackJava Linux Hadoop Kafka Apache Spark Apache Flink ClickHouse Python Spring Cloud Dubbo Yarn HBase Hudi SeaTunnel DolphinScheduler HDFS MapReduce Hive Storm StarRocks Scrapy Pyspider Nutch Jsoup Excel Axure RP

What 中数科技有限公司 Is Building

◆Challenges

Optimizing etl performance
Quality issues
Project cost control
Improving crawler efficiency
Contract compliance
Low user engagement
Developing promotion channels
Handling high concurrency
High availability
Litigation support

▲Active Projects

Data middle platform development
Data collection
Distributed crawler and scheduler system
Data quality module design
Platform product market expansion
Evaluate business project economics
Marketing channel promotion
Supplier related system
Supply chain system
On-site password assessment

Hiring Activity

Minimal45 roles · 0 in 30d

Department

Product

Engineering

Marketing

Sales

Finance

Data

Construction

Executive

Seniority

Mid

Senior

Manager

Director

Junior

Intern

Notable leadership hires: Marketing Director, Sales Director

Company intelligence

Find more companies like 中数科技有限公司 by tech stack, pain points and active projects

Get started free

About 中数科技有限公司

中数科技 is a China-based data platform company building infrastructure for data collection, transformation, and supply-chain analytics. The product stack spans web data acquisition (distributed crawlers and schedulers), ETL pipelines (SeaTunnel, DolphinScheduler), and analytics storage (ClickHouse, StarRocks). The team is small (~4 core) but maintains a broad product and engineering focus across data, supply systems, and market expansion. Current operational challenges include optimizing pipeline performance, maintaining data quality at scale, and expanding adoption beyond initial customer segments.

Headquarters重庆市, 重庆市

Company Size~4 employees

Hiring MarketsChina

Frequently Asked Questions

What technology does 中数科技 use for data processing?

Java, Spring Cloud, Kafka, Apache Spark, Flink, HBase, and ClickHouse for streaming and batch analytics. Web data collection runs on Scrapy, Pyspider, and Nutch with DolphinScheduler orchestration.

What is 中数科技 working on?

Data middleware platform development, distributed crawler and scheduler systems, data quality modules, supply-chain systems, and market expansion. Focus areas include ETL performance optimization and high-concurrency handling.