E

Enflame Tech Stack

AI chip design and software stack for data center inference

Semiconductor Manufacturing 上海市, 上海市 501–1,000 employees Founded 2018 Partnership

Enflame designs hardware and software for AI inference at scale, built on a deep compiler and systems stack (LLVM, CUDA, TensorFlow, PyTorch, NCCL, DeepSpeed). The project list—chip interconnect, quantization tools, operator development, and chiplet architecture—reveals a vertically integrated play from silicon through ML framework adaptation. Heavy senior engineering hiring (20 of 28 roles) against active work on bandwidth efficiency and power consumption points to a company solving hard performance and cost problems in domestic AI infrastructure.

Tech Stack 46 technologies

Core StackLinux C++ Python TensorFlow PyTorch LLVM GCC C/C++ CUDA RISC-V OpenCL OpenMP clang-tidy CDC C Caffe MXNet PaddlePaddle RDMA TensorRT MPI Open MPI NCCL DeepSpeed VTune Bash DPDK SPDK PCIe Intel+16 more

What Enflame Is Building

◆Challenges

Ai model sparsification design
Improving bandwidth efficiency
Optimizing inference performance on domestic ai chips
Next-gen ai chip design
Balancing business and r&d priorities
Ai model performance optimization
Performance bottlenecks
Ai system competitiveness
Reducing power consumption
Aligning market demands with chip development

▲Active Projects

Ai chip interconnect design
Build low-bit hardware-aware quantization tools
Ai model sparsification design
Quantization operator development
Ai operator development on custom ai chip
D2d interconnect architecture design
Develop day-0 adaptation toolchain for gcu
Pd separation architecture implementation on gcu
Distributed communication component development
Multi-die/chiplet architecture for ai server chip

Hiring Activity

Accelerating30 roles · 25 in 30d

Department

Engineering

22

Ops

1

Product

1

Security

1

Seniority

Senior

20

Mid

3

C-Level

2

Company intelligence

Find more companies like Enflame by tech stack, pain points and active projects

Get started free

About Enflame

Enflame, founded in 2018 and based in Shanghai, builds AI chips and software systems for data center workloads. The company operates across the full stack: hardware design (chiplet and interconnect architecture), compiler tooling (day-0 adaptation for custom AI processors), and runtime optimization (quantization, sparsification, operator kernels). Active development spans distributed communication, inference optimization, and tool chains for their custom GPU-class processors. The organization is engineering-concentrated, with R&D leadership drawn from semiconductor and systems backgrounds.

Headquarters上海市, 上海市

Company Size501–1,000 employees

Founded2018

Hiring MarketsChina

Frequently Asked Questions

What tech stack does Enflame use?

Core stack includes LLVM, GCC, CUDA, RISC-V, PyTorch, TensorFlow, Caffe, MXNet, PaddlePaddle, TensorRT, NCCL, DeepSpeed, MPI, and Open MPI for compiler, ML frameworks, and distributed training/inference.

What is Enflame working on?

Active projects include AI chip interconnect and chiplet architecture design, low-bit quantization and sparsification tools, custom AI operator development, and day-0 software adaptation toolchains for their processors.