echoloc

Enflame Tech Stack

AI chip design and software stack for data center inference

Semiconductor Manufacturing 上海市, 上海市 501–1,000 employees Founded 2018 Partnership

Enflame designs hardware and software for AI inference at scale, built on a deep compiler and systems stack (LLVM, CUDA, TensorFlow, PyTorch, NCCL, DeepSpeed). The project list—chip interconnect, quantization tools, operator development, and chiplet architecture—reveals a vertically integrated play from silicon through ML framework adaptation. Heavy senior engineering hiring (20 of 28 roles) against active work on bandwidth efficiency and power consumption points to a company solving hard performance and cost problems in domestic AI infrastructure.

Tech Stack 46 technologies

Core StackLinux C++ Python TensorFlow PyTorch LLVM GCC C/C++ CUDA RISC-V OpenCL OpenMP clang-tidy CDC C Caffe MXNet PaddlePaddle RDMA TensorRT MPI Open MPI NCCL DeepSpeed VTune Bash DPDK SPDK PCIe Intel+16 more

What Enflame Is Building

Challenges

  • Ai model sparsification design
  • Improving bandwidth efficiency
  • Optimizing inference performance on domestic ai chips
  • Next-gen ai chip design
  • Balancing business and r&d priorities
  • Ai model performance optimization
  • Performance bottlenecks
  • Ai system competitiveness
  • Reducing power consumption
  • Aligning market demands with chip development

Active Projects

  • Ai chip interconnect design
  • Build low-bit hardware-aware quantization tools
  • Ai model sparsification design
  • Quantization operator development
  • Ai operator development on custom ai chip
  • D2d interconnect architecture design
  • Develop day-0 adaptation toolchain for gcu
  • Pd separation architecture implementation on gcu
  • Distributed communication component development
  • Multi-die/chiplet architecture for ai server chip

Hiring Activity

Accelerating30 roles · 25 in 30d

Department

Engineering
22
Ops
1
Product
1
Security
1

Seniority

Senior
20
Mid
3
C-Level
2
Company intelligence

Find more companies like Enflame by tech stack, pain points and active projects

Get started free

About Enflame

Enflame, founded in 2018 and based in Shanghai, builds AI chips and software systems for data center workloads. The company operates across the full stack: hardware design (chiplet and interconnect architecture), compiler tooling (day-0 adaptation for custom AI processors), and runtime optimization (quantization, sparsification, operator kernels). Active development spans distributed communication, inference optimization, and tool chains for their custom GPU-class processors. The organization is engineering-concentrated, with R&D leadership drawn from semiconductor and systems backgrounds.

Headquarters上海市, 上海市
Company Size501–1,000 employees
Founded2018
Hiring MarketsChina

Frequently Asked Questions

What tech stack does Enflame use?

Core stack includes LLVM, GCC, CUDA, RISC-V, PyTorch, TensorFlow, Caffe, MXNet, PaddlePaddle, TensorRT, NCCL, DeepSpeed, MPI, and Open MPI for compiler, ML frameworks, and distributed training/inference.

What is Enflame working on?

Active projects include AI chip interconnect and chiplet architecture design, low-bit quantization and sparsification tools, custom AI operator development, and day-0 software adaptation toolchains for their processors.

Similar Companies in Semiconductor Manufacturing

Other companies in the same industry, closest in size