SRIBD is a government-backed research institute in Shenzhen specializing in large-language-model optimization and biomedical AI. The tech stack—PyTorch, CUDA, vLLM, TensorRT, MLIR, and NVIDIA GPUs—reflects a deep focus on inference acceleration and model compilation. Active pain points around KV-cache optimization, edge deployment, and low-precision quantization, paired with projects on LLM pretraining and single-cell foundation models, signal a research agenda bridging cloud-scale inference and resource-constrained deployment.
SRIBD was established in 2016 with government approval and operates as an independent research institution under Shenzhen's Peacock talent program. Located on the CUHK (Shenzhen) campus, the institute targets precision medicine, intelligent cities, and communications infrastructure. The team comprises doctoral-level researchers with overseas experience. Current work spans large-model pretraining and fine-tuning, single-cell AI recognition, and optimization for edge devices—supported by access to the National Supercomputing Center in Shenzhen. Hiring has accelerated, with internship-heavy staffing in research roles and emerging engineering capacity.
Primary tools: Python, PyTorch, CUDA, Linux. Model optimization: vLLM, TensorRT, MLIR, llama.cpp, ONNX Runtime, Triton. Graph databases: Neo4j, TigerGraph. Medical imaging: DICOM, NIfTI, SimpleITK. All work targets GPU-accelerated inference on NVIDIA hardware.
Focus areas: large-model pretraining and fine-tuning, single-cell foundation models, cell-image AI recognition, and biomedical database construction. Infrastructure: KV-cache optimization, scheduling and inference optimization for edge devices, and low-precision model compression for deployment.
Other companies in the same industry, closest in size