on the Path to AGI

Qinlong works on AI infrastructure at Bytedance Seed, focusing on robust and elastic distributed system for data processing, training and inference. At Bytedance, he contributes to the Robust LLM Training Infrastructure, especially in NCCL hang & straggler detection. He was also responsible for the data and RL training engineering in Seed3D. Before joining Bytedance, he initiated the DLRover, a open source project to stabilize the LLM training and was the core contributor of ElasticDL at AntGroup.

Open Source Softwares (Python, Golang, C++)

DLRover, An Automatic Distributed Deep Learning System, #1 contributor, 1251 commits
ElasticDL, A Kubernetes-native Deep Learning Framework, #1 contributor, 320 commits