Qinlong Wang

Qinlong works on AI infrastructure at Bytedance Seed, focusing on robust and elastic distributed system for data processing, training and inference. At Bytedance, he contributes to the Robust LLM Training Infrastructure, especially in NCCL hang & straggler detection. He was also responsible for the data and RL training engineering in Seed3D. Before joining Bytedance, he initiated the DLRover, a open source project to stabilize the LLM training and was the core contributor of ElasticDL at AntGroup.

Open Source Softwares (Python, Golang, C++)

Papers

Tech Reports

Blogs