cv
This is a description of the page. You can modify it in '_pages/cv.md'. You can also change or remove the top pdf download button.
Work
-
2023.09 - 2024.07 Research Intern
AntGroup AI Infrastructure Group
Build efficient training system over heterogeneous GPUs.
- (co-)Design quick checkpoint scheme for Large Language Models training.
- Optimize distributed parallelism for Parameter Efficient Fine-tuning (PEFT).
Education
Projects
- 2023.03 - 2024.07
DLRover
DLRover is an automatic system aiming to train large AI models easy, stable, fast and green.
- Deep Learning
- AutoML
- Distributed Computing