University of Electronic Science and Technology of China
Work Experience
x
xDan AI
AI Data Engineer Python
July 2023 - present
company
xDan AI
title
AI Data Engineer Python
overview
- Primarily responsible for AI Data Synthesis and Alignment Training Algorithms, achieving automated data cleaning
- Developed and implemented reusable ETL pipelines for processing large structured and unstructured datasets
- Utilized the xDAN-Distilabel framework for question generation, enhancing High-Quality Generalization, diversity, and complexity from seed questions. Designed and implemented an Automated Model Evaluation System
- Built an Automated AI Data Processing and quality assessment method using xDAN LLM models for handling noisy data
- Managed and visualized large language model datasets using NOMIC on Hugging Face
C
CGSP Georgetown University
Data Analyst AND Visualization Specialist Python AWS