•Worked in data platform team and collaborated with the international growth team, focusing on KPIs such as user retention rate and user conversion rate for new features to ensure a high growth rate of TikTok.
• Built and maintained a terabyte-scale user data analytics workflow with SparkSQL, collaborating with machine learning engineer, maintaining Hive tables and supporting a streaming data pipeline for offline deep learning model training.
• Developed Python programs for automatically generating SQL to improve the efficiency of data team .
• Responsible for Spark tuning, improved computing efficiency by setting appropriate cluster parallelism, optimizing memory usage, adjusting the logic in data pipelines and eliminating data skew.