- Incorporated raw profile and ads texts and retained key tabular features, reducing derived features and manual engineering.Docker, Git, Spark, Generative AI, LLM, Hadoop, Scala, Python, Airflow Sunnyvale, USA
- Designed trait-based prompts for LLM to detect fraud ads accounts, increasing label coverage by 20% at 80% precision
- Trained XGBoost model by adding GPT-generated labels with system labels, improving precision by 30% over current system
- Built DSPy extensions and adapters, fully automating LLM prompt generation and optimization without manual prompting
- Built an OpenConnect pipeline with Docker to automate data processing, LLM labeling, evaluation and review workflows
B
Bosch Research
Software Engineer Machine Learning Intern
Shanghai, CN
February 2023 - August 2023
Software Engineer Intern
February 2023 - May 2023
M
Mitacs Globalink
Research Internship
July 2022 - February 2023
Skills
AirflowAnomaly DetectionApache HadoopApache SparkAutomationCachingCombinatoricsComputer VisionComputing PlatformsContinuous IntegrationDatabase AdministrationData ProcessingDeep LearningDjango Web FrameworkDockerE2e TestingElk StackGenerative AIGitGitlab-ciGoogle MapsGPTInstant Messaging TechnologyKnowledge of EngineeringKnowledge of Packaging and ProcessingKubernetesLabelingLarge Language ModelsLinear ProgrammingLinuxLog AnalysisMachine LearningMagnetic Resonance ImagingMicroservicesMicrosoft AzureModel View Controller (MVC)MySQLNode.JsOperations ResearchPython (Programming Language)Recommender SystemsRedisReinforcement LearningRestful APIsSelenium WebdriverSpring-bootStrategic ThinkingTeam WorkingTransfer LearningVue.jsWeb ApplicationsWeb DevelopmentWeb Performance OptimizationWorkflowsXgboost