Data Scientist, 4+ yrs: Python/R/SQL,end-to-end ML, NLP & CV; Spark, Airflow; AWS/Azure/GCP; MLflow MLOps; dashboards; A/B tests & stats; Agile team player AI
Location
Denton, TX
Education
U
University of North Texas
August 2023 - May 2025
degree
Master's
major
Computer Science
Work Experience
c
codelanceit
data scientist
Dallas, TX, United States, 75398
January 2025 - present
company
codelanceit
title
data scientist
overview
Built LightGBM-based churn prediction models with SHAP explainability, integrated with CRM to drive retention campaigns.
Developed BERT-based NLP pipelines to extract intent, sentiment, and key entities from unstructured support logs.
Deployed Prophet and ARIMA time-series models on AWS Lambda for forecasting key KPIs like default rates.
Orchestrated model training and deployment pipelines with GitHub Actions, MLflow, and Docker containers.
Built a distributed ETL pipeline using PySpark on Databricks, processing 1TB+ datasets daily from financial systems.
Created Streamlit apps for internal teams to interact with models and visualize scenario-based predictions.
Led containerization of AI services with Docker, deployed and monitored on Kubernetes clusters.
Implemented model monitoring dashboards using Prometheus + Grafana for drift and accuracy alerts.
Integrated model predictions into Power BI dashboards using Python and REST APIs for real-time decision-making.
Used TensorFlow with mixed precision and data augmentation to improve training times for fraud detection use case.
Designed and conducted A/B experiments to evaluate the business impact of ML-driven personalization strategies.
Collaborated cross-functionally with data engineers and business leaders to align technical solutions with use cases.
Conducted model retraining automation with DVC versioning and metadata tracking for audit trails.
Delivered quarterly stakeholder demos, showcasing model impact on KPIs like NPS, conversion, and CLTV.
A
Accenture
ASE
Bangalore, INDIA, IN
February 2022 - August 2023
I
ICICI Bank
data analyst
January 2021 - February 2022
Passion
Passionate about transforming data into actionable insights, building ethical and scalable AI/ML solutions, and driving innovation through data storytelling, automation, and continuous learning in real-world applications.
Skills
Languages
EnglishSpanish
Skills
AirflowAlteryxAmazon DynamoDBAmazon Elastic Compute CloudAmazon RedshiftAmazon S3Amazon Web ServicesAnalytical ThinkingApache HadoopApache HiveApache SparkAttention to DetailAutomationAWS GlueAzure Data FactoryBash ShellBig DataBusiness IntelligenceCassandraCloud ComputingCommunication SkillsConsumer BehaviorCoordination SkillsCustomer ExperienceCustomer Relationship ManagementDashboardsData AnalysisDatabasesData CleansingData GovernanceData LakesData ModelingData PipelinesData QualityData StreamingData VisualizationData WarehousingDecision Making SkillsDialectical Behavior TherapyDockerE-CommerceExtract Transform Load (ETL)GitGovernanceHadoop Distributed File SystemHealth Insurance Portability and Accountability Act ComplianceInformation EngineeringInformation TechnologyKnowledge of CampaignsKnowledge of Purchasing ProcessesKnowledge of StatisticsLooker AnalyticsMarketingMatplotlibMicrosoft AzureMicrosoft ExcelMicrosoft OfficeMicrosoft PowerPointMicrosoft SQL ServerMongoDBMySQLNormalization ProcessesNumPyPandasPivot TablesPostgreSQLPower BIPredictive Data AnalysisProblem SolvingProgramming LanguagesPython (Programming Language)RapidMinerR (Programming Language)SnowflakeSoftware Version ControlSPSS (Software)SQL DatabasesSQL Server Integration ServicesSQL Server Reporting ServicesStakeholder ManagementStock ControlStreamlineTableau (Software)TalendTeam WorkingVba Programming LanguageVisualizationWorkflows