Data Scientist II – Model Validation and Monitoring

Vytwo

Data Scientist II – Model Validation and Monitoring

Prosper, TX
Full Time
Paid
  • Responsibilities

    Role : Data Scientist II – Model Validation and Monitoring

    Location: Scottsdale AZ (Onsite)

    *US Citizen & GC Only

    **Must be legally authorized to work in US without need for employer sponsorship now or at any time in the future.

    Overall, Purpose

    This position serves as a data science team member in the Model Validation and Monitoring Team delivering leading edge machine learning models to our clients. This includes providing effective challenges to model development, conduct model monitoring and performance tracking, provide root cause analysis of model performance, exploring, building, validating, and deploying models.

    Essential Functions

    Lead model monitoring activities, including tracking performance metrics, detecting model and data drift, identifying data quality issues, providing root cause analysis, and recommending remediation strategies.

    Conduct rigorous model validation by providing effective challenges during model development phases, including performance testing, benchmarking, provide remediation plan, and documentation to ensure models meet business, technical, and regulatory standards.

    Explore and aggregate data independently to uncover data anomalies that impact algorithm performance

    Write production level code in a dynamic, start-up environment

    Solve complex problems using terabyte size data sets

    Apply of a variety of machine learning techniques to a business problem to arrive at optimal approach

    Partner with Product and Engineering teams to solve problems and identify trends and opportunities

    Explain and visualize results and algorithm performance to non-technical audiences

    Minimum Qualifications

    A minimum of 2 years of data science, engineering, mathematics, or related work experience is required.

    Experience developing data science pipelines & workflows in Python, R or equivalent programming language. Experience in writing and tuning SQL. Experience handling terabyte size datasets with Spark language.

    Experience applying various machine learning techniques, and understanding the key parameters that affect model performance

    Experience using ML libraries, such as scikit-learn, mllib, etc.

    Experience using data visualization tools

    Able to write production level code, which is well-written and explainable

    Ability to effectively communicate findings from complex analyses to non-technical audiences.

    Preferred Qualifications

    Experience of using advanced ML algorithms building, testing, and deploying fraud models.

    Hands-on experience with PySpark

    Industry experience in building or validating machine learning models

    Experience exploring data and finding hidden patterns and data anomalies