Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Automation Engineer

Throtle

Automation Engineer

Red Bank, NJ
Full Time
Paid
  • Responsibilities

    Throtle, Inc. seeks Automation Engineer in Red Bank, NJ to manage the building and maintenance of the company's proprietary identity graph. Evaluate incoming data such as maids (mobile advertising IDs), audience segment data (demographics, behavior, etc.) From dozens of sources (3rd party data aggregators/verifiers, online sync partners) to perform specialized data hygiene plus standardize and reformat data for consumption by multiple processes.

    The Automation Engineer will be part of the team that is responsible for transforming and maintaining several billion records that make Throtle’s data onboarding solution work. You’ll need to be able to evaluate incoming data, perform specialized hygiene, and standardize it for consumption by multiple processes. You’ll be working in a fast-paced, high volume processing environment where quality and attention to the details are paramount.

    PRIMARY RESPONSIBILITIES

    Link raw data sources for consistency and validity plus analyze results of processing, identify issues and recommend actions such as (adding/rejecting/merging/quarantine).

    Develop and use existing tools such as (Spark/Scala, AWS Athena, data hygiene tools) to transform disparate data into a consistent format for data hygiene, scoring, and further processing.

    Troubleshoot and resolve big data issues in SQL Server, AWS S3, spark/data frames plus work on fine-tuning queries and designing tables, views, indexes to maintain and enforce data security policies.

    Analyze, design, develop, and testing phases of software development lifecycle (SDLC), plus design, develop and maintain device identity graph using Scala and Hadoop Spark framework on AWS.

    Design and build utility programs for customer match tests and onboarding files to then implement customized automated tools for big data extraction and customized reporting using Scala and Hadoop/Spark Framework with AWS Lambda function.

    Maintain program design documents and data infrastructure documentation plus participate in testing process through unit/component/integration testing and bug fixing.

    Research and propose new technologies, frameworks and tools to facilitate big data processes plus create data extracts and customized reports.

    Be a key member of the team that builds and maintain our identity graph which includes billions of data records.

    Design, build and maintain code that transforms the data into useful formats.

    Build logic that ensure the resultant data is accurate and consistently processed.

    Proactively monitor database systems ensuring availability and performance.

    Troubleshoot and resolve database issues.

    Participate in capacity monitoring and planning.

    Develop, maintain and enforce data security policies.

    Maintain data infrastructure and code documentation.

    Participate in testing process through unit testing and bug fixes.

    KNOWLEDGE AND SKILL REQUIREMENTS

    Must have Masters degree or foreign equivalent in Software Systems, Computer Science, Information Technology, or related field plus 24 months of experience as a Software Engineer analyzing, designing, developing, and testing phases of Software Development Life Cycle (SDLC).

    Experience with design, development and maintenance of device identity graph using Scala and Hadoop/Spark Framework on AWS.

    Experience must include the use of Scala 2.10, Spark 2.10, Play, MS SQL Server, and AWS Lambda/S3/Athena/Glue.

    Submit resume to nanderson@throtle.io with “ML2019-EMPWS” in the subject line.