Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Senior Data Engineer, Geodata

Strategi.biz

Senior Data Engineer, Geodata

San Francisco, CA
Full Time
Paid
  • Responsibilities

    Sr. Data Engineer

    Job Description:

    • Work with many geospatial data sets, specifically road networks, buildings, POI and address data

    • Implement distributed pipelines using Airflow and Spark to process geospatial data

    • Integrate third-party data sources from different geographic areas into the basemap

    • Interface with engineers from other teams to analyze their needs for geospatial data and solve their data problems

    • Implement automated quality metrics to ensure we are continuously delivering high-quality data to our customers

    • Participating in design and code reviews

    • Mentor other software developers to develop all aspects of their engineering skill sets, run point on projects

    • Create new data products by aggregating proprietary sources and derived data from sensors and aerial imagery

    Required Skills and Experience

    • 7+ years of professional SDLC experience

    • Bachelor's degree or higher - in Computer Science or related field

    • Experience with AWS or another cloud provider.

    • Proficiency in at least one modern programming language (Python, Scala, Java, JavaScript, …) suitable for data processing

    • Proficiency in a query language like SQL

    • Strong experience with data processing and developed judgment to implement new data pipelines and develop best practices around it.

    • Familiarity working with Spark or other Hadoop based technologies

    • Familiarity with CI/CD processes

    • Familiarity handling processing and normalizing many different datasets into a single coherent product.

    • Ability to communicate complex concepts to both peers and leadership. Strong verbal and written communication skills.

    • Experience with introducing quality and operational metrics into a data ETL pipeline.

    • High-performing team player that can create consensus.

    • Deliver key results quickly and resolve ambiguity in the customer's favor.

    • Ability and willingness pivot to new languages, skills, techniques quickly.