Sr. Data Engineer
Job Description:
Work with many geospatial data sets, specifically road networks, buildings, POI and address data
Implement distributed pipelines using Airflow and Spark to process geospatial data
Integrate third-party data sources from different geographic areas into the basemap
Interface with engineers from other teams to analyze their needs for geospatial data and solve their data problems
Implement automated quality metrics to ensure we are continuously delivering high-quality data to our customers
Participating in design and code reviews
Mentor other software developers to develop all aspects of their engineering skill sets, run point on projects
Create new data products by aggregating proprietary sources and derived data from sensors and aerial imagery
Required Skills and Experience
7+ years of professional SDLC experience
Bachelor's degree or higher - in Computer Science or related field
Experience with AWS or another cloud provider.
Proficiency in at least one modern programming language (Python, Scala, Java, JavaScript, …) suitable for data processing
Proficiency in a query language like SQL
Strong experience with data processing and developed judgment to implement new data pipelines and develop best practices around it.
Familiarity working with Spark or other Hadoop based technologies
Familiarity with CI/CD processes
Familiarity handling processing and normalizing many different datasets into a single coherent product.
Ability to communicate complex concepts to both peers and leadership. Strong verbal and written communication skills.
Experience with introducing quality and operational metrics into a data ETL pipeline.
High-performing team player that can create consensus.
Deliver key results quickly and resolve ambiguity in the customer's favor.
Ability and willingness pivot to new languages, skills, techniques quickly.