Principal Data Engineer to support the Products Data Exchange (PDE) initiative—part of a strategic program focused on modernizing our data landscape to enable global reporting, AI/ML capabilities, and real-time analytics. This role is responsible for designing, implementing, and scaling cloud-based data solutions, with deep expertise in Azure, AWS, and SAP environments.
Key Responsibilities:
- Architect and deliver scalable data platforms and pipelines
- Translate business needs into production-ready data solutions
- Lead ELT development, data modeling, and ingestion
- Drive stakeholder engagement and provide technical mentorship
- Ensure best practices in DevOps, change management, and documentation
- Contribute to broader PDS 2030 digital and analytics vision
Mandatory Skills:
- Azure, AWS, SAP (expert level)
- ELT development, data modeling, data integration (expert level)
- Databricks, Azure Data Factory, Synapse, SQL DB, Redshift, Glue, Stream Analytics, Airflow, Kinesis
- GitHub, GitHub Actions, Azure DevOps, SonarQube, PyTest
- Python (strong development background)
Preferred Skills:
- Experience leading scrum teams or managing small technical teams
- Exposure to planning tools (e.g., BPC) or documentation tools (e.g., MKDocs)
- Familiarity with scientific computing, seismic, or subsurface data
- Energy, oil and gas, or trading domain experience
Project Background:
Products Data Exchange (PDE) is a centralized cloud repository for harmonized data supporting our Downstream and Trading organizations. It enables real-time access to curated datasets and supports global intra-day reporting and analytics aligned to our PDS 2030 Vision.