IT Data Engineering JG3

United Global Technologies

Houston, TX

Full Time

Paid

Responsibilities
This is not a standard data engineer role.
Looking for a deeply technical, hands-on individual contributor who can:
- Diagnose performance, latency, and cost issues in a large-scale cloud data platform
- Take a top-down, platform-level view across multiple projects
- Improve architecture, efficiency, and cost optimization, not just write Spark code
- Act as a technical problem-solver and mentor, guiding other data engineers
This person is expected to make the platform better, not just execute tasks.
Current Platform & Architecture (Very Important)
Data Flow:
- On-premise systems → Cloud (Azure)
- Streaming ingestion → Azure Data Lake Storage (ADLS)
- Data processed into two separate containers:
Technologies in Use:
- Qlik Replicate (formerly Attunity)
- Azure Data Lake Storage (ADLS)
- Databricks
- Python
- SQL (complex queries and procedures)
Key Challenges the Role Is Meant to Solve
1\. Data Latency
- High-volume streaming data
- End-to-end latency issues that need root-cause analysis
2\. Databricks / DLT Cost Spikes
- DLT costs are far higher than expected
- Known contributors:
- The current solution works but is not optimal
This role exists because generic recommendations are not enough.
What we do Not Want
- Someone who has:
- Someone who:
- Someone whose resume was “AI-polished” but not real
What WE DO WANT
Technical Depth
- Deep understanding of:
- Ability to:
- Comfortable challenging Databricks as a product
Programming & Data Skills
- Strong Python (mandatory)
- PySpark (advanced, not basic)
- Advanced SQL
Working Style
- Hands-on individual contributor
- Collaborative with data engineers
- Willing to:
Role Scope
- Will work across multiple projects
- Acts as a cross-platform technical expert
- Evaluates: