Data Lake Engineer

SOSi

Data Lake Engineer

Doral, FL
Full Time
Paid
  • Responsibilities

    Job Description

    SOSi is seeking a Data Lake Engineer to support mission requirements for a structured approach to further develop, integrate, and sustain a scalable, federated data ecosystem that enhances interoperability, governance, and mission-driven analytics for a DoD customer. The primary objective of the program is to bridge the operational gaps between DoD, IC, interagency, and non-traditional international partners to enable real-time information sharing, dynamic data integration, and mission-tailored analytical capabilities.

    Essential Job Duties:

    • The contractor shall design, implement, and maintain scalable Data Lake architectures to support structured and unstructured data ingestion, ensuring efficient data access and retrieval.
    • The contractor shall configure and manage the integration interface between the Data Lake and the knowledge graph platform (Stardog), including SPARQL endpoint access, metadata federation, and catalog alignment.
    • The contractor shall follow access control policies and usage scope defined by the Government and other coordinated Work Orders.
    • The contractor shall confirm compliance with access policies on a quarterly basis and document the results in the Data Governance & Compliance Report.
    • The contractor shall optimize ETL pipelines for high-volume data transformation, ensuring compliance with DoD IL-4/IL-5 security standards.
    • The contractor shall implement storage tiering strategies and access controls, ensuring data is properly classified, retained, and accessed per DoD governance requirements.
    • The contractor shall submit the Data Lake Performance & Optimization Report, detailing ingestion efficiency, access control improvements, and storage utilization metrics.
  • Qualifications

    Qualifications

    • Active TS/SCI Clearance.
    • Master’s degree or higher (e.g., Ph.D.) in Computer Science, Information Technology, Systems Engineering, Data Science, Business Administration, Engineering Management, or a closely related field, or
      • a minimum of eleven (11) years of experience managing complex technical projects in enterprise data architecture, Databricks administration, and cloud-based data platforms.
    • Knowledge and capability to support Data Lake platform administration and enterprise data architecture for DoD data-driven projects.
    • Skilled in Data Lake platform administration, including workspace management and configuration, cluster optimization and performance tuning, cloud integration, and Unity Catalog integration for secure data governance.
    • Proficient in ETL/ELT pipeline development, Delta Lake architecture and optimization, AI/ML workflow integration, and Data Lakehouse optimization for DoD analytics and mission-critical data workflows.
    • Experienced in SysEngOps, DevSecOps, version control systems (Git), and CI/CD pipelines to streamline Data Lake development and deployment.
    • Knowledgeable in identity and access management (IAM), role-based access control (RBAC), and cloud security best practices across AWS, Azure, and GCP.
    • Hands-on expertise in Python, SQL/NoSQL, Apache Spark, Databricks SQL, Terraform, and cloud-native data services for large-scale data processing and analytics.

    Additional Information

    Work Environment

    • Normal office conditions

    Working at SOSi

    All interested individuals will receive consideration and will not be discriminated against for any reason.