Gokaraju Rangaraju Institute of Engineering and Technology
Work Experience
C
CyberGuard Solutions
ETL/Pyspark Developer Full time
Indianapolis, IN, US
January 2022 - present
company
CyberGuard Solutions
title
ETL/Pyspark Developer Full time
overview
- Responsibilities
- Spearheaded the Agile development of ETL data pipelines, reducing processing time by 30% through automation and optimization for CSV/JSON and database integrations, utilizing GIT and Jupyter within an Anaconda environment
- Innovated a data extraction framework from Splunk to Parquet, boosting data management efficiency by 40%, leveraging
- Architected a Python-based automation module for dynamic metadata and SQL query generation from Excel, improving
- Elevated data integrity and accuracy post-ETL by 35% through strategic validation processes, championing Agile and Scrum
- Focused on optimizing data flow, storage access, operational efficiency, code quality, and application performance
- Environments: Experienced with a broad range of technologies including Git, Python, Spark, Hadoop, Jira, JSON, Oracle
- MySQL, Redshift, SQL Developer, AWS, and Airflow