- Texas
- Led the development and optimization of ETL pipelines using
- Python, SQL, Apache Spark, and Hadoop to ensure
- Used Airflow monitoring and alerting capabilities to identify and troubleshoot data pipeline issues, minimizing data
- Used Spark SQL, DataFrames, and DBT transformations to perform complex data transformations and improve
- Reduced data pipeline development time by 50% through
- Fivetran's pre-built connectors and transformations
- Integrate Kafka for real-time data streaming, achieving a measurable reduction in data latency and improving the processing speed of customer transactions
- Implemented
- AWS Glue to automate data catalog creation and schema management for our data lake on
- Snowflake
- Accomplished interactive Power BI reports leveraging advanced
- DAX and Power Query modeling techniques to unlock
- Designed and implemented serverless
- AWS Lambda to process real-time data streams, achieving
T
Tobacco Consumption And Consequences Trends In Us GCP