- Developing robust ETL pipelines using Apache Spark, Python, and SQL to integrate diverse financial data
- Designing and implementing robust data quality frameworks using Informatica and Talend, reducing data errors
- Conducting root cause analysis of performance issues and data discrepancies using Azure Operator Insights
- Contributing to improving operational efficiency through data-driven insights, enabling proactive decisionmaking and strategic resource allocation based on real-time analytics
- Implementing mobile-friendly dashboards and reports in Power BI, providing executives and stakeholders with access to critical financial insights anytime, anywhere, and on any device
- Executing real-time data processing and analytics using Azure Databricks Streaming, enabling immediate
- Developing ER diagrams to represent the relationships between entities such as customers, accounts
- Executing UAT test scripts using established methodologies and tools (JIRA, TestRail) to validate ETL pipelines
- Leveraging MapReduce programming model to process large volumes of financial data efficiently across
- Using Matplotlib to create a variety of charts and plots to visualize financial data trends, such as line charts for time series analysis of transaction volumes or bar charts for comparing account balances
T
Tata Consultancy Services
Data Engineer
IN
June 2021 - June 2022
Skills
Acceptance TestingAgile MethodologyAirflowAlgorithmsAmazon Elastic Compute CloudAmazon S3Amazon Web ServicesApache BeamApache HadoopApache HiveApache KafkaApache SparkApple Mac SystemsAutomationBalance SheetsBanking ServicesBig DataBusiness Analytics ApplicationsBusiness EfficiencyBusiness PlanningBusiness Process ImprovementBusiness StrategiesClinical Decision SupportCloud ComputingCommunication SkillsComputer ProgrammingConsultingCritical ThinkingDashboardsData AnalysisDatabasesDatabricksData GovernanceData InfrastructureData LakesData ManagementData MiningData ModelingData NormalizationData PipelinesData ProcessingData QualityData StreamingData SystemsData TransformationData VisualizationData WarehousingDecision Making SkillsDirected Acyclic Graph (Directed Graphs)Distributed SystemsDockerElectronic Medical RecordsExtract Transform Load (ETL)Financial AnalysisFinancial Data AnalysisFriendlinessGitGithubGoogle CloudGovernanceGrafanaHadoop Distributed File SystemHealth CareImporting and Exporting of GoodsInformation EngineeringInformation TechnologyIntelliJ IDEAJenkinsJIRAJSONJupyter NotebookKnowledge of FinanceKnowledge of StatisticsKubernetesLinuxLoad BalancingLookup TableMachine LearningMapReduceMatplotlibMicrosoft AccessMicrosoft AzureMicrosoft ExcelMicrosoft SQL ServerMicrosoft Visual StudioMicrosoft WindowsMongoDBMySQLNatural Language ProcessingNumPyOperational SystemsPandasPerformance ManagementPivot TablesPostgreSQLPower BIPredictive Data AnalysisPresentationsProblem SolvingProcess AutomationProgramming LanguagesPrometheusPysparkPython (Programming Language)Real Time DataRegular ExpressionsRelational DatabasesReliabilityResource AllocationResource EfficiencyRetail CommerceRoot Cause AnalysisSafety PrinciplesScalabilitySchedulingScikit LearnSciPySelf MotivationShell ScriptSnowflakeSoftware Version ControlSolution ArchitectureSpark StreamingSQL DatabasesSQL Server Analysis ServicesSQL Server Integration ServicesSQL Server Management StudioSQL Server Reporting ServicesStakeholder ManagementStrategic ResourcesStrategic ThinkingStreamlineSystems Development Life CycleTableau (Software)TalendTeam WorkingTechnical SkillsTensorflowTerraformTesting SkillsTestrailTest ScriptsTime SeriesVisualization