Aditi Shivaji Choudhary


Location

Houston, TX
Education
    University of Cincinnati-Main Campus
    August 2022 - April 2024
    degree
    Master's
    major
    Computer Science
    TKR college of Engineering and Technology
Work Experience
    AIG
    Data Engineer
    May 2023 - present
    company
    AIG
    title
    Data Engineer
    overview
    • Streamlined data migration processes by employing Hive SQL to transition a legacy SQL codebase to Azure; achieved seamless data accessibility for 15+ analysts, improving overall data manipulation efficiency. • Utilized SQL Server Integration Services (SSIS) to streamline the import and export of databases, designing 15+ ETL packages that decreased data load times by 40% while ensuring data integrity and consistency across multiple sources. • Engineered a scalable ETL pipeline using Apache Spark on Hadoop, handling over 2TB of log data. Employed PySpark and Scala for transformations and incorporated Hive for structured data querying and storage, boosting data processing efficiency by 35%. • Created Azure Data Factory pipelines to process and analyze datasets exceeding 1TB and designed interactive Tableau dashboards that improved decision-making speed by 30% through real-time project insights. • Optimized MS SQL Server and MySQL databases, ensuring data integrity through advanced queries and stored procedures. • Architected and enhanced Kubernetes-based, high-availability data pipelines, automating lifecycle with Jenkins CI/CD, reducing deployment time by 40% and improving processing performance by 25%.
    Cognizant Technology
    Programmer Analyst
    Indianapolis, IN, United States, 46298
    December 2021 - July 2022
    Vivma Software
    Data Engineer
    Indianapolis, IN, United States, 46298
    January 2021 - November 2021
Skills
AdaptabilityAgile MethodologyAmazon RedshiftAmazon S3Amazon Web ServicesAnalytical ThinkingApache FlinkApache HadoopApache HiveApache SparkAutomationAutomation of TestsBig DataBusiness EfficiencyBusiness StrategiesCloud ComputingCloud Platform SystemCloudwatchCodebaseCommunication SkillsContinuous IntegrationCritical ThinkingCursor (Graphical User Interface Elements)DashboardsData AnalysisDatabasesData CleansingData CollectionData IngestionData IntegrationData IntegrityData LoggingData MartData MigrationData MiningData PipelinesData ProcessingData ReportingData RetrievalData Storage TechnologiesData VisualizationData WarehousingDecision Making SkillsDockerExtract Transform Load (ETL)Fault ToleranceForecasting SkillsGenerative AIGitGitlab-ciHadoop Distributed File SystemInformation EngineeringInformation TechnologyInfrastructure ManagementIntegrated Development EnvironmentsInteroperabilityJenkinsJupyter NotebookKnowledge of EngineeringKubernetesLinuxLogistic RegressionMachine LearningMaintenanceMapReduceMatplotlibMetricsMicrosoft AccessMicrosoft AzureMicrosoft Certified ProfessionalMicrosoft ExcelMicrosoft OfficeMicrosoft SQL ServerMicrosoft Visual StudioMicrosoft WindowsMySQLNLTK (NLP Analysis)NumPyOperational SystemsPandasPerformance TuningPivot TablesPostgreSQLPower BIPredictive ModellingPresentationsProblem SolvingProgramming LanguagesPython (Programming Language)Real Time DataRelational DatabasesReliabilityReliability of SystemsRequirements AnalysisResource UtilizationR (Programming Language)S3 BucketScalabilitySciencesScientific ComputatingScikit LearnSciPySelf MotivationSensitive Compartmented InformationSnowflakeSoftware DebuggingSoftware EngineeringSoftware QualitySoftware Version ControlSQL AzureSQL DatabasesSQL Server Integration ServicesSQL Stored ProceduresStakeholder ManagementStorage SystemsStreamlineSupport Vector MachineSystem AvailabilitySystems Development Life CycleTableau (Software)Team WorkingTensorflowTesting SkillsVisualizationWeb ServicesWorkflows