Shyam Sundhar Yathirajam


Location

San Marcos, CA
Education
    California State University-San Marcos
    August 2022 - May 2024
    degree
    Master's
    major
    Computer Science
    Jawaharlal Nehru Technological University Hyderabad
Work Experience
    Ally Financial
    Data Scientist
    US
    January 2024 - present
    company
    Ally Financial
    title
    Data Scientist
    overview
    - Worked in Agile environments to deliver iterative solutions and adapt to changing project requirements - Conducted hypothesis tests to evaluate the significance of relationships between variables and make data-driven decisions - Employed ggplot2, a data visualization package in R, to create elegant and customizable graphics based on the grammar of graphics - Utilized machine learning algorithms such as linear regression, multivariate regression, Naive Bayes, Random Forests, K-means, & KNN - Used AWS S3, DynamoDB, AWS lambda, AWS EC2 for data storage and models' deployment - Created and maintained reports to display the status and performance of the deployed model and algorithm with Tableau - Implemented, tuned, and deployed machine learning models using AWS Lambda and SageMaker - Worked with CI/CD pipelines to ensure seamless deployment of machine learning models - Integrated machine learning models with RESTful APIs for real-time predictions - Implemented, tuned, and tested the model on AWS Lambda with the best-performing algorithm and parameters - Identified and assessed available machine learning and statistical analysis libraries (including regressors, classifiers, statistical tests, and clustering algorithms - Worked with the NLTK library on NLP data processing and finding patterns - Created customized SQL Queries using MS SQL Management Studio to pull specified data for analysis and report building in conjunction - Implemented deep learning using TensorFlow to create word semantic representations of customer data - Performed data cleaning and feature selection using the MLLib package in Spark and working with deep learning frameworks such as - TensorFlow - Utilized Google BigQuery for advanced data analysis and data pipelines and handling datasets with large records in cloud environment
    California State University
    Research Data Scientist
    US
    June 2023 - December 2023
    Rlogical Techsoft
    Data Scientist
    IN
    January 2020 - July 2022
    Groovy Web
    Data Analyst
    IN
    January 2019 - December 2019
Skills
A/B TestingAgile MethodologyAlgorithmsAmazon DynamoDBAmazon Elastic Compute CloudAmazon RedshiftAmazon S3Amazon Web ServicesAnalysis of Variance (ANOVA)Apache SparkArtificial IntelligenceArtificial Neural NetworksAutomationAWS LambdaBayes' Theorem (Bayesian Statistics)BigQueryBootstrap (Software)CassandraCisco Nexus SwitchesCloud ComputingCloud StorageCluster AnalysisCommunication SkillsComputer VisionContinuous IntegrationContinuous MonitoringC++ (Programming Language)Critical ThinkingCrystal Reports (Reporting Software)C Sharp (Programming Language)Customer Data ManagementCustomer RetentionDashboardsData AnalysisDatabasesDatabricksData CleansingData IngestionData IntegrityData MiningData ModelingData PipelinesData ProcessingData ReportingData ScienceData Storage TechnologiesData StreamingData SystemsData TransformationData ValidationData VisualizationDecision Making SkillsDecision TreesDeep LearningDockerExtract Transform Load (ETL)Feature SelectionForecasting SkillsFront End Software DevelopmentGenerative AIGoogle BigqueryGoogle CloudGraphic DesignGroovyInformation TechnologyJava (Programming Language)JIRAJMP (Statistical Software)KerasKnowledge of FinanceKnowledge of StatisticsKubernetesLarge Language ModelsLatent Dirichlet AllocationLinear RegressionLogistic RegressionMachine LearningMathematical ModelingMatplotlibMicrosoft ExcelMicrosoft SQL ServerMicrosoft Visual StudioMicrosoft WordMongoDBMultivariate RegressionMySQLNaive BayesNatural Language ProcessingNLTK (NLP Analysis)NumPyNvidia CUDAOracle ApplicationsOscillationPandasPattern RecognitionPerformance ImprovementPersonalizationPower BIPredictive Data AnalysisPredictive ModellingPresentationsProblem SolvingPublishing SkillsPython (Programming Language)PytorchRegression AnalysisResearch MethodologiesResearch SkillsRestful APIsR (Programming Language)Sap Business ObjectsSAS (Software)ScalabilityScikit LearnSciPyScrum MethodologySemanticsSentiment AnalysisSnowflakeSoftware EngineeringSpelling and GrammarSQL DatabasesStatistical Hypothesis TestingStorytellingStrategic ThinkingSupport Vector MachineSystems Development Life CycleTableau (Software)TensorflowTeradata SQLTesting SkillsText MiningTime SeriesToad (Software)Unsupervised LearningWaterfall ModelWeb PortalsWind FarmingXgboost