Ryan Liao


Location

San Diego, CA
Education
    University of California-San Diego
    September 2019 - April 2024
    degree
    Bachelor's
    major
    Data Science
Work Experience
    U S Bureau of Fiscal Service
    Intern
    June 2023 - present
    company
    U S Bureau of Fiscal Service
    title
    Intern
    overview
    - Fiscal Data Hub: Improving accessibility of data lake containing all fiscal related data for the federal government - Utilized DataBricks to make requests using the USASpending public API, and various other sources to analyze data, create - Created onboarding notebook to help new users learn DataBricks and Python alongside getting the data they need - Consulting other domains and offices regarding data-related activities - Data Stewards: Providing seminars and recommendations about data maturity and best practices - Data Exchanges Working Group: Improving the bureau's data exchange infrastructure by analyzing metadata and using advanced ROI & opportunity cost metrics - Analyzed data exchange metadata in Python using EDA visualizations and interactive Dash & Power BI dashboards - Classified Salesforce ticket requests automatically using NLP vectorization methods such as Doc2Vec and TF-IDF - Automated savings bonds transfer documentation for TreasuryDirect - Read, cleaned & combined text files from emails and then inserted into an access database - Projects
    San Diego
    October 2023 - April 2024
    Clothing Size Recommender System Algorithm
    September 2022 - December 2022
Skills
AlgorithmsAmazon Web ServicesApache SparkApplication Programming Interfaces (APIs)Artificial Neural NetworksAutomationBig DataCascading Style Sheets (CSS)CensusCloud ComputingComputer ProgrammingConsultingC++ (Programming Language)DashboardsDaskData AnalysisDatabasesDatabricksData GovernanceData HubData LakesData ManagementData ProcessingData ScienceData StructuresDecision TreesElectronic Data Interchange (EDI)Employee OnboardingForecasting SkillsHTMLInformation EngineeringInfrastructure ManagementInnovation ManagementJava (Programming Language)JavaScript LibrariesJavaScript (Programming Language)Knowledge of StatisticsLinear RegressionLogistics OperationsMachine LearningMatplotlibMetadataMetricsMicrosoft OfficeNumPyPandasPassionatePlotlyPolitical SciencePostgreSQLPower BIPredictive ModellingProgramming LanguagesPublic PoliciesPython (Programming Language)Random ForestRecommender SystemsSalesforce.ComScikit LearnSciPySearch AlgorithmsSentiment AnalysisSQL DatabasesSQLiteTechnical SkillsTensorflowText FilesText MiningUnsupervised LearningUsability TestingVisualizationXgboost