Below are some of my projects and you can find more on my GitHub.
Youtube Trending Videos ETL Pipeline and Analysis
Built a serverless ETL pipeline to get information of Youtube trending videos every day. Data is stored in AWS RDS MySQL database, analyzed by SQL, and visualized by AWS QuickSight.
Skills: Data collection with API, Python, MySQL, AWS, Docker
Analysis of California 30-Day Inpatient Readmission Rates
Performed EDA and identified higher readmission rates were associated with patients that were aged 65+, males, covered by Medicare or Medi-Cal, and counties with lower ratios of rehabilitation clinics
Skills: Python, EDA, Data Visualization
Airbnb Reviews Text Mining and Sentiment Analysis
Conducted text mining and sentiment analysis on Airbnb reviews in New York City, 2018-2022, to find out what guests cared about most and why they left negative reviews.
Skills: R, Text Mining, Sentiment Analysis, EDA, Data Visualization