Hi, I am Soon Hao
I have always been passionate about data, the way it can uncover patterns, answer tough questions, and drive better decisions. Whether it is crunching numbers or building tools, I enjoy using data to create clarity out of complexity.
I am also a huge sports fan. I support Manchester United in football and also a long-time LeBron James fan in basketball. Naturally, my love for sports often crosses over into my work, which is why many of my data projects are built around sports stats and analytics.
My goal is to keep blending my interests in data, sports, and automation to build things that are insightful, useful, and a little bit fun.
Work & Internship Experience
Business Architecture Analyst
Accenture
Oct 2024 – Present
Robotic Process Automation Intern
CPF Board
Jan 2024 – Sep 2024
Business Analyst Intern
Shopee
Aug 2023 – Dec 2023
IT Business Analyst Intern
Johnson & Johnson
Jan 2023 – Jun 2023
Data Analyst Intern
National Community Leadership Institute (NACLI)
Jun 2022 – Aug 2022
Testing & Data Analyst Intern
Tracesafe
Jun 2021 – Aug 2021
Projects Overview
In my free time, I enjoy diving into data-driven projects that blend analytics, prediction, and storytelling. Whether it is analyzing MVP trends, debating the GOAT in basketball, or finding patterns in lottery numbers, I love using data to explore ideas and uncover insights.
- NBA MVP Prediction: Predicting MVP outcomes using machine learning and season stats.
- LeBron vs Jordan Analysis: Deep-dive player comparison based on data and era-adjusted metrics.
- Toto Dashboard: Interactive number analysis and prediction tool.
From Stats to Superstars: Using Data Science to predict the 2024–2025 NBA MVP
As a basketball fan and data enthusiast, I was curious whether we could predict MVP shares using players and teams season stats. This curiosity led me to build a machine learning model that forecasts MVP voting outcomes based on historical data.
I experimented with an ensemble model (Linear Regression, Gradient Boosting, XGBoost). Throughout the project, I focused on model interpretability, cross-validation, and feature importance to understand which metrics truly influence MVP results.
Read the full article on MediumLeBron vs. Jordan: The Definitive GOAT Debate by the Numbers
The LeBron vs Jordan debate has always been filled with strong opinions, so I wanted to cut through the noise and approach the question from a more analytical angle. Using a blend of statistics, historical context, and weighted metrics, I compared the two players across career longevity, regular season dominance, playoff success, and awards.
I built this as a structured, data-first deep dive to spotlight patterns fans often overlook, like how scoring trends changed by era. Beyond just the stats, the project pushed me to think critically about framing narratives with data and storytelling.
Read the full article on MediumToto Predictor: Data-Driven Analysis and Forecasting Tool
This project combines data analytics and machine learning to help users make more informed Toto predictions. The dashboard offers interactive tools to explore past draw trends and run predictions using a model trained on historical results.
Users can dive into number patterns, frequency stats, and model-generated forecasts all in one place. Built with usability in mind, this tool is a practical example of applying data science to real-world games of chance.