Data Science Projects

Wisconsin County and MSA Income and Employment (Data Visualization)

An interactive dashboard shows the breakdown of employment types and salary for metropolitan statistical areas in and neighboring Wisconsin. This visualization was created using the javascript library d3.js.

Go to Visualization

Has the minimum wage kept up with inflation? (Data Visualization)

In order to answer this question, the minimum wage - starting in different years - is increased at the annual rate of inflation until 2016. This visualization was created using the javascript library d3.js.

Go to Visualization

Meanwhile in Luxembourg... (Data Visualization)

While looking at data in the OECD 2013 world factbook, one country kept sticking out. This visualization was created using the javascript library d3.js.

Go to Visualization

Distributed Matrix Factorization Algorithm For Recommendation Systems (Machine Learning)

I implemented the Alternating Least Squares algorithm in Apache Spark (pyspark). The ALS algorithm is used for collaborative filtering recommendation systems. It trained on the Netflix Prize Data set of over 100,000,000 ratings in 4 hours.

Go to Github

Content-based Recommendation System - for Steam games (Machine Learning)

I created an algorithm that recommends games to users based on attributes of games they've played.

Go to Github

Superpixel Hierarchical Clustering (SPHC) - Image Segmentation Algorithm (Machine Learning)

Image segmentaion is helpful for image analysis tasks such as classification. Therefore, I created a image segmentation algorithm that combines neighboring superpixels.

Go to Github

Milk and Money Case Study (Linear Regression Analysis)

A business decides to put a floor on production costs but exchange futures options don't match up with their product. My team (a team member and I) used linear regression to recomend a put option and strike price to use.

Go to PDF

Future Bankruptcy Rates Case Study (Time Series Analysis)

Using historic monthly Canadian bankruptcy rates from 1987 to 2010, my team predicted rates for 2011.

Go to PDF