Topp Nattawat

Logo

Leverage data to answer business and social questions that matter

View the Project on GitHub tnattawat/Portfolio

Welcome to Topp’s GitHub Profile!

drawing  drawing  drawing  drawing

Data Projects

Project #1: TV, Halftime Shows, and the Big Game of Super Bowl (Click!)

Whether or not you like football, the Super Bowl is a spectacle. There’s drama in the form of blowouts, comebacks, and controversy in the games themselves. There are the ridiculously expensive ads, some hilarious, others gut-wrenching, thought-provoking, and weird. In this project, you will find the answer to interesting questions like:

The dataset used in this project was scraped and polished from Wikipedia. It is made up of three CSV files, one with game data, one with TV data, and one with halftime musician data for all 52 Super Bowls through 2018.

Project #2: The Android App Market on Google Play (Click!)

Mobile apps are everywhere. They are easy to create and can be lucrative. Because of these two factors, more and more apps are being developed. With data scraped from the Google Play Store, this project will do a comprehensive analysis of the Android app market by comparing over ten thousand apps in Google Play across different categories. You’ll find insights to devise strategies to drive growth and retention such as:

The data was scraped from the Google Play website. The data files include ‘apps.csv’ containing 13 features that describe apps on Google Play, and ‘user_reviews.csv’ containing 100 reviews for each app.

Project #3: The GitHub History of the Scala Language (Click!)

Open source projects contain entire development histories, such as who made changes, the changes themselves, and code reviews. This project takes the challenge to read in, clean up, and visualize the real-world project repository of Scala that spans data from a version control system (Git) as well as a project hosting site (GitHub). With almost 30,000 commits and a history spanning over ten years. Scala is a mature language. You will find the answers to fun questions like:

The dataset includes the project history of Scala retrieved from Git and GitHub as a set of CSV files.

Project #4: A Visual History of Nobel Prize Winners (Click!)

The Nobel Prize is perhaps the world’s most well known scientific award. Every year it is given to scientists and scholars in chemistry, literature, physics, medicine, economics, and peace. This project uses data manipulation and visualization libraries in Python to explore patterns and trends over 100 years worth of Nobel Prize winners. You will find out the following:

The dataset used in this project is from The Nobel Foundation on Kaggle.

Project #5: The Discovery of Handwashing (Click!)

In 1847, the Hungarian physician Ignaz Semmelweis made a breakthough discovery: he discovers handwashing. Contaminated hands was a major cause of childbed fever and by enforcing handwashing at his hospital he saved hundreds of lives. This project will reanalyze the medical data Semmelweis collected and answer the following questions:

The dataset used in this project is from one of the most important discoveries of modern medicine: handwashing.

Project #6: Predicting Credit Card Approvals (Click!)

Commercial banks receive a lot of applications for credit cards. Many of them get rejected for many reasons, like high loan balances, low income levels, or too many inquiries on an individual’s credit report, for example. Manually analyzing these applications is mundane, error-prone, and time-consuming (and time is money!). Luckily, this task can be automated with the power of machine learning and pretty much every commercial bank does so nowadays. In this project builds an automatic credit card approval predictor using machine learning techniques, just like the real banks do.

The dataset is the Credit Card Approval dataset from the UCI Machine Learning Repository.

Project #7: Predicting Loan Risk using SparkML on IBM Cloud (Click!)

This project will create a machine learning model to predict customer churn. It will build the prediction model using the SparkML library, and walk you through these steps:

This project is part of my training at IBM Digital Developer Conference 2020 on Data & AI.

Project #8: School Budgeting with Machine Learning (Click!)

Data science isn’t just for predicting ad-clicks-it’s also useful for social impact! This project explores a problem related to school district budgeting. By building a model to automatically classify items in a school’s budget, it makes it easier and faster for schools to compare their spending with other schools. It uses natural language processing to prepare the budgets for modeling and applies different techniques to make the model most accurate.

Skills & Certficates

drawing  drawing  drawing  drawing  drawing  drawing  drawing  drawing