What I Bring to the Table

Machine Learning

Advanced statistical models in Python and R

Statistical Inferences

Deduction of underlying probability distributions

Creative Problem-Solving

Merging scientific methods & divergent thinking

Data Visualization

Mastery of ggplot2, matplotlib, and Tableau

Text Mining

Sentiment analysis of complex customer data

Advanced SQL Querying

End-to-end pipelines for easy data analysis

Featured Projects

Here are a few select projects I’ve completed. The code can also be found on my github account

*
External Link

Pet Adoption Time 2.0 - More than Tabular Data

Vast improvement and refactoring of the previous Pet Adoption Time project. Incorporated image feature extraction, image metadata, entity sentiment, and text mining to the initial model

Skills: Python, Keras, LightGBM, Google Vision, Google NLP, Convolutional Neural Network, TF-IDF

External Link

Sonic Strata

Front-end web app visualizing and analyzing artists’ musical audio features through Spotify’s Web API. Identifies the most unique and the most ordinary tracks for an artist and provides recommendations for other similar songs
(Note: Please allow a few moments for Heroku to spin up the initial dyno)

Skills: Python, App Development, CSS, API, Data Visualization

External Link

Neural Style Transfer

Extracts deep image feature representations and merges the content of one image with the style of a second images

Skills: Python, TensorFlow, Computer Vision, Convolutional Neural Network

External Link

Breakfast MBA

Increasing profitability of a bakery through Association Rule Learning

Skills: R, Association Rule Learning, Market Basket Analysis

External Link

Pet Adoption Time

Predicting pet adoption time using a decision tree ensemble model

Skills: Python, Supervised Ensemble Learning, XGBoost, Decision Trees, χ2 Analysis, EDA

External Link

NLP & Sentiment Analysis

Predicting polarity of user reviews through text mining of IMDB reviews without NLTK’s SentimentAnalyzer library

Skills: Python, NLP, Sentiment Analysis, Text Mining

External Link

Predicting House Prices in Ames, Iowa

An XGBoost ensemble model predicting house prices of the Boston real estate market

Skills: Python, Imputation, Regression, Feature Engineering, XGBoost, Ensemble Models

External Link

Titanic Survival Prediction

Predicting survival of passengers aboard the Titanic using binary classification machine learning

Skills: Python, EDA, Feature Engineering, Imputation, Classification, Neural Networks

Recent Publications


Gene expression variability in the human population is likely to be important in tissue development and identity, methylation, and in natural biological aging. The expression variability of a gene is an important functional characteristic of the gene itself and the classification of a gene as one with Hyper-Variability or Hypo-Variability in a human population or in a specific tissue should be useful in the identification of important genes that functionally regulate development or disease.

Continue Reading Full PDF text

Blog

A collection of cool things I’ve worked on for my personal enjoyment   >>

An interactive infographic highlighting the UFC’s Anti-Doping Program

Continue Reading

An interactive choropleth visualization of my travels over the years

Continue Reading

Web scraping, archiving, and search retrieval of monsters scraped from DND Beyond

Continue Reading

Adding GIFs to a Valence (Song Positivity) Analysis of Gorillaz Discography

Continue Reading

Exploring the rise of the Death Metal genre

Continue Reading

Contact Me