I’ve been a data practitioner my entire career, first as an experimental physics researcher and later as a machine learning scientist.
You can find a selection of my data analysis work below, both from my work in physics and from side projects. Additional project work can also be found on GitHub and in my blog.
Examples of Data Science and Machine Learning Work
Semantic Search for Product Recommendations
I built a recommendation system at The Knot that leverages text embeddings and performs semantic search in order to recommend wedding venues to engaged couples. The recommendations feature LLM-generated explanations for why they are a good match. Read more…
Based on data on house transactions in the Seattle area, I used different regressors, feature selection and feature generation techniques to build a regression model capable of predicting the sales price based on the other properties of a house. Read more…
In this project, I analyzed a data set of entry and exit data of turnstiles in New York’s subway stations to study commute in the city as well as the ridership distribution over the course of a day and a year. I also used a linear regression model to predict ridership based on holidays, weather, and other features. Read more…
Machine Learning: Classifying Wine
I used a variety of classification algorithms implemented in scikit-learn to build models capable of predicting which of three cultivars a wine sample belongs to based on 13 chemical constituents. In order to visualize the different models, I used principal component analysis to reduce the dimensionality of the features. Read more…
This python program imports gpx files recorded with the Runkeeper app, extracts statistics, visualizes runs (in elevation and pace profiles, and on maps) and compares multiple runs. Read more…
This is a program which automatically performs a complex analysis of an experimental data set. The resulting information are valuable for benchmarking new models which are being developed in computational physics. Read more…
Multidimensional Spectroscopy Data
This is a brief overview of the types of analyses I’ve performed on multidimensional data obtained from spectroscopy experiments. Read more…


You must be logged in to post a comment.