Writing
Notes on statistics, Python, machine learning systems, LLM evaluation, and the places where implementation details change the conclusion.

A refreshed version of my original Quarto note on building a Japan prefecture choropleth with Leaflet in R.
A practical guide to using Bayes' rule, Beta priors, credible intervals, and posterior comparison for A/B tests.
A practitioner deep dive into latency measurement, cache locality, allocation avoidance, branch predictability, atomics, false sharing, syscalls, and p99 discipline.
A practitioner deep dive into bootstrap aggregation, decorrelated trees, majority votes, OOB error, and why random forests work so well on tabular data.
A compact note on using posterior predictive checks to catch quiet distribution shifts before model quality moves.
Designing an evaluation loop that treats prompts, datasets, and model settings as versioned artifacts.

This post is a review of my short stint as a researcher in undergrad.

This post is a walkthrough of various power analyses in R.

This post is a walkthrough of a structural equation model in R.

The XGBoost Boosting Algorithm and an Application to the Forest Cover Type Dataset

My topic of my 2021 master's thesis during my final semester.

One of the most difficult projects I did in a statistics class during grad school.

A quick notebook on how to create and interpret naive bayes models.

