MQ

Writing

Notes on statistics, Python, machine learning systems, LLM evaluation, and the places where implementation details change the conclusion.

Fast Paths in lavaan: A Performance Study preview

A research-style account of a lavaan performance PR, using matched profiling runs to connect internal fast paths with measured latency and allocation reductions.

RlavaanPerformanceStructural Equation ModelingBenchmarking
Schematic of a confidence interval overlapping an interval null
May 9, 20266 min read
Hard

A rigorous tutorial on second-generation p-values, interval null hypotheses, frequentist interpretation, and applied R workflows with the sgpv package.

StatisticsRFrequentist InferenceData Visualization
Leaflet logo

A JavaScript Leaflet walkthrough for joining GeoJSON boundaries to official 2024 population metrics and turning the result into an interactive choropleth.

JavaScriptLeafletMapsData Visualization
May 7, 202618 min read
Hard

A from-scratch C++ guide to dot products, transposes, matrix-vector multiply, cache locality, allocation control, blocking, SIMD-aware loops, and measurement discipline.

C++Linear AlgebraPerformanceMachine LearningLow Latency
Made with Leaflet in R
May 7, 20266 min read
Medium

A refreshed version of my original Quarto note on building a Japan prefecture choropleth with Leaflet in R.

RLeafletMapsData Visualization
May 5, 20264 min read
Medium

A practical guide to using Bayes' rule, Beta priors, credible intervals, and posterior comparison for A/B tests.

StatisticsBayesianA/B TestingExperimentation

A practitioner deep dive into latency measurement, cache locality, allocation avoidance, branch predictability, atomics, false sharing, syscalls, and p99 discipline.

C++SystemsPerformanceLow Latency

A practitioner deep dive into bootstrap aggregation, decorrelated trees, majority votes, OOB error, and why random forests work so well on tabular data.

MLStatisticsRandom ForestsEnsembles

A compact note on using posterior predictive checks to catch quiet distribution shifts before model quality moves.

Statisticsml-monitoringPython

Designing an evaluation loop that treats prompts, datasets, and model settings as versioned artifacts.

llmsevaluationPython
Being a Researcher preview
May 11, 20242 min read
Easy

This post is a review of my short stint as a researcher in undergrad.

StatisticsR
Power Analysis Calculator preview
Nov 13, 20225 min read
Medium

This post is a walkthrough of various power analyses in R.

RStatisticsDesigns
Structural Equation Modelling preview
Aug 12, 20226 min read
Hard

This post is a walkthrough of a structural equation model in R.

StatisticsR
XGBoost preview
Dec 8, 202115 min read
Hard

The XGBoost Boosting Algorithm and an Application to the Forest Cover Type Dataset

MLStatisticsPython
Cluster Validity Indices preview
Feb 28, 202120 min read
Hard

My topic of my 2021 master's thesis during my final semester.

StatisticsPythonRML
Designing Randomized Schemas preview
Aug 10, 202010 min read
Hard

One of the most difficult projects I did in a statistics class during grad school.

StatisticsR
Naive Bayes preview
Jan 29, 20204 min read
Medium

A quick notebook on how to create and interpret naive bayes models.

StatisticsR
Stylized carrot logo for caret in R
Oct 26, 201920 min read
Medium

A rewritten lab-led tutorial on logistic regression, odds ratios, model diagnostics, and fitting a binomial GLM with caret in R.

RStatisticsMachine Learning
Bike Sharing preview
Aug 22, 201715 min read
Medium

Bike Sharing

StatisticsR