Writing

Notes on statistics, Python, machine learning systems, LLM evaluation, and the places where implementation details change the conclusion.

Fast Paths in lavaan: A Performance Study preview

May 24, 202618 min read

Hard

Fast Paths in lavaan: A Performance Study

A research-style account of a lavaan performance PR, using matched profiling runs to connect internal fast paths with measured latency and allocation reductions.

RlavaanPerformanceStructural Equation ModelingBenchmarking

Read case

Schematic of a confidence interval overlapping an interval null

May 9, 20266 min read

Hard

Second-Generation p-Values

A rigorous tutorial on second-generation p-values, interval null hypotheses, frequentist interpretation, and applied R workflows with the sgpv package.

StatisticsRFrequentist InferenceData Visualization

Read case

May 8, 20264 min read

Medium

Building a Leaflet.js Choropleth Map of Japan

A JavaScript Leaflet walkthrough for joining GeoJSON boundaries to official 2024 population metrics and turning the result into an interactive choropleth.

JavaScriptLeafletMapsData Visualization

Read case

systems

C++

May 7, 202618 min read

Hard

Linear Algebra in C++

A from-scratch C++ guide to dot products, transposes, matrix-vector multiply, cache locality, allocation control, blocking, SIMD-aware loops, and measurement discipline.

C++Linear AlgebraPerformanceMachine LearningLow Latency

Read case

May 7, 20266 min read

Medium

Mapping Japan with Leaflet in R

A refreshed version of my original Quarto note on building a Japan prefecture choropleth with Leaflet in R.

RLeafletMapsData Visualization

Read case

inference

STAT

May 5, 20264 min read

Medium

Bayesian A/B Testing

A practical guide to using Bayes' rule, Beta priors, credible intervals, and posterior comparison for A/B tests.

StatisticsBayesianA/B TestingExperimentation

Read case

systems

C++

May 5, 202620 min read

Hard

Low-Latency C++ Techniques for the Hot Path

A practitioner deep dive into latency measurement, cache locality, allocation avoidance, branch predictability, atomics, false sharing, syscalls, and p99 discipline.

C++SystemsPerformanceLow Latency

Read case

inference

STAT

May 5, 20267 min read

Hard

Random Forests, Variance, and Out-of-Bag Error

A practitioner deep dive into bootstrap aggregation, decorrelated trees, majority votes, OOB error, and why random forests work so well on tabular data.

MLStatisticsRandom ForestsEnsembles

Read case

inference

STAT

Feb 18, 20267 min read

Medium

Bayesian drift detection for feature pipelines

A compact note on using posterior predictive checks to catch quiet distribution shifts before model quality moves.