Gojiberries (Page 4)

Sign in Subscribe

More issues

Bandwidth Selection Without Grid Search

Bandwidth selection is often framed as "scan a grid and pick the CV minimum." For common univariate problems, you can do better. The leave‑one‑out (LOO) cross‑validation objectives admit closed‑form derivatives in the bandwidth $h$. With analytic gradients (and Hessians when useful), a one‑dimensional

Rank‑Preserving Calibration for Multiclass Probabilities

We often need to adjust a fitted $E[Y|X]$ so that the aggregate $E[Y]$ (or class totals) matches known targets (see here). In binary classification, a single logit shift (or a global temperature) is a monotone transform of the model scores, so if person A had a higher

Generalization Gap in Over‑Parameterized Models

Textbook bias–variance intuition implies that worst‑case test error should eventually rise as model capacity overtakes the sample size because the variance term in the bound grows while the bias term has already bottomed out (see Vapnik and Chervonenkis, 1971; Bartlett and Mendelson, 2002). Those worst‑case guarantees shrink

Burning ₹168 to Earn ₹100

Tamil Nadu’s civil service hopefuls give up nearly 1.68 times as much in lost earnings (and coaching fees) as the state will ever pay in salary (see Table 2.5 in Mangal, 2023 (PDF); see MR as well). At first pass, this looks like over-dissipation: candidates appear to

A Lightweight ALS Solver for Iterative GLS

When generalized least squares spans hundreds of equations, the error covariance becomes the problem. Every loop that updates $\beta$ drags along a $K \times K$ inverse; storing it costs about $O(K^2)$ memory and inverting it costs about $O(K^3)$ time. Even when we model the covariance statistically

Streaming Calibration

Modern applications—from ad platforms calibrating click-through predictions to polling systems incorporating responses to ML algorithms adapting fairness thresholds—share a common challenge: maintaining calibrated weights on live data streams. To address streaming data, we recast raking as a streaming convex optimization problem: minimize the squared error between current weighted

npm fund

In November 2019, npm introduced the npm fund command. If you've run npm install recently, you've seen the gentle reminder: "4 packages are looking for funding. Run npm fund for details." As npm’s former CEO, Isaac Schlueter, noted, maintainers have historically had “very

Beam-GD

Gradient descent commits to a single direction at each step based on the local gradient. This myopic approach can be suboptimal when gradients are noisy, local geometry is misleading, or the loss landscape has multiple competing descent directions. The algorithm makes irrevocable decisions based on local information, potentially missing better

Good Enough: Satisficing in Production Machine Learning

Herbert Simon observed that managers rarely chase the global optimum. Instead, they set an aspiration level, a “good enough” performance, and quit searching once they found an option that met it. Simon called this satisficing. That habit makes sense because every decision is a trade‑off. In modeling, the benefit