Beeminder's Turquoise Swath?

joshrule · January 11, 2019, 9:46pm

What sort of fit does Beeminder perform when you enable the “Turquoise Swath”?

I’m wondering because I noticed wild variations sometimes appearing before or after the existing data points on various graphs that have it enabled. Have the bees considered using Gaussian processes to perform the fitting a la the Automatic Statistician? Depending on the kernel selected, you can provide a much stronger prior on what curves should look like, reducing those wild swings outside the existing data.

There has even been some work on compositional kernel construction (e.g. this paper, which might be useful for capturing structure like a predictable flurry of activity at the end of the month or breaks on the weekend.

I’m not entirely sure what the computational demands are for fitting something like a Gaussian process compared to what Beeminder does now but just wanted to throw the idea out there in case it sparked ideas for features down the road.

dreev · January 11, 2019, 10:55pm

It’s a polynomial fit! Here’s the very, very complete answer: https://github.com/beeminder/road/blob/master/src/polyfit.js [1]

Thanks for the pointer to something potentially much better! I have a bunch of notes collected on that question as well…

[1] That’s right, all the code that generates the graphs is open source now! Well, as soon as we finish the migration to the new Beebrain: [oops, I originally linked to an internal thread here]

joshrule · January 14, 2019, 9:18pm

Thanks - this is great!

It looks like the fit is always a cubic - is that right? Is there any particular reason why we should expect beeminder trajectories to be cubic?

Just curious, but what sorts of ideas do you have for improving the Turquoise swath?

dreev · January 16, 2019, 11:17pm

Yeah, there’s no good justification for that choice of polyfit!

Here’s our list of ideas/notes:

Ridge regression, scipy.optimize.fmin_l_bfgs_b
http://en.wikipedia.org/wiki/Tikhonov_regularization
http://en.wikipedia.org/wiki/Kriging
Butterworth filters?
https://en.wikipedia.org/wiki/Kernel_density_estimation
R’s geom_smooth with Loess smoother

joshrule · January 17, 2019, 9:08pm

Cool!

Perhaps I’m using it incorrectly, but my primary use case for the swath is to less to smooth out existing data and more to make sane predictions. Whatever improves that is a win in my book .

One other option to consider that’s somewhat similar to Kriging, is some sort of Bayesian regression in which you perform inference over both the type of curve (sinusoidal, linear, parabolic, cubic,…) and the specific parameters of the curve. Depending on your setup, you can tradeoff between the complexity of the curve and fit in a principled way.

mufflon · January 18, 2019, 1:28pm

I may be completely wrong here @dreev, but I think kriging is a special version of Gaussian Process.

joshrule · January 18, 2019, 1:39pm

@mufflon - that’s my understanding, too. It looks like kriging originally came out of geostatistics.

I’ll link a couple other resources that were useful for me when first learning about Gaussian Processes more generally:

Topic		Replies	Views
Polynomial fit trend with weight loss goals Newbee	1	654	January 31, 2017
Visualizing beeminder data Tech	41	2331	July 7, 2022
draft of today's emergency blog post on beeminding outside the box	7	752	April 14, 2013
What is this purple square on the graph? Akrasia	1	264	April 27, 2021
Getting Google Fit data into Beeminder Tech	1	1825	April 6, 2015

Beeminder's Turquoise Swath?

Related topics