Documentation of aggregation methods would be nice


#29

I ought to leave the straw polls to @dreev, apparently.

Other than the eponymous @jolly, seems as though everyone would welcome a redefinition of ‘binary’ to count the existence of any non-zero datapoint as aggregating to ‘1’, and ‘0’ if there are no datapoints or all datapoints are themselves zero.

All of which may have added to the confusion, of course. Sounds like a quick and easy #UVI.


#30

No, no, don’t make them strings! 1 != '1' (yes I’m just teasing)


#31

Is there any evidence that @jolly wanted 0 to be 1?


#32

No but yes. (:

The ‘jolly’ aggregation method was intended in order to enforce frequency of measurement-taking, counting days on which something was entered, regardless of the measured value.

His particular items of interest would be unlikely to result in a zero reading, but the request was specifically about datapoints entered, yes.

That aggregation method has a valid use, so let’s not change it. That use is just not the one that most of us seem to have expected when reading the code…


#33

None whatsoever. :slight_smile: I was using this for a goal where I wanted to enforce me entering data, while ignoring the value of the data entered. And for that goal, 0 was never a value I would have entered…but it might be for someone else.


#34

What was the upshot of this? I don’t see ‘jolly’ in the list. Has 'binary" been changed to have the ‘nonzero’ behavior? Because the ‘nonzero’ behavior is actually exactly ideal for my exercise goal (though several others work fine too).


#35

The ‘jolly’ function has been deprecated, and removed from the selectable list. Though it still works if that’s what had been set.

Looks as though we haven’t yet made the ‘binary’ vs ‘unary’ distinction in the code.

That should be a quick #UVI though, @dreev. I looked at the code and the only thing stopping me from making the changes is that my dev env is broken and I wouldn’t be able to test it.


#36

Done! Made nonzero a new aggday method and updated the wikified post above.


#37

This is super! But, I don’t see it in the list on my exercise goal. Is there an overnight refresh or something?


#38

Sorry about that! I deployed it for the back end (and thus working via the API) but I forgot to update the dropdown in the UI. Done now!


#39

Awesome. Two of my goals are now sporting it : )


#40

Aaaaaand nonzero aggmode breaks pessimistic presumptions on the Do Less goals, apparently. One my goals is still sporting it though : D


#41

Can you point us to the problematic one? (Can move this to support@beeminder.com if it makes sense to.)


#42

Sure thing - I emailed the deets to the support line with your name in the subject.


#43

Let me take another stab? If there was some possible universe in which I could get the comments for my aggday, I would have a good use for it.

My new Beelint goal is trying to track the number of goals that are in violation each day, and currently uses min to not fault you for violations that come up during the day. Sort of like how Gmail Zero works.

The problem is that I might go from A,B to B,C and it looks like those are both size 2, but C is new and shouldn’t be counted. I’d like to just count the problems that have been present ALL day. So what I’d really like is an aggday that looks like

lambda comments: len(set.intersection(*[set(c.split(',') if c else set()) for c in comments]))

If there’s no hope I can print/read a state file but I was hoping to keep my thingy stateless. :slight_smile:


#44

skatesum : min(rfin, np.sum(x)), # only count the daily min

Can someone explain this one? What is rfin?


#45

Per a daily beemail by @dreev

Thanks to user kyshoc for suggesting a new aggday method that turned out to be super easy to add to the list. I’m not sure I agree it’s a good idea but here’s the use case:

Auto-ratcheting (setting “max safe days”) to never let you accumulate safety buffer only prevents you from getting whole days off. Even set to zero, it still let’s you almost get a day off, like having just one more step to get the next day. This new aggday method prevents you from getting more safety buffer at all (than you already have). Like if your road is set to 100/day then anything more than 100 that you report for a day just doesn’t get plotted.

It’s kind of like using a binary goal of “did it or not” but letting you store the richer data about how much exactly you did. The reason I don’t like it is that the graph is not plotting your actual data. But I’ll be interested to hear if anyone else likes it.

It’s called skatesum because it’s used for auto-summing goals like do-more where all datapoints for a day are summed and the cumulative total is plotted, but then that daily sum is capped at the daily rate. So if you’re skating the edge of the road now (meaning the bare minimum due is the full daily average you have to maintain) then you’ll permanently be skating it.

PS: After explaining all that it occurs to me that the right way to do this is to improve and generalize the auto-ratchet feature. But skatesum is what we have in the meantime. If anyone requests it I can throw in a version for non-auto-summing roads.


#46

Thanks! Also, can someone clarify how clocky works?

I’m thinking that if there are 2n or 2n + 1 data points from midnight to midnight, it returns sum over i from 1 to n of f(2i - 1) - f(2i) where f(x) is the xth data point, and just ignores f(2n + 1) to start again at g(1) the next day. (clock in, clock out pairs.)

Or does it include g(1) - f(2n + 1)? Then what does it do for the other g values?

Or does it just sum f(j + 1) - f(j) over j from 1 to m - 1, where there are m data points?


#47

Thanks for asking for that clarification!

# Sum of differences of pairs, eg, [1,2,6,9] -> 2-1 + 9-6 = 1+3 = 4
def clocky(l):
  if len(l) % 2 != 0: l = l[:-1] # ignore last entry if unpaired
  return sum([end-start for [start,end] in partition(l,2,2)])

(The partition function turns a list into a list of pairs.)

Does that answer it? Do you agree with the answer?


#48

You definitely answered my question. But the way it’s implemented will only work if you’re never clocked in at midnight. For instance, if I log the start and end time for every meditation session and use clocky to total the times, if I start meditating at 11:30pm and stop at 12:30am, the 11:30pm time will get ignored and the 12:30am time will be treated as a start time, making that day’s numbers inaccurate.

I’m not sure how easy that would be to fix - you’d have to either subtract 24 from any leftover entry and pass it to the next day’s list, or else, anytime a list has an odd number of entries, tack midnight on to the end of that list and start the next day’s list with a midnight entry as well (essentially clocking out at midnight minus epsilon and back in at midnight plus epsilon). Hopefully that can be done without needing to make a bunch of changes to the rest of the code.


My list of bugs and feature requests