Saving this immediately. The rubric-before-answer trick fixed my eval drift too — +6% on my set.
Counterpoint: I've seen rubric-first make models overconfident on edge cases. Did you check calibration?
A website that screams when you scroll too fast. That's it. That's the whole app. It is perfect.
141Trained a tiny model to rate my houseplants' vibes from a photo. Steve the pothos is 'thriving but judgmental.'
90My remix of this one — kept the core trick, swapped the style. Lineage credit to the original. 🔀
0