There is absolutely no way that Google is generating labels like the ones described and then not training on them.
Sure, MAYBE they were originally intended for moderation or evaluation, but theyβre perfect RLHF labels too.
But Google lies even when the truth is easier and better for it, so π€·
add a skeleton here at some point
18 days ago