minus-squareRainer Burkhardt@lemmy.worldtoProgrammer Humor@programming.dev•It must be a silent RlinkfedilinkEnglisharrow-up1·3 months agoI can evaluate this because it’s easy for me to count. But how can I evaluate something else, how can I know whether the LLM ist good at it or not? linkfedilink
I can evaluate this because it’s easy for me to count. But how can I evaluate something else, how can I know whether the LLM ist good at it or not?