The AI evals field chose a flawed tool and stuck with it
Session one left me with two things I hadn’t resolved.1 The first was a line the instructor said almost in passing: “the hard part is scalability, not automation.” I wrote it down because it piqued something, but I couldn’t quite work out what problem it was pointing at. The second was a question I kept […]
The AI evals field chose a flawed tool and stuck with it Read More »

