Podcast Banner

Podcasts

Paul, Weiss Waking Up With AI

Model Metrics: Benchmarking AI

In this episode of "Paul, Weiss Waking Up With AI," Katherine Forrest and Anna Gressel discuss AI benchmarking, exploring how these standardized tests evaluate AI models against each other and human capabilities, helping developers and deployers assess performance, safety and progress toward artificial general intelligence.

Stream here or subscribe on your
preferred podcast app:

Episode Transcript

Katherine Forrest: Well hello, everyone, and welcome to another episode of “Paul, Weiss' Waking Up With AI.” I'm Katherine Forrest.

Anna Gressel: And I am Anna Gressel.

Katherine Forrest: And so, Anna, we're recording this on a Friday. But as you know, we are now back in the office four days a week, Monday through Thursday. And Friday's a remote day. So where are you?

Anna Gressel: I'm in the city, I'm just not in the office.

Katherine Forrest: You're still in one of your places in waiting for the fire problem to be fully remediated, et cetera, et cetera, right?

Anna Gressel: Yeah, we're waiting for like the 10th mold remediation report. So it's a joy.

Katherine Forrest: You poor thing. Anyway, I am upstate New York, and I have this like really peaceful place. But what I was reminded of this morning is that I have a super race of frogs that actually live outside of my house up here.

Anna Gressel: Like a super race of them? What does that mean?

Katherine Forrest: Total super race. That means that they make more sound than any group of frogs should ever make. And I will tell you the story if you're game.

Anna Gressel: I would love nothing more on a rainy Friday afternoon when we're recording this podcast.

Katherine Forrest: To hear the story. And by the way, it is going to intersect with AI. That's going to be the extraordinary thing that's going to happen. I'm going make my super race of frogs intersect with AI. So here it is. So the short version, and this is the short version, is that last summer there was a patch of land at the same place where I am now that got a puddle. And there was a rainstorm one night, and a puddle developed, and for whatever reason, some frog decided to lay its tadpole eggs in that puddle. Now this caused me a great deal of consternation because the puddle would each week get smaller and smaller and smaller, and I would then have to water the puddle because the tadpoles were in great danger of drying up. Now in the great circle of life, that may have been what they were supposed to have done, right? But I instead launched something called Operation Tadpole.

Anna Gressel: To the consternation of all the evolutionary biologists who would have just let them perish.