Podcast Banner

Podcasts

Paul, Weiss Waking Up With AI

Demystifying the Mixture-of-Experts Approach

Katherine and Anna break down the concept of “Mixture of Experts,” an innovative AI technique that enhances efficiency and performance and powers some of today’s most advanced large language models.

Stream here or subscribe on your
preferred podcast app:

Episode Transcript

Katherine Forrest: Hey, good morning everyone and welcome to another episode of “Waking Up With AI,” a Paul, Weiss podcast. I’m Katherine Forrest.

Anna Gressel: And I’m Anna Gressel.

Katherine Forrest: And Anna, today I am actually in Maine. I’m in Maine with my travel mic. It’s a new version—I got it off of Amazon—from the one that my dog ate. A new travel mic. And I’ve got it now wedged against a nice little stand here and I’m ready to go in this stunningly beautiful place in Maine.

Anna Gressel: I can look out your window and it actually does look stunningly beautiful.

Katherine Forrest: All right. So now, what’s on our plate for this episode?

Anna Gressel: So we have a really interesting topic for us today, Katherine: Mixture-of-Expert models. This one is much more of a technical and engineering deep dive than we normally do.

Katherine Forrest: Okay, we’re not going to—for the audience, don’t despair. We’re not going to lose you. What we’re going to try to do, Anna, is as we take this dive down, we’ll baseline our listeners on what a Mixture of Experts is and why we should think about them. So why don’t you go ahead and just kick us off.