Catching up with Scale AI: A Conversation with founder and CEO Alexandr Wang

Available In Following Subtitles

English

Variant 1

Posted on: Jun 27, 2025

Video by: Index Ventures

Index partner Mike Volpi recently caught up with Alexandr Wang, founder and CEO of Scale AI, at Scale’s new San Francisco office. They discussed the future of AI, the critical role of high-quality data, and the challenges of moving AI projects from prototype to production. Alex also shared his thoughts on the importance of AI safety and the broader geopolitical dynamics shaping the industry today. https://scale.com/ https://www.indexventures.com/ https://www.indexventures.com/perspectives/catching-up-with-scale-ai-founder-and-ceo-alexandr-wang/

Instantly generate YouTube summary, transcript and subtitles!

Install Tubelator On Chrome

Video Summary & Chapters

No chapters for this video generated yet.

Video Transcript

0:02

So these are the new digs on Iowa.

0:04

New office.

0:07

It's been a great setup.

0:10

We have a responsibility to produce all the data,

0:14

truly generate the data necessary to fuel this next era.

0:17

What worked before is not going to keep working.

0:22

Our responsibility is to utilize the exact same infrastructure and data foundry that we've built

0:27

to support every enterprise in their own journey

0:29

to make use of all their proprietary data

0:33

towards building customized specialized agents

0:36

for their own businesses.

0:38

You must still do a lot of college.

0:40

I mean, when I, obviously I dropped out to start the company, so I have a strong conviction

0:45

and talent right out of college.

0:47

We have an office in New York, an office in DC, we just started an office in London.

0:53

That's amazing.

0:55

Thank you very much.

1:01

You were sort of born in the AI era, so where do you think we are in the development of

1:06

this?

1:06

Are we getting to the stage where, you know, musical cheers are done, we know who the main

1:11

players are, this game is over, or you know, are we in the first inning, the

1:14

third inning, are we, you know, getting settled into it? To me, the first inning

1:19

was sort of the tinkering phase of modern deep learning. So from ImageNet

1:28

and AlexNet, so ImageNet was the first large-scale image labeled image

1:31

data set, AlexNet was the very first use of deep neural networks to solve

1:36

that problem. There was like the Google result where they could recognize cats

1:40

and YouTube videos. All of that was sort of, you know, basically like call it 2009 through

1:48

really probably 2020 was the first inning. And you know, while it was quite a while, it was really

1:57

a lot of tinkering with different kinds of model architectures, different kinds of data sets.

2:03

It was the first demonstration that showed that scaling these models up really worked. A lot of

2:08

the progress from GPT-2 through GPT-4 was all in pre-training, was all training on,

2:13

you know, more bigger and bigger chunks of the internet on more and more GPUs. And then

2:18

basically all the gains from that point, which was, you know, March of last year through

2:24

today, you know, August of 2024 have been through gains in post-training. And this is

2:34

through better SFT, RLHF, DPO of the models

2:37

and the use of data sets of increasing complexity,

2:44

going into more and more expert areas

2:46

and really driving for the performance of the models

2:49

using high quality over quantity.

2:52

So smaller data sets, but a very, very high quality.

2:55

Now you're actually creating effectively data sets

2:58

that are unique to the explicit purpose

3:00

that the model maker wants, right?

3:02

Exactly, yeah.

3:03

So we think about our role now as more of a data foundry than a data annotator.

3:09

The industry is very, very excited about agents.

3:13

There's very little data that can train these models of what are the series of actions

3:17

that a human takes and what is their internal thought process as they go through each of those steps.

3:23

We actually view one of our most important roles, especially over the next few years,

3:27

is to lay the groundwork for agents to actually become a possibility.

Download extension to view full transcript.

Install Tubelator On Chrome

YouTube First AI Assistant

Install On Chrome

AI Art For This Video No image generated for this video yet but here is the example.

ai art

0:09

Prompt

spider man in aladdin style, bright colors, hyper quality, high detail, high resolution, --video --s 750 --v 6. 0 --ar 1:2

ai images

Explore more in Science & Technology

الدكتور محمد فائد || بين الحكمة والشريعة 58: هل الله موجود؟ نعم موجود وغير موجود

by Dr Faid Channel

.NET Project Setup From Scratch Using These 6 Best Practices

by Milan Jovanović

How the Universe Will End

Dell EMC PowerEdge FX2s , FC640 Server Blades , FD332 Storage Block , @CTOSERVERS XEON GOLD !