- Tubelator AI
- >
- Videos
- >
- Science & Technology
- >
- Claude 3.7 Sonnet: Exploring the Latest Features and Innovations
Claude 3.7 Sonnet: Exploring the Latest Features and Innovations
Discover the exciting new features of Claude 3.7 Sonnet, an advanced thinking model by Anthropic. Learn about its innovative capabilities, including building a complex snake game with AI snakes and a super food block. Explore the developments in the Cloud series models and Cloud Code for agentic coding.
Video Summary & Chapters
No chapters for this video generated yet.
Video Transcript
Clawed 3.7 Sonnet was just released and I just got done testing it.
I built a complex snake game that allowed two AI snakes to battle each other.
I added a super food that creates a block that can destroy one of the snakes
that actually moves and follows the snake around.
And all of this was done on the very first try.
And I'm going to show you more about that later in the video.
But now let me tell you a little bit about Clawed 3.7 Sonnet.
So, two things were actually released just now.
We have Cloud 3.7 Sonnet, which is a big but still a dot upgrade to the Cloud series of
models.
And then we also have Cloud Code, which is a command line interface for agentic coding.
Now for Cloud 3.7 Sonnet, it is a thinking model.
And this is the first thinking model by Anthropic.
I am pretty surprised that this is not Cloud 4.
And I find it a little bit weird that this jump is from 3.5 to 3.7 versus just straight to 4,
which makes me think 4 is in the works and it's going to be much, much better,
but we don't know that for sure. But what we do know is that this
minor version increment is a big jump. This is the first.
hybrid reasoning model on the market.
That means Cloud 3.7 is both capable of generating near instant replies
to whatever prompt you have in the more traditional LLM way.
And it also has thinking so it can take its time using chain of thought
before replying to you.
Very similar to O1, O3 and Grok3.
But both of those come from a single model.
Now, just like other thinking models, Cloud 3.7 has a scratchpad
in which it's doing chain of thought.
So it's actually iterating on its thinking, it's reflecting, it's trying different potential
results and then finally summarizing everything or kind of choosing the best one and then
showing it to you.
And they actually do show the chain of thought, which I thought was surprising because Anthropic
is kind of known for being really closed source and very big on security.
Now whether or not they're actually showing the true full chain of thought, I'm not actually
sure, but it kind of does look like they are.
And if you have API access, there's actually a dial in which you can tell Cloud 3.7 how long to think for.
And you can actually specify the number of tokens up to the context window maximum, which is 128,000 tokens.
which is definitely on the smaller side of context windows.
So as an API user, if you're building API applications and you're using
Cloud 3.7 SONNET to power it all, you do want to specify how many tokens maximum
so that you just don't blow your budget overnight.
Let's look at some of these results.
So this is SWE Bench verified.
Here is Cloud 3.7 SONNET.
This is a 20% increase versus the other models listed here.
This is Claude 3.5 Sonnet new, a 103 mini high and DeepSeek R1.
All four of these models come in right around the 49%
and then with Claude 3.7 Sonnet, we reach 70%.
Now there's a caveat here.
This kind of lighter pink area says with custom scaffolding.
That just means they use customized chain of thought techniques