- Tubelator AI
- >
- Videos
- >
- Education
- >
- Will Devon AI Take Your Job? A Detailed Look at Cognition Labs' New AI Tool
Will Devon AI Take Your Job? A Detailed Look at Cognition Labs' New AI Tool
Discover the truth behind Devon AI - a new tool by Cognition Labs that is causing a stir in the tech world. Are the claims about its capabilities exaggerated? Learn what Devon can and cannot do, and whether it poses a real threat to software engineers' jobs.
Video Summary & Chapters
1. Introduction 🌐
Overview of Devon AI and its impact on software engineering jobs.
2. Devon's Capabilities 💡
Exploring the impressive claims and capabilities of Devon AI.
3. Addressing Concerns 🤖
Discussing common fears and misconceptions surrounding Devon AI.
4. GitHub Performance Analysis 📊
Analyzing Devon's performance in solving GitHub issues and its implications.
5. AI in Issue Documentation 📝
The importance of well-documented GitHub issues and pull requests.
6. Evaluation of Devon AI 🧠
Concerns about Devon AI's evaluation on a 25% subset of data.
7. AI Learning Abilities 📚
Discussion on how Devon AI can learn from blog articles and resources.
8. Bug Fixing Capabilities 🐞
Exploring Devon AI's ability to find and fix bugs in specific code.
9. Devon Writing Test Cases
Devon AI writes test cases based on developer's prompts.
10. Devon's Work on Upwork
Devon accomplishes tasks on Upwork by implementing models.
11. AI Tools Empowering Developers
AI tools like Devon empower developers by aiding in coding tasks.
Video Transcript
If you've been on Twitter or YouTube over the last week, you've definitely heard of Devon,
the brand new AI tool that supposedly acts and works just like a software engineer,
and a lot of people are worried that this is going to be the thing that takes over your job as
a software engineer, and there's a lot of really impressive claims that Devon is making,
but how true are they actually, and how impressive is this AI tool? I've gone through,
I've done the research, read the papers, looked at all the different claims that they're making,
and I really think Devon is not nearly as impressive or scary as people are making it out to be,
And in this video, I kind of want to talk about what Devon is, what it actually can accomplish,
and some of the things that it really cannot do.
Welcome back to WebDevSimplified.
My name is Kyle, and my job is to simplify the web for you so you can start building your
dream project center.
And today we're going to be talking about cognition, labs, newest AI, which is Devon.
And this is pretty much a brand new company that really hasn't released anything at all
before until releasing this Devon AI.
Now, they put out a blog article, which I'm going to link in the description of this
video.
And this blog article goes through quite a few different things about Devon, what it's
capable of, what it can all do, and really is showcasing all of the best case scenarios
for Devon because they want this to look as good as possible.
And that's because most of the time these AI companies, what they're trying to accomplish
is actually getting tons and tons of funding.
If we actually scroll to the top of this page, you can see that they've already raised
to $21 million in funding pretty much immediately from announcing this and all of that stuff going
along with this.
So really the goal of these types of blog articles and all of this information is to really
drama up as much hype as possible to get as much funding as possible into these particular
AIs so they wanted to look as good as possible on paper.
Now, there's a few different things I want to talk about in this video that's specifically
that things people are most scared of.
So, we scroll down to this Devins capabilities, there's a bunch of different videos that
we can go through, and talk about the different things Devin can do, and I want to focus on
some of the main ones and why they're maybe not as scary as you think.
The first one here is that Devin can learn how to use unfamiliar technology.
This one is scary to a lot of people because the AI essentially can teach itself using
existing blog articles, videos, documentation and so on, which sounds really scary, but honestly,
we'll leave dive into this, it's not that bad. Another thing that we want to talk about is how
we can actually find and fix bugs for you autonomously, which is very misleading compared to what they
actually do in the video. Again, I'll dive deeper into why this is not nearly as scary as they make
it out to be, especially based on the video that they show you. And then finally here, if we go
down a little bit further, we can see that Devon is actually able to accomplish real-world jobs
on Outwork, which is again something that's really scary for people because it's like replacing
essentially jobs that people could do, but again this may not be as scary as you think it is.
Now if we scroll all the way down to the bottom here, you may see this chart. This is probably
something you've seen if you've heard people talk about Devon, and essentially it's saying that Devon
is able to accomplish 13.86% of GitHub issues. And that's how a lot of people present it,
but essentially it's just using this SWE bench, which is essentially a paper, a benchmark for testing
AI against GitHub issues. And if we go to the actual site for this, you'll notice that
this is actually much less of a scary thing than people think. They may think that,
okay, it can solve essentially what is it? 13.8% of all GitHub issues. But really, what