Anthropic says its new AI model can work almost an entire workday straight
- Anthropic unveiled its latest Claude 4 lineup at its first-ever developer conference on Thursday, introducing two advanced AI models designed for complex data analysis and task execution.
- The launch follows Anthropic's shift from chatbots to complex AI tasks like coding and research, amid a heated AI arms race involving major tech firms.
- Claude Opus 4 maintained focus on a complex coding project for nearly seven hours, outperforming competitors with a 72.5% score on the SWE-bench software engineering benchmark.
- Anthropic said Opus 4 is the best coding model globally and reduces reward hacking by 65%, while both models can use search engines and tools in parallel for deeper reasoning.
- This advancement signals a shift toward AI as genuine collaborators capable of day-long autonomous work, with significant implications for software development and knowledge work.
Insights by Ground AI
Does this summary seem wrong?
27 Articles
27 Articles
All
Left
5
Center
10
Right
Anthropic overtakes OpenAI: Claude Opus 4 codes seven hours nonstop, sets record SWE-Bench score and reshapes enterprise AI
Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and record-breaking 72.5% SWE-bench score, transforming AI from quick-response tool to day-long collaborator.
·San Francisco, United States
Read Full ArticleCoverage Details
Total News Sources27
Leaning Left5Leaning Right0Center10Last UpdatedBias Distribution67% Center
Bias Distribution
- 67% of the sources are Center
67% Center
L 33%
C 67%
Factuality
To view factuality data please Upgrade to Premium
Ownership
To view ownership data please Upgrade to Vantage