MobileMall BlogMobileMall BlogMobileMall Blog
  • #Explore
  • Business
  • Technology
    • Gaming
    • Headphones
    • Laptops
    • Mobile Accessories
    • Home Networking
    • PCs
    • Printers
    • Smart Watches
    • Speakers
    • Streaming Devices
    • Tablets
    • Wearables
    • Smart Office
  • Security
  • Buying Guides
  • Contribute
Reading: Anthropic Drops Claude Opus 4.6: Beats GPT-5.2, Gets 1M Token Context, and Ships Agent Teams
Share
Font ResizerAa
MobileMall BlogMobileMall Blog
Font ResizerAa
  • #Explore
  • Business
  • Technology
  • Security
  • Buying Guides
  • Contribute
  • #Explore
  • Business
  • Technology
    • Gaming
    • Headphones
    • Laptops
    • Mobile Accessories
    • Home Networking
    • PCs
    • Printers
    • Smart Watches
    • Speakers
    • Streaming Devices
    • Tablets
    • Wearables
    • Smart Office
  • Security
  • Buying Guides
  • Contribute
2025 © Mobilemall. All Rights Reserved.
Home » Blog » Anthropic Drops Claude Opus 4.6: Beats GPT-5.2, Gets 1M Token Context, and Ships Agent Teams
News

Anthropic Drops Claude Opus 4.6: Beats GPT-5.2, Gets 1M Token Context, and Ships Agent Teams

Mohammad Ahsan
Last updated: February 5, 2026 6:08 pm
Mohammad Ahsan
Share
Anthropic 4.6 launched
SHARE

Contents

  1. The Numbers That Matter
  2. What’s Actually New
  3. What Partners Are Saying
  4. The Safety Angle
  5. Pricing
  6. The Bottom Line

Feb 5 2026, Anthropic just launched Claude Opus 4.6 — and the benchmarks are brutal for competitors.

The new model outperforms OpenAI’s GPT-5.2 by 144 Elo points on real-world work tasks, scores highest in the industry on agentic coding, and becomes the first Opus-class model with a 1 million token context window.

It’s live now on claude.ai and the API.

The Numbers That Matter

  • 144 Elo points ahead of GPT-5.2 on GDPval-AA (knowledge work tasks in finance, legal, and other domains)
  • 190 Elo points ahead of its own predecessor, Claude Opus 4.5
  • #1 score on Terminal-Bench 2.0 (agentic coding evaluation)
  • #1 score on Humanity’s Last Exam (complex reasoning)
  • #1 score on BrowseComp (finding hard-to-find information online)
  • 76% vs 18.5% — Opus 4.6 vs Sonnet 4.5 on long-context retrieval (MRCR v2)
  • 90.2% on BigLaw Bench (legal reasoning) — highest of any Claude model
  • 38 out of 40 blind cybersecurity investigations won against Claude 4.5

What’s Actually New

1M Token Context Window (Beta)

Opus 4.6 is the first Opus-class model that can handle a million tokens. Premium pricing kicks in above 200k tokens ($10/$37.50 per million input/output).

Agent Teams in Claude Code

You can now spin up multiple AI agents that work in parallel, coordinate autonomously, and tackle tasks together. Best for codebase reviews and read-heavy work that splits into independent chunks.

Adaptive Thinking

Previously, extended thinking was either on or off. Now the model decides when deeper reasoning would actually help — and developers can tune how selective it is.

Effort Controls

Four levels: low, medium, high (default), max. Dial down if the model is overthinking simple tasks. Dial up for complex problems.

Context Compaction

Long-running tasks used to hit the context window and stall. Now Claude can automatically summarize older context and keep going.

128K Output Tokens

Opus 4.6 can generate outputs up to 128,000 tokens in a single response — no need to break large tasks into multiple requests.

Claude in PowerPoint (Research Preview)

New. Claude can now build presentations from scratch, read your layouts and fonts, and stay on brand. Pairs with the upgraded Claude in Excel for full document workflows.


What Partners Are Saying

The early access feedback is unusually strong:

“Claude Opus 4.6 is the biggest leap I’ve seen in months. I’m more comfortable giving it a sequence of tasks across the stack and letting it run.” — Austin Ray, Staff Software Engineer, Ramp

“Opus 4.6 handled a multi-million-line codebase migration like a senior engineer. It planned up front, adapted its strategy as it learned, and finished in half the time.” — Gregor Stewart, Chief AI Officer, SentinelOne

“Claude Opus 4.6 autonomously closed 13 issues and assigned 12 issues to the right team members in a single day, managing a ~50-person organization across 6 repositories.” — Yusuke Kaji, General Manager AI, Rakuten

“Both hands-on testing and evals show Claude Opus 4.6 is a meaningful improvement for design systems and large codebases. It also one-shotted a fully functional physics engine.” — Eric Simons, CEO, Bolt.new


The Safety Angle

Anthropic says Opus 4.6 shows “an overall safety profile as good as, or better than, any other frontier model in the industry.”

Key points:

  • Lowest rate of over-refusals (refusing harmless requests) of any recent Claude model
  • Low rates of deception, sycophancy, and misuse cooperation in automated audits
  • Six new cybersecurity probes to detect misuse of the model’s enhanced hacking capabilities
  • Full system card published with comprehensive testing details

Pricing

Unchanged: $5 per million input tokens / $25 per million output tokens

Premium pricing for 1M context (above 200k tokens): $10/$37.50 per million.

US-only inference available at 1.1× token pricing for compliance-sensitive workloads.


The Bottom Line

Claude Opus 4.6 isn’t a minor version bump. It’s a significant capability jump that:

  • Dominates benchmarks across coding, reasoning, search, and professional work tasks
  • Finally gives Opus users the million-token context window Sonnet already had
  • Ships genuinely new features (agent teams, adaptive thinking, context compaction)
  • Maintains or improves safety alignment despite the intelligence gains

Available now on claude.ai, the API, Amazon Bedrock, and Google Cloud Vertex AI.

Model string for developers: claude-opus-4-6

Samsung’s Galaxy Z Fold7 Just Got More Expensive in the US
TECNO Spark 40C Now Available in Pakistan
Huawei Pura 80 Series Set to Launch on June 11, Camera Upgrades Expected
Apple’s Cheap MacBook Is Apparently Coming Next Month
Google Translate Is 20, and It Just Got the One Thing It Was Missing

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
[mc4wp_form]
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Copy Link Print
Share
ByMohammad Ahsan
Follow:
is a creative writer & a BBA Student from Karachi Pakistan. He is Co-Admin at Mobilemall.pk. Mostly share ideas about Mobile Phones, Technology, SEO, SEM, PPC, etc.
Previous Article Qatar's Smartphone Industry The Gulf's Most Connected Market Qatar’s Smartphone Industry: The Gulf’s Most Connected Market
Next Article Samsung Galaxy S26 Samsung Galaxy S26 Series Has an Official Launch Date Reservations Are Already Open

Latest News

samsung-galaxy-z-fold8-ultra
Samsung Z Fold8 Ultra Reportedly Jumps to 5,000mAh Battery — No Extra Weight
News Samsung
alphabet-80-billion-raise
Alphabet Raised $80 Billion in One Move. Berkshire Was Part of It.
Google News
crypto-casinos
Crypto Casinos Pulled In $81 Billion Last Year, and That’s Five Times What They Made in 2022
Business Entertainment
How AI Room Redesign Apps Turn a Phone Photo Into a Photorealistic Makeover
How AI Room Redesign Apps Turn a Phone Photo Into a Photorealistic Makeover in Under 10 Seconds
Artificial Intelligence
The Truth About Megapixels
The Truth About Megapixels: Why a Higher Number Does Not Mean a Better Camera
Camera & Photo Innovation
WHEN ADS STOP WORKING
Brand Recall From Digital Ad Screens Stops Climbing Around the 8th Exposure — Here’s What the Habituation Data Actually Shows
Digital Marketing
Your Phone's Touch Latency Might Be Costing You Bets You Thought You Placed in Time
Your Phone’s Touch Latency Might Be Costing You Bets You Thought You Placed in Time
Phone Review
Comparing AI Strategies in Betting Platforms
Esports Betting Platforms Now Run on AI — From Odds to Support Tickets to Fraud Detection
Data Science

You Might also Like

OPPO-Reno15-Series
News

Oppo Reno15 Series Details

Sagar Bakre
Sagar Bakre
2 Min Read
Google Pixel 9a
GoogleNewsPhone Leak

Google Pixel 9a’s full specifications and price revealed through latest leak

Peter Brandt (Google Guy)
Peter Brandt (Google Guy)
1 Min Read
Hot 50i
NewsPhone Launch

Infinix Officially Launches Hot 50i with Impressive Specs and Features.

Sagar Bakre
Sagar Bakre
2 Min Read

About us

Mobilemall.co blog is an informative and engaging platform that offers readers the latest news and insights on mobile phones and accessories. The blog covers a wide range of topics, including product reviews, industry trends, and tips on how to get the most out of your mobile device.

Contact Us:
[email protected]

Categories Link

  • Business
  • Mobile
  • Technology
  • Gaming
  • Phone Review
  • Android

Must Read

s27-pro-camera
Samsung Galaxy S27 Pro camera leak
Phone Leak Samsung
Why Your TikTok Videos Are Not Getting Views (And How to Fix It)
Why Your TikTok Videos Are Not Getting Views (And How to Fix It)
Social Media

Quick Links

  • Privacy Policy
  • Tech Write For Us
  • Contact Us
  • Facebook
  • Instagram
  • YouTube
  • LinkedIn
2026 © Mobilemall. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up