Azmat Ullah Babar | AI Engineer & Tech Editor

Sakana Fugu Explained: The AI Model That Is Really a Powerful LLM Orchestrator

June 22, 2026 by Azmat

Sakana Fugu infographic for How Sakana Fugu Works In Simple Words

Sakana Fugu has arrived with the kind of claim that makes AI people immediately open three tabs and start arguing. Is it a new frontier model? Is it a wrapper? Is it just a router wearing a pufferfish costume? The clean answer is this: Sakana Fugu is a learned LLM orchestrator delivered through a single … Read more

GLM 5.2: Benchmarks, API Pricing, and the Developer Routing Playbook

June 17, 2026 by Azmat

GLM 5.2 feature image for benchmarks, pricing, and developer routing playbook

GLM 5.2 lands at a funny moment in AI. Developers are tired of paying luxury prices for every autocomplete, every log summary, every “please explain this stack trace” moment. At the same time, nobody wants a cheap model that gets clever in the wrong direction and turns a simple bug fix into interpretive dance. That … Read more

Anthropic Mythos And Claude Fable 5: The First AI Release That Feels Like A Controlled Detonation

June 10, 2026 by Azmat

Anthropic Mythos feature image showing controlled AI release in a safety chamber

1. The Dawn Of Anthropic Mythos Most model launches feel like product updates. Anthropic Mythos feels more like someone quietly wheeled a jet engine into the office and asked whether the furniture was bolted down. The strange thing about this release is not just that Claude Fable 5 is powerful. Frontier models are always powerful, … Read more

Gemma 4 12B Local Setup Guide: Requirements, Download, Commands, And Benchmarks

June 4, 2026 by Azmat

Gemma 4 12B Local Setup Guide feature image with local AI workstation

There’s a quiet shift happening in AI, and it sounds suspiciously like your laptop fan. For years, the default story was simple: serious models live in data centers, everyone else rents tokens. Gemma 4 12B makes that story feel old. Not dead, not silly, just no longer the only sane option. Google’s new 12B dense … Read more

MiniMax M3: Sparse Attention, 1M Context, and the Agent Model Nobody Should Evaluate Lazily

June 2, 2026June 2, 2026 by Azmat

MiniMax M3 feature image showing sparse attention and 1M context evaluation

MiniMax M3 arrives with the kind of launch copy that makes engineers both curious and allergic. Frontier coding. One million tokens. Native multimodality. Open weights coming soon. Cheaper than the usual suspects. Somewhere, a product manager is already updating a roadmap slide with fireworks. The interesting part is not the fireworks. It’s the engineering bet … Read more

Claude Opus 4.8: Brilliant Agentic AI Or A Token-Burning Trap?

May 30, 2026 by Azmat

Claude Opus 4.8 feature image for Brilliant Agentic AI or Token-Burning Trap

Claude Opus 4.8 is the kind of release that makes two tribes shout past each other. The benchmark crowd sees a serious engineering model, sharper at code, better at tool use, and less willing to bluff. The daily users see a brilliant assistant that sometimes answers “Good morning” like it’s defending a PhD thesis on … Read more

Qwen 3.7 Max Review: 35-Hour Agents, Real Benchmarks, And The Awkward GPT-5.5 Question

May 22, 2026 by Azmat

Qwen 3.7 Max review feature image showing long-running agent engineering workflow

Qwen 3.7 Max is the kind of model launch that makes engineers put down their coffee and squint. Not because Alibaba has discovered a magic new species of intelligence, but because the claim is practical and weirdly concrete: a model that can keep working for 35 hours, call tools more than a thousand times, and … Read more