Azmat Ullah Babar | AI Engineer & Tech Editor

MiniMax M2.1 Review: The Fast Path To API Access, Local Runs, Real Pricing, And Benchmarks That Hold Up

December 23, 2025 by Azmat

MiniMax M2.1 cover showing API, local, pricing, benchmarks

Watch or Listen on YouTube MiniMax M2.1 Review: API Access, Local Runs, Pricing, and Benchmarks Introduction Every few months, the internet discovers a new “coding beast” model and immediately does what it always does. Someone posts a chart, someone posts a slick UI demo, and then a thousand developers ask the only questions that matter, … Read more

GLM-4.7 Review: From $3 Agentic Workflows to Local Uncensored Roleplay

January 18, 2026December 23, 2025 by Azmat

A developer working on code at a desk with monitors displaying the GLM-4.7 review title.

8 Watch or Listen on YouTube GLM-4.7 Review: Agentic Workflows & Local Roleplay Deep Dive Introduction Sudden benchmark glow-ups always give me the same feeling as a too-clean Git history. Interesting, maybe impressive, but I want to see what got squashed. GLM-4.7 landed with that exact energy, a flagship model claiming big jumps in coding, … Read more

Anthropic Bloom Guide: Automating LLM Red Teaming And Benchmarking Claude Opus 4.5 Vs GPT-5

February 8, 2026December 21, 2025 by Azmat

Engineer in server lab analyzing Anthropic Bloom data for AI red teaming.

Watch or Listen on YouTube Anthropic Bloom Guide: Automating LLM Red Teaming and Benchmarking Claude guide hub (beginner to pro) Introduction I used to “test” models the way most of us do at first. A dozen prompts, a quick skim, a shrug. It feels responsible. It’s also a lie we tell ourselves because writing good … Read more

T5Gemma 2 Explained: Why Google Is Betting Big On Encoder-Decoders (Again)

January 18, 2026December 20, 2025 by Azmat

A Google engineer working on T5Gemma 2 code on a laptop in a bright office.

Watch or Listen on YouTube T5Gemma 2 Explained: Why Google Is Betting Big On Encoder-Decoders (Again) Introduction Decoder-only models have been winning the popularity contest for a while. They are great at talking. You give them a prompt, they keep the autocomplete train rolling, and suddenly you have code, essays, or a questionable poem about … Read more

Chain of Thought Monitorability: Panopticon Or Protection? Inside OpenAI’s Strategy To Catch Deceptive Reasoning

January 18, 2026December 19, 2025 by Azmat

AI researcher analyzing Chain of Thought Monitorability on a glass interface.

Watch or Listen on YouTube Chain of Thought Monitorability: Panopticon Or Protection? Introduction Reasoning models did something quietly radical. They turned “thinking” into an explicit artifact. Instead of jumping straight to an answer, they often generate an internal chain-of-thought and only then produce the user-facing output. That shift is exciting, and it’s also a new … Read more

Inside GPT 5.2 Codex: Benchmarks, Cybersecurity, and the React Vulnerability

December 19, 2025 by Azmat

Editorial magazine cover showing a glowing tech core representing GPT 5.2 Codex.

Watch or Listen on YouTube GPT 5.2 Codex: Benchmarks, Cybersecurity, and the React Vulnerability Introduction Most “AI coding assistants” are fancy autocomplete with a chat box. They speed you up on the easy parts, then tap out the moment the work turns into real engineering: tracing a bug across modules, running tools, chasing flaky tests, … Read more

GPT 5 math Breakthrough: How Solving An Open Geometry Optimization Problem Signals The AI Tipping Point

January 18, 2026December 18, 2025 by Azmat

$A researcher watches chaotic data transform into glowing complex geometry on a holographic interface, symbolizing the GPT 5 math breakthrough.$

Watch or Listen on YouTube GPT 5 math Breakthrough Introduction There is a specific sound a field makes right before it changes. It is not applause. It is the quieter noise of people updating their defaults. This week’s example is a short paper by Johannes Schmitt, where research-grade AI systems helped discover and prove a … Read more