Azmat Ullah Babar | AI Engineer & Tech Editor

Gemini Omni Review: Is Google’s “Nano Banana For Video” Worth The Hype?

May 21, 2026 by Azmat

Gemini Omni Review feature image showing conversational video editing control

Gemini Omni arrives with the kind of demo that makes product teams whisper, “Well, there goes the roadmap.” Type a prompt, feed it an image, add audio, point it at a rough clip, and suddenly the model is editing video as if it has a tiny film crew trapped inside the weights. That is the … Read more

Gemini 3.5 Flash Review: Fast Agents, Expensive Tokens, And The New Flash Era

May 20, 2026 by Azmat

Gemini 3.5 Flash review feature showing fast agents and expensive tokens

The old meaning of “Flash” was simple: cheap, fast, good enough, and slightly disposable. Gemini 3.5 Flash breaks that mental model. This is not a bargain-bin chatbot with a racing stripe. It is Google’s attempt to turn the Flash tier into an agent engine, the kind of model you point at a terminal, a workflow, … Read more

Codex Sandbox: How Windows Security Failed the AI Agent Test (And Linux Didn’t)

May 14, 2026 by Azmat

Codex Sandbox feature showing Windows security failing the AI agent test

The Codex Sandbox story starts with a wonderfully ordinary developer problem: a coding agent wants to run a command. That sounds harmless until the command is reading your SSH config, editing the wrong folder, or phoning home with a token it found in a forgotten .env file. The old software model was simple. Humans click … Read more

Diffusion Language Models: Inside MIT’s ELF And Kaiming He’s Continuous Breakthrough

May 13, 2026 by Azmat

Diffusion language models feature visual for MIT ELF breakthrough

For most of the modern AI boom, text generation has had one very good trick: predict the next token, then do it again, and again, and again, until the machine either writes a sonnet or apologizes for being unable to help with your toaster. Diffusion language models ask a different question. What if language didn’t … Read more

Project Maven: How Palantir And Claude Were Reportedly Used In Iran

May 7, 2026 by Azmat

Project Maven feature image about Palantir, Claude and Iran targeting reports

The first thing to understand about Project Maven is that it was never just “AI for war” in the Hollywood sense. No glowing red robot eye. No machine calmly deciding the fate of cities. The more unsettling reality is much more ordinary: a giant data pipeline, trained models, sensor feeds, maps, alerts, rankings, dashboards, and … Read more

Multimodal Chain Of Thought: DeepSeek’s Visual Primitives And The 7056x Compression Trick

May 4, 2026 by Azmat

Multimodal chain of thought feature showing grounded visual reasoning with coordinates

The weird thing about vision models is that they can often describe an image beautifully, then fall apart when asked to do something a five-year-old does with a finger. Count the bears on the ground. Trace the line from the crown icon. Navigate the maze. Don’t hallucinate a shortcut through a wall. That is where … Read more

GPT 5.5 Review: Benchmarks, Pricing, And The Agentic Era

April 24, 2026 by Azmat

GPT 5.5 review feature showing benchmarks, pricing, and agentic workstreams

A funny thing happened on the way to the next chatbot upgrade: the chatbot started behaving less like a chatbot. GPT 5.5 arrived on April 23, 2026, and the GPT 5.5 release date matters because this launch is less about prettier paragraphs and more about handing work to software that can plan, click, debug, browse, … Read more