Back to Articles
News

This Week in AI Coding: Grill-with-docs Skill and Deepseek / Composer / Qwen

May 27, 2026
5 min read

Hey hey, Useful Laravel links to read/watch for this week of May 27, 2026.

If you want to get such list every Wednesday, subscribe at the bottom of the website.


For Premium Members

I Tried /grill-with-docs Skill: Massive Difference

26-minute video. I went through /grill-with-docs session based on a project description from a client, and then gave the result docs to Composer 2.5 to implement the code. I will show you the cost and the final code result.


I Tried Planning with Opus and Building with Deepseek Flash

18-minute video. One of the ways to save money on tokens is to prepare the plan with expensive model like Opus/GPT, and to give implementation to a cheaper model like Deepseek or new Cursor Composer 2.5. I tried this scenario and will show you code quality and actual cost.


Benchmark of 12 LLMs on React/Typescript: 7 Tests with Playwright

12-minute video. I have executed the same prompt to create 7 React component, on 12 AI models, 5 times on each. Let me show the results and the conclusions.


From My YouTube Channel

I Tried NEW Qwen-3.7-Max on Three Projects

Another new LLM was released, and I hurried to test it out, comparing to other models on the same benchmark.


Your LLM Prompt Result Depends on THIS Factor

I made an experiment with 8 different LLM, giving the same prompt on two different-quality codebases. Did any model even try to refactor bad code?


I Tried to Plan with Opus and Build with Deepseek Flash / Composer 2.5

Shorter free version of Premium video from above.


Benchmark of 12 LLMs on React/Typescript: 7 Tests with Playwright

Shorter free version of Premium video from above.


AI Coding Community

AI usage is getting too expensive?
x.com

Look at the news. - MS is restricting Claude Code usage as their bill went too high - Uber used their CC yearly budget by April - GitHub Copilot prices skyrocket since June 1st. And more.


We’ve shipped a security-guidance plugin for Claude Code
x.com

It helps identify and fix vulnerabilities as you’re writing code. Available for all Claude Code users. Install from the plugin marketplace (/plugins).


OpenRouter on X: "Today we’re announcing our $113M Series B"
x.com

Over the last 6 months, weekly volume on OpenRouter grew from 5T to 25T tokens as AI rapidly shifts from experimentation into production. We’re excited for what comes next.


Peter Steinberger on X: "Folks: when you write skills, ask your agent to be token efficient, relax grammer."
x.com

I see too many skills that write books in the skill description, and all that crap is loaded into every context. I wrote a skill that finds the worst offenders.


James Long on X: "we built a diff viewer in opencode! available now"
x.com

More and more work is moving into coding agents, I don't live in my editor anymore but you gotta keep an eye on these little goblins, they write bad code.


DeepSeek on X: "We are making our discount permanent!"
x.com

DeepSeek-v4-Pro stays with 75% discount. Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life!


OpenAI on X: Now your Mac doesn’t have to be unlocked for Codex to use your computer.
x.com

From your phone, Codex can securely use apps on your Mac, even when the screen is off and locked.


In Claude Code: run /usage to see a breakdown of which Skills, Agents, MCPs, and Plugins are using your tokens
x.com

CLI today, coming to Desktop next


Theo on X: "Gemini 3.5 Flash is a really interesting release."
x.com

It's super fast and surprisingly smart. It's also more expensive (3x more per token) and super token hungry. The result - it costs 2x more to run than Gemini 3.1 Pro on similar tasks. It's more expensive than GPT-5.5 Medium.


Claude Code vs Codex vs Cursor (an honest comparison)
youtube.com

Video on YouTube by Theo. The three main coding agents right now are Claude Code, Codex, and Cursor. Which one should you use?


Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks.
x.com

On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.


That's it for this week, see you in the next newsletter issue!

Again, if you wanna get this to your inbox every Wednesday, subscribe at the bottom of the website.

Share this article

Povilas Korop

Get Weekly AI Coding News

You'll also get TWO free tutorials:
"My Favorite 10 Tips & Tricks" on Claude Code and Codex CLI!

Sent every Wednesday. No spam, ever. Unsubscribe anytime.