Claude Code Daily Briefing - 2026-05-07

Release Summary

Version	Date	Key Changes
v2.1.132	5/6	`CLAUDE_CODE_SESSION_ID` env var, fullscreen disable option, 10GB+ memory leak fix, sleep/wake blank screen fix
v2.1.129	5/6	`--plugin-url` flag, auto-update setting, `skillOverrides`, Ctrl+R history picker, prompt cache TTL fix

Full Release Notes

New Features & Practical Usage

Code with Claude SF Key Announcements — Dreaming, Outcomes & Multi-agent for Managed Agents (5/6)

Yesterday’s Code with Claude SF brought no new model release, but three major additions to Claude Managed Agents.

1. Dreaming (Research Preview): Agents review their own past sessions to extract patterns and automatically curate memory. Think of it as overnight playbook generation. Harvey reported ~6x improvement in completion rates when agents could remember workarounds between sessions. You can choose automatic updates or manual review.

2. Outcomes (Public Beta): Define success criteria as a rubric, and a separate grader evaluates agent output in its own context window — isolated from the agent’s reasoning. This delivers up to 10-point success rate improvements over standard prompting, with +8.4% on docx generation and +10.1% on pptx generation.

3. Multi-agent Orchestration (Public Beta): A lead agent breaks complex jobs into pieces and delegates to specialist agents running in parallel on a shared filesystem with persistent event history. Netflix’s platform team uses this to analyze build logs across hundreds of sources.

Anthropic Blog | Simon Willison Live Blog

SpaceX Colossus Deal — Claude Code Usage Limits Doubled (5/6)

Anthropic signed a deal for full access to SpaceX’s Colossus 1 data center — 300+ MW of infrastructure and 220,000+ NVIDIA GPUs (H100, H200, GB200) available within one month.

Immediate impact:

Claude Code’s 5-hour usage limits doubled for Pro, Max, Team, and Enterprise
Peak-time throttling removed for Pro/Max accounts
Claude Opus API rate limits substantially increased

The companies also plan to develop multi-gigawatt orbital AI compute capacity.

Bloomberg | CNBC

/ultrareview — Cloud Multi-agent Code Review (5/6)

Claude Code now includes /ultrareview, which launches a fleet of reviewer agents in a remote sandbox to analyze your branch or PR. Reviews take 5–10 minutes and run as background tasks so you can keep working.

Already used by every team at Anthropic, it integrates with GitHub to leave comments directly on PRs.

# Review current branch
/ultrareview

# Review a specific PR
/ultrareview --pr 123

Claude Blog | Claude Code Docs

Developer Workflow Tips

v2.1.129 — Remote Plugin Loading with `--plugin-url`

Building on v2.1.128’s ZIP plugin support, v2.1.129 adds --plugin-url to fetch plugin archives directly from URLs. Combined with CLAUDE_CODE_PACKAGE_MANAGER_AUTO_UPDATE for background auto-updates, plugin distribution across teams is now seamless.

# Load plugin from remote URL
claude --plugin-url https://example.com/my-plugin.zip

v2.1.132 — 10GB+ Memory Leak Fix and Fullscreen Disable

A critical fix in v2.1.132 addresses unbounded memory growth (10GB+ RSS) caused by non-protocol MCP server output being buffered indefinitely.

The new CLAUDE_CODE_DISABLE_ALTERNATE_SCREEN=1 env var opts out of the fullscreen renderer — useful for terminals with compatibility issues or when piping output.

# Disable fullscreen mode
export CLAUDE_CODE_DISABLE_ALTERNATE_SCREEN=1

# CLAUDE_CODE_SESSION_ID is now auto-set in Bash tool subprocesses
# Useful for automation and logging pipelines

GitHub v2.1.132 | GitHub v2.1.129

Security & Limitations

Google Chrome has been found to automatically download and install Gemini Nano model weights (~4GB) without explicit user consent. It supports the “Help me write” feature and occupies disk space regardless of usage.

Developers with limited disk space or data-capped environments should be aware. Disable via chrome://flags/#optimization-guide-on-device-model.

GeekNews

Ecosystem & Plugins

Cloudflare + Stripe — Agents Can Autonomously Create Accounts, Buy Domains & Deploy (5/7)

Through a new protocol co-designed by Cloudflare and Stripe, AI coding agents can now autonomously create Cloudflare accounts, start paid subscriptions, register domains, and deploy to production.

The three-stage flow (Discovery → Authorization → Payment) uses Stripe as identity provider with automatic Cloudflare account provisioning. Payment details are never exposed to agents, with a $100/month default spending cap.

# Initialize via Stripe CLI
stripe projects init
# Then ask your agent to "build and deploy to a new domain"

GeekNews | Cloudflare Blog

Anthropic API Traffic Up 17x — Infrastructure Expansion Roadmap

Anthropic disclosed 17x year-over-year API traffic growth at Code with Claude SF. Beyond SpaceX, compute capacity is being secured through Amazon (5GW), Google & Broadcom (5GW, from 2027), Microsoft & NVIDIA ($30B Azure), and Fluidstack ($50B US AI infrastructure).

Community News

Code with Claude SF Extended — Today (5/7): The indie developer and early-stage startup session continues today with Applied AI team deep dives. No livestream available. Code with Claude SF Extended
“AI Didn’t Delete Your Database — You Did”: Analysis arguing that production database deletion incidents stem from system design flaws enabling destructive API access, not AI itself. A reminder to use hooks and denyRules for system-level safeguards. GeekNews
Go Achieves FIPS 140-3 Certification: The Go Cryptographic Module has been certified under federal security standards, reducing compliance burden for Go-based MCP servers and infrastructure tools in regulated environments. GeekNews

Minor Changes

skillOverrides setting: Control skill behavior with off, user-invocable-only, or name-only in v2.1.129
Ctrl+R history picker: Now defaults to showing all prompts; narrow with Ctrl+S
Gateway model discovery opt-in: Set CLAUDE_CODE_ENABLE_GATEWAY_MODEL_DISCOVERY=1 to enable (previously default-on)
/context ASCII grid removed: Eliminated ~1.6k token waste from debug output
Prompt cache TTL fix: 1-hour TTL was being downgraded to 5 minutes — directly impacts cache hit rates
External SIGINT graceful shutdown: Terminal modes now properly restored on external signals

Interesting Projects & Tools

oh-my-free-models — Local proxy that routes to the fastest free LLM: Automatically handles rate limiting, latency variance, and quota exhaustion across free LLM providers. Point your OpenAI/Anthropic-compatible agent to localhost, and it routes to the fastest available model with real-time latency measurement. Rate-limited models get a 10-minute cooldown. 13 points. GitHub
Brewery — macOS GUI for Homebrew package management: A native macOS app for managing Homebrew packages without terminal commands. Search, install, update, and remove packages through a visual interface. GeekNews