NOTES
BLOG
OSS
TALKS
WIKI
ABOUT

AI discovery: 2026/02

Date

2026/02/13

Cover

Tags

discoveryai

02.02.2026 – 13.02.2026

Product

Claude Opus 4.6 launches as Anthropic’s new flagship model, significantly boosting coding and agentic workflow performance
GitHub Copilot rolls out GPT‑5.3‑Codex plus Claude Opus 4.6 across VS Code, web, mobile, CLI and the Copilot coding agent
ChatGPT ships deep‑research upgrades (targeted sources, editable plans, progress tracking) and a GPT‑5.2 Instant style update for clearer advice
Devin adds Scheduled Devins and Devin Review draft‑PR support, moving closer to “always‑on” agentic dev workflows
Cursor increases agent usage limits, creates separate Auto/Composer vs API pools, and improves large‑repo indexing for faster semantic search

UX/UI

ChatGPT deep research gets a redesigned sidebar entry point, fullscreen report view, and mid‑run plan editing for more controllable research sessions
GitHub improves the PR “Files changed” view with CODEOWNERS validation, layout tweaks, and performance gains for large reviews
Cursor introduces a new usage‑visibility page so users can track separate Auto/Composer and API usage pools directly in the editor
Devin’s new Schedules page makes it easier to configure recurring agent runs with prompts and playbooks from a dedicated UI
ChatGPT Voice update improves how the assistant follows user instructions and uses tools like web search, especially for free‑tier and fallback Voice

Engineering

GitHub Copilot’s new GPT‑5.3‑Codex model delivers stronger reasoning for long agentic workflows and up to 25% faster performance versus GPT‑5.2‑Codex
Claude Opus 4.6 is positioned as a premium model for large codebases and complex agent chains, with hybrid “instant vs extended” reasoning controls
Cursor shares results from week‑long “very long‑running” coding agents, peaking at over 1,000 commits per hour across hundreds of agents in research preview
Cognition rebuilds Devin’s Agent Preview on Claude Sonnet 4.5, claiming 2× speed and 12% better scores on internal junior‑developer evals
Claude Code’s February changelog highlights new task‑dependency management, background agents, plugin system, and tighter permission enforcement for agent workflows

DevOps

Devin’s v3 API adds enterprise IP access‑list endpoints for centrally managing which networks can hit Devin in production
Northflank publishes a practical guide to sandboxing AI agents (microVMs, gVisor, strict isolation) to contain code execution and tool abuse
GitHub Actions’ early‑February updates improve runner enforcement, ship new Actions features, and open‑source the Dependabot Proxy for supply‑chain security
Devin’s Scheduled Devins feature effectively turns agents into cron‑like jobs, integrating recurring automation into CI/CD‑style workflows
OpenCode’s CLI docs, updated through Feb 13, emphasize deeper GitHub and MCP integration, including GitHub mock events for running agents in CI

Infosec

Microsoft’s “Cyber Pulse” report warns 2026 will be the “Year of AI Agents,” with 80%+ of Fortune 500 already building agents and a growing risk of “double‑agent” abuse
Anthropic reports Claude Opus 4.6 uncovered 500+ previously unknown high‑severity flaws in open‑source projects during internal security testing
OWASP GenAI Security Project’s new ASI Top 10 (Top 10 for Agentic Applications) lays out 2026’s key risks like Excessive Agency and Rogue Agents
Northflank’s sandboxing playbook walks through defenses against prompt‑injection, context poisoning, and tool abuse via strong isolation and policy gates
Devin’s new IP allowlists give enterprises another hard boundary around where agent traffic can originate, aligning with stricter governance models

Tricks & Tips

NxCode publishes a 2026 OpenCode beginner’s guide: install, provider setup (GPT, Gemini, local), Plan vs Build modes, and AGENTS.md for better context
Stack Overflow blog interviews Eno Reyes on “code smells for AI agents,” outlining patterns that signal brittle or over‑autonomous workflows
OWASP’s ASI Top 10 article doubles as a 2026 security checklist for anyone deploying autonomous coding or ops agents
Northflank’s sandboxing guide gives concrete recipes (microVMs, container hardening, policy gates) you can adapt to your own AI tooling stack
CSSAuthor’s Feb roundup of “Best Open Source AI Coding Agents 2026” highlights Aider, Cline and others, with notes on when to pick each tool

Reddit & Twitter

Official r/ClaudeAI post announces Claude Opus 4.6, framing it as better at long‑horizon coding, agentic search, and large‑codebase work
Cursor’s X thread shares research on long‑running coding agents hitting 1,000+ commits per hour, sparking discussion on “hands‑off” AI development
YouTube creators dissect Claude Opus 4.6, with one calling it “the biggest AI jump” they’ve covered and noting Reddit’s view that it’s better at coding than writing

OSS

CSSAuthor’s Feb 1 guide ranks the best open‑source AI coding agents of 2026, led by Aider and Cline for terminal‑first workflows
Comparateur‑IA updates its 2026 open‑source AI tools directory, including IDE‑native assistants and agents like Smol Developer
SecondTalent’s “Top 7 Open‑Source AI Coding Assistants in 2026” (Continue, Tabby, Cody, Aider, Cursor OSS components, CodeGeeX, FauxPilot) remains a key reference as teams compare tools
NxCode’s OpenCode tutorial emphasizes its OSS, local‑first model‑agnostic design (45k+ GitHub stars, 650k+ monthly users) as an alternative to closed agents
GitHub formalizes support for using Copilot subscriptions inside the open‑source OpenCode agent, bridging proprietary models with OSS tooling

© f3l1x