Model ≠ System
Agent = model + harness. 80% of the return comes from the wrapper: tools, memory, evals, guardrails, orchestration.
Context Graph
A 90-min transcript = 15K tokens every query. A graph parses once, stores structurally, answers in ~5ms. Semantic + episodic + procedural memory.
MCP + AI Gateway
Single auth + policy layer. SSO, role-based access, token budgets, DLP, policy engine, full audit. 88% of orgs reported agent security incidents.
Four Agent Topologies
Sequential · parallel · chat · business-process · hybrid. Pick for the flow, not the other way around.
Generator / Discriminator
Generator → Judge agents (practical, data freshness, language, substance, completeness) → score → ship or iterate.
DGM & Hyperagents
Darwin Gödel Machine: a meta-agent improves the task agent. Hyperagent: improves how it evaluates, proposes improvements, stores knowledge.
L0 → L3 Progression
2024: 80% disengaged. 2025: 50% competent users, 20% technical builders. Tracking AI level as a people metric.
Router economics
70-80% → Haiku/Flash ($0.25/1M). 20-30% → Opus/Sonnet ($15/1M). 60× cost delta — route or lose.
«Я могу себе представить, что в будущем каждому инженеру в нашей компании понадобится годовой токен-бюджет. Может достигать базовой зарплаты.»
— Jensen Huang, CEO NVIDIA, GTC 2026
«50% офисных сотрудников пишут код, только 20% имеют IT-образование.»
— Nicolai Tangen, CEO Norwegian Sovereign Wealth Fund