2026 Hermes Agent Skills — продвинутый гид: SKILL.md → GEPA самоэволюция, Metal/unified memory

В начале 2026 Nous Research выпустил Hermes Agent; за два месяца GitHub Stars перевалили за 160 000 — один из самых быстрорастущих open-source Agent-проектов. Ставка не в размере модели, а в "the agent that grows with you": агент накапливает операционное знание между сессиями. Под капотом — Skills: стандартизированная, эволюционирующая процедурная память. Этот гид пропускает onboarding и идёт в deep dive: Skills vs Memory/Prompt, формат SKILL.md и трёхуровневая progressive disclosure, Skill Bundles, условная активация, GEPA+DSPy самоэволюция, Tap-публикация и экосистема. Вывод: «как делать» — в переносимый SKILL.md; workflow — в Bundles одной командой; качество — через GEPA без fine-tune весов. На Mac с параллельными Xcode/FCP/ComfyUI каждый лишний КБ в Level 0 — это unified memory, которую вы не отдаёте под Metal-sidecar MLX или batch-рендер. Ниже: матрица концепций — разбор формата — Bundle/условная активация — community Tap — GEPA пять стадий — техники написания — блог-кейс — FAQ — ресурсы — Mac 7×24 acceptance checklist.

1. Узкие места: почему Skills — отдельный слой инфраструктуры

① Prompt — одноразовый: ~800 токенов deploy-runbook на каждый чат, шаги теряются при compaction. ② Memory — факты, не процедуры: «предпочитаешь TypeScript» ≠ «как открыть PR по team SOP». ③ Token burn неконтролируем: все runbook'и в system prompt — Level 0 съедает десятки тысяч токенов до первого tool call. ④ Нет кросс-платформенного reuse: каждый Agent — свой формат конфигов. Hermes Skills следуют открытому стандарту agentskills.io и переносятся между Hermes, Claude Code, Cursor, OpenCode — инфраструктурная ставка 2026, а не cosmetic feature.

На Apple Silicon unified memory prompt bloat конкурирует с Metal compute: 16 GB MacBook Air при одновременном Hermes Gateway + локальном MLX sidecar быстро упирается в swap — progressive disclosure Skills снимает фиксированную цену контекста до момента активации.

2. Ядро: Skills ≠ Prompts, Skills ≠ Memory

Измерение	Prompt	Memory	Skills
Персистентность	Текущая сессия	Межсессионная, постоянная	Межсессионная, постоянная
Момент загрузки	Всегда в контексте	Auto-inject каждую сессию	On-demand (ключевое отличие)
Token cost	Каждый запрос	Малый, стабильный	Ноль до активации
Тип контента	Произвольное намерение	Предпочтения/факты	Процедурные шаги (как сделать X)
Кто поддерживает	Пользователь вручную	Agent автоматически	Пользователь + Agent
Sharability	Неудобно	Приватно	Публикация как community Tap

Мнемоника: Prompt = стикер (одна сессия); Memory = блокнот (всегда под рукой); Skill = SOP-мануал (открываешь когда нужен runbook).

3. SKILL.md — разбор формата (agentskills.io)

Все Hermes Skills следуют спецификации agentskills.io для кросс-Agent portability:

---
name: my-skill                    # required: lowercase+hyphens, ≤64 chars
description: |                    # required: ≤1024 chars, start with "Use when..."
  Use when the user needs to [...].
  Handles [...] and [...].
version: 1.0.0
license: MIT
compatibility: Requires git, docker
allowed-tools: Bash(git:*) Read   # pre-authorized tools (experimental)
metadata:
  hermes:
    tags: [devops, automation]
    category: software-development
    related_skills: [github-pr-workflow, test-driven-development]
    requires_toolsets: [terminal]
    fallback_for_toolsets: [web]
---

# My Skill Title

## Overview
1-2 paragraphs: what it does and why it exists.

## When to Use
- Use for: [конкретные сценарии]
- Don't use for: [явные исключения]

## Procedure
1. Шаг один (точные команды)
2. Шаг два
3. Шаг три

## Common Pitfalls
1. Типичные сбои и fix

## Verification Checklist
- [ ] Checkpoint 1
- [ ] Checkpoint 2

3.1 Layout каталога (модульность)

~/.hermes/skills/
└── my-category/
    └── my-skill/
        ├── SKILL.md              # core steps (target ≤500 lines)
        ├── references/
        │   ├── api-docs.md       # on-demand load
        │   └── examples.md
        ├── templates/
        │   └── config.yaml
        └── scripts/
            └── setup.sh          # Agent executes directly

3.2 Progressive Disclosure — три уровня загрузки

Уровень	Содержимое	Триггер	Token cost
Level 0	`name` + `description`	Старт сессии, все skills	~3K (суммарно)
Level 1	Полный SKILL.md body	`/skill-name` или LLM match	Зависит от длины файла
Level 2	references/ scripts/	LLM решает во время execution	On-demand, per file

Engineering note: description — единственный сигнал Level 0; LLM по нему решает, грузить ли полный skill. «Когда использовать» важнее «что это». Валидация: skills-ref validate ./my-skill. На Mac с 50+ skills в Tap-репо Level 0 ~3K vs 50× full SKILL.md — разница в GB unified memory pressure при длинных agent-сессиях.

4. Skill Bundles — один slash-команда, полный workflow

Skill Bundles — фича Hermes 2026: лёгкий YAML, упаковывающий несколько skills в одну slash-команду. При /bundle-name все перечисленные skills загружаются одновременно. Путь: ~/.hermes/skill-bundles/<slug>.yaml

name: backend-dev
description: |
  Full backend feature workflow — code review, TDD, and PR management.
skills:
  - github-code-review
  - test-driven-development
  - github-pr-workflow
instruction: |
  Always write failing tests first before implementation.
  Open PRs with co-author tags for pair-programming sessions.
  Never push directly to main.

Production patterns: AI research — arxiv + deep-research + plan + excalidraw; MLOps deploy — vllm + llama-cpp + github-pr-workflow + systematic-debugging.

Priority rules: Bundle vs single Skill с одинаковым именем — Bundle wins; missing skills — skip без crash + warning; Bundle не трогает system prompt → prompt cache остаётся valid (token-friendly). CLI:

hermes bundles create backend-dev \
  --skills github-code-review,test-driven-development,github-pr-workflow \
  --instruction "Always write failing tests first"

5. Conditional Activation — environment-aware skills

Skills автоматически show/hide по доступности toolsets в текущей сессии. Конфиг в metadata.hermes SKILL.md:

Поле	Логика
`requires_toolsets`	Skill скрыт, если toolset отсутствует
`requires_tools`	Skill скрыт, если tool отсутствует
`fallback_for_toolsets`	Skill скрыт, если toolset присутствует (fallback path)
`fallback_for_tools`	Skill скрыт, если tool присутствует

Классика free/paid switch: при FIRECRAWL_KEY / BRAVE_SEARCH_KEY активируется paid web_search; DuckDuckGo skill (fallback_for_tools: [web_search]) исчезает из prompt — экономия Level 0 tokens. API down — fallback всплывает автоматически. Через hermes skills TUI — per-platform toggle (CLI, Telegram, Discord).

6. Skills Hub и open-source экосистема

# Official optional skills
hermes skills install official/research/arxiv

# Install from HTTP URL
hermes skills install https://example.com/SKILL.md --name my-skill

# Install from GitHub
hermes skills install github:openai/skills/k8s

# Custom Tap (subscribe entire skill repo)
hermes skills tap add github:my-org/my-skills

Репозиторий	Описание	Highlights
awesome-hermes-skills	Curated production skills	Deep Research, MLOps, Apple integration; 23 skills с GitHub Copilot
hermeshub	Community registry	Security scan, API/marketplace, prompt injection detection
ai-agent-skills	191 skills, 28 categories	Hermes/Claude Code/Cursor one-click install
hermes-agent	Official repo	Authoritative source, skill authoring spec

7. Публикация Skill Tap — team/community sharing

my-skills-tap/
├── skills.sh.json              # category config (optional)
├── mlops/
│   ├── vllm-deploy/SKILL.md
│   └── model-benchmark/SKILL.md
├── research/
│   ├── paper-summarizer/SKILL.md
│   └── citation-finder/SKILL.md
└── README.md

Team deploy flow:

# One-click team subscribe
hermes skills tap add github:your-org/your-skills-tap

# Private repo (GitHub Token)
hermes skills tap add github:your-org/private-skills --token $GH_TOKEN

# Update all Tap skills
hermes skills tap update

# List subscribed Taps
hermes skills tap list

Version control: ~/.hermes/skills/ в Git; cross-device sync — git pull && hermes skills reset для rebuild built-in skills.

8. Self-Evolving Skills: GEPA + DSPy

GEPA (Genetic-Pareto Prompt Evolution) — ICLR 2026 Oral, интегрирован в hermes-agent-self-evolution. Без fine-tune весов: анализ execution traces → генерация вариантов → multi-objective Pareto optimization текста skill. Cost ~$2–10/run (pure API, GPU не нужен) — идеально для overnight eval на remote Mac, не съедая local Metal throughput.

GEPA — пять стадий:

Stage 1 Trace collection (SQLite, full reasoning traces) → Stage 2 Reflective failure analysis (LLM: actionable «почему упало») → Stage 3 Targeted mutation (10–20 SKILL.md variants per failure mode) → Stage 4 Multi-objective Pareto eval (success rate × token efficiency × latency) → Stage 5 Human PR review (best variant → PR → merge).

git clone https://github.com/NousResearch/hermes-agent-self-evolution
cd hermes-agent-self-evolution && pip install -r requirements.txt
export HERMES_AGENT_PATH=~/.hermes

# Synthetic data evolution (entry point)
python -m evolution.skills.evolve_skill \
    --skill github-code-review --iterations 10 --eval-source synthetic

# Real session data (better results)
python -m evolution.skills.evolve_skill \
    --skill github-code-review --iterations 10 --eval-source sessiondb

# Mixed Claude Code / Gemini traces (experimental)
python -m evolution.skills.evolve_skill \
    --skill github-code-review --iterations 10 --eval-source mixed \
    --trace-dirs ~/.claude/traces,~/.hermes/sessions

Четыре guardrails: ① Full test suite pytest tests/ -q — 100% pass; ② Size limits Skills ≤15KB, tool descriptions ≤500 chars; ③ Prompt cache compatibility; ④ Semantic preservation (skill purpose не drift'ит).

Phase	Target	Engine	Status
Phase 1	Skill files (SKILL.md)	DSPy + GEPA	✅ Shipped
Phase 2	Tool descriptions	DSPy + GEPA	🔲 Planned
Phase 3	System prompt fragments	DSPy + GEPA	🔲 Planned
Phase 4	Tool implementation code	Darwinian Evolver	🔲 Planned
Phase 5	Continuous improvement loop	Automation pipeline	🔲 Planned

9. Plugin Skills — расширение границ Hermes

Plugins namespace skills (plugin:skill): не попадают в default skills_list (меньше system prompt noise); opt-in activation; sibling skills внутри plugin visible при load.

# Load plugin skill (namespace)
skill_view("superpowers:writing-plans")

# plugin.yaml skill declaration
name: my-hermes-plugin
skills:
  - name: writing-plans
    path: skills/writing-plans/SKILL.md
  - name: editing
    path: skills/editing/SKILL.md

10. Advanced authoring — engineer lens

10.1 description = activation precision: ❌ «Helps with code.» → ✅ «Use when reviewing a pull request, checking for code quality issues, security vulnerabilities... Do NOT use for writing new code.»

10.2 Pitfalls — quality gate: конкретные failure modes, root cause, actionable fix (fragile CSS selectors, GitHub API rate limits, large diff token overflow).

10.3 Scripting: в Procedure явно: Agent runs scripts/extract_schema.py --input $FILE; on failure load references/manual-extract.md. Heavy scripts — на remote Mac с NVMe throughput, local unified memory не трогаем.

10.4 Size control: <500 lines — всё в SKILL.md; 500–1000 → references/; >1000 — split strongly; >15KB — GEPA hard limit, must split.

10.5 skill_manage — agent self-maintenance:

skill_manage(action='patch', name='github-code-review',
    old_string='Check for obvious bugs',
    new_string='Check for: null pointer, SQL injection, XSS, logic errors')

# Human approval gate
# config.yaml: skills.agent_writes_require_approval: true

11. Case study: tech blog workflow Skills

# ~/.hermes/skill-bundles/blog-workflow.yaml
name: blog-workflow
description: Full tech blog writing workflow.
skills:
  - seo-keyword-research
  - outline-generator
  - code-example-validator
  - bilingual-checker
  - publish-to-platform
instruction: |
  Always research SEO keywords before writing.
  Ensure all code examples are tested and runnable.
  Generate both Chinese and English title options.

Custom seo-keyword-research: на старте сессии — RU/EN long-tail («X tutorial», «как настроить X»), cross-ref Habr trending, Dev.to, HN; output 3–5 primary + 10–15 long-tail matrix. RU vs EN query intent diverges («Agent» vs «агент» vs «ИИ-агент»).

12. Five-step rollout checklist

Step 1 — Install Hermes + browse official Skills: hermes skills install official/research/arxiv.
Step 2 — First SKILL.md в ~/.hermes/skills/ с чётким description trigger.
Step 3 — Bundle YAML для частых workflow; hermes bundles create для быстрого scaffold.
Step 4 — Conditional activation (free/paid fallback) для снижения token noise.
Step 5 — Team Tap: hermes skills tap add github:your-org/your-skills-tap; advanced — clone self-evolution, run GEPA.

13. FAQ и citable numbers

Q: Skills vs MCP? Skills — procedural knowledge (как действовать); MCP — tool interface (дополнительные API calls). Complementary layers.
Q: Skill обновлён, Agent использует старую версию? Mid-session не применяется; /reset для новой сессии или install с --now (invalidates prompt cache).
Q: GEPA-evolved skills безопасны? Четыре guardrails + human PR review; semantic drift detection.
Q: Reuse Hermes Skills в Claude Code? Copy SKILL.md → ~/.claude/skills/ или ai-agent-skills multi-platform install.
Q: RU content в Skill — token penalty? ~1–1.5 token/char для кириллицы; description держите на EN для sharper LLM matching.

Citable numbers: ① Hermes Agent 160k+ GitHub Stars (early 2026, two months). ② Level 0 all skills ~3K tokens. ③ GEPA single run $2–10 (no GPU). ④ GEPA size limit ≤15KB. ⑤ ai-agent-skills 191 cross-platform skills.

14. Further reading

Official: Hermes Agent docs · Chinese docs · Skills system · agentskills.io
Open source: hermes-agent-self-evolution · gepa-ai/gepa · stanfordnlp/dspy
Community: SegmentFault RU/CN guides · Dev.to Self-Improving Agent · YouTube GEPA & Skill Bundles tutorials

15. Deep case: Hermes Skills + remote Mac 7×24 evolution loop

«Tech media team упаковала blog writing в blog-workflow Bundle: local Hermes — SEO research + outline (Level 0 burn ~3K tokens), GEPA weekly оптимизирует outline-generator Pitfalls по real session traces — success rate 72% → 91%, avg token −18%. Heavy scripts (code validation, multilingual publish) — remote Mac node по SSH; local unified memory не забита overnight queue. Tap repo для 8 человек — hermes skills tap add one-click; private skills via GitHub Token.»

Перекрёстные ссылки: OpenRouter Hermes usage, Cursor Agent Skills guide. Cursor Skills — IDE on-demand load; Hermes Skills + GEPA — «gets better with use» loop. Windows/Linux run Hermes CLI, но macOS выигрывает при parallel Xcode/FCP/ComfyUI, launchd-resident Gateway, Metal sidecar inference. GEPA eval или Bundle heavy scripts с long test runs / batch render — laptop unified memory saturates fast. Skills define how; remote Mac defines where.

Если Hermes Skills уже структурируют ваш workflow и нужен stable, rentable Apple Silicon для GEPA eval, scripts и 7×24 agents — MACGPU remote Mac node: evolution evals и batch jobs на dedicated hardware с predictable Metal throughput; Hermes orchestration и Skill authoring на laptop — unified memory для thinking, compute для overnight queues.