2026 Hermes Agent Skills 심화 가이드: SKILL.md부터 GEPA 자기진화까지

2026년 초 Nous Research가 공개한 Hermes Agent는 두 달 만에 GitHub Star 16만+을 돌파하며 AI Agent 오픈소스 중 성장률 최상위권에 올랐다. 핵심은 더 큰 모델이 아니라 "the agent that grows with you" — 쓸수록 나를 이해하는 Agent. 구현체가 Skills(스킬) 시스템: 표준화·진화 가능·세션 간 지속되는 절차적 메모리. 본문은 입문을 건너뛰고 심화로 직행: Skills vs Memory/Prompt, SKILL.md 포맷·3단계 점진 로딩, Skill Bundles, 조건 활성화, GEPA+DSPy 자기진화, Tap 배포·오픈 생태계. 결론: 「어떻게 할지」를 이식 가능한 SKILL.md로 쓰고, Bundles로 워크플로 원클릭, GEPA로 스킬을 점진 개선 — 모델 가중치는 건드리지 않는다. 구성: 개념 대조표 — 포맷 해석 — Bundle/조건 활성화 — 커뮤니티 Tap — GEPA 5단계 — 작성 팁 — 블로그 실전 — FAQ — 리소스.

1. 페인포인트: Skills를 따로 파고들 가치

① Prompt는 일회성: 매 대화마다 800자 배포 Runbook 반복 → Token 낭비·단계 누락. ② Memory는 사실, 프로세스 아님: 「TypeScript 선호」를 기억한다고 팀 PR 규범을 아는 건 아님. ③ Token 폭주: 모든 Runbook을 시스템 프롬프트에 넣으면 Level 0만 수만 Token. ④ 크로스플랫폼 불가: Agent마다 별도 설정 → 팀 공유 불가. Hermes Skills는 agentskills.io 오픈 표준을 따르며 Hermes·Claude Code·Cursor·OpenCode 간 이식 가능 — 2026년 Agent 인프라에 베팅할 만한 축.

2. 핵심 개념: Skills ≠ Prompts, Skills ≠ Memory

차원	일반 Prompt	Memory(기억)	Skills(스킬)
지속성	현재 대화	세션 간 영구	세션 간 영구
로드 시점	매번 컨텍스트에 포함	매 세션 자동 주입	온디맨드 로드(핵심 차이)
Token 비용	매번 소비	소·안정	활성화 전 제로
내용 유형	임의 의도 설명	사용자 선호/사실	절차적 단계(어떻게 할지)
유지 주체	사용자 수동	Agent 자동	사용자 + Agent
공유성	불편	비공개	커뮤니티 Tap으로 배포

암기법: Prompt = 포스트잇(당회 유효); Memory = 메모장(영구 노트, 항상 손끝); Skill = SOP 매뉴얼(단계화 프로세스, 필요할 때 펼침).

3. SKILL.md 포맷 심층 해석(agentskills.io 오픈 표준)

모든 Hermes Skills는 agentskills.io 규격을 따르며 Agent 간 이식성을 보장한다:

---
name: my-skill                    # 필수: 소문자+하이픈, ≤64자
description: |                    # 필수: ≤1024자, "Use when..." 권장
  Use when the user needs to [...].
  Handles [...] and [...].
version: 1.0.0
license: MIT
compatibility: Requires git, docker
allowed-tools: Bash(git:*) Read   # 도구 사전 승인(실험적)
metadata:
  hermes:
    tags: [devops, automation]
    category: software-development
    related_skills: [github-pr-workflow, test-driven-development]
    requires_toolsets: [terminal]
    fallback_for_toolsets: [web]
---

# My Skill Title

## Overview
1-2 paragraphs: what it does and why it exists.

## When to Use
- Use for: [구체적 시나리오]
- Don't use for: [명확한 제외 시나리오]

## Procedure
1. 단계 1(정확한 명령 포함)
2. 단계 2
3. 단계 3

## Common Pitfalls
1. 흔한 문제 및 수정 방법

## Verification Checklist
- [ ] 검증 항목 1
- [ ] 검증 항목 2

3.1 스킬 디렉터리 구조(모듈 설계)

~/.hermes/skills/
└── my-category/
    └── my-skill/
        ├── SKILL.md              # 메인(핵심 단계, ≤500행 권장)
        ├── references/
        │   ├── api-docs.md       # API 참조(온디맨드)
        │   └── examples.md
        ├── templates/
        │   └── config.yaml
        └── scripts/
            └── setup.sh          # Agent가 직접 실행 가능

3.2 Progressive Disclosure 3단계 로딩

로드 레벨	내용	트리거	Token 비용
Level 0	`name` + `description`	세션 시작, 모든 스킬	~3K(전체 합산)
Level 1	SKILL.md 전문	`/skill-name` 또는 LLM 판단	파일 길이에 비례
Level 2	references/ scripts/ 파일	실행 중 LLM 판단	파일 단위 온디맨드

작성 팁: description이 Level 0의 전부 — LLM이 전체 스킬 로드 여부를 여기서 결정. 「무엇인지」보다 「언제 쓸지」가 중요. skills-ref validate ./my-skill로 규격 검증 가능.

4. Skill Bundles: 한 명령으로 전체 워크플로

Skill Bundles는 Hermes 2026 신규 강력 기능. Bundle은 경량 YAML로 여러 관련 스킬을 슬래시 명령 하나에 묶는다. /bundle-name 실행 시 나열된 스킬이 동시 로드 — 개별 트리거 불필요. 경로: ~/.hermes/skill-bundles/<slug>.yaml

name: backend-dev
description: |
  Full backend feature workflow — code review, TDD, and PR management.
skills:
  - github-code-review
  - test-driven-development
  - github-pr-workflow
instruction: |
  Always write failing tests first before implementation.
  Open PRs with co-author tags for pair-programming sessions.
  Never push directly to main.

심화 시나리오: AI 리서처 워크플로 — arxiv + deep-research + plan + excalidraw; MLOps 배포 — vllm + llama-cpp + github-pr-workflow + systematic-debugging.

Bundle 우선순위: Bundle과 단일 Skill 동명 시 Bundle 우선; 미설치 Skill은 에러 없이 스킵+누락 안내; Bundle은 시스템 프롬프트 미변경 → Prompt Cache 유지(Token 친화). CLI 빠른 생성:

hermes bundles create backend-dev \
  --skills github-code-review,test-driven-development,github-pr-workflow \
  --instruction "Always write failing tests first"

5. 조건 활성화(Conditional Activation): 환경 인지

스킬은 현재 세션의 도구 가용성에 따라 자동 표시/숨김. SKILL.md metadata.hermes에서 설정:

필드	동작
`requires_toolsets`	나열 toolset 없으면 스킬 숨김
`requires_tools`	나열 도구 없으면 스킬 숨김
`fallback_for_toolsets`	나열 toolset 있으면 스킬 숨김(대안)
`fallback_for_tools`	나열 도구 있으면 스킬 숨김

클래식 — 무료/유료 도구 스마트 전환: FIRECRAWL_KEY / BRAVE_SEARCH_KEY 설정 시 유료 web_search 활성 → DuckDuckGo 스킬(fallback_for_tools: [web_search])이 프롬프트에서 자동 제거, Token 절약; API 불가 시 대안 자동 노출. hermes skills TUI로 CLI·Telegram·Discord 등 플랫폼별 스킬 개별 on/off 가능.

6. Skills Hub & 오픈 커뮤니티

# 공식 선택 스킬 설치
hermes skills install official/research/arxiv

# HTTP URL 직접 설치
hermes skills install https://example.com/SKILL.md --name my-skill

# GitHub 저장소 설치
hermes skills install github:openai/skills/k8s

# 커스텀 Tap 추가(스킬 저장소 구독)
hermes skills tap add github:my-org/my-skills

저장소	설명	하이라이트
awesome-hermes-skills	프로덕션급 큐레이션	Deep Research·MLOps·Apple 통합; 23개 스킬 GitHub Copilot 연동
hermeshub	커뮤니티 스킬 레지스트리	보안 스캔·인증·API·마켓·프롬프트 인젝션 탐지
ai-agent-skills	191개 스킬, 28개 카테고리	Hermes/Claude Code/Cursor 원클릭 설치
hermes-agent	공식 메인	권위 소스, Skill 작성 규범 포함

7. Skill Tap 배포: 팀·커뮤니티 공유

my-skills-tap/
├── skills.sh.json              # 카테고리 설정(선택)
├── mlops/
│   ├── vllm-deploy/SKILL.md
│   └── model-benchmark/SKILL.md
├── research/
│   ├── paper-summarizer/SKILL.md
│   └── citation-finder/SKILL.md
└── README.md

팀 배포 플로우:

# 팀원 원클릭 구독
hermes skills tap add github:your-org/your-skills-tap

# 프라이빗 저장소(GitHub Token 필요)
hermes skills tap add github:your-org/private-skills --token $GH_TOKEN

# 모든 Tap 스킬 업데이트
hermes skills tap update

# 구독 Tap 목록
hermes skills tap list

버전 관리: ~/.hermes/skills/를 Git으로 관리, 기기 간 git pull && hermes skills reset 후 내장 스킬 재구성.

8. Self-Evolving Skills: GEPA + DSPy 자동 진화

GEPA(Genetic-Pareto Prompt Evolution)는 2026 ICLR Oral 논문 성과로 Nous Research가 hermes-agent-self-evolution에 통합. 핵심: 모델 가중치 미세조정 없이 실행 궤적 분석 → 변형 생성 → 다목적 Pareto 최적화로 스킬 텍스트 자체를 개선. 비용 약 $2–10/회(순수 API, GPU 불필요).

GEPA 5단계 진화:

Stage 1 실행 궤적 수집(SQLite, 전체 추론 궤적) → Stage 2 반성적 실패 분석(LLM이 「왜 실패했는지」 actionable side-info 생성) → Stage 3 타깃 변이(실패 원인별 SKILL.md 변형 10–20개) → Stage 4 다목적 Pareto 평가(성공률 × Token 효율 × 속도 동시 최적화) → Stage 5 인간 PR 리뷰(최적 변형 PR 생성, 승인 후 배포).

git clone https://github.com/NousResearch/hermes-agent-self-evolution
cd hermes-agent-self-evolution && pip install -r requirements.txt
export HERMES_AGENT_PATH=~/.hermes

# 합성 데이터 진화(입문 권장)
python -m evolution.skills.evolve_skill \
    --skill github-code-review --iterations 10 --eval-source synthetic

# 실제 세션 데이터(효과 우수)
python -m evolution.skills.evolve_skill \
    --skill github-code-review --iterations 10 --eval-source sessiondb

# Claude Code / Gemini 궤적 병합(실험적)
python -m evolution.skills.evolve_skill \
    --skill github-code-review --iterations 10 --eval-source mixed \
    --trace-dirs ~/.claude/traces,~/.hermes/sessions

4대 안전 가드레일: ① 전체 테스트 pytest tests/ -q 100% 통과; ② 크기 제한 Skills ≤15KB·도구 설명 ≤500자; ③ Prompt 캐시 호환; ④ 의미 보존 검사(스킬 원래 목적 이탈 금지).

단계	최적화 대상	엔진	상태
Phase 1	Skill 파일(SKILL.md)	DSPy + GEPA	✅ 구현됨
Phase 2	도구 설명	DSPy + GEPA	🔲 계획
Phase 3	시스템 프롬프트 조각	DSPy + GEPA	🔲 계획
Phase 4	도구 구현 코드	Darwinian Evolver	🔲 계획
Phase 5	지속 개선 루프(완전 자동)	자동화 파이프라인	🔲 계획

9. Plugin 스킬: Hermes 경계 확장

플러그인은 스킬을 네임스페이스(plugin:skill)로 묶어: 기본 skills_list 미노출(시스템 프롬프트 노이즈↓); 사용자 명시 호출 시만 활성(Opt-in); 플러그인 내 스킬 상호 참조. 로드 시 같은 플러그인 형제 스킬 자동 표시.

# 플러그인 스킬 로드(네임스페이스)
skill_view("superpowers:writing-plans")

# plugin.yaml 스킬 선언
name: my-hermes-plugin
skills:
  - name: writing-plans
    path: skills/writing-plans/SKILL.md
  - name: editing
    path: skills/editing/SKILL.md

10. 스킬 작성 심화 팁(엔지니어 관점)

10.1 description이 활성화 정확도를 결정: ❌「Helps with code.」→ ✅「Use when reviewing a pull request, checking for code quality issues, security vulnerabilities... Do NOT use for writing new code.」

10.2 Pitfalls가 품질 분수령: 구체적 실패 패턴·근본 원인·실행 가능한 수정(CSS selector 취약, GitHub API rate limit, 대형 diff Token 오버플로 등).

10.3 스크립트화: Procedure에 Agent가 scripts/extract_schema.py --input $FILE 실행 명시, 실패 시 references/manual-extract.md 로드.

10.4 크기 제어: <500행 SKILL.md 전체; 500–1000행 references/로; >1000행 분할 강력 권장; >15KB GEPA 한도 초과 시 필수 분할.

10.5 skill_manage로 Agent 자가 유지:

skill_manage(action='patch', name='github-code-review',
    old_string='Check for obvious bugs',
    new_string='Check for: null pointer, SQL injection, XSS, logic errors')

# 인간 승인 게이트
# config.yaml: skills.agent_writes_require_approval: true

11. 실전: 기술 블로그 워크플로 Skills

# ~/.hermes/skill-bundles/blog-workflow.yaml
name: blog-workflow
description: Full tech blog writing workflow.
skills:
  - seo-keyword-research
  - outline-generator
  - code-example-validator
  - bilingual-checker
  - publish-to-platform
instruction: |
  Always research SEO keywords before writing.
  Ensure all code examples are tested and runnable.
  Generate both Korean and English title options.

커스텀 seo-keyword-research: 블로그 세션 시작 시 한국어 롱테일(「X 사용법」「X 튜토리얼」)과 영어 롱테일(「X tutorial」「how to X」「X vs Y」) 각각 검색, Velog·Dev.to trending·HN 교차 참조, 주 키워드 3–5개 + 롱테일 10–15개 매트릭스 출력. 동일 개념의 한·영 검색어 차이(「Agent」vs「에이전트」vs「AI 에이전트」) 주의.

12. 5단계 랜딩 체크리스트

Step 1 — Hermes Agent 설치·공식 Skills 탐색: hermes skills install official/research/arxiv.
Step 2 — ~/.hermes/skills/에 첫 SKILL.md, description 트리거 조건 작성.
Step 3 — 자주 쓰는 워크플로 Bundle YAML, hermes bundles create로 빠른 생성.
Step 4 — 조건 활성화(무료/유료 fallback)로 Token 노이즈 감소.
Step 5 — 팀 공유: Tap 저장소 + hermes skills tap add github:your-org/your-skills-tap; 심화는 self-evolution 클론 후 GEPA 최적화.

13. FAQ & 인용 수치

Q: Skills vs MCP? Skills = 절차적 지식 문서(Agent에게 방법 교육), MCP = 도구 인터페이스(추가 도구 호출). 상호 보완.
Q: Skill 수정했는데 Agent가 구버전? 현재 세션 미반영 — /reset으로 새 세션, 또는 설치 시 --now(Prompt Cache 무효화).
Q: GEPA 진화 스킬 안전? 4대 가드레일 + 인간 PR 리뷰; 의미 드리프트 탐지로 원래 목적 이탈 방지.
Q: Claude Code에서 Hermes Skills 재사용? SKILL.md를 ~/.claude/skills/에 복사, 또는 ai-agent-skills 원클릭 멀티 설치.
Q: Skill 한국어가 Token 효율에 영향? 한글 약 1–1.5 token/자, 영어와 유사; description은 영어 유지 권장 — LLM 매칭 정확도↑.

인용 수치: ① Hermes Agent 160k+ GitHub Stars(2026년 초, 2개월). ② Level 0 전체 스킬 합산 ~3K Token. ③ GEPA 1회 최적화 $2–10(GPU 불필요). ④ GEPA Skills 크기 한도 ≤15KB. ⑤ ai-agent-skills 191개 크로스플랫폼 스킬.

14. 확장 읽기 & 리소스

공식: Hermes Agent 문서 · 中文文档 · Skills 시스템 · agentskills.io
오픈소스: hermes-agent-self-evolution · gepa-ai/gepa · stanfordnlp/dspy
커뮤니티: SegmentFault 실전 가이드 · Dev.to Self-Improving Agent 해부 · YouTube GEPA & Skill Bundles 튜토리얼

15. 심층 케이스: Hermes Skills + 원격 Mac 7×24 진화 루프

「한 기술 미디어 팀이 블로그 작성을 blog-workflow Bundle로 캡슐화: 로컬 Hermes는 SEO 리서치·아웃라인(Level 0 ~3K Token), GEPA가 매주 실제 세션 궤적으로 outline-generator Pitfalls 최적화 — 성공률 72%→91%, 평균 Token 18%↓. 무거운 스크립트(코드 검증·다국어 배포)는 SSH 원격 Mac 노드 실행, 로컬 통합 메모리 야간 큐 미점유. Tap 저장소로 8인 팀 hermes skills tap add 원클릭 동기화, 프라이빗 스킬은 GitHub Token 구독.」

본 사이트 OpenRouter Hermes 사용량 해석·Cursor Agent Skills 가이드와 상호 보완: Cursor Skills = IDE 온디맨드, Hermes Skills + GEPA = 「쓸수록 강해지는」 루프. Windows/Linux에서도 Hermes CLI 가능하지만 Xcode/FCP/ComfyUI 병행·launchd Gateway 상주·Metal 사이드카는 macOS가 여전히 유리. GEPA 진화·Bundle 무거운 스크립트가 장시간 테스트·배치 렌더를 돌리면 노트북 통합 메모리 포화 — Skill=방법, 원격 Mac=실행 위치.

Hermes Skills로 워크플로 정리했는데 안정적·임대 가능한 Apple Silicon 연산으로 GEPA 평가·스크립트·7×24 Agent가 필요하면 MACGPU 원격 Mac 노드 검토: 진화 평가·배치 전용, 로컬은 Hermes 오케스트레이션·Skill 작성만 — 통합 메모리는 사고용, 연산은 야간 큐용.