AI复盘
ai-retrospective
by amoshc
>
安装
claude skill add --url https://github.com/openclaw/skills文档
AI Collaboration Retrospective
Post-session systematic review tool. Eight-dimension deep analysis drives a continuous improvement loop for AI-assisted development.
Core Principles
- Conversation context is the data source: The complete conversation history of the current session is already in context — no external data fetching needed
- Progressive loading: Detailed evaluation criteria live in
references/analysis_dimensions.md— load on demand - Self-reflection first: Examine the AI's own shortcomings before analyzing user-side improvements. This is NOT about criticizing the user — it's about finding efficiency gains in the "AI + Human" collaboration
- Quantify everything: Every finding must reference specific conversation turns, wasted operations, and include counterfactual reasoning ("If X had been done, Y turns could have been saved")
- Dig deep: Don't settle for "no findings." Complete the self-check list for each dimension before declaring it clean
Execution Model
This skill is pure LLM instruction-driven — no scripts, no external dependencies. It works on any AI assistant that can:
- Access the current conversation history
- Read reference files from this skill's directory
- Write output files to the workspace
Capability adaptation: The workflow below references file operations and memory updates. If your AI tool doesn't support a specific capability, skip that step and note it in the report. The analysis itself only requires conversation context access.
Workflow (Six Steps)
Step 1: Conversation Review — Extract Key Events + Tag Waste Points
Scan the entire conversation context and extract these key events into a timeline:
| Event Type | Recognition Signal |
|---|---|
| Tool invocations | Command execution, file reading/writing, web searches, code generation |
| File changes | Files created, modified, or deleted |
| Errors & fixes | Error messages, lint failures, debugging cycles |
| Repeated modifications | Same file/feature modified multiple times, user providing multiple clarifications |
| Decision points | Technology choices, architecture decisions, trade-offs |
| Automation/plugin usage | Any skill, agent, plugin, or extension triggered during the session |
| User clarifications | User adding context because the AI misunderstood intent |
| Verification rounds | User providing test data/feedback, AI analyzing verification results |
| AI misjudgments | AI providing wrong conclusions, missing critical issues, or jumping to premature conclusions |
Filter rule: System initialization events (bootstrap files, identity setup, etc.) are excluded from analysis.
Critical step — Waste point tagging:
After building the timeline, interrogate each event in reverse:
- Could this step have been avoided? If something had been done earlier, would this step be unnecessary?
- Could this step have happened sooner? Did the AI delay something it should have proactively done?
- Did this step duplicate prior work? Was the AI hand-writing logic that could have been reused?
Tag events where the answer is "yes" with [⚠ Optimizable] and record the reason. These tags are the core input for Step 2.
Output format: Chronological event list with type labels and brief descriptions. Waste points tagged separately.
Step 2: Eight-Dimension Deep Analysis
Load references/analysis_dimensions.md for detailed evaluation criteria, self-check lists, and common patterns per dimension. Analyze conversation events dimension by dimension to identify improvement opportunities.
Eight dimensions overview:
- AI Self-Reflection ⭐ — AI's mistakes, delayed reactions, missed judgments in this session (highest priority, must be analyzed first)
- Verification Strategy — Did the AI proactively define verification criteria and expected outcomes, or passively wait for user feedback?
- Automation Opportunities — Repetitive workflows or hand-written scripts that could be encapsulated into reusable automations
- Existing Automation Tuning — Were any existing automations/skills/templates used? Did they have gaps, unclear instructions, or output issues?
- Tool Integration Opportunities — Operations that would benefit from dedicated tool integrations, plugins, or API connections
- Knowledge Persistence — Preferences, conventions, and technical decisions from this session that should be persisted for future sessions
- Documentation Updates — Project docs, coding standards, or architecture notes that need updating
- Workflow Efficiency — Sequential steps that could be parallel, repeated labor, suboptimal tool choices
Analysis requirements (mandatory):
For each dimension:
- Run through the dimension's self-check list (defined in
references/analysis_dimensions.md) - For findings, output: Specific event reference (which turn, what operation) + Counterfactual reasoning (if X had been done, Y could be saved) + Recommendation + Priority
- Only after all self-check items pass can a dimension be declared "no findings" and skipped
Step 3: Generate Retrospective Report
Load assets/report_template.md for the report template. Fill the template with results from Step 1 and Step 2 to produce a complete Markdown retrospective report.
Report save path: {workspace}/retrospectives/{topic}_retrospective.md
Naming rules:
{topic}uses 2-4 English words joined by hyphens, summarizing the session's core task (e.g.,multithread-scope-collection,login-flow-refactor)- Multiple retrospectives on same topic: If the file already exists, append the new report at the end (separated by
---and a new date heading) — don't create a new file
If the retrospectives/ directory doesn't exist, create it first.
Note: The save path above is a sensible default. Adapt to your project's conventions if they differ.
Step 4: Display Full Analysis in Conversation
The complete analysis must be shown directly in the conversation — don't just output a summary and point to the file. The file is an archive; the primary reading experience is in the conversation.
Output content (show in full, no trimming):
- Session summary: One-sentence overview
- Efficiency score: Optimizable turns / total turns
- Event timeline: Complete table with waste point tags
- All dimension findings: Each with event reference, problem, counterfactual reasoning, recommendation (this is the core content — never abbreviate or reduce)
- Pending action list (if any)
- Report archive location
Format: Use Markdown tables and headings for clear structure. Better to be thorough than to cut valuable analysis.
Step 5: Automatic Execution — Knowledge Persistence
For items identified in the "Knowledge Persistence" dimension (Dimension 6), execute persistence operations available in your AI tool:
- If your tool supports persistent memory (e.g., memory APIs, memory files,
.memorydirectories), write new preferences/conventions directly - If your tool supports project-level notes or config, update those
- If your tool has no persistence mechanism, list the items that should be persisted and recommend the user save them manually
Briefly state what was updated after each operation. Skip this step if no knowledge needs persisting.
Step 6: Pending Action List
For the following types of improvement suggestions, do not auto-execute — list them for user selection:
| Action Type | Examples |
|---|---|
| Create new automation | Reusable workflow, script template, custom command |
| Tune existing automation | Modify instructions, parameters, or trigger conditions |
| Create/update project rules | Coding standards, review checklists, conventions |
| Update project documentation | Architecture docs, API references, onboarding guides |
| Create tool integration | Custom plugin, API connection, webhook |
List format: Numbered list, each item includes "Action type + Specific content + Expected benefit." User can reply with numbers to select which actions to execute.
If no pending actions, skip this step and state "No additional actions needed for this session."
Edge Cases
Very short sessions: If the conversation is only a few turns with simple content, output a brief summary and state "This session was brief — no significant improvement opportunities identified." Don't force analysis.
Compressed/summarized history: If the conversation history appears compressed or truncated, analyze based on available context and note in the report: "Some conversation history was compressed; analysis is based on visible context."
Tool capability limitations: If the AI tool being used lacks certain capabilities referenced in this workflow (e.g., no file writing, no memory persistence), adapt gracefully — perform the analysis steps that are possible and clearly note any skipped steps with the reason.
相关 Skills
Claude接口
by anthropics
面向接入 Claude API、Anthropic SDK 或 Agent SDK 的开发场景,自动识别项目语言并给出对应示例与默认配置,快速搭建 LLM 应用。
✎ 想把Claude能力接进应用或智能体,用claude-api上手快、兼容Anthropic与Agent SDK,集成路径清晰又省心
RAG架构师
by alirezarezvani
聚焦生产级RAG系统设计与优化,覆盖文档切块、检索链路、索引构建、召回评估等关键环节,适合搭建可扩展、高准确率的知识库问答与检索增强应用。
✎ 面向RAG落地,把知识库、向量检索和生成链路系统串联起来,做架构设计时更清晰,也更少踩坑。
多智能体架构
by alirezarezvani
聚焦多智能体系统架构设计,梳理 Supervisor、Swarm、分层和 Pipeline 等模式,覆盖角色定义、通信协作与性能评估,适合规划稳健可扩展的 AI agent 编排方案。
✎ 帮你系统解决多智能体应用的架构设计与协同编排难题,适合构建复杂 AI 工作流,成熟度高、社区认可也很亮眼。
相关 MCP 服务
知识图谱记忆
编辑精选by Anthropic
Memory 是一个基于本地知识图谱的持久化记忆系统,让 AI 记住长期上下文。
✎ 帮 AI 和智能体补上“记不住”的短板,用本地知识图谱沉淀长期上下文,连续对话更聪明,数据也更可控。
顺序思维
编辑精选by Anthropic
Sequential Thinking 是让 AI 通过动态思维链解决复杂问题的参考服务器。
✎ 这个服务器展示了如何让 Claude 像人类一样逐步推理,适合开发者学习 MCP 的思维链实现。但注意它只是个参考示例,别指望直接用在生产环境里。
PraisonAI
编辑精选by mervinpraison
PraisonAI 是一个支持自反思和多 LLM 的低代码 AI 智能体框架。
✎ 如果你需要快速搭建一个能 24/7 运行的 AI 智能体团队来处理复杂任务(比如自动研究或代码生成),PraisonAI 的低代码设计和多平台集成(如 Telegram)让它上手极快。但作为非官方项目,它的生态成熟度可能不如 LangChain 等主流框架,适合愿意尝鲜的开发者。