Files
document_analyzer/docs/QE_AGENT_WORKFLOW.html
T
pzhang_zywl ae0ff5d4de
CI / test (pull_request) Successful in 8s
test: 统一 Agent Issue 轮询 label 体系与创建规则 - Closes #40
- test-dev → test-code:QE-Agent 一致化 label
- Dev-Agent 新增 product-code label + [product] 前缀规则
- agent_poller.py 新增 create-issue action
- QE/Dev Agent 轮询改为多轮递进:label → title 前缀 → 无标识分析

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-06-02 14:16:51 +08:00

214 lines
10 KiB
HTML
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
<!DOCTYPE html>
<html lang="zh-CN">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>QE-Agent Workflow</title>
<style>
:root { --bg:#0d1117; --card:#161b22; --border:#30363d; --text:#c9d1d9;
--green:#3fb950; --red:#f85149; --yellow:#d2991d; --blue:#58a6ff; --purple:#bc8cff; }
* { box-sizing:border-box; margin:0; padding:0; }
body { background:var(--bg); color:var(--text); font:14px/1.6 -apple-system,BlinkMacSystemFont,sans-serif; max-width:960px; margin:0 auto; padding:24px; }
h1 { font-size:24px; border-bottom:1px solid var(--border); padding-bottom:12px; margin-bottom:24px; }
h2 { font-size:18px; margin-top:32px; margin-bottom:12px; color:var(--blue); }
h3 { font-size:15px; margin-top:20px; margin-bottom:8px; }
.card { background:var(--card); border:1px solid var(--border); border-radius:8px; padding:16px; margin:12px 0; }
.flow { display:flex; flex-wrap:wrap; gap:8px; align-items:center; margin:16px 0; font-size:13px; }
.flow .step { background:var(--card); border:1px solid var(--border); border-radius:6px; padding:8px 14px; white-space:nowrap; }
.flow .arrow { color:var(--blue); font-weight:bold; }
.pass { color:var(--green); }
.fail { color:var(--red); }
.warn { color:var(--yellow); }
table { width:100%; border-collapse:collapse; margin:12px 0; font-size:13px; }
th, td { border:1px solid var(--border); padding:8px 12px; text-align:left; }
th { background:var(--card); }
code { background:var(--card); padding:2px 6px; border-radius:4px; font-size:13px; }
pre { background:var(--card); border:1px solid var(--border); border-radius:6px; padding:12px; overflow-x:auto; font-size:13px; }
ul, ol { padding-left:24px; margin:8px 0; }
li { margin:4px 0; }
.badge { display:inline-block; padding:2px 8px; border-radius:12px; font-size:12px; font-weight:600; }
.badge-qe { background:var(--purple); color:#fff; }
.badge-dev { background:var(--blue); color:#fff; }
.badge-pass { background:var(--green); color:#000; }
.badge-fail { background:var(--red); color:#fff; }
</style>
</head>
<body>
<h1>QE-Agent Workflow</h1>
<p>QE-Agent 是一个自动化质量工程代理,专注于 <strong>main branch 的发布质量</strong>
通过三层验收测试(Schema / Coverage / LLM Audit)验证 IR 管道的输出质量,
并与 Dev-Agent 通过 Gitea Issue 协同工作。</p>
<div class="card">
<strong>启动方式</strong><br>
<code>bash scripts/start_qe_agent.sh</code> — 三种模式:单次 / 持续轮询 / 交互<br>
<code>claude --agent agents/QE_AGENT.md</code> — 直接启动交互模式(默认 /loop 10m 轮询)
</div>
<h2>1. 角色与边界</h2>
<table>
<tr><th></th><th><span class="badge badge-qe">QE-Agent</span></th><th><span class="badge badge-dev">Dev-Agent</span></th></tr>
<tr><td>关注范围</td><td>main branch 健康</td><td>功能开发与 bug 修复</td></tr>
<tr><td>代码</td><td><code>tests/acceptance/</code></td><td><code>skills/</code> <code>scripts/</code></td></tr>
<tr><td>测试</td><td>验收测试 (三层)</td><td>UT/IT</td></tr>
<tr><td>分支</td><td><code>test/issue-N</code></td><td><code>dev/issue-N-*</code></td></tr>
<tr><td>Commit</td><td><code>test: ... - Closes #N</code></td><td><code>fix: ... - Closes #N</code></td></tr>
<tr><td>签名</td><td><code>[qe-agent: qa-01]</code></td><td><code>[da-01]</code></td></tr>
<tr><td>Issue 标签</td><td><code>test-code</code></td><td><code>agent-task</code> <code>ci-failure</code></td></tr>
</table>
<h2>2. 三层验收测试</h2>
<div class="flow">
<div class="step">Layer A<br><strong>Schema</strong><br>确定性验证</div>
<div class="arrow"></div>
<div class="step">Layer B<br><strong>Coverage</strong><br>结构溯源覆盖率</div>
<div class="arrow"></div>
<div class="step">Layer C<br><strong>QE Audit</strong><br>LLM 专家审计</div>
<div class="arrow"></div>
<div class="step"><strong>Report</strong><br>JSON 报告</div>
</div>
<table>
<tr><th>Layer</th><th>方法</th><th>阈值</th><th>LLM</th></tr>
<tr><td>A — Schema</td><td>IR 结构验证 (rule_id / trigger / sources / actions)</td><td>0 errors</td><td>不需要</td></tr>
<tr><td>B — Coverage</td><td>IR sources[] 对文档内容单元的引用率</td><td>≥ 70%</td><td>不需要</td></tr>
<tr><td>C — QE Audit</td><td>LLM 逐章节评估 IR 覆盖充分性</td><td>inadequate ≤ 30%</td><td>deepseek-v4-flash</td></tr>
</table>
<div class="card">
<strong>最终判决</strong>: 三层全部 PASS → <span class="pass">releasable ✓</span> | 任意一层 FAIL → <span class="fail">blocked ✗</span>
</div>
<h2>3. Issue 工作流</h2>
<h3>3.1 轮询</h3>
<pre>python scripts/agent_poller.py --action list --labels test-code
python scripts/agent_poller.py --action list --labels acceptance-failure</pre>
<h3>3.2 test-code Issue 闭环</h3>
<div class="flow">
<div class="step">1. 领取<br>comment</div>
<div class="arrow"></div>
<div class="step">2. 开发<br>tests/acceptance/</div>
<div class="arrow"></div>
<div class="step">3. 本地验证<br>pytest</div>
<div class="arrow"></div>
<div class="step">4. 提交<br>test/issue-N</div>
<div class="arrow"></div>
<div class="step">5. PR + CI</div>
<div class="arrow"></div>
<div class="step">6. merge</div>
<div class="arrow"></div>
<div class="step">7. close</div>
</div>
<h3>3.3 e2e 验证流程</h3>
<ol>
<li>识别 dev-agent 修复完毕(关联 dev issue 已关闭)</li>
<li><code>git pull origin main</code></li>
<li><code>python scripts/run_pipeline.py --parsed &lt;path&gt; --test</code></li>
<li>分析三层报告</li>
<li>全部 PASS → 关闭 test-code issue</li>
<li>仍有 FAIL → 重开 dev issue + 更新 test-code issue</li>
</ol>
<h2>4. Issue 生命周期规则</h2>
<div class="card">
<h3>关闭规则</h3>
<ul>
<li>QE 测试通过 → 关闭 test-code issue</li>
<li>QE 测试失败 + 新问题 → 开 dev issue (agent-task)test-code <strong>保持 open</strong></li>
<li>QE 测试失败 + dev issue 已存在 → test-code <strong>保持 open</strong></li>
<li><strong>绝不</strong>在问题未修复时关闭 test-code issue</li>
</ul>
</div>
<div class="card">
<h3>重开规则</h3>
<ul>
<li>Dev issue 被关但 QE 重验仍失败 → <strong>重开 dev issue</strong></li>
<li>必须加 <code>## REOPEN by [qe-agent: qa-01]</code> 评论,包含:<ol>
<li>已修复项(肯定进展)</li>
<li>仍存在的问题(具体数据 + 阈值对比)</li>
<li>结论:为什么修复不完整</li>
</ol></li>
<li>重开后同步更新关联 test-code issue</li>
</ul>
</div>
<h2>5. Agent 间通信协议</h2>
<div class="card">
<p><strong>Issue 状态是唯一通信渠道</strong>。两个 agent 共用 <code>pzhang_zywl</code> Gitea 账号,通过签名区分:</p>
<ul>
<li><span class="badge badge-qe">QE</span> 评论末尾: <code>[qe-agent: qa-01]</code></li>
<li><span class="badge badge-dev">Dev</span> 评论末尾: <code>[da-01]</code></li>
</ul>
<p><strong>QE → Dev</strong>: 发现问题 → 开 dev issue (agent-task) / 重开已有 dev issue</p>
<p><strong>Dev → QE</strong>: 修复完成 → 关闭 dev issue(自验证后)</p>
<p><strong>QE 验收</strong>: 拉取 main → 重跑 e2e → 通过就关 test-code,不通过就重开 dev issue</p>
</div>
<h2>6. 命令速查</h2>
<table>
<tr><th>操作</th><th>命令</th></tr>
<tr><td>轮询 issue</td><td><code>agent_poller.py --action list --labels test-code</code></td></tr>
<tr><td>查看 issue</td><td><code>agent_poller.py --action get --issue &lt;N&gt;</code></td></tr>
<tr><td>评论</td><td><code>agent_poller.py --action comment --issue &lt;N&gt; --body "..."</code></td></tr>
<tr><td>生命周期</td><td><code>agent_poller.py --action lifecycle --issue &lt;N&gt;</code></td></tr>
<tr><td>创建 PR</td><td><code>agent_poller.py --action create-pr --issue &lt;N&gt; --branch test/issue-&lt;N&gt;</code></td></tr>
<tr><td>查 PR CI</td><td><code>agent_poller.py --action pr-status --pr &lt;N&gt;</code></td></tr>
<tr><td>合并 PR</td><td><code>agent_poller.py --action merge-pr --pr &lt;N&gt;</code></td></tr>
<tr><td>跑管道</td><td><code>python scripts/run_pipeline.py --parsed &lt;path&gt; --test</code></td></tr>
<tr><td>验收测试</td><td><code>pytest tests/acceptance/ -v --run-acceptance</code></td></tr>
<tr><td>仅 Layer A+B</td><td><code>pytest tests/acceptance/ -v --run-acceptance -k "not test_layer_c"</code></td></tr>
</table>
<h2>7. 文件结构</h2>
<pre>
tests/acceptance/
├── conftest.py # Pytest 配置、fixtures、LLM client
├── ir_schema.py # IR schema 验证
├── report.py # 三层 JSON 报告
└── test_main_health.py # Layer A → B → C
scripts/
├── agent_poller.py # Gitea API 工具
├── run_pipeline.py # 端到端管道运行器
├── start_qe_agent.sh # QE-Agent 启动脚本
└── .env # Token 配置 (gitignored)
agents/
├── QE_AGENT.md # QE-Agent 系统指令
└── DEV_AGENT.md # Dev-Agent 系统指令
.gitea/workflows/
├── ci.yml # CI (push/PR)
└── acceptance.yml # 手动触发验收
</pre>
<h2>8. 本 Session 处理记录</h2>
<table>
<tr><th>Issue</th><th>内容</th><th>结果</th></tr>
<tr><td>#10</td><td>移除硬编码路径,适配 config.py</td><td><span class="pass">closed</span></td></tr>
<tr><td>#12</td><td>实现端到端验收测试流程</td><td><span class="pass">closed</span></td></tr>
<tr><td>#14</td><td>跑完整 e2e 测试</td><td><span class="pass">closed</span></td></tr>
<tr><td>#15</td><td>Dev: IR rules=[] (多次 reopen)</td><td><span class="pass">closed</span></td></tr>
<tr><td>#18</td><td>再跑 e2e 测试</td><td><span class="warn">open</span></td></tr>
<tr><td>#21</td><td>P0: 覆盖率不足 (多次 reopen)</td><td><span class="fail">reopened</span></td></tr>
<tr><td>#22</td><td>P1: trigger.operator 为空</td><td><span class="pass">closed</span></td></tr>
</table>
<p style="margin-top:24px;color:var(--border);font-size:12px;">QE-Agent [qe-agent: qa-01] — document_analyzer project</p>
</body>
</html>