ae0ff5d4de
CI / test (pull_request) Successful in 8s
- test-dev → test-code:QE-Agent 一致化 label - Dev-Agent 新增 product-code label + [product] 前缀规则 - agent_poller.py 新增 create-issue action - QE/Dev Agent 轮询改为多轮递进:label → title 前缀 → 无标识分析 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
214 lines
10 KiB
HTML
214 lines
10 KiB
HTML
<!DOCTYPE html>
|
||
<html lang="zh-CN">
|
||
<head>
|
||
<meta charset="UTF-8">
|
||
<meta name="viewport" content="width=device-width, initial-scale=1.0">
|
||
<title>QE-Agent Workflow</title>
|
||
<style>
|
||
:root { --bg:#0d1117; --card:#161b22; --border:#30363d; --text:#c9d1d9;
|
||
--green:#3fb950; --red:#f85149; --yellow:#d2991d; --blue:#58a6ff; --purple:#bc8cff; }
|
||
* { box-sizing:border-box; margin:0; padding:0; }
|
||
body { background:var(--bg); color:var(--text); font:14px/1.6 -apple-system,BlinkMacSystemFont,sans-serif; max-width:960px; margin:0 auto; padding:24px; }
|
||
h1 { font-size:24px; border-bottom:1px solid var(--border); padding-bottom:12px; margin-bottom:24px; }
|
||
h2 { font-size:18px; margin-top:32px; margin-bottom:12px; color:var(--blue); }
|
||
h3 { font-size:15px; margin-top:20px; margin-bottom:8px; }
|
||
.card { background:var(--card); border:1px solid var(--border); border-radius:8px; padding:16px; margin:12px 0; }
|
||
.flow { display:flex; flex-wrap:wrap; gap:8px; align-items:center; margin:16px 0; font-size:13px; }
|
||
.flow .step { background:var(--card); border:1px solid var(--border); border-radius:6px; padding:8px 14px; white-space:nowrap; }
|
||
.flow .arrow { color:var(--blue); font-weight:bold; }
|
||
.pass { color:var(--green); }
|
||
.fail { color:var(--red); }
|
||
.warn { color:var(--yellow); }
|
||
table { width:100%; border-collapse:collapse; margin:12px 0; font-size:13px; }
|
||
th, td { border:1px solid var(--border); padding:8px 12px; text-align:left; }
|
||
th { background:var(--card); }
|
||
code { background:var(--card); padding:2px 6px; border-radius:4px; font-size:13px; }
|
||
pre { background:var(--card); border:1px solid var(--border); border-radius:6px; padding:12px; overflow-x:auto; font-size:13px; }
|
||
ul, ol { padding-left:24px; margin:8px 0; }
|
||
li { margin:4px 0; }
|
||
.badge { display:inline-block; padding:2px 8px; border-radius:12px; font-size:12px; font-weight:600; }
|
||
.badge-qe { background:var(--purple); color:#fff; }
|
||
.badge-dev { background:var(--blue); color:#fff; }
|
||
.badge-pass { background:var(--green); color:#000; }
|
||
.badge-fail { background:var(--red); color:#fff; }
|
||
</style>
|
||
</head>
|
||
<body>
|
||
|
||
<h1>QE-Agent Workflow</h1>
|
||
|
||
<p>QE-Agent 是一个自动化质量工程代理,专注于 <strong>main branch 的发布质量</strong>。
|
||
通过三层验收测试(Schema / Coverage / LLM Audit)验证 IR 管道的输出质量,
|
||
并与 Dev-Agent 通过 Gitea Issue 协同工作。</p>
|
||
|
||
<div class="card">
|
||
<strong>启动方式</strong><br>
|
||
<code>bash scripts/start_qe_agent.sh</code> — 三种模式:单次 / 持续轮询 / 交互<br>
|
||
<code>claude --agent agents/QE_AGENT.md</code> — 直接启动交互模式(默认 /loop 10m 轮询)
|
||
</div>
|
||
|
||
<h2>1. 角色与边界</h2>
|
||
|
||
<table>
|
||
<tr><th></th><th><span class="badge badge-qe">QE-Agent</span></th><th><span class="badge badge-dev">Dev-Agent</span></th></tr>
|
||
<tr><td>关注范围</td><td>main branch 健康</td><td>功能开发与 bug 修复</td></tr>
|
||
<tr><td>代码</td><td><code>tests/acceptance/</code></td><td><code>skills/</code> <code>scripts/</code></td></tr>
|
||
<tr><td>测试</td><td>验收测试 (三层)</td><td>UT/IT</td></tr>
|
||
<tr><td>分支</td><td><code>test/issue-N</code></td><td><code>dev/issue-N-*</code></td></tr>
|
||
<tr><td>Commit</td><td><code>test: ... - Closes #N</code></td><td><code>fix: ... - Closes #N</code></td></tr>
|
||
<tr><td>签名</td><td><code>[qe-agent: qa-01]</code></td><td><code>[da-01]</code></td></tr>
|
||
<tr><td>Issue 标签</td><td><code>test-code</code></td><td><code>agent-task</code> <code>ci-failure</code></td></tr>
|
||
</table>
|
||
|
||
<h2>2. 三层验收测试</h2>
|
||
|
||
<div class="flow">
|
||
<div class="step">Layer A<br><strong>Schema</strong><br>确定性验证</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step">Layer B<br><strong>Coverage</strong><br>结构溯源覆盖率</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step">Layer C<br><strong>QE Audit</strong><br>LLM 专家审计</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step"><strong>Report</strong><br>JSON 报告</div>
|
||
</div>
|
||
|
||
<table>
|
||
<tr><th>Layer</th><th>方法</th><th>阈值</th><th>LLM</th></tr>
|
||
<tr><td>A — Schema</td><td>IR 结构验证 (rule_id / trigger / sources / actions)</td><td>0 errors</td><td>不需要</td></tr>
|
||
<tr><td>B — Coverage</td><td>IR sources[] 对文档内容单元的引用率</td><td>≥ 70%</td><td>不需要</td></tr>
|
||
<tr><td>C — QE Audit</td><td>LLM 逐章节评估 IR 覆盖充分性</td><td>inadequate ≤ 30%</td><td>deepseek-v4-flash</td></tr>
|
||
</table>
|
||
|
||
<div class="card">
|
||
<strong>最终判决</strong>: 三层全部 PASS → <span class="pass">releasable ✓</span> | 任意一层 FAIL → <span class="fail">blocked ✗</span>
|
||
</div>
|
||
|
||
<h2>3. Issue 工作流</h2>
|
||
|
||
<h3>3.1 轮询</h3>
|
||
<pre>python scripts/agent_poller.py --action list --labels test-code
|
||
python scripts/agent_poller.py --action list --labels acceptance-failure</pre>
|
||
|
||
<h3>3.2 test-code Issue 闭环</h3>
|
||
<div class="flow">
|
||
<div class="step">1. 领取<br>comment</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step">2. 开发<br>tests/acceptance/</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step">3. 本地验证<br>pytest</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step">4. 提交<br>test/issue-N</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step">5. PR + CI</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step">6. merge</div>
|
||
<div class="arrow">→</div>
|
||
<div class="step">7. close</div>
|
||
</div>
|
||
|
||
<h3>3.3 e2e 验证流程</h3>
|
||
<ol>
|
||
<li>识别 dev-agent 修复完毕(关联 dev issue 已关闭)</li>
|
||
<li><code>git pull origin main</code></li>
|
||
<li><code>python scripts/run_pipeline.py --parsed <path> --test</code></li>
|
||
<li>分析三层报告</li>
|
||
<li>全部 PASS → 关闭 test-code issue</li>
|
||
<li>仍有 FAIL → 重开 dev issue + 更新 test-code issue</li>
|
||
</ol>
|
||
|
||
<h2>4. Issue 生命周期规则</h2>
|
||
|
||
<div class="card">
|
||
<h3>关闭规则</h3>
|
||
<ul>
|
||
<li>QE 测试通过 → 关闭 test-code issue</li>
|
||
<li>QE 测试失败 + 新问题 → 开 dev issue (agent-task),test-code <strong>保持 open</strong></li>
|
||
<li>QE 测试失败 + dev issue 已存在 → test-code <strong>保持 open</strong></li>
|
||
<li><strong>绝不</strong>在问题未修复时关闭 test-code issue</li>
|
||
</ul>
|
||
</div>
|
||
|
||
<div class="card">
|
||
<h3>重开规则</h3>
|
||
<ul>
|
||
<li>Dev issue 被关但 QE 重验仍失败 → <strong>重开 dev issue</strong></li>
|
||
<li>必须加 <code>## REOPEN by [qe-agent: qa-01]</code> 评论,包含:<ol>
|
||
<li>已修复项(肯定进展)</li>
|
||
<li>仍存在的问题(具体数据 + 阈值对比)</li>
|
||
<li>结论:为什么修复不完整</li>
|
||
</ol></li>
|
||
<li>重开后同步更新关联 test-code issue</li>
|
||
</ul>
|
||
</div>
|
||
|
||
<h2>5. Agent 间通信协议</h2>
|
||
|
||
<div class="card">
|
||
<p><strong>Issue 状态是唯一通信渠道</strong>。两个 agent 共用 <code>pzhang_zywl</code> Gitea 账号,通过签名区分:</p>
|
||
<ul>
|
||
<li><span class="badge badge-qe">QE</span> 评论末尾: <code>[qe-agent: qa-01]</code></li>
|
||
<li><span class="badge badge-dev">Dev</span> 评论末尾: <code>[da-01]</code></li>
|
||
</ul>
|
||
<p><strong>QE → Dev</strong>: 发现问题 → 开 dev issue (agent-task) / 重开已有 dev issue</p>
|
||
<p><strong>Dev → QE</strong>: 修复完成 → 关闭 dev issue(自验证后)</p>
|
||
<p><strong>QE 验收</strong>: 拉取 main → 重跑 e2e → 通过就关 test-code,不通过就重开 dev issue</p>
|
||
</div>
|
||
|
||
<h2>6. 命令速查</h2>
|
||
|
||
<table>
|
||
<tr><th>操作</th><th>命令</th></tr>
|
||
<tr><td>轮询 issue</td><td><code>agent_poller.py --action list --labels test-code</code></td></tr>
|
||
<tr><td>查看 issue</td><td><code>agent_poller.py --action get --issue <N></code></td></tr>
|
||
<tr><td>评论</td><td><code>agent_poller.py --action comment --issue <N> --body "..."</code></td></tr>
|
||
<tr><td>生命周期</td><td><code>agent_poller.py --action lifecycle --issue <N></code></td></tr>
|
||
<tr><td>创建 PR</td><td><code>agent_poller.py --action create-pr --issue <N> --branch test/issue-<N></code></td></tr>
|
||
<tr><td>查 PR CI</td><td><code>agent_poller.py --action pr-status --pr <N></code></td></tr>
|
||
<tr><td>合并 PR</td><td><code>agent_poller.py --action merge-pr --pr <N></code></td></tr>
|
||
<tr><td>跑管道</td><td><code>python scripts/run_pipeline.py --parsed <path> --test</code></td></tr>
|
||
<tr><td>验收测试</td><td><code>pytest tests/acceptance/ -v --run-acceptance</code></td></tr>
|
||
<tr><td>仅 Layer A+B</td><td><code>pytest tests/acceptance/ -v --run-acceptance -k "not test_layer_c"</code></td></tr>
|
||
</table>
|
||
|
||
<h2>7. 文件结构</h2>
|
||
|
||
<pre>
|
||
tests/acceptance/
|
||
├── conftest.py # Pytest 配置、fixtures、LLM client
|
||
├── ir_schema.py # IR schema 验证
|
||
├── report.py # 三层 JSON 报告
|
||
└── test_main_health.py # Layer A → B → C
|
||
|
||
scripts/
|
||
├── agent_poller.py # Gitea API 工具
|
||
├── run_pipeline.py # 端到端管道运行器
|
||
├── start_qe_agent.sh # QE-Agent 启动脚本
|
||
└── .env # Token 配置 (gitignored)
|
||
|
||
agents/
|
||
├── QE_AGENT.md # QE-Agent 系统指令
|
||
└── DEV_AGENT.md # Dev-Agent 系统指令
|
||
|
||
.gitea/workflows/
|
||
├── ci.yml # CI (push/PR)
|
||
└── acceptance.yml # 手动触发验收
|
||
</pre>
|
||
|
||
<h2>8. 本 Session 处理记录</h2>
|
||
|
||
<table>
|
||
<tr><th>Issue</th><th>内容</th><th>结果</th></tr>
|
||
<tr><td>#10</td><td>移除硬编码路径,适配 config.py</td><td><span class="pass">closed</span></td></tr>
|
||
<tr><td>#12</td><td>实现端到端验收测试流程</td><td><span class="pass">closed</span></td></tr>
|
||
<tr><td>#14</td><td>跑完整 e2e 测试</td><td><span class="pass">closed</span></td></tr>
|
||
<tr><td>#15</td><td>Dev: IR rules=[] (多次 reopen)</td><td><span class="pass">closed</span></td></tr>
|
||
<tr><td>#18</td><td>再跑 e2e 测试</td><td><span class="warn">open</span></td></tr>
|
||
<tr><td>#21</td><td>P0: 覆盖率不足 (多次 reopen)</td><td><span class="fail">reopened</span></td></tr>
|
||
<tr><td>#22</td><td>P1: trigger.operator 为空</td><td><span class="pass">closed</span></td></tr>
|
||
</table>
|
||
|
||
<p style="margin-top:24px;color:var(--border);font-size:12px;">QE-Agent [qe-agent: qa-01] — document_analyzer project</p>
|
||
|
||
</body>
|
||
</html>
|