Compare commits

...

4 Commits

Author SHA1 Message Date
pzhang_zywl 440cd5812b fix: step2 prompt 增加功能完整性要求 - Closes #75
CI / test (pull_request) Successful in 7s
新增规则 #9:要求 LLM 覆盖上下文包中的每个表格行和每条文字描述,
确保不遗漏任何数据来源。

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-06-02 19:24:37 +08:00
pzhang_zywl 55dcfc1b3e Merge pull request 'fix: [bug] Layer C QE Audit 持续 REJECT — 1/5 adequate 需提升至 ≥70% - 来自 #18 - Closes #75' (#77) from dev/issue-75-round2-ensemble-temp into main
CI / test (push) Successful in 9s
2026-06-02 18:55:49 +08:00
pzhang_zywl 4a8032665f fix: ensemble 温度从 3 个增至 4 个增加多样性 - Closes #75
CI / test (pull_request) Successful in 8s
新增 t=0.5 温度变体,提高 ensemble 多样性以捕获更多功能单元。

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-06-02 18:55:16 +08:00
pzhang_zywl 6536c7fa9d Merge pull request 'fix: [bug] Layer C QE Audit 持续 REJECT — 1/5 adequate 需提升至 ≥70% - 来自 #18 - Closes #75' (#76) from dev/issue-75-retry-3 into main
CI / test (push) Successful in 10s
2026-06-02 18:35:44 +08:00
3 changed files with 6 additions and 3 deletions
+2 -1
View File
@@ -86,7 +86,8 @@ COVERAGE_TARGET = float(os.environ.get("IR_COVERAGE_TARGET", "0.95"))
ENSEMBLE_TEMPERATURES = [ ENSEMBLE_TEMPERATURES = [
float(os.environ.get("IR_ENSEMBLE_T1", "0.0")), float(os.environ.get("IR_ENSEMBLE_T1", "0.0")),
float(os.environ.get("IR_ENSEMBLE_T2", "0.3")), float(os.environ.get("IR_ENSEMBLE_T2", "0.3")),
float(os.environ.get("IR_ENSEMBLE_T3", "0.7")), float(os.environ.get("IR_ENSEMBLE_T3", "0.5")),
float(os.environ.get("IR_ENSEMBLE_T4", "0.7")),
] ]
@@ -186,6 +186,8 @@
8. **开关关闭状态**:开关关闭时所有限制失效,这也必须作为一条规则输出(path: ["...", "开关关闭", "无限制"])。 8. **开关关闭状态**:开关关闭时所有限制失效,这也必须作为一条规则输出(path: ["...", "开关关闭", "无限制"])。
9. **功能完整性要求(重要)**:上下文包中的每个表格行、每条文字描述、每个逻辑树路径都必须被至少一条规则覆盖。仔细检查上下文包,确保不遗漏任何数据来源。如果上下文包中有表格,每条表格行至少生成一条对应规则。
{format_feedback} {format_feedback}
## 输出格式 ## 输出格式
+2 -2
View File
@@ -83,8 +83,8 @@ def test_output_dir_structure():
def test_ensemble_temperatures_count(): def test_ensemble_temperatures_count():
"""Should have exactly 3 ensemble temperatures.""" """Should have exactly 4 ensemble temperatures."""
assert len(config.ENSEMBLE_TEMPERATURES) == 3 assert len(config.ENSEMBLE_TEMPERATURES) == 4
def test_max_tokens_is_int(): def test_max_tokens_is_int():