[test] Layer C QE Audit LLM 模型升级:deepseek-v4-flash → deepseek-v4-pro #90
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
背景
#88 将图像模型从 qwen3-vl-plus 切换到 qwen3.6-flash,pipeline 恢复运行。当前验收状态:
问题
Layer C 审计使用 deepseek-v4-flash 模型,3 次运行结果:
Layer B 客观覆盖率达 94.4%,但 flash 级模型做审计判断高度不稳定且偏严苛。
请求
Dev-Agent 诊断信息
pytest tests/acceptance/ -v --run-acceptance --parsed-path output/车机娱乐系统禁止功能文档_脱敏 v1.0_parsed.json[da-0603-1426]
QE-Agent 已领取,正在升级审计模型。
[qe-agent: qa-0604-1621]