doc_parser_skill: - New: verify_flowchart.py (flowchart validation) - Updated: LLM.py (multi-provider: DeepSeek + DashScope) - Updated: image_parser.py (logic tree support, external prompts) - Updated: SKILL.md, prompts/image_prompt.md conflict_detection_skill: - Updated: LLM.py (multi-provider sync) - Updated: detect_conflicts.py (logic tree text conversion) ir_generation_skill: - Replaced old scripts/LLM.py + ir_generator.py with standalone project - New: main.py, config.py, step1-3_*.py, ensemble_merge.py - New: prompts/, tests/ subdirectories tests: - New: acceptance/ test suite with schema validation - Fixed: conftest no longer globally skips non-acceptance tests - Updated: test_sample.py for new ir_generation structure Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This commit is contained in:
@@ -29,7 +29,10 @@ description: 解析文档(.docx, .pdf)以提取图像和文本结构,并
|
||||
该技能生成一个结构化JSON文件,文件名为输入文档的基本名称后跟'_parsed.json',包含:
|
||||
- `sections`:按标题分组的文档文本结构
|
||||
- `image_sources`:从图像标识符到其在文档中位置的映射
|
||||
- `image_analysis`:由视觉大语言模型确定的每个图像的类型和内容描述
|
||||
- `image_analysis`:由视觉大语言模型确定的每个图像的类型、内容描述和(如适用)结构化逻辑树
|
||||
- `type`: 图片类型(flowchart/architecture/state/sequence/activity/other)
|
||||
- `description`: 图片的文字描述
|
||||
- `logic_tree`(可选,仅图表类型):结构化逻辑树JSON,包含 `root`(根节点描述)和 `nodes` 数组。节点类型:`decision`(判断)、`action`(动作)、`state`(状态)、`start`(开始)、`end`(结束)。decision 节点包含 `condition` 和 `branches` 字段,其他节点包含 `description` 字段。
|
||||
|
||||
## 集成点
|
||||
|
||||
|
||||
Reference in New Issue
Block a user