DocumentToolkit
Document toolkit for parsing documents and support Q&A.
Support backends:
- Chunkr: https://github.com/lumina-ai-inc/chunkr
-
pymupdf: https://github.com/pymupdf/PyMuPDF
-
[ ] unify the filepath cache logic (also suppoort audio_toolkit, image_toolkit)
DocumentToolkit
Bases: AsyncBaseToolkit
源代码位于: utu/tools/document_toolkit.py
18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 | |
tools_map
property
Lazy loading of tools map. - collect tools registered by @register_tool
__init__
__init__(config: ToolkitConfig = None) -> None
Initialize the DocumentToolkit, with configed parser and llm.
源代码位于: utu/tools/document_toolkit.py
19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 | |
document_qa
async
Get file content summary or answer questions about attached document.
Supported file types: pdf, docx, pptx, xlsx, xls, ppt, doc
参数:
| 名称 | 类型 | 描述 | 默认 |
|---|---|---|---|
document_path
|
str
|
Local path or URL to a document. |
必需 |
question
|
str
|
The question to answer. If not provided, return a summary of the document. |
None
|
源代码位于: utu/tools/document_toolkit.py
55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 | |
get_tools_map_func
Get tools map. It will filter tools by config.activated_tools if it is not None.
源代码位于: utu/tools/base.py
56 57 58 59 60 61 62 63 64 65 | |
get_tools_in_agents
get_tools_in_agents() -> list[FunctionTool]
Get tools in openai-agents format.
源代码位于: utu/tools/base.py
67 68 69 70 71 72 73 74 75 76 77 78 | |
get_tools_in_openai
Get tools in OpenAI format.
源代码位于: utu/tools/base.py
80 81 82 83 | |
get_tools_in_mcp
get_tools_in_mcp() -> list[Tool]
Get tools in MCP format.
源代码位于: utu/tools/base.py
85 86 87 88 | |
call_tool
async
Call a tool by its name.
源代码位于: utu/tools/base.py
90 91 92 93 94 95 96 | |