feat: add a new internal analyze_files tool with supporting infrastructure for file attachment handling #368

cristian-groza · 2025-12-19T16:56:55Z

Key Changes

New analyze_files tool: Internal tool for analyzing file attachments with configurable analysis types
Job attachments module: Extracted job attachment handling logic into dedicated job_attachments.py module with improved organization
Job attachment wrapper: New wrapper class for managing file attachments in agent workflows
Internal tool factory: Factory pattern implementation for creating and managing internal tools
Dependency updates: Updated uipath and jsonschema-pydantic-converter package versions

Technical Highlights

JSON Schema Converter

The converter solves a critical challenge with dynamically generated Pydantic models by:

Creating a shared pseudo-module (jsonschema_pydantic_converter._dynamic) in sys.modules that acts as a namespace for all dynamically created types
Recursively collecting and registering all nested BaseModel types with proper module attribution
Ensuring get_type_hints() can resolve forward references when models are used with external libraries like LangGraph

Why this was needed: Dynamically generated Pydantic models need proper module references for type introspection to work correctly. Without this, frameworks that rely on typing.get_type_hints() (like LangGraph) fail to resolve forward references in nested models, breaking validation and serialization.

src/uipath_langchain/agent/react/job_attachments.py

src/uipath_langchain/agent/react/init_node.py

src/uipath_langchain/agent/react/job_attachments.py

src/uipath_langchain/agent/tools/internal_tools/analyze_files_tool.py

andreitava-uip · 2025-12-22T14:25:51Z

src/uipath_langchain/agent/tools/internal_tools/analyze_files_tool.py

+class AnalyzeFileTool(StructuredToolWithOutputType, ToolWrapperMixin):
+    pass


this is ok for now, but we will have to revise this at some point.

It doesn't provide a nice way to compose functionalities (like file handling & static args)

It risks other code relying on isinstance(AnalyzeFileTool). In my design, systems would only search of Mixins

src/uipath_langchain/agent/wrappers/job_attachment_wrapper.py

andreitava-uip · 2025-12-22T14:32:30Z

src/uipath_langchain/agent/react/types.py

    """Agent Graph state for standard loop execution."""

    messages: Annotated[list[AnyMessage], add_messages] = []
+    job_attachments: Annotated[dict[str, Attachment], add_job_attachments] = {}


nit: this is ok for now but we will likely change our internal state management in the near future

src/uipath_langchain/agent/react/job_attachments.py

andreitava-uip · 2025-12-22T15:47:07Z

src/uipath_langchain/agent/react/job_attachments.py

+    results = []
+    for json_path in json_paths:
+        expr = parse(json_path)
+        matches = expr.find(data)
+        results.extend([match.value for match in matches])


what happens if some of the paths overlap?
ie: $.attachments[*] vs $attachments[0]

In that case we will have duplicate matches in the result. Is that a problem?
If we assume we will get disjoint paths (which is the case for the paths you extract earlier), we could at least make it clear in the docstring.

If the paths overlap the function will return duplicates. Currently this is not a problem because the get_json_paths_by_type function is producing disjoint paths by design. I will update the docstring to make this behavior clear.

src/uipath_langchain/agent/react/job_attachments.py

andreitava-uip · 2025-12-22T16:04:38Z

src/uipath_langchain/agent/react/jsonschema_pydantic_converter.py

+def create_model(
+    schema: dict[str, Any],
+) -> Type[BaseModel]:
+    model, namespace = transform_with_modules(schema)
+    corrected_namespace: dict[str, Any] = {}


i'm really not a fan of this... but there is no way around this.

Our InputModel is dynamically generated from json schema -> unavoidable

We need to dynamically extend the AgentGraphState with the InputModel, otherwise we do not have the inputs in the state -> almost unavoidable

AgentGraphState is then inspected by langgraph internals with get_type_hints(), which will fail to resolve dynamic types unless we do this. We have no mechanism to pass a local namespace through.

andreitava-uip

Please squash commits before merging

cristian-groza force-pushed the feat/add-analyze-file-tool branch from 58d7fc8 to f31733d Compare December 22, 2025 09:32

cristian-groza changed the title ~~WIP add analyze file tool~~ feat: add a new internal analyze_files tool with supporting infrastructure for file attachment handling Dec 22, 2025

cristian-groza force-pushed the feat/add-analyze-file-tool branch from f91813c to 958edcc Compare December 22, 2025 12:15

cotovanu-cristian reviewed Dec 22, 2025

View reviewed changes

src/uipath_langchain/agent/react/job_attachments.py Show resolved Hide resolved

cotovanu-cristian reviewed Dec 22, 2025

View reviewed changes

src/uipath_langchain/agent/react/init_node.py Outdated Show resolved Hide resolved

andreitava-uip reviewed Dec 22, 2025

View reviewed changes

cotovanu-cristian reviewed Dec 22, 2025

View reviewed changes

src/uipath_langchain/agent/react/job_attachments.py Outdated Show resolved Hide resolved

cotovanu-cristian reviewed Dec 22, 2025

View reviewed changes

src/uipath_langchain/agent/react/job_attachments.py Outdated Show resolved Hide resolved

andreitava-uip reviewed Dec 22, 2025

View reviewed changes

cristian-groza requested review from andreitava-uip and cotovanu-cristian December 23, 2025 09:43

cristian-groza added 10 commits December 23, 2025 11:45

feat: add jsonchema pydantic converter

8a8ad0e

fix: tool args

fb8cda8

fix: refactor code

39ce271

fix: update uipath and jsonschema-pydantic-converter versions

a2fb69c

fix: refactored code

62ca99d

fix: linting issues

8738ab4

fix: linting issues

c4d7c54

fix: linting issues

9ca88f0

fix: ruff format issues

a3165a9

fix: address PR comments

c9e633a

cristian-groza force-pushed the feat/add-analyze-file-tool branch from 7c81f33 to abcc021 Compare December 23, 2025 09:52

cristian-groza added 2 commits December 23, 2025 12:02

fix: pr comments

23a8548

fix: increment package version

5fe66f5

cristian-groza force-pushed the feat/add-analyze-file-tool branch from 6a9b2fb to 5fe66f5 Compare December 23, 2025 10:02

cristian-groza added 3 commits December 23, 2025 16:08

feat: resolve job attachments and call llm with files

f41e418

fix: add unit tests

48551df

fix: linting errors

daa8b8a

andreitava-uip reviewed Dec 23, 2025

View reviewed changes

andreitava-uip approved these changes Dec 23, 2025

View reviewed changes

cristian-groza merged commit 316002d into main Dec 23, 2025
39 checks passed

cristian-groza deleted the feat/add-analyze-file-tool branch December 23, 2025 15:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add a new internal analyze_files tool with supporting infrastructure for file attachment handling #368

feat: add a new internal analyze_files tool with supporting infrastructure for file attachment handling #368

Uh oh!

cristian-groza commented Dec 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreitava-uip Dec 22, 2025

Uh oh!

Uh oh!

andreitava-uip Dec 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreitava-uip Dec 22, 2025

Uh oh!

cristian-groza Dec 23, 2025

Uh oh!

Uh oh!

andreitava-uip Dec 22, 2025 •

edited

Loading

Uh oh!

andreitava-uip left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		class AnalyzeFileTool(StructuredToolWithOutputType, ToolWrapperMixin):
		pass

feat: add a new internal analyze_files tool with supporting infrastructure for file attachment handling #368

feat: add a new internal analyze_files tool with supporting infrastructure for file attachment handling #368

Uh oh!

Conversation

cristian-groza commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreitava-uip Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

andreitava-uip Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreitava-uip Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

cristian-groza Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

andreitava-uip Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andreitava-uip left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cristian-groza commented Dec 19, 2025 •

edited

Loading

andreitava-uip Dec 22, 2025 •

edited

Loading

andreitava-uip left a comment •

edited

Loading