server/public_simplechat vision (basic ok), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) #17142

hanishkvc · 2025-11-10T11:05:00Z

This adds the initial basic skeleton for supporting vision models going forward to the tools/server/public_simplechat alternate web client ui. Basic tested with Gemma3 and Qwen3VL

This builds on the previous PR in this series ie #17038 which completed the current initial go at tool calling which also includes a set of client side based builtin and bundled tool calls for code execution, data storage, web access, web search, pdf and xml use, ... as well as support for looking into reasoning/chainofthought of the models which share the same.

These no longer need to worry about * setting up the console.log related redirection to capture the generated outputs, nor about * setting up a dynamic function for executing the needed tool call related code The web worker setup to help run tool calls in a relatively isolated environment independent of the main browser env, takes care of these. One needs to only worry about getting the handle to the web worker to use and inturn pass the need code wrt the tool call to it.

tools manager/module * setup the web worker that will help execute the tool call related codes in a js environment that is isolated from the browsers main js environment * pass the web worker to the tool call providers, for them to use * dont wait for the result from the tool call, as it will be got later asynchronously through a message * allow users of the tools manager to register a call back, which will be called when ever a message is got from the web worker containing response wrt previously requested tool call execution. simplechat * decouple toolcall response handling and toolcall requesting logic * setup a timeout to take back control if tool call takes up too much time. Inturn help alert the ai model, that the tool call took up too much time and so was aborted, by placing a approriate tagged tool response into user query area. * register a call back that will be called when response is got asynchronously wrt anye requested tool calls. In turn take care of updating the user query area with response got wrt the tool call, along with tool response tag around it.

Had forgotten to specify type as module wrt web worker, in order to allow it to import the toolsconsole module. Had forgotten to maintain the id of the timeout handler, which is needed to clear/stop the timeout handler from triggering, if tool call response is got well in time. As I am currently reverting the console redirection at end of handling a tool call code in the web worker message handler, I need to setup the redirection each time. Also I had forgotten to clear the console.log capture data space, before a new tool call code is executed, this is also fixed by this change. TODO: Need to abort the tool call code execution in the web worker if possible in future, if the client / browser side times out waiting for tool call response, ie if the tool call code is taking up too much time.

As the tool calling, if enabled, will need access to last few user query and ai assistant responses (which will also include in them the tool call requests and the corresponding results), so that the model can build answers based on its tool call reqs and got responses, and also given that most of the models these days have sufficiently large context windows, so the sliding window context implemented by SimpleChat logic has been increased by default to include last 4 query and their responses roughlty.

ie wrt the tool calls provided.

Modify the constructor, newFrom and clear towards this goal.

Rename ChatMessage to ChatMessageEx. Add typedefs for NSToolCall and NSChatMessage, they represent the way the corresponding data is structured in network hs. Add logic to build the ChatMessageEx from data got over network in streaming mode.

Update HasToolCalls and ContentEquiv to work with new structure

Use the equivalent update_stream directly added to ChatMessageEx. update_stream is also more generic to some extent and also directly implemented by the ChatMessageEx class.

response_extract logic moved directly into ChatMessageEx as update oneshot, with suitable adjustments. Inturn use the same directly.

these have been updated to work with ChatMessageEx to an extent

GetSystemLatest and its users updated wrt ChatMessageEx. RecentChat updated wrt ChatMessageEx. Also now irrespective of whether full history is being retrieved or only a subset, both cases refer to the ChatMessageEx instances in SimpleChat.xchat without creating new instances of anything.

Simplify Add semantic by expecting any validation of stuff before adding to be done by the callers of Add and not by add itself. Also update it to expect ChatMessageEx object Update all users of add to follow the new syntax and semantic. Remove the old and ununsed AddSysPromptOnlyAtBegin helper

Users of recent_chat updated to work with ChatMessageEx As part of same recent_chat_ns also added, for the case where the array of chat messages can be passed as is ie in the chat mode, provided it has only the network handshake representation of the messages.

wrt ChatMessageEx related required flow as well as avoid warnings

Use HTMLElement's dataset to maintain tool call id along with the element which maintains the toolname. Pass it along to the tools manager and inturn the actual tool calls and through them to the web worker handling the tool call related code and inturn returning it back as part of the obj which is used to return the tool call result. Embed the tool call id, function name and function result into the content field of chat message in terms of a xml structure Also make use of tool role to send back the tool call result. Do note that currently the id, name and content are all embedded into the content field of the tool role message sent to the ai engine on the server. NOTE: Use the user query entry area for showing tool call result in the above mentioned xml form, as well as for user to enter their own queries. Based on presence of the xml format data at beginning the logic will treat it has a tool result and if not then as a normal user query. The css has been updated to help show tool results/msgs in a lightyellow background

Expand the xml format id, name and content in content field of tool result into apropriate fields in the tool result message sent to the genai/llm engine on the server.

these common helpers avoid needing ignore tagging to ts-check, in places where valid constructs have been used which go beyond strict structured js handling that is tried to be achieved using it, but are still valid and legal.

Also update the sliding window context size to last 9 chat messages so that there is a sufficiently large context for multi turn tool calls based adjusting by ai and user, without needing to go full hog, which has the issue of overflowing the currently set context window wrt the loaded ai model.

also possible refinement wrt trapping, if needed, added as comment all or allSettled to use or not is the question. whether to wait for a round trip through the related event loop or not is also a question.

Dont forget to map members of got entity from fetch to things from saved original promise, bcas remember what is got is a promise. also add some comments around certain decisions and needed exploration

Use css conditional attribute styling to change background color of the user input textarea to match the tool role message block color, when the user input textarea is in the TOOL.TEMP mode With this user can know that the user input area is currently showing and accepting tool result data for submission.

Currently the logic doesnt allow user to send a empty message to ai, during their term. Previously this path wasnt directly alerting the end user. Now it informs the end user using placeholder property so they can see the alert, while also ensuring that once user enters something, the alert wont interfere. The logic takes care of saving any original placeholder, so that the same is restored, when user switches sessions.

Avoid directly accessing content field, from any place other than where it is absolutely requried. Add a bOverwrite field to the content_adj helper, so that one can overwrite instead of appending passed content to whats already in. * this is currently used only wrt * promote_tooltemp helper * trim garbage helper * the oneshot could ideally use overwriting, but currently not doing as this flow will occur only once per message Add a image_url field for the image url with image data in dataurl format with base64 encoded image data.

If I cant control the look of the file type input, I may have to hide it and use a normal button, which chains into file selection or so

Also rename the id/label of InFile+Btn to Image. Extra fields while Adding.

Add a new helper to create a file type input which includes a btn with image. Use same wrt the user image selection button. Update button creation helper to show innerText only if the newly added innerHTML arg is undefined. When ever user makes a image selection, the image will be shown in the input-filetype-image-button. In turn when the same is submitted to ai engine server, the image will be cleared.

There can be issue with chat.add->chat.save, in that trying to store into localStorage or so can raise exception, like quota exceeded and so. So now trap chat.add also and inturn for now take care of clearing image state while also trapping and rethrowing a new error which identifies the above location, as well as tracks the original err

Move all dataUrl handling into helper functions. So that its manipulation is done in a controlled manner, as well as in future, changes to the semantic can be easily carried out by updating the helper functions suitably and inturn updating the caller as needed. For now avoid push and pop and work with 0th index directly, given that currently the logic is setup for handling only a single image with the ai model. This keeps things simple. It can be changed if required in future easily.

If a caught error had chained in details about what triggered it in the 1st place, then show it also to user.

hanishkvc · 2025-11-10T19:05:49Z

a basic / simple vision flow should work now.

user can select a image to be loaded/passed to ai (the same can be viewed directly in the image load button itself, once loaded from filesystem)
- they can change the image to pass if reqd by clicking the image button to load a new image
the same is handshaked with the ai server using openai http handshake format of array of items containing text and image_url items in the array as needed.
any reasoning generated by ai model when analysising the image will be shown to user (rather implicit in the flow from before) as well as the response generated.
show the user query and associated image in the chat session view

Given the limit of around 5MB or so enforced by browsers wrt localStorage, the auto save and option to restore a previous chat, can fail given that it will get filled fast and or even with a single large image of few MBs or more. May change to indexedDB later.

Limit scroll to veritical dir and inturn show only when needed Move the Tool Call Edit UI's HR into its div. Add a bottom margin wrt the individual chat messages.

So that we dont overload users with the details by default, but user can open or close the block of current settings info details.

Distinguish between top level and remaining levels. More flexibility and also cleaner flow

and placeholders for few other logical elements

ie opening db as well as a transaction to access a store within the db.

Added put and get helpers wrt indexedDB. Updated save and load related logics in SimpleChatTCRV.

Allow passing the accept list for file type input element helper. Inturn given that currently it is used for the image selection for vision models, set it to jpeg and png in the caller for the same.

hanishkvc · 2025-11-12T00:47:00Z

Changed Chat session auto save and user optional load to use indexedDB instead of localStorage, so that there is enough space availabe for saving chat sessions, given that images are in the mix now.

Cleaned up the ui a tiny little bit

hanishkvc added 30 commits November 6, 2025 15:32

SimpleChatTC: Update readme.md wrt latest updates. 2k maxtokens

06a5480

SimpleChatTC: update descs to indicate use of web workers

2899c84

ie wrt the tool calls provided.

SimpleChatTC:ChatMessage: AssistantResponse into chat message class

6e7c9f5

Modify the constructor, newFrom and clear towards this goal.

SimpleChatTC:ChatMessageEx:cleanup, HasToolCalls, ContentEquiv

0d8ce70

Update HasToolCalls and ContentEquiv to work with new structure

SimpleChatTC:ChatMessage: remove ResponseExtractStream

4ab4462

Use the equivalent update_stream directly added to ChatMessageEx. update_stream is also more generic to some extent and also directly implemented by the ChatMessageEx class.

SimpleChatTC:ChatMessageEx: add update_oneshot

d59fa84

response_extract logic moved directly into ChatMessageEx as update oneshot, with suitable adjustments. Inturn use the same directly.

SimpleChatTC:ChatMessageEx: ods load, system prompt related

5c4ee51

these have been updated to work with ChatMessageEx to an extent

SimpleChatTC:ChatMessageEx: Cleanup remaining stuff

e707d91

wrt ChatMessageEx related required flow as well as avoid warnings

SimpleChatTC:Load allows old and new ChatMessage(Ex) formats

674312b

SimpleChatTC:ChatMessageEx: send tool_calls, only if needed

df5cfac

SimpleChatTC:ChatMessageEx: Build tool role result fully

9f234e3

Expand the xml format id, name and content in content field of tool result into apropriate fields in the tool result message sent to the genai/llm engine on the server.

SimpleChatTC:ChatMessageEx:While at it also ns_delete

42b3fe1

these common helpers avoid needing ignore tagging to ts-check, in places where valid constructs have been used which go beyond strict structured js handling that is tried to be achieved using it, but are still valid and legal.

SimpleChatTC:ChatMessageEx: Better tool result extractor

8a28b33

SimpleChatTC:ChatMessageEx: 1st go at trying to track promises

e43342b

SimpleChatTC:TrapPromise: log the trapping

d0c3939

also possible refinement wrt trapping, if needed, added as comment all or allSettled to use or not is the question. whether to wait for a round trip through the related event loop or not is also a question.

SimpleChatTC:Promises: trap normal fetch (dont care await or not)

84fd55b

SimpleChatTC: Allow await in generated code that will be evald

ada6fd1

SimpleChatTC:Ensure fetch's promise chain is also trapped

ae62f3a

Dont forget to map members of got entity from fetch to things from saved original promise, bcas remember what is got is a promise. also add some comments around certain decisions and needed exploration

SimpleChatTC: update readme wrt promise related trapping

1f685f1

SimpleChatTC:WebFetchThroughProxy:Initial go creating request

32bb28e

hanishkvc added 8 commits November 9, 2025 00:43

SimpleChatTC:Vision: take care of image_url wrt newFrom & load

c3cb0ce

SimpleChatTCRV:Vision: Create needed MixedContent b4 nw handshake

f3c7e3a

SimpleChatTCRV:Vision: Add file input to get hold of img file

16277e2

If I cant control the look of the file type input, I may have to hide it and use a normal button, which chains into file selection or so

SimpleChatTCRV:Basic skeleton to load a dataUrl

d3a1d49

SimpleChatTCRV:Submit: Remember to include image, if available

4152f78

Also rename the id/label of InFile+Btn to Image. Extra fields while Adding.

hanishkvc changed the title ~~server/public_simplechat vision (wip), toolcall (done, with 0 cost builtin tools+), reasoing(done)~~ server/public_simplechat vision (wip), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) Nov 10, 2025

github-actions bot added examples python python script changes server labels Nov 10, 2025

DajanaV mentioned this pull request Nov 10, 2025

UPSTREAM PR #17142: server/public_simplechat vision (wip), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) auroralabs-loci/llama.cpp#155

Open

hanishkvc added 5 commits November 10, 2025 18:09

SimpleChatTCRV:Vision:Show images as part of the message

d574263

SimpleChatTCRV:HandleUserSubmit:details of internaly caught exc

f1b36f7

If a caught error had chained in details about what triggered it in the 1st place, then show it also to user.

hanishkvc added 8 commits November 11, 2025 12:17

SimpleChatTCRV:UI Cleanup: scrollOnlyIf, Tool HR, Msg Margin

7c0a315

Limit scroll to veritical dir and inturn show only when needed Move the Tool Call Edit UI's HR into its div. Add a bottom margin wrt the individual chat messages.

SimpleChatTCRV:UI Cleanup: DetailsNotDiv Current settings info

902ae9f

So that we dont overload users with the details by default, but user can open or close the block of current settings info details.

SimpleChatTCRV:UICleanup: ObjInfo dClassNames

f74d42d

Distinguish between top level and remaining levels. More flexibility and also cleaner flow

SimpleChatTCRV:UICleanup: gradient wrt heading

49de720

and placeholders for few other logical elements

SimpleChatTC:IDBHelper: Move core indexedDB helper as a module

ef41564

ie opening db as well as a transaction to access a store within the db.

SimpleChatTCRV:iDB:Add Put/Get; SimpleChat Save/Load using iDB

9269fb6

Added put and get helpers wrt indexedDB. Updated save and load related logics in SimpleChatTCRV.

SimpleChatTCRV:iDB:GetKeys: helps decide whether restore btn shown

f832f06

SimpleChatTCRV:InputFileDialog:AcceptList:Images:Jpeg and Png

55d62aa

Allow passing the accept list for file type input element helper. Inturn given that currently it is used for the image selection for vision models, set it to jpeg and png in the caller for the same.

hanishkvc changed the title ~~server/public_simplechat vision (wip), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done)~~ server/public_simplechat vision, toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) Nov 12, 2025

hanishkvc changed the title ~~server/public_simplechat vision, toolcall (done, with 0 setup clientside builtin tools+), reasoing(done)~~ server/public_simplechat vision (basic ok), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) Nov 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server/public_simplechat vision (basic ok), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) #17142

server/public_simplechat vision (basic ok), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) #17142

hanishkvc commented Nov 10, 2025 •

edited

Loading

Uh oh!

hanishkvc commented Nov 10, 2025 •

edited

Loading

Uh oh!

hanishkvc commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

server/public_simplechat vision (basic ok), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) #17142

Are you sure you want to change the base?

server/public_simplechat vision (basic ok), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done) #17142

Conversation

hanishkvc commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanishkvc commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanishkvc commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

hanishkvc commented Nov 10, 2025 •

edited

Loading

hanishkvc commented Nov 10, 2025 •

edited

Loading