End‑to‑end multimodal chat with document parsing, media uploads, audio recording, and streaming markdown rendering#1316
Draft
SignalRT wants to merge 2 commits intoSciSharp:masterfrom
Draft
End‑to‑end multimodal chat with document parsing, media uploads, audio recording, and streaming markdown rendering#1316SignalRT wants to merge 2 commits intoSciSharp:masterfrom
SignalRT wants to merge 2 commits intoSciSharp:masterfrom
Conversation
Initial version
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary:
This PR delivers a full multimodal chat pipeline in LLama.Web: PDF and Word document ingestion with text extraction, image and audio uploads, native in‑browser audio recording (preview/attach/discard), plus streaming response
rendering with Markdown support.
Key Features:
Implementation Highlights
Capability to upload images and ask about the images
Model auto-download + Capability to upload files and ask about the files
