Local rag by BenBritons · Pull Request #88 · lsfusion/plugin-idea

BenBritons · 2026-02-08T22:18:53Z

Summary

Added a local RAG pipeline for LSF sources with Lucene vector indexing and ONNX-based embeddings.
Wired startup indexing and VFS-based reindexing to keep embeddings up to date.
Added model download + native ONNX extraction to make runIde reliable on Windows.

What changed vs master

New files

src/com/lsfusion/mcp/LocalMcpRagService.java — local RAG service: indexing, vector storage, query scoring.
src/com/lsfusion/mcp/OnnxEmbeddingProvider.java — ONNX Runtime embedding inference.
src/com/lsfusion/mcp/EmbeddingProvider.java — embedding provider interface.
src/com/lsfusion/mcp/LSFMcpRagFileListener.java — VFS listener to reindex on file changes.

Updated

src/com/lsfusion/LSFBaseStartupActivity.java — startup indexing + listener registration.
src/com/lsfusion/mcp/McpServerService.java, src/com/lsfusion/mcp/McpToolset.kt, src/com/lsfusion/mcp/MCPSearchUtils.java — integrate local RAG search results.
build.gradle.kts — new dependencies + model download + native extraction + runIde JVM args.

How it works

Indexing

On startup, LocalMcpRagService scans all LSF files, extracts MCP declarations, builds a text payload and computes embeddings.
Each record is stored in Lucene with the vector saved as a binary field.
A VFS listener triggers reindexing on file change/delete.

Query

Input query is embedded with ONNX Runtime.
Results are scored via dot‑product against stored vectors and top‑K matches returned.

Model + native libs

downloadE5Model fetches model.onnx + tokenizer.json into .mcp-model.
ONNX native DLLs are extracted into build/onnxruntime-native.
runIde sets onnxruntime.native.path and a stable temp dir for reliable loading on Windows.

Technologies

ONNX Runtime (Java) — CPU embeddings.
DJL HuggingFace Tokenizers — tokenization from tokenizer.json.
Lucene — vector storage and scoring.
IntelliJ Platform APIs — startup activity, VFS listener, DumbService guard.

Test plan

runIde
Wait for initial indexing
Run MCP search and verify relevance
Modify an LSF file and verify search reflects the change

…l for search

Vaneez added 5 commits February 5, 2026 01:35

added local lucene vecor db, vecorization and query, improved mcp too…

e5c7e03

…l for search

fix

fc38897

fix2

370acb9

added model and tokinizer downloader and init task

d689a04

final fix, vector search works

d338dcf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Local rag#88

Local rag#88
BenBritons wants to merge 5 commits intolsfusion:masterfrom
BenBritons:local-rag

BenBritons commented Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

BenBritons commented Feb 8, 2026

Summary

What changed vs master

How it works

Technologies

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants