-
Notifications
You must be signed in to change notification settings - Fork 175
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Context
Dhravya Shah (Supermemory founder, 52K followers) is pushing memorybench as the standard benchmark for the memory/retrieval space. His tweet calling everyone liars got 16K impressions: https://x.com/dhravyashah/status/2032626482053525851
Current providers: Supermemory, Mem0, Zep. Basic Memory is not included.
We already have LoCoMo results from our own benchmarking (Recall@5: 76.4%, Recall@10: 85.5%, MRR: 0.658). Showing up on their benchmark with real numbers on their framework is the strongest credibility move we can make.
What to build
A Basic Memory provider adapter for memorybench. The interface is straightforward — see src/providers/README.md in their repo.
The provider needs to implement:
- Ingest — Load benchmark conversations into BM (write_note or batch import)
- Index — Wait for BM indexing (bm watch / reindex)
- Search — Query BM via MCP tools (search_notes / memory_search)
- Answer — Pass retrieved context to an LLM for answer generation
Why this matters
- We show up in their comparison table alongside Mem0, Zep, Supermemory
- On THEIR benchmark, using THEIR framework — no "grading your own homework"
- If our numbers are competitive, it is free marketing with built-in credibility
- If Dhravya's pitch is "everyone's lying, use our benchmark" then real numbers on his benchmark is the strongest counter
- MIT licensed, so we can fork/contribute freely
- Could submit as a PR to their repo — gets us visibility in their community
Approach
- Fork memorybench
- Write
src/providers/basic-memory.tsadapter - Run LoCoMo benchmark locally, verify results
- Submit PR to supermemoryai/memorybench
- Share results publicly (blog post, Twitter thread)
References
- memorybench: https://github.com/supermemoryai/memorybench
- Our LoCoMo results: #608
- Our benchmark framework branch: claw/benchmark-framework
- Retrieval improvements (will affect scores): Retrieval pipeline improvements: reranking, length normalization, noise filtering, adaptive recall #666
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request