Flow Memory is an open-source autonomous AI agent operating system and local/testnet public-alpha preflight prototype.
Windows PowerShell:
git clone https://github.com/FlowmemoryAI/flow-memory.git
cd flow-memory
python -m venv .venv
.venv\Scripts\activate
pip install -e ".[dev]"
python -m flow_memory --json "Explore and report"Linux/macOS:
git clone https://github.com/FlowmemoryAI/flow-memory.git
cd flow-memory
python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"
python -m flow_memory --json "Explore and report"Launch paths:
python scripts/launch_local_agent.py --goal "Explore and report"
python scripts/launch_flowlang_agent.py examples/flowlang_agent.flow --goal "Run the declared agent"
pip install -e ".[dev,ml]"
python scripts/launch_neural_agent.py --backend tiny_torch --goal "Explore and report"
python -m flow_memory --neural tiny_torch --neural-live --json "Explore and report"
python -m flow_memory neural live step --backend tiny_torch --goal "Explore and report"
python -m flow_memory launch agent --template live-research --neural tiny_torch --ticks 5 --emit-visual --json
python -m flow_memory launch agent --flow examples/live_research_agent.flow --ticks 5 --emit-visual --json
python -m flow_memory launch runs list --json
python -m flow_memory launch runs replay <run_id> --json
python -m flow_memory launch runs export <run_id> --out artifacts/launch/bundles/<run_id>.json --json
python -m flow_memory launch bundle public-alpha --out artifacts/launch/bundles/public-alpha-local-demo.json --json
python -m flow_memory launch visual embodiment --run live-agent-supervisor --out dashboard/src/mock-data/live-neural-embodiment.json --json
python -m flow_memory launch finalize public-alpha --out release_evidence/public_alpha_launch_finalizer.json --json
python -m flow_memory cognition predict --goal "verify dashboard" --action "check mission-control route" --json
python -m flow_memory cognition tick --agent live-research --goal "verify dashboard is serving real Mission Control" --json
python -m flow_memory launch supervisor start --template live-research --neural tiny_torch --predictive-core --ticks 5 --emit-visual --json
python -m flow_memory cognition benchmark run --scenario dashboard-stale-server --trials 5 --json
python -m flow_memory cognition benchmark run --scenario all --trials 5 --json
python -m flow_memory cognition lessons consolidate --json
python -m flow_memory cognition metrics --json
python scripts/run_local_network.py --scenario all --json-out artifacts/network/local_network_report.json
python scripts/run_agent_learning_loop.py
python scripts/test_full_system.py --quick --json-out artifacts/full_system/quick_report.json
python scripts/run_local_network.py --scenario all --emit-visual-events --json-out artifacts/network/local_network_report.json
python scripts/export_visual_replay.py artifacts/network/local_network_report.json --out dashboard/src/mock-data/local-network-replay.json
python scripts/validate_visual_replay.py dashboard/src/mock-data/local-network-replay.json
python -m flow_memory compute plan --goal "Use budgeted local compute routing with dry-run settlement"Neural, neural-live, RL, and compute-market signals advise. Policy and approval gates remain authoritative.
Mission Control visual path:
python scripts/run_local_network.py --scenario all --emit-visual-events --json-out artifacts/network/local_network_report.json
python scripts/export_visual_replay.py artifacts/network/local_network_report.json --out dashboard/src/mock-data/local-network-replay.json
python scripts/run_local_api_server.py --host 127.0.0.1 --port 8765
python -m flow_memory launch visual embodiment --run live-agent-supervisor --out dashboard/src/mock-data/live-neural-embodiment.json --json
python -m flow_memory launch finalize public-alpha --out release_evidence/public_alpha_launch_finalizer.json --json
cd dashboard
npm run build
npm testMission Control is connected to local state/replay/API data, with mock fallback clearly labeled.
Live 3D Mode renders the neural embodiment as read-only local/replay telemetry; it is not an agent launcher, provider connector, settlement console, or policy bypass.
cd dashboard && npm run dev serves the real Mission Control replay UI at /mission-control with the run selector, neural embodiment panel, Live 3D Mode, GPU evidence status, and public-alpha finalizer status from local fixtures. It does not expose unsafe write/control endpoints.
Predictive Cognition is available in CLI/API/replay mode: agents encode current state, retrieve similar experience records, predict candidate outcomes, score counterfactuals, observe actual results, compute prediction error, and write lessons under artifacts/cognition/experiences/.
Predictive Learning Benchmark and memory consolidation are available in CLI/API/replay mode: repeated local scenarios write experience records, consolidate reusable lessons under artifacts/cognition/lessons/, reuse those lessons before later predictions, and export benchmark records under artifacts/cognition/benchmarks/.
The project now combines:
- FlowLang v0 agent declarations
- FlowIR manifests
- first-class AI agent profiles/state/goals/planning/execution
- layered memory and constitutional memory governance
- safe skill/tool execution seams
- local Economy V3 marketplace, escrow, settlement, disputes, slashing, reputation, receipts
- signed manifest/receipt/provenance prototypes
- SQLite durable storage
- internal API router and optional server seams
- Base Sepolia / ERC-4337 dry-run adapters
- sandbox hardening interfaces
- MCP/A2A/libp2p protocol seams
- dashboard scaffold and CI workflows
- Flow Memory Compute Market dry-run provider/route/quote/settlement simulation
- Live Agent Launchpad for one-command local neural-live agent runs and Mission Control replay artifacts
- Live Agent Operations registry for local run inspection, replay lookup, safe stop/no-op handling, and bundle export
- Bounded Live Agent Supervisor with heartbeat/status artifacts, continuation semantics, and Mission Control supervisor replay
- Mission Control run console for launchpad, operations, supervisor, and local-network replay fixture selection/status summaries
- Public-alpha local demo bundle export with replay references, docs, commands, release evidence, GPU status, and honest limitations
- Mission Control neural embodiment view for visible local neural runtime/session, loop phase, memory, learning, policy, supervisor heartbeat, and imported GPU evidence status
- Mission Control Live 3D Mode for read-only CSS 3D/WebGL-ready local neural embodiment telemetry with policy/approval authority intact
- Public Alpha Launch Finalizer evidence for local demo bundle, Live 3D mode, launch evidence, release decisions, and C:\tmp backup exclusion checks
- Predictive Cognitive Core for local deterministic world-state encoding, candidate action prediction, counterfactual scoring, prediction-error records, experience memory, FlowLang cognition blocks, read-only Mission Control cognition telemetry, and policy-gated learning metadata
- Predictive Learning Benchmark and memory consolidation for deterministic local scenario replay, before/after prediction-accuracy metrics, consolidated lessons, lesson reuse, repeated-mistake reduction, and policy-authoritative benchmark evidence
Public-alpha RC1 preflight adds clean-clone validation, an agent reliability gauntlet, asymmetric/DID signing seams, scoped API/auth/error contracts, typed dashboard mock API client, Base Sepolia dry-run artifacts, expanded contract security tests, optional Docker sandbox backend seam, storage replay scripts, adversarial economy simulation, and hashed release evidence. Flow Memory is production-shaped, not production-certified, not an AGI/sentience claim, not audited, and not mainnet-ready. It does not claim audited contracts, hardened sandboxing, production API authentication, safe real-funds custody, provider settlement, or trained production ML model performance.
PowerShell:
cd E:\FlowMemory\flow-memory
.\.venv\Scripts\python.exe -m pip install -e ".[dev]"If no virtual environment exists:
py -3 -m venv .venv
.\.venv\Scripts\python.exe -m pip install --upgrade pip
.\.venv\Scripts\python.exe -m pip install -e ".[dev]".\.venv\Scripts\python.exe -m pytest -q
.\.venv\Scripts\python.exe examples\flowlang_compile_demo.py
.\.venv\Scripts\python.exe examples\flowlang_runtime_demo.py
.\.venv\Scripts\python.exe examples\flowlang_economy_demo.py
.\.venv\Scripts\python.exe -m flow_memory --json "Explore and report"
.\.venv\Scripts\python.exe -m flow_memory --flow examples\flowlang_agent.flow --json "Run the declared agent"
bash scripts/verify.sh
.\.venv\Scripts\python.exe scripts\generate_deployment_plan.py
.\.venv\Scripts\python.exe scripts\base_sepolia_dry_run.py
docker compose config
forge build
forge test
git diff --check
.\.venv\Scripts\python.exe scripts\public_alpha_smoke.py --root .
.\.venv\Scripts\python.exe scripts\clean_clone_validation.py --root . --out release_evidence\clean_clone_validation.json
.\.venv\Scripts\python.exe scripts\validate_base_sepolia_artifacts.py --dir deployments\base-sepolia
.\.venv\Scripts\python.exe scripts\export_event_log.py
.\.venv\Scripts\python.exe scripts\replay_event_log.py
.\.venv\Scripts\python.exe scripts\verify_storage_integrity.py
.\.venv\Scripts\python.exe scripts\sandbox_smoke_test.py
.\.venv\Scripts\python.exe scripts\release_decision.py --target public-alphaObserved during the public-alpha RC1 preflight build:
- Python tests:
287 passed, 1 skipped - FlowLang compile demo: passed
- FlowLang runtime demo: passed
- FlowLang economy demo: passed
- CLI smoke: passed
- CLI
--flow: passed - deployment dry-run scripts: passed
- agent reliability gauntlet demo: passed
- adversarial economy simulation demo: passed
- clean clone validation: passed
- public-alpha release decision: passed
.\.venv\Scripts\python.exe -m flow_memory --flow examples\flowlang_agent.flow --json "Run the declared agent".\.venv\Scripts\python.exe examples\agent_profile_demo.py
.\.venv\Scripts\python.exe examples\agent_economy_v3_demo.py
.\.venv\Scripts\python.exe examples\agent_dispute_slashing_demo.py
.\.venv\Scripts\python.exe examples\signed_manifest_demo.py
.\.venv\Scripts\python.exe examples\storage_persistence_demo.pydocs/AI_AGENT_LAYER.mddocs/PUBLIC_ALPHA_QUICKSTART.mddocs/LIVE_AGENT_LAUNCHPAD.mddocs/NEURAL_LIVE_AGENTS.mddocs/PUBLIC_ALPHA_READINESS.mddocs/CLEAN_CLONE_VALIDATION.mddocs/TESTNET_PREFLIGHT.mddocs/RELEASE_GATES.mddocs/CONTRACT_SECURITY_TESTS.mddocs/DASHBOARD.mddocs/AUDIT_REPLAY.mddocs/ADVERSARIAL_ECONOMY_SIMULATION.mddocs/AGENT_ECONOMY_V3.mddocs/FLOWLANG_RUNTIME_INTEGRATION.mddocs/STORAGE.mddocs/SIGNED_MANIFESTS.mddocs/API_SERVER.mddocs/WEB3_ADAPTERS.mddocs/BASE_SEPOLIA_DEPLOYMENT.mddocs/SANDBOX_HARDENING.mddocs/PROTOCOL_GATEWAYS.mddocs/THREAT_MODEL.mddocs/SQUIRE_GOAL.mddocs/PRODUCTION_READINESS.mdBUILD_REPORT.mdFLOW_MEMORY_STATUS.mddocs/MISSION_CONTROL_QUICKSTART.mddocs/PREDICTIVE_COGNITIVE_CORE.mddocs/PREDICTIVE_LEARNING_BENCHMARK.md
- FlowLang remains v0/prototype.
- Economy V3 is local/testnet-ready architecture, not a live funds system.
- Contracts are unaudited.
- Signing uses local HMAC by default plus local deterministic asymmetric seams; production key custody is not implemented.
- Base Sepolia scripts produce dry-run payloads and artifacts only.
- Sandbox hardening includes profiles, receipts, policy checks, and an optional Docker backend seam; default local sandboxing is not hardened isolation.
- Protocol gateways are local/offline-safe seams, not production transports.
- Dashboard is a typed mock API scaffold, not a live operator console.
- Compute Market integration is local dry-run planning/routing only; it does not move funds, broadcast transactions, call providers, or imply settlement execution.
- Predictive learning benchmarks are scoped to deterministic local Flow Memory scenarios and do not provide broad external forecasting.
Flow Memory now includes an optional Neural Agent Layer v1 and a local neural-live runtime for public-alpha agents. The base install still has no PyTorch requirement. Install flow-memory[ml] to run tiny CPU-safe PyTorch prototypes for dual-stream perception, appearance-suppressed dorsal motion, tiny JEPA-style world modeling, advisory plan scoring, skill routing, risk scoring, and neural memory retrieval. Neural-live mode adds local runtime sessions, deterministic perception/prediction/plan/risk/learning telemetry, metadata-only checkpoints, and Mission Control replay signals. V-JEPA 2 and VideoMAE are adapter seams that require explicit local checkpoints; Flow Memory never downloads checkpoints automatically. Neural scores never override policy or approval gates.
Flow Memory now includes a dependency-free local HTTP API server for public-alpha operator testing. Run it with python scripts/run_local_api_server.py --host 127.0.0.1 --port 8765. Add --api-key dev-local-only --require-scopes to exercise local API-key and scope gates. This is not production internet authentication; it is a local server boundary for smoke tests, demos, and preflight tools.
This repo now includes Flow Arena, a dependency-free local RL environment layer for agent-economy decision training, plus GPU evidence import/release-gate seams. RL policies are advisory only; policy, approval, autonomy, and economy risk controls remain authoritative. Neural GPU validation evidence is stored as text/JSON metadata and hashes; raw checkpoint/model artifacts are not committed.
python -m flow_memory launch supervisor start --template live-research --neural tiny_torch --ticks 5 --tick-interval-ms 10 --emit-visual --json
python -m flow_memory launch supervisor status --json
python -m flow_memory launch supervisor heartbeat <run_id> --jsonThe supervisor is local-only, bounded, inspectable, and policy-gated. GPU-gated release targets use imported RunPod evidence and still require that evidence to remain verified.
Mission Control run console and demo bundle:
python -m flow_memory launch bundle public-alpha --out artifacts/launch/bundles/public-alpha-local-demo.json --json
python -m flow_memory launch finalize public-alpha --out release_evidence/public_alpha_launch_finalizer.json --jsonThe dashboard run selector can inspect Live Neural Agent Launch, Live Agent Operations, Live Agent Supervisor, Live Neural Embodiment, and Local Network Replay fixtures. The bundle is local-only and does not move funds, use private keys, broadcast transactions, or claim production ML certification.
Mission Control Live 3D Mode reads the same visible embodiment fixture and keeps it read-only/local-only; the finalizer records Live 3D readiness, public-alpha launch evidence, release decisions, demo bundle status, and the invariant that the C:\tmp backup is not tracked.
Predictive Learning Benchmark adds deterministic local scenarios for dashboard stale-server recovery, GPU evidence import, policy denial, compute-market dry-run, and git clean-commit behavior. It writes experience, lesson, and benchmark artifacts under artifacts/cognition/ and exposes Mission Control trend data through dashboard/src/mock-data/predictive-learning-benchmark.json.