SymbiOS
A multi-agent system for systemic therapy: every user gets their own Guardian AI, connected to partners through a consent-based knowledge graph, giving relationship insight no single shared chatbot can. It's not another chatbot wrapper. The architecture is the differentiation, and the architecture is the point.
SymbiOS is a frontier build on a deliberate, test-first track, the kind of system I hope becomes a clinical product over time. The reason it moves carefully is the architecture itself: scaling a multi-agent system, anonymising data before any model sees it, secure transfer of highly sensitive information across a consent boundary. Building this kind of system slowly and carefully is the responsible move, not a limitation.
The hard parts,
all at once.
Most AI products solve one hard thing. SymbiOS has to solve several at the same time: scaling a multi-agent system; anonymisation and strict data boundaries so the model never sees real identities; complex data flows between independent services; and the secure transfer of highly sensitive data across a consent boundary.
The moat is the combination, a Neo4j temporal graph, a per-user Guardian doing anonymised cross-referencing, and an asymmetric consent model, which competitors with a single shared AI can't copy quickly. That's the whole thesis: a technically advanced solution to genuinely hard problems, built hands-on.
The system, in parts
The architecture
Engineering honesty is part of the build: the measured ~3.2s time-to-first-token is acknowledged as too slow, with a planned move to faster inference and a semantic cache. Naming the gaps is how you keep a system trustworthy, especially one handling sensitive data.
Lean today. A clear plan to scale.
The whole multi-agent system runs on a single local stack (Postgres, Neo4j, Redis, API and client together), proving the architecture works and costs almost nothing while the hard problems are validated.
Around 70% of replies never touch a model. That template-first design is what keeps the AI bill near zero now and keeps it sane when the user count climbs.
The measured ~3.2s time-to-first-token is named as too slow on purpose, with a planned move to faster inference and a semantic cache so repeat-shaped queries skip the model entirely.
The five datastores are already separate concerns, so scaling means moving the graph, relational and cache layers onto their own infrastructure as load demands, not a rebuild. Spend tracks real usage.
Building something this hard?
SymbiOS is the kind of architecture that breaks teams in the handoffs: multi-agent, consent-gated, privacy-first, all at once. If that's the shape of your problem, let's talk.