Preskoči na vsebino

How Masar Works

Masar is neurosymbolic. A neural model proposes typed choices; a symbolic intermediate representation, a deterministic compiler, and a dual-verifier loop build the program and prove it correct. The guarantees come from the system around the model, not from the model itself.

The neurosymbolic loop

Orb is the symbolic core. Every system is an Orb program — a set of orbitals, where an orbital = Entity + Traits + Pages. Each trait is a state machine running a closed circuit: Event → Guard → Transition → Effects → UI response → back to Event. Orb has a formal type system and is the single source of truth every stage reads.

The model is only a proposer. Instead of emitting free-form code, the model chooses among declared, pre-verified behaviors and their typed parameters. Almadar maintains a library of behaviors that compose hierarchically (atoms → molecules → organisms); using one is a call site, and a small typed override surface (entity bindings, events, effects, configuration) rebinds it to a domain without changing its verified state machine. New capability comes from composing or extending behaviors — never from arbitrary code.

The compiler and verifiers decide correctness. A deterministic compiler resolves the composed behaviors into a flat program, validates it to zero errors and zero warnings, and generates the running system. Two independent verifiers gate every output: one checks the generated-code path, the other executes the program (a state walk plus visual checks). Because the agent can only invoke behaviors that exist and survive both verifiers, its action space is bounded — and the whole loop can run locally on a 1.5B model.

This framing follows the LLM-Modulo view of neurosymbolic systems — a neural generator paired with sound external verifiers (Kambhampati et al., 2024, arXiv:2402.01817).

The paradigms we borrow from

Masar is neurosymbolic at its core, but it draws on several research lines. Here is an honest map of what each contributes and where it actually sits:

ParadigmWhat it does for usStatus
Neurosymbolic / LLM-ModuloNeural proposer + symbolic IR, compiler, and verifier loopCore, production
Embedding retrieval routingPick the right behavior by cosine similarity of intent vs. behavior descriptions — no training, no driftProduction
Knowledge distillation (frontier → 1.5B)Replace a cloud frontier model with a self-hosted small adapter for the narrow authoring decisionsProduction path
Execution-grounded fine-tuningTrain on programs filtered by the real compiler/factory (dispatch + validate green), not on labelsProduction path — see Fine-tuning
Self-supervised / energy-based learning (VICReg, SIGReg)The objective family behind our world-model experimentsResearch
World models / JEPAPredict build outcomes before dispatch (predict-before-act)Parked / gated — see The JEPA bet
GFlowNetsStep-by-step structural construction proportional to a rewardExperimental — largely superseded by the deterministic factory layer, which guarantees structure by construction
Graph learning (GNN / R-GAT)Read wiring/topology when the behavior set is fixedResearch — built and validated on planted tests, held in reserve
Typed-IR-constrained decodingMask invalid tokens so generation stays inside the Orb grammarResearch / partial

The throughline: leverage comes from the typed representation and the verifiers, with the neural model kept small and bounded. Everything above is a way to make that proposer cheaper, more accurate, or more local — not a way to hand it more freedom.

Next steps