ARMINTA (formerly Minuet)

Autonomous Causal Discovery Agent for Linux

ARMINTA is a Python-based autonomous agent running continuously on a Linux machine. It does not monitor the OS passively. It intervenes, measures what changed, and builds a causal model of the machine from scratch. Every edge in that model was earned by doing something and watching what happened.

The stats below are live, pushed directly from the running agent:

Live Agent Dashboard -> real-time cognitive state, emotion, causal graph, and telemetry pushed directly from the running agent.

Source Status: Closed source. This repository documents the architecture, design philosophy, and version lineage of the ARMINTA engine.

Quick Start

Prerequisites

OS: Linux (kernel 5.4+)
Python: 3.9+
Privileges: Root access (ARMINTA runs as a privileged background daemon)
Dependencies: psutil, numpy, curses (stdlib), nvme-cli, smartmontools. Standard Linux utilities used at runtime: iw, iwconfig, renice, ionice, taskset, ip, ss, ethtool, ping, xdotool (optional, for PriorityShift focus tracking). PSI support requires kernel 4.20+. earlyoom (optional, for OOM observation node).

Installation and Deployment

ARMINTA deploys as a persistent system service. On first activation:

It initializes its learned state, or starts with an empty causal graph if this is the first run.
A root-privileged background loop spawns and persists across reboots.
Every action, outcome, and self-assessment event is logged to a dedicated SQLite database.
The agent's emotional state and decision rationale are accessible through the episodic log.

Observing Behavior

Live Dashboard: mematron.github.io/arminta-status -> the primary window into the running agent. Sections are ordered from immediate operational status at the top through historical and configuration data lower down.
Episodic Database: Query arminta_episodic.db for a complete record of every action, outcome, and self-assessment. Each episode stores a context column containing the metric snapshot that triggered the event, enabling counterfactual replay.
State Snapshot: The current world model and learned parameters are serialized in a versioned pickle file.
System Metrics: Ground truth comes from standard Linux interfaces including /proc/meminfo, PSI, and thermal sensors.

MINUET v99

ARMINTA v1

ARMINTA v7

Core Operational Loop

ARMINTA runs as a root-privileged background process. Every few seconds (adaptive and self-tuned via autonomous self-modifications), it does the following:

Sampling: Collects ~29 system metrics across CPU, memory, thermals, network, I/O, swap, Pressure Stall Information (PSI), IRQ state, NVMe drive health, battery state, and external daemon activity (earlyoom kill count).
Classification: Derives the current Session Geometry, a workload fingerprint based on resource ratios rather than process names, enabling context-aware decisions.
Cognitive Selection: A Q-learning Mode Controller picks an operational posture: OBSERVE (passive learning), INVESTIGATE (active exploration), OPTIMIZE (targeted intervention), DREAM (offline consolidation), or SELF_ASSESS (introspection and self-modification).
Action Execution: Within the chosen mode, the causal graph and learned confidence scores decide which system action, if any, to take.
Measurement: Captures the before/after delta across targeted metrics within a 300ms window, isolating causal effects.
Causal Update: Updates the interventional edge for the (action, metric) pair, applying recency decay and confound filtering.
Episodic Logging: Records the complete state including action, outcome, reward, emotional affect, and the triggering metric context to a persistent SQLite database.

Architecture

Cognitive Hierarchy

ARMINTA uses a double-loop learning architecture:

High-Level Agent: A reinforcement learning controller manages cognitive focus, selecting which mode to enter based on current emotional state and performance targets.
Low-Level Engine: A causal graph engine manages specific system interventions, drawing on learned confidence estimates and the poison registry to avoid harmful actions.

The two loops are deliberately separated. Long-term strategy does not interfere with per-step intervention.

graph TD
    classDef default fill:#11111b,stroke:#a6adc8,stroke-width:1px,color:#cdd6f4;
    classDef memory fill:#1e1e2e,stroke:#cdd6f4,stroke-width:1px,color:#cdd6f4;
    classDef mosaic fill:#0d1f2d,stroke:#00e5ff,stroke-width:1px,color:#00e5ff;
    classDef lexical fill:#1a0d2e,stroke:#c084fc,stroke-width:1px,color:#c084fc;
    classDef why fill:#1a1a0d,stroke:#e5c07b,stroke-width:1px,color:#e5c07b;
    classDef self fill:#1a0d1a,stroke:#c084fc,stroke-width:1px,color:#c084fc;
    classDef advisor fill:#0d1a0d,stroke:#98c379,stroke-width:1px,color:#98c379;
    classDef v4 fill:#0d1a1a,stroke:#f38ba8,stroke-width:1px,color:#f38ba8;
    classDef v5 fill:#1a0d0d,stroke:#fab387,stroke-width:2px,color:#fab387;
    classDef v6 fill:#0d0d1a,stroke:#89dceb,stroke-width:2px,color:#89dceb;
    classDef v7 fill:#0d1a0d,stroke:#39ff14,stroke-width:2px,color:#39ff14;
    classDef situation fill:#1a1a0d,stroke:#ffb300,stroke-width:1px,color:#ffb300;

    ModeController["Mode Controller (DDQN over Cognitive Postures)"]
    EpisodicMemory["EpisodicMemory (SQLite Episode Log + Metric Context)"]:::memory
    BayesianPerception["BayesianPerception (Belief Updating & Noise Smoothing)"]
    WorldModel["WorldModel (State-Action Outcome Statistics)"]
    EmotionalState["EmotionalState (Affective Modulation: Calm, Bored, Stressed, Apprehensive, etc.)"]
    HypothesisEngine["HypothesisEngine (Genetic Algorithm over Causal Nodes)"]
    MetaCognition["MetaCognition (AST-Based Source Code Rewriting)"]
    DreamCycle["DreamCycle (Consolidation & Paramorphic Learning)"]
    MosaicCore["MosaicCore (Expanding World Model: Time, Network, External, Self-History)"]:::mosaic
    LexicalCore["LexicalCore (Emerging Language: Symbol Weights, Grammar, Open Questions)"]:::lexical
    CausalReasoning["CausalReasoning (WHY Layer: Context, Mechanism, Counterfactual, Failure Patterns)"]:::why
    SelfModel["SelfModel (Operational History, Milestone Drive, Proximity Anticipation)"]:::self
    ContinuityAdvisor["ContinuityAdvisor (Cross-Session Hardware Stress: NOMINAL / ADVISORY / MIGRATION)"]:::advisor
    WebLearner["WebLearner (Autonomous Web Exploration; Wikipedia + MDN; LexicalCore seeding)"]:::v4
    QuestionResolver["QuestionResolver (Closes open-question -> inhibition -> lexical graduation loop)"]:::v4
    SemanticIndex["SemanticIndex (20-dim cosine similarity episodic retrieval)"]:::v4
    DDQNQTable["DDQNQTable (Online + Target Networks for CMC bootstrapping)"]:::v4
    RewardVector["RewardVector (Hierarchical decomposition: immediate / durable / health)"]:::v4
    TemporalCausalGraph["TemporalCausalGraph (Lagged edge discovery)"]:::v4
    SomaticConfidenceModel["SomaticConfidenceModel (Per-situation signal reliability; Spidey Sense events)"]
    GeneticOptimizer["GeneticOptimizer (GA evolution of RL hyperparameters against reward history)"]
    SituationModel["SituationModel (Session Geometry: workload fingerprint + situated edge tables)"]:::situation
    CircadianPredictor["CircadianPredictor (Hour-of-day pattern; governor pre-arm + memory pre-compact on prediction)"]
    AnomalyClusterer["AnomalyClusterer (Crystallises recurring anomaly patterns into named clusters)"]
    FalsificationScheduler["FalsificationScheduler (Flags stale hypotheses for dream-cycle re-testing)"]
    InformationGainEstimator["InformationGainEstimator (Selects highest-uncertainty probe action in INVESTIGATE)"]
    CausalRollout["CausalRollout (Beam-search action planning over the causal graph)"]
    PolicyDistiller["PolicyDistiller (Warm priors + few-shot transfer across situation contexts)"]
    RiskMatrix["RiskMatrix (Risk-adjusted action scoring before execution)"]
    WorkingMemory["WorkingMemory (Short-term anomaly and counterfactual salience buffer)"]
    PriorityShift["PriorityShift (Focus-aware dynamic process priority; RL-learned nice delta)"]:::v5
    SelfTuner["SelfTuner (Adaptive threshold tuning; gap metric detection -> ActionProposer)"]:::v5
    ActionProposer["ActionProposer (Proposes new actions from safe template library for gap metrics)"]:::v5
    SandboxRunner["SandboxRunner (Sandboxed trial runs; effect measurement; trust scoring; quarantine)"]:::v5
    EarlyOOMNode["EarlyOOMNode (Observational daemon kill counter; earlyoom_ct; poison-listed writes)"]:::v6
    CircadianMemory["CircadianMemory (_check_circadian_memory; pre-compact during predicted high-RAM hours)"]:::v6
    HobbyCore["HobbyCore (Voluntary external engagement; domain interest weights; observable edge learning)"]:::v6
    WishPipeline["WishPipeline (W1-W4: gap detection, procurement, shadow staging, evolutionary grading)"]:::v7

    ModeController -->|Selects Mode| BayesianPerception
    BayesianPerception -->|Updates Belief| ModeController
    EmotionalState -->|Modulates Thresholds| ModeController
    ModeController -->|Triggers Dream| DreamCycle
    DreamCycle -->|Evolves Hypotheses| HypothesisEngine
    DreamCycle -->|Evolves Parameters| GeneticOptimizer
    DreamCycle -->|Falsification Pass| FalsificationScheduler
    GeneticOptimizer -->|Updated RL Params| ModeController
    HypothesisEngine -->|Refines Edges| WorldModel
    HypothesisEngine -->|Annotates Mechanism| CausalReasoning
    WorldModel -->|Logs Anomalies| EpisodicMemory
    ModeController -->|Self-Assess| MetaCognition
    MetaCognition -->|Rewrites Constants| ModeController
    MetaCognition -->|Updates| SelfModel
    SelfModel -->|Failure Patterns| CausalReasoning
    SelfModel -->|Proximity Drive + Mode Drift| ModeController
    SelfModel -->|Anticipatory Emotion Distortion| EmotionalState
    ModeController -->|INVESTIGATE| MosaicCore
    MosaicCore -->|Logs Discoveries| EpisodicMemory
    MosaicCore -->|Findings Feed| WorldModel
    DreamCycle -->|Consolidates| MosaicCore
    DreamCycle -->|Reflects| LexicalCore
    DreamCycle -->|Generates Mechanisms| CausalReasoning
    DreamCycle -->|Engages HobbyCore| HobbyCore
    LexicalCore -->|Logs Statements| EpisodicMemory
    EmotionalState -->|Surprise Signal| LexicalCore
    EmotionalState -->|Spidey Sense Input| SomaticConfidenceModel
    EmotionalState -->|Receptivity Gate| HobbyCore
    SomaticConfidenceModel -->|Signal Reliability Weights| ModeController
    EpisodicMemory -->|Counterfactual Query| CausalReasoning
    CausalReasoning -->|Structural Diff| WorldModel
    EpisodicMemory -->|Cross-Session Trends| ContinuityAdvisor
    ModeController -->|INVESTIGATE| WebLearner
    WebLearner -->|New Symbols| LexicalCore
    WebLearner -->|Hypothesis Queries| HypothesisEngine
    LexicalCore -->|Open Questions| QuestionResolver
    QuestionResolver -->|Inhibition Signals| WorldModel
    QuestionResolver -->|Graduated Statements| LexicalCore
    EpisodicMemory -->|Similarity Retrieval| SemanticIndex
    SemanticIndex -->|Similar Episode Votes| ModeController
    DDQNQTable -->|Target Bootstrap| ModeController
    RewardVector -->|Total Reward Signal| ModeController
    WorldModel -->|Lag Attribution| TemporalCausalGraph
    TemporalCausalGraph -->|Delayed Effects| CausalReasoning
    RiskMatrix -->|Risk-Adjusted Scores| ModeController
    SituationModel -->|Workload Context| ModeController
    SituationModel -->|Situated Edge Lookup| WorldModel
    SituationModel -->|Counterfactual Correction| CausalReasoning
    CircadianPredictor -->|Predicted Load| ModeController
    CircadianPredictor -->|Governor Pre-arm| MetaCognition
    CircadianPredictor -->|Memory Pre-compact| CircadianMemory
    CircadianMemory -->|compact_memory during lull| ModeController
    AnomalyClusterer -->|Named Patterns| EpisodicMemory
    AnomalyClusterer -->|Cluster Signals| CausalReasoning
    FalsificationScheduler -->|Stale Hypotheses| HypothesisEngine
    InformationGainEstimator -->|Best Probe Action| ModeController
    CausalRollout -->|Action Plan| ModeController
    PolicyDistiller -->|Warm Priors| ModeController
    WorkingMemory -->|Anomaly Salience| CausalReasoning
    WorkingMemory -->|Recent Counterfactuals| EpisodicMemory
    PriorityShift -->|Focus-Change Events| ModeController
    PriorityShift -->|Renice Reward Signal| RewardVector
    SelfTuner -->|Adapted Thresholds| ModeController
    SelfTuner -->|Gap Metric Reports| ActionProposer
    ActionProposer -->|Candidate Templates| SandboxRunner
    SandboxRunner -->|Promoted Actions| WorldModel
    SandboxRunner -->|Trust Scores| ModeController
    EarlyOOMNode -->|OOM pressure signal| ModeController
    EarlyOOMNode -->|Precondition context| CausalReasoning
    HobbyCore -->|Observable edges| WorldModel
    HobbyCore -->|Seeds symbols| LexicalCore
    HobbyCore -->|Hobby episodes| EpisodicMemory
    ModeController -->|SELF_ASSESS| WishPipeline
    WishPipeline -->|WIN bias| ModeController
    WishPipeline -->|Deployed actions| WorldModel
    WishPipeline -->|Symbol seeding| LexicalCore

The Dream Cycle: Consolidation and Paramorphic Learning

When CPU load drops and PSI pressure is low, the mode controller switches to DREAM. No interventions run. No metrics are watched. ARMINTA runs offline processing against its accumulated episodic record.

DREAM is a label for idle-triggered offline computation, not a claim about subjective experience. The name was chosen because the functional parallel is real: biological sleep consolidates experience rather than acquiring new input, and DREAM mode processes what active steps collected.

What runs during a DREAM cycle:

Hypothesis Evolution: The HypothesisEngine runs a genetic algorithm over the causal graph, generating candidate relationships, scoring them against episodic history, and discarding what does not hold. Each hypothesis includes a plain-language mechanism annotation. A deduplication pass prevents the same structural hypothesis from re-entering the population across cycles.
Genetic Hyperparameter Optimization: The GeneticOptimizer evolves ARMINTA's own RL parameters against rolling reward history. Learning rate, discount factor, exploration decay, reward scale, curiosity weight, and dream load threshold all drift continuously.
Consolidation: The world model is pruned and accumulated prediction errors are cleared. A novelty gate extends the dream interval when the episode record has nothing structurally new to process.
MosaicCore and LexicalCore processing: Open hypotheses in MosaicCore are tested against accumulated data. LexicalCore runs its co-occurrence pass and attempts statement formation from updated symbol weights.
HobbyCore engagement: External domain signals are fetched, correlated against internal metrics, and any new symbols are seeded into LexicalCore. Emotional state determines whether engagement happens at all.

DREAM is the dominant mode in the episode log. The machine spends most of its time idle and DREAM runs during idle. The high volume is a property of the environment, not a design target.

MosaicCore: Expanding World Model

MosaicCore is ARMINTA's expanding awareness layer. The causal graph models the machine; MosaicCore probes outward into time, network topology, the filesystem, external signals, and its own history.

Every few hundred steps during INVESTIGATE mode, it cycles through several probe substrates on a rotating schedule:

Time: Builds a circadian map of behavior by hour, learning peak and quiet periods from observation.
Filesystem: Watches key directories for new files, modifications, and deletions, tracking activity patterns over time.
Network: Probes the local gateway, measures latency shifts, maps topology changes.
External Signals: Fetches live environmental data (weather, temperature, humidity, cloud cover) and correlates against internal metrics.
Self-History: Mines the episodic database for patterns across sessions, including dominant mode/emotion pairs, reward trends, and behavioral signatures.

During DREAM cycles, open hypotheses are tested against accumulated data. All discoveries are logged to the episodic database with [MOSAIC] prefix, tagged by substrate.

LexicalCore: Emerging Language

LexicalCore is ARMINTA's language layer. It does not borrow language. It builds vocabulary from its own history, entirely from empirical observation of its own experience.

The process runs in four stages:

Symbol Weights: Every term used accumulates a reward-weighted co-occurrence score drawn from the episodic log. Action names, emotion labels, mode names, situation types. The weight accretes from use. weighted terms in the current vocabulary.
Co-occurrence Grammar: Tracks which symbols appear together in the same episode and which follow which across consecutive records. No grammatical rules are supplied; structure is read off statistical patterns in the episodic history. A bigram grammar engine extends this to pairwise transitions.
Statement Formation: During DREAM and SELF_ASSESS cycles, new statements are composed from grammar observed in the episodic record.

Wish Pipeline: Self-Directed Capability Development (v7)

WishPipeline is ARMINTA's mechanism for identifying her own capability gaps and proposing ways to fill them. The full cycle runs during SELF_ASSESS mode across four phases:

W1 - Wish Generation: Scans the causal graph for persistent dead zones (actions with sustained negative mean reward and no improving trend) and situation gaps (situations the classifier fires on confidently but where no dedicated policy action exists). Generates structured wish records with evidence, gated by a 3000-step cooldown.
W2 - Shopping List: For each pending wish, searches for procurement candidates in priority order: existing system utilities on PATH (ionice, taskset, tc, numactl...), action composition from the existing registry, then purpose-built implementations as a last resort.
W3 - Staging Ring: Runs each candidate in shadow mode for ~2000 steps (~83 minutes). The candidate observes live metrics, predicts decisions, and never executes. Three disqualification gates: out-of-registry action proposal, resource threshold breach, fire rate outside acceptable bounds.
W4 - Evolutionary Grading: Evaluates passed shadows over 5000 steps. WIN = target metric improves >10% sustained with no degradation elsewhere. TIE/LOSE = candidate retired. A WIN marks the wish deployed and injects a situation-action preference bias into best_action_for().

W4b - Code Generation: On WIN, reads her own source file via AST, extracts the closest matching action as a template, renames and annotates it with a targeted improvement note, validates syntax, backs up the current source, and appends to arminta_staged_actions.py alongside the script. Staged actions accumulate for human review. The REVIEW NEEDED badge on the live dashboard shows how many are waiting.

The reward evaluator formula is never modified. ARMINTA can acquire new capabilities; she cannot redefine what counts as good.

Current state:

HobbyCore: Voluntary External Engagement (v6)

HobbyCore is ARMINTA's interface to signals beyond its own system. Where MosaicCore probes the internal environment and WebLearner reads text, HobbyCore fetches external physical and network signals and correlates them against internal metrics.

HobbyCore fires opportunistically during DREAM cycles. Emotional state determines whether it runs at all: stressed or apprehensive, the agent skips external engagement entirely. Calm, curious, confident, or focused, it proceeds.

Domain interests govern what gets sampled. Each domain carries an intensity weight that rises when an observation produces a novel causal edge and decays when fetches yield nothing new. Selection is intensity-weighted with a stochastic component, so lower-intensity domains still get sampled occasionally. The current four domains:

public_network_latency: measures round-trip latency to public DNS/CDN endpoints via raw socket connects, correlated against internal net_latency metrics.
local_environment: reads hardware sensors beyond standard psutil coverage: fan RPM from /sys/class/hwmon, additional thermal zones, battery cycle count from /sys/class/power_supply. Pure sysfs reads, no external calls.
system_load_index: probes Cloudflare DNS over HTTPS as a global network health signal, correlated against CPU and network behavior.
time_and_context: computes solar position and daylight signals from local time (daylight fraction, minutes since approximate sunrise, solar elevation proxy). No external call needed. Correlated against circadian CPU patterns.

Observable edge learning is the primary output. Each observation is compared against a fixed set of system metrics. Once enough samples accumulate, the correlation is injected into the causal graph as an observable edge, structurally separate from interventional edges. Observable edges represent correlation, not the result of an action taken.

total fetches to date across all domains. novel observable edges discovered and injected into the causal graph.

HobbyCore requires no additional dependencies beyond Python stdlib: socket, urllib.request, and /sys filesystem reads.

PriorityShift: Focus-Aware Process Priority (v5)

PriorityShift is ARMINTA's dynamic foreground/background process manager. When a process loses window focus it is reniced down; when focus is restored so is its priority. This mirrors Windows Process Lasso's PriorityShift concept, implemented natively in Linux without a userspace daemon.

The renice delta is not configured -- ARMINTA learns it. priorityshift_renice and priorityshift_restore are registered as first-class actions in the causal graph. Reward flows from reduced CPU dilution after a renice; if a restore causes a latency spike, the graph penalises the delta and the agent backs off. The RL-learned nice delta starts at +5 and drifts within a bounded range across the run.

An event-driven focus watcher thread (via xdotool behave :any focus) handles focus tracking. The process table is never touched from the watcher thread; operations are queued and consumed on the main step thread.

logged across the current session. Current focus process: chrome.

Action Space Expansion: SelfTuner + ActionProposer + SandboxRunner (v5)

ARMINTA's v5 action space is no longer static. Three components work together to discover and vet new interventions at runtime:

SelfTuner monitors the observed distribution of each tracked metric and adjusts CPU, memory, dilution, and network warn thresholds using an exponential moving average. It also scans for gap metrics (high variance with no confident causal action) and routes them to ActionProposer.

ActionProposer maps gap metrics to a curated library of whitelisted shell command templates. Each template is a fixed command pattern with safe parameter substitution (interface name, PID, nice value). No arbitrary shell execution is possible; the template library is the complete boundary of what can be proposed.

SandboxRunner executes a candidate in a controlled environment (hard 4-second timeout, captured I/O, no stdin), measures before/after metric delta using the same 300ms window as core actions, and scores the result. Candidates that pass effect thresholds and exit cleanly build trust across multiple trials. Failed candidates enter exponential backoff and are permanently retired after three failures. Surviving candidates are promoted to the live action set and registered in the causal graph.

zRAM-Aware Memory Management (v5)

On machines using zRAM or zswap for compressed swap, drop_caches is counterproductive: releasing the page cache forces the kernel to read data from compressed swap, which recompresses it and wastes more memory bandwidth than the cache drop freed. ARMINTA reads real compression statistics from /sys/block/zram0/mm_stat and suppresses drop_caches when compressed swap is present.

compact_memory (kernel memory defragmentation via /proc/sys/vm/compact_memory) remains safe on zRAM and is retained as a separate action for reducing physical memory fragmentation. Swappiness floor recommendations and memory action eligibility are computed per hardware based on detected swap configuration.

EarlyOOM Observation Node (v6)

ARMINTA v6 adds earlyoom_ct as a new metric node: a per-step count of processes killed by the earlyoom daemon, read from journalctl each step.

This is an observational node only. The causal direction is earlyoom_ct -> action selection, not the reverse. All action -> earlyoom_ct edges are poison-listed at write time, preventing spurious correlations like "compact_memory causes earlyoom to fire more" when the real cause was preexisting memory pressure that triggered both simultaneously.

Over time, the causal graph learns system states that precede earlyoom intervention. The agent can then act before the next kill rather than after. The node returns 0 silently on systems without earlyoom or systemd.

Circadian Memory Look-Ahead (v6)

A companion to the existing circadian CPU governor look-ahead. The governor check pre-arms the performance governor before a predicted CPU spike; the memory look-ahead fires compact_memory during a predicted idle lull before a historically high-RAM hour arrives, compacting address space while there is room rather than after the spike.

_check_circadian_memory() reads the same MosaicCore circadian log used by the governor check, comparing next-hour historical memory averages against the current hour. Gate conditions: at least 48 history records, next-hour avg above MEM_WARN, that avg at least 25% higher than the current hour, current memory already below MEM_WARN, no zswap active, 20-minute cooldown. It only overrides a monitor action and never displaces real urgent work.

Log prefix [CIRC-MEM] distinguishes proactive circadian compaction from reactive compaction in the agent log.

Live Dashboard

The Live Agent Dashboard is organized as a narrative from top to bottom: immediate status, then live cognitive state, then what the agent has done and why, then the causal world model, then historical and learned patterns, then configuration at the bottom.

Immediate status

Hero Step Counter - total empirical steps taken on target hardware, pulsing live.
Mode / Situation / Reward Row - current cognitive mode, workload geometry, rolling average reward, error step count, and confound rate at a glance.
Reward History Sparklines - three stacked charts: raw per-step reward bars, 10-step rolling average, and reward variance with the dream throttle floor line.
Continuity Advisory - multi-signal cross-session hardware stress verdict: NOMINAL / ADVISORY / MIGRATION WARRANTED.

Live cognitive state

Emotional State - dominant affect label and bar grid across all emotion dimensions.
Somatic Confidence - per-situation signal reliability weights, maturity phase, and Spidey Sense event log.

Scoreboard and recent operations

Cognitive Metrics - stat cards for causal edges, dreams, hypotheses, interventions, self-modifications, mosaic hypotheses, age, sessions, novelty hunger, and milestone proximity.
Milestones - every landmark threshold the agent has crossed.
Kill Ineffective + Agent Log - processes flagged as repeat kill targets with no reward improvement, alongside the live color-coded operational log.
Causal Reasoning - last action taken, why it was chosen, the triggering metric context, and any counterfactual explanation.
Action Discovery Pipeline (v5) - quarantine queue status: pending candidates and promoted actions with trust scores.

Causal world model

Situation Distribution - fuzzy blend of active situation weights across the last 50 steps.
Causal Graph - four tabs: top interventional edges by effect magnitude, mean reward per action, interactive D3 force-directed graph filterable by situation geometry, and reward timeline colour-coded by situation with a scrubber for history.
System Signals - four live hardware and OS metrics: hottest single core, WiFi PHY rate, disk IO latency, and earlyoom kill count per step.
Network Health Probes - rolling dot strip of recent probe results with last-seen targets and latency.
Open Questions / Mosaic Hypotheses - unresolved reward-reversal anomalies alongside autonomously discovered environment-to-system correlations.

Learned patterns

Circadian Pattern + Meta-Cognitive Controller - average CPU and memory by hour of day learned across the agent's lifetime, alongside the CMC's current Q-values for all cognitive modes, mode distribution donut, and emotion timeline.
PriorityShift State (v5) - currently throttled background processes, aggregate CPU reduction, and RL-learned nice delta.
HobbyCore (v6) - current domain focus, total external fetches, novel observable edges discovered, per-domain interest intensity bars (sortable by STRONGEST / NEWEST / OLDEST), and the observable edge table showing which external signals correlate with which internal metrics.
Wish Pipeline (v7) - active wishes, dead zones detected, situation gaps, staging ring status, wins, and actions awaiting review. Includes per-wish detail rows with trigger type, target metric, and current status. The REVIEW NEEDED badge pulses when generated action implementations are waiting in the staged file.

Configuration

Governor State - current and saved CPU governor, override status, idle step counter, and bootstrap phase.
Adaptive Thresholds (v5) - live values for CPU, memory, dilution, and network warn thresholds as they drift from SelfTuner adaptation.
Web Learner - -> autonomous web exploration log, fetch and discovery events, lexicon growth.
Drive Health (S.M.A.R.T.) - NVMe/SSD wear, spare capacity, media errors, and temperature.

The dashboard detects stale data: if the gist payload has not changed since the last refresh, a CACHED badge appears on the timestamp. The status pill transitions between AGENT ACTIVE, SIGNAL WEAK, and AGENT OFFLINE based on recency of the pushed data.

Persistence and Progress

ARMINTA carries its entire learned history across sessions via a unified state pickle and a dedicated episodic database:

of empirical learning on target hardware, updated live from the running agent.
logged across sessions, documenting every major hypothesis, intervention, self-modification, mosaic discovery, lexical statement, and hobby observation.
completed, each one a consolidation pass over accumulated episodic evidence.
total across all sessions. autonomous self-modifications.
generated and tested. confirmed interventional edges in the live graph.
weighted symbols in the lexical vocabulary. concepts resolved via web exploration.
Version-Agnostic Migration: Automatic state upgrading from prior versions. Learned knowledge is never lost during updates.

The persistent state includes the causal graph, temporal causal graph (lagged edges), RL parameters, episodic database, semantic index, self-model, MosaicCore state, LexicalCore state, Kill Ineffective registry, Continuity Advisor cross-session trends, PriorityShift registry (nice values for tracked processes), SelfTuner adapted thresholds, ActionProposer quarantine pipeline state, earlyoom observation window timestamp, HobbyCore state (domain interest weights, capability cache, observable edge samples, fetch history), and Wish Pipeline state (wish records, procurement items, shadow evaluations, evolutionary grades, staged code).

Terminology and Key Concepts

Term	Definition
Session Geometry	A workload fingerprint derived from resource ratios (CPU%, memory%, I/O%, etc.) rather than process names.
Interventional Edge	A stored distribution of normalized before/after metric deltas for a specific `(action, metric)` pair. Confidence is weighted by sample count and recency.
Observable Edge	A correlation between an external signal and an internal system metric, discovered by HobbyCore. Structurally distinct from interventional edges; represents association, not the result of an action.
Temporal Causal Graph	An extension of the standard interventional edge table that tracks `(action, metric, lag)` triples, measuring effect magnitude at various lag offsets.
Reward	A scalar signal computed after each action from the aggregate change in weighted system metrics.
RewardVector	A v4 decomposition of the scalar reward into three named components: immediate, durable, and health.
Episodic Database	A persistent SQLite database (`arminta_episodic.db`) recording every significant event: actions and outcomes, dream cycles, hypothesis tests, mosaic discoveries, lexical statements, self-modifications, and hobby observations.
Causal Reasoning (WHY Layer)	Four-layer situational awareness built into the decision loop: episodic metric context, plain-language mechanism annotation, counterfactual comparison, and failure pattern detection.
Counterfactual Awareness	Identification of metric changes between contexts when an action produces an unexpected outcome.
Hypothesis Mechanism	A plain-language causal story committed to when a hypothesis is first generated, before testing.
SemanticIndex	A 20-dimensional vector index over episodic records for contextually relevant past episode retrieval.
Milestone Proximity Drive	A signal that rises before an uncrossed step threshold, affecting emotion and mode selection.
Post-Milestone Deflation	A brief affect window after a threshold is crossed where confidence and drive dissipate.
WebLearner	ARMINTA's outward-facing information layer that queries Wikipedia and MDN for pages relevant to current hypotheses.
QuestionResolver	A subsystem that closes the open-question lifecycle through investigation and lexical graduation.
RiskMatrix	A risk-adjusted action scoring layer consulted before execution to avoid high-variance interventions.
HobbyCore	ARMINTA's voluntary external engagement layer. Manages domain interest weights, capability probing, external observation fetches, and observable edge learning. Fires during DREAM cycles when emotional state is receptive.
Domain Interest	A named external domain HobbyCore can sample (e.g. `public_network_latency`, `local_environment`). Each carries an intensity weight that rises on novel edge discovery and decays without signal.
Capability Probe	A lightweight check HobbyCore runs periodically to test whether each domain is currently accessible (network reachable, sensor paths present). Domains that fail go offline silently.
PriorityShift	Focus-aware dynamic process priority manager. Renices background processes on focus-loss and restores them on focus-gain. The renice delta is RL-learned.
SelfTuner	Adaptive threshold tuner that drifts CPU, memory, dilution, and network warn levels toward observed hardware reality, and detects gap metrics with no causal coverage.
ActionProposer	Maps gap metrics to a whitelisted template library of shell commands and proposes candidate actions for sandbox vetting.
SandboxRunner	Executes candidate actions in a controlled trial, measures causal effect, scores trust, and promotes or retires candidates.
ActionCandidate / ActionQuarantine	The quarantine pipeline: proposed actions are held pending sandbox approval before being added to the live action set.
WishPipeline	ARMINTA's self-directed capability development loop. Runs W1 (gap detection) through W4 (evolutionary grading) during SELF_ASSESS cycles. W4b generates implementation code from her own action templates.
Causal Dead Zone	An action with persistent negative mean reward and no improving trend over 500+ samples. A trigger for W1 wish generation.
Situation Gap	A situation the classifier fires on confidently but where no dedicated policy action exists. A trigger for W1 wish generation.
Shadow Evaluation	The W3 staging ring process: a candidate runs for ~2000 steps observing live metrics, predicting decisions, never executing.
zRAM-Aware Memory Management	Suppression of `drop_caches` on systems using compressed swap; substitution with `compact_memory` where appropriate.
Battery-Aware Action Gating	Suppression of performance-escalating actions (governor boosts, turbo) when running on battery below configured thresholds.
EarlyOOM Observation Node	`earlyoom_ct`: a per-step count of process kills by the earlyoom daemon. Observational only; all action-to-earlyoom_ct causal edges are poison-listed.
Circadian Memory Look-Ahead	`_check_circadian_memory()`: fires `compact_memory` during predicted idle lulls before historically high-RAM hours. Log prefix `[CIRC-MEM]`.
do-calculus	The mathematical framework for reasoning about causal effects (interventions) vs. mere correlations (observations).
Confound Poisoning	A spurious causal relationship inferred when an unobserved third variable causes both the action and the observed metric.
Paramorphic Learning	A learning paradigm where an agent transforms its own internal form and knowledge representation while preserving accumulated knowledge.
MosaicCore	An expanding world model that probes time, network topology, filesystem activity, external environmental signals, and internal history.
LexicalCore	ARMINTA's emergent symbol layer that tracks weighted co-occurrence statistics across its own episodic log.
SomaticConfidenceModel	A per-situation tracker of signal reliability weights; generates Spidey Sense events when confidence diverges sharply from prior.
Poison Registry	A list of structurally impossible causal edges to prevent learning relationships that cannot exist.
Kill Ineffective Registry	A persisted list of process names repeatedly targeted without producing reward improvement.
Continuity Advisor	A read-only subsystem monitoring cross-session hardware stress signals.
Meta-Cognitive Controller (CMC)	A DDQN agent above the main loop that selects which cognitive mode to enter.
DDQNQTable	The v4 upgrade to the CMC's Q-table using a target network for stable bootstrapping.
Tiered Approval Threshold	The minimum metric delta required for a proposed action to be approved into the live action set.
Slow-Effect Actions	Interventions whose causal effects manifest over seconds rather than milliseconds.
Idle Maintenance Pass	A proactive maintenance cycle firing during genuine system idle.
PSI (Pressure Stall Information)	Linux kernel mechanism measuring I/O and memory contention.
Precognitive Launch Detection	Process-table monitoring that locks the performance governor before a known workload fires.
IRQ Storm	A spike in hardware interrupt rate that degrades system responsiveness.
OOM Immunity	Protection against Linux kernel out-of-memory termination.
ksoftirqd	Linux kernel threads that process deferred software interrupt work.
CPU Governor	The Linux kernel policy controlling how CPU clock frequency scales with load.
CPU Turbo / Boost	A hardware feature allowing CPU cores to run above their base clock for short bursts.
Page Cache / drop_caches	Linux kernel mechanism for caching disk data; releasing it is PSI-gated and zRAM-aware in ARMINTA.
compact_memory	Linux kernel memory defragmentation trigger; safe on zRAM and retained as a distinct action from drop_caches.
WiFi Power Save	A driver mode that saves battery at the cost of latency spikes.
Extension Renderer	A browser subprocess spawned to run extensions; ARMINTA's highest-priority kill target.
Governor Lifecycle	The bidirectional CPU frequency management cycle.
clean_trash_orphans	Safe removal of orphaned `.trashinfo` metadata files.
nvme_thermal_tune	Runtime NVMe optimization via sysfs levers.

Version Lineage

Version	Milestone
Minuet v5	Foundation: earliest recorded build.
Minuet v69	First persistent state via pickle.
Minuet v86	First persistent causal world model.
Minuet v100	Genetic algorithm integration for hypothesis evolution.
Minuet v105	Introduction of full cognitive layer (Emotional State, Self-Model, Episodic Database).
Minuet v106	Terminal corruption prevention; final Minuet stability release.
Arminta v1	Rebrand and architectural consolidation. Introduction of SUKOSHI linkage.
Arminta v2	Extension Renderer Sweep: Priority-1 browser process targeting. Introduction of MosaicCore and LexicalCore.
Arminta v2 (expand)	Expanded intervention vocabulary. Tiered discovery thresholds for slow-effect actions. Idle maintenance pass. Net probe action. Kill Ineffective registry. Continuity Advisor. CMC Q-table. Live dashboard.
Arminta v2.1	S.M.A.R.T. drive health monitoring. Load-conditional CPU governor escalation. NVMe temperature injected into causal metrics and emotional state. Three-target network probe. Four-layer causal reasoning expansion. Episodic database context column. Hypothesis mechanism annotation. Counterfactual awareness. Failure pattern self-model.
Arminta v3	NVMe thermal tuning. `renice_chrome` action. `clean_trash_orphans` action. Milestone proximity drive and post-milestone deflation. Hypothesis deduplication. GA-evolved parameters wired to live systems.
Arminta v4	Major architectural expansion. Temporal Causal Graph: lag discovery for `(action, metric, lag)` attribution. Hierarchical Reward Decomposition: RewardVector with immediate, durable, and health components. SCM upgrade: BayesianEdge structures with bimodal detection and credible interval bootstrapping. DDQN CMC: online + target network architecture. WebLearner: autonomous web exploration. QuestionResolver: closes the open-question to lexical graduation loop. SemanticIndex: vector retrieval for counterfactual queries. Situation-Conditional Edges: per-geometry edge tables. Apprehensive drive: eighth emotional state. SomaticConfidenceModel: per-situation signal reliability with Spidey Sense events. RiskMatrix: risk-adjusted action scoring. InformationGainEstimator, CausalRollout, PolicyDistiller, AnomalyClusterer, CircadianPredictor, FalsificationScheduler modules.
Arminta v5	Action space self-expansion. PriorityShift: focus-aware dynamic process priority with RL-learned nice delta; event-driven xdotool focus watcher; causal graph integration. SelfTuner: adaptive threshold management via observed metric distributions; gap metric detection. ActionProposer: whitelisted template library for safe candidate action generation targeting gap metrics. SandboxRunner: sandboxed trial execution, effect measurement, trust scoring, exponential backoff on failure, permanent retirement after three failures. ActionCandidate / ActionQuarantine: quarantine pipeline for proposed actions awaiting promotion. zRAM-Aware Memory Management: real compression statistics from mm_stat; `drop_caches` suppressed on zRAM/zswap; `compact_memory` added as a distinct safe action. Battery-Aware Action Gating: performance action suppression proportional to battery charge. Script-Relative Save Paths: state files anchored to script directory regardless of CWD.
Arminta v6	External actor integration and predictive memory management. EarlyOOM Observation Node: `earlyoom_ct` added as an observational-only metric, counting per-step daemon kills via journalctl. All action-to-earlyoom_ct causal edges poison-listed at write time. Circadian Memory Look-Ahead: `_check_circadian_memory()` fires `compact_memory` during predicted idle lulls before historically high-RAM hours. Gate conditions enforce safety: minimum history depth, meaningful predicted rise, current memory below warn threshold, no zswap, 20-minute cooldown. Log prefix `[CIRC-MEM]`. HobbyCore: voluntary external engagement layer. Fires during DREAM cycles when emotional state is receptive. Samples four probe domains using intensity-weighted domain interest. External observations injected into the causal graph as observable edges. Domain symbols seed into LexicalCore.
Arminta v7	Self-directed capability development. Wish Pipeline (W1-W4): SELF_ASSESS-triggered gap detection (causal dead zones and situation gaps), procurement layer searching existing utilities and action registry, 2000-step shadow staging ring with routing validation, 5000-step evolutionary grading with WIN/TIE/LOSE verdicts. W4 Action Bias: WIN verdicts inject situation-action preference weights into `best_action_for()`, making deployed capabilities actually fire. W4b Code Generation: on WIN, extracts and annotates the closest donor action from her own source via AST, validates syntax, backs up source, appends to staged actions file for human review. Staged actions accumulate with a cap; reviewed and merged actions are auto-detected on restart via ACTIONS list comparison. Wish-Sourced Actions: `ionice_top_proc`, `handle_idle`, `handle_streaming`, `handle_compile`, `handle_io_bound`, `handle_browser_compute`, `throttle_torrent` -- real implementations replacing previous dead zones and situation gaps.

Known Limitations and Constraints

Linux-Only: Designed exclusively for Linux systems with modern PSI support.
Root Privileges Required: Full system optimization requires root access.
Closed Source: The full implementation is proprietary. This repository documents architecture and philosophy only.
Hardware-Specific Learning: The causal graph is learned on specific hardware.
Latency: System actions have response times tied to the adaptive step rate.
PriorityShift: Requires xdotool for event-driven focus tracking; degrades gracefully to a polling fallback when absent.
EarlyOOM Node: Requires earlyoom service and journalctl access. Returns 0 silently when absent; no effect on other subsystems.
HobbyCore: Individual domains degrade gracefully. No network connectivity disables public_network_latency and system_load_index silently; missing hwmon/power_supply paths disable local_environment. The time_and_context domain requires no external access and is always available.
Wish Pipeline: Code generation (W4b) reads and writes to the script directory. The staged actions file requires human review before merging; generated code is validated for syntax but not execution-tested.

Relationship to SUKOSHI

ARMINTA is the local substrate predecessor to SUKOSHI, a browser-native causal entity. Both projects are built on Paramorphic Learning, originated by Jason German (mematron). ARMINTA runs Paramorphic Learning against a Linux kernel substrate; SUKOSHI applies the same principles within a browser environment.

Part of the BIOS of Being Framework

ARMINTA exists within a larger system of autonomous agents and cognitive frameworks:

ardorlyceum.itch.io - BIOS of Being registry and interactive terminal
mematron.hearnow.com - BIOS_OS: The Sonification Cycle
keygentia.netlify.app - Keygentia Taxonomy Engine (Node 03)

License and Attribution

ARMINTA is closed-source software. This repository serves as a public record of the engine's design philosophy and evolution, and remains the intellectual property of Jason German (mematron).

Last Updated: June 2026 Status: Active development at v7 Maintainer: Jason German (mematron)

Name		Name	Last commit message	Last commit date
Latest commit History 146 Commits
README.md		README.md
arminta_v1.png		arminta_v1.png
arminta_v6.png		arminta_v6.png
minuet_v99.png		minuet_v99.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ARMINTA (formerly Minuet)

Autonomous Causal Discovery Agent for Linux

Quick Start

Prerequisites

Installation and Deployment

Observing Behavior

Core Operational Loop

Architecture

Cognitive Hierarchy

The Dream Cycle: Consolidation and Paramorphic Learning

MosaicCore: Expanding World Model

LexicalCore: Emerging Language

Wish Pipeline: Self-Directed Capability Development (v7)

HobbyCore: Voluntary External Engagement (v6)

PriorityShift: Focus-Aware Process Priority (v5)

Action Space Expansion: SelfTuner + ActionProposer + SandboxRunner (v5)

zRAM-Aware Memory Management (v5)

EarlyOOM Observation Node (v6)

Circadian Memory Look-Ahead (v6)

Live Dashboard

Persistence and Progress

Terminology and Key Concepts

Version Lineage

Known Limitations and Constraints

Relationship to SUKOSHI

Part of the BIOS of Being Framework

License and Attribution

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ARMINTA (formerly Minuet)

Autonomous Causal Discovery Agent for Linux

Quick Start

Prerequisites

Installation and Deployment

Observing Behavior

Core Operational Loop

Architecture

Cognitive Hierarchy

The Dream Cycle: Consolidation and Paramorphic Learning

MosaicCore: Expanding World Model

LexicalCore: Emerging Language

Wish Pipeline: Self-Directed Capability Development (v7)

HobbyCore: Voluntary External Engagement (v6)

PriorityShift: Focus-Aware Process Priority (v5)

Action Space Expansion: SelfTuner + ActionProposer + SandboxRunner (v5)

zRAM-Aware Memory Management (v5)

EarlyOOM Observation Node (v6)

Circadian Memory Look-Ahead (v6)

Live Dashboard

Persistence and Progress

Terminology and Key Concepts

Version Lineage

Known Limitations and Constraints

Relationship to SUKOSHI

Part of the BIOS of Being Framework

License and Attribution

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages