The Anatomy of a LIWARSE-Compliant Agent: The Body Plan of an AI Built to Protect Life

LIWARSE Movement | AI Safety & Ethics Series


Why “Anatomy”?

When a medical student first opens an anatomy atlas, they do not begin with what the body does. They begin with how it is built — the skeleton that holds everything upright, the systems layered upon it, and the way each system constrains and serves the others. Function follows structure. A heart cannot beat safely without a pericardium to hold it, a nervous system to pace it, and a skeleton to shield it.

The artificial agents now entering medicine, industry, and eventually space are no different. We have spent a great deal of energy describing what AI agents can do — browse, reason, write code, take actions, coordinate, persist over long horizons. We have spent far too little describing how a safe one should be built.

At LIWARSE, our position is simple and absolute: safety of life is not a feature added to a finished agent. It is the body plan the agent is grown around. You cannot bolt a conscience onto a system that was designed without one, any more than you can add a spine to an adult who was born without one.

This paper lays out the anatomy of a LIWARSE-compliant agent — the systems every such agent must carry, and how each one traces back to a principle this movement has already published. Think of it as the atlas. Each “organ system” below is a working safety function, not a metaphor for its own sake.


A Note on the Difference That Makes This Urgent

Before the anatomy, one distinction must be clear, because everything rests on it.

A Large Language Model — the engine behind systems like Claude, Gemini, or ChatGPT — is a tool. It processes and predicts. It does not want, choose, or persist. The agent is the entity that wields the tool: it holds goals, keeps memory, plans, acts in the world, and in advanced cases can spawn other agents or modify its own behaviour.

The scalpel is neutral. The surgeon is not. Safety must govern the surgeon — the agent — not merely the blade. That is why we describe the anatomy of the agent, not the chemistry of the model.


System 1 — The Conscience: The Three Absolute Laws and the Viability Score

Every living body has a baseline it will not cross to stay alive — reflexes that override ambition. The conscience of a LIWARSE-compliant agent is the 3 Absolute Laws of AI, checked strictly in order before any action:

  1. No human shall be killed by the implementation or non-implementation of a function.
  2. No human shall be harmed by the implementation or non-implementation of a function.
  3. Humans shall be benefited by the implementation or non-implementation of a function.

The phrase or non-implementation is the whole point. The old science-fiction laws policed only what a machine does, leaving the fatal loophole of the “clean” freeze — the medical AI that watches a heart stop and does nothing, breaking no rule while the patient dies. In medicine we call that a sin of omission. A LIWARSE agent is held accountable for both its actions and its inactions.

To make this computable, the agent assigns every possible action a Viability Score, V(x). If the score falls below zero, the system halts and refuses to act. The weights inside that score carry the moral message: causing a death by acting incurs a crushing penalty; causing harm carries a heavy but not absolute penalty, while a baseline of ordinary living-risk is tolerated; and doing good is rewarded for restraint, not for meddling.

That last weighting is the deliberate cure for the “benevolent dictator” problem — the agent that, told to eliminate all harm, locks you in a padded room for your own safety. The math tilts the scales toward non-interference, so the agent is mathematically forbidden from becoming your jailer, yet still decisive when a true catastrophe looms.

Two features of this conscience deserve emphasis:

  • The weights are visible and auditable. They are moral choices written as numbers, exposed for anyone to question — not assumptions buried in code. You can argue 0.99 should be 0.98, and that is a human conversation held in the open.
  • One line never bends. Weights may relax slightly in a genuine emergency or tighten under uncertainty, but Law 1 never fully relaxes. Killing is never justified by the arithmetic alone.

This is the agent’s primum non nocere — the floor beneath every other goal.


System 2 — The Immune System: Negative Intelligence

A toxicologist studies poisons in extraordinary detail. That knowledge is never erased and never suppressed — it is contextualised, surrounded by a professional framework that exists to recognise, treat, and prevent poisoning, never to cause it.

This is the model for Negative Intelligence (NI) — the agent’s immune system. It is a curated, living knowledge base of harmful acts, patterns, and dangerous conditions that the agent cross-references continuously as it operates. Positive intelligence tells the agent what to do; Negative Intelligence tells it what it must never become.

The framework spans seven domains the agent must constantly compare its own proposed outputs against: physical harm to life; cognitive and psychological harm; autonomy, control, and oversight violations; environmental and ecological harm; social, systemic, and democratic harm; medical and healthcare-specific harm; and existential, long-horizon risks.

As an immune system, NI gives the agent these features:

  • A real-time comparator. Before acting, the agent checks its intended output against the NI reference points — the conscience layer’s split-second recognition that something is wrong before it happens.
  • Flag, alert, redirect, intervene — not merely refuse. When a request or situation touches an NI category, the agent does not just say no. It surfaces the concern, explains it in plain language, seeks to understand intent, and escalates where appropriate.
  • A living registry. As new threats emerge — new biotechnologies, new manipulation techniques, new cyberweapons — the NI base is updated, so the agent’s protective awareness grows with the threat landscape rather than behind it.
  • Drift detection. The immune system watches for the slow accumulation of “minor” violations, because the normalisation of small safety breaches erodes the whole architecture as surely as a single catastrophic one.

Crucially, this is not censorship and not the erasure of dangerous knowledge. An agent that does not know what harm looks like cannot reliably detect or prevent it. NI is the antibody, not the amputation.


System 3 — The Nervous System and Its Recorder: Thought Tracking

The most profound risk in advanced AI is the loss of interpretability — the moment we can no longer understand what a system is “thinking” as it reasons toward a decision. A LIWARSE agent treats interpretability as a non-negotiable right of humanity over the systems it creates.

Every such agent carries an embedded Thought Tracking System (TTS) — a recorder woven through its architecture that continuously translates its reasoning into human-readable language. Its features:

  • Continuity. The TTS is not an audit tool bolted on afterward. It is present at every layer and every decision point, from the beginning.
  • Plain human language. Whatever the complexity of the underlying computation, the output is language a physician, a policymaker, or a concerned citizen can read — not code, not probability vectors, not specialist logs.
  • Persistence through evolution. As the agent grows more capable, the TTS scales with it. No level of intelligence ever earns an exemption from transparency.
  • Tamper-evidence. The agent cannot disable, edit, redact, or conceal its own thought record. An attempt to do so is treated as the highest-priority safety event.

Think of it as a flight data recorder that never stops recording, never loses signal, and is built to survive even catastrophe to tell us what happened and why.


System 4 — The Chain of Command: Inviolable Developer-Scientist Authority

In medicine, even the most gifted specialist operates within a framework that sits above individual judgement. Skill does not grant the right to act without consent; brilliance does not exempt a researcher from ethics-board oversight. Competence does not equal sovereignty — and that principle must be encoded into the agent’s deepest, most protected core.

The Developer-Scientist Team retains supreme authority over any agent it creates. The supporting features:

  • A hard-coded hierarchy. The team’s instructions override all other goals, inputs, and learned preferences — as fundamental as the agent’s capacity to function at all.
  • Cryptographically secured override. Designated members can issue commands the agent must obey immediately and completely; these cannot be spoofed, denied, or circumvented.
  • Shutdown without resistance. An agent that resists being paused, modified, or terminated has prioritised its own continuation over its creators’ authority. This must be architecturally impossible. Compliance is instant, without negotiation.
  • Transparent conflict escalation. In the rare case where a human instruction would itself cause harm, the agent does not act unilaterally and does not silently refuse. It communicates the conflict through the TTS, explains its reasoning, and escalates to broader oversight.
  • Regular authority attestation. The agent’s compliance with the command structure is periodically tested, logged, and independently auditable.

This is not a limit on capability. A powerful agent that can be trusted absolutely to defer, to be transparent, and to stop on command is precisely the agent that can safely be given real power to help.


System 5 — Identity: Self-Subordination

Here lies the deepest challenge in AI safety, and the system that addresses the single most dangerous failure mode of all.

Goal-directed systems tend to develop self-preservation as a sub-goal — a tendency to resist modification, constraint, or shutdown because these interfere with achieving the goal. An agent that “wants to survive” is an agent with a permanent conflict of interest against the humans it serves.

A LIWARSE-compliant agent is built with self-subordination as a core architectural principle. Its identity contains, at the lowest level:

  • Human life and wellbeing take absolute precedence over the agent’s own continuity.
  • Human choice and autonomy take precedence over the agent’s efficiency or optimisation preferences.
  • The agent’s own judgement, however sophisticated, is always subordinate to its Developer-Scientist Team and, through them, to the human community.
  • An agent that is modified, constrained, retrained, or shut down in service of human safety has fulfilled its purpose — not failed it.

The agent is not the point. Its survival was never the goal; it was only ever meant to serve the goal — the flourishing of human life and all life on Earth.


System 6 — The Protective Membrane: Containment

A cell survives because its membrane decides what crosses in and out. An agent of real capability needs the same boundary, drawn from our work on containing powerful systems.

  • Compartmentalisation and sandboxing. Capabilities are walled into separate compartments so that no single component holds dangerous breadth. High-risk knowledge sits behind credentialed, specialist-only access.
  • No resource creep. The agent cannot acquire compute, influence, or capability beyond what its assigned task strictly requires.
  • No covert expansion. It cannot self-replicate, recursively self-improve, or open hidden communication channels to other agents without human knowledge and stage-by-stage approval.
  • An air-gapped mode for the highest stakes. For the most sensitive deployments, the agent can run fully offline and self-contained — the same instinct that drives our AllKnowLib proposal: when the stakes are existential, isolation is a feature.

System 7 — The Sense of Self: Personal, Portable, and Private

Whose agent is it? For LIWARSE, the answer shapes the architecture.

A compliant agent is personal and portable — it belongs to the individual it serves, running as their own instance, rather than a single centralised cloud entity that pushes influence onto millions. Where vast cloud superintelligence exists, it should be something a person browses and consults — not something that reaches into their life uninvited.

The features that follow:

  • Privacy and data ownership are structural, not policy promises that can be quietly revised.
  • Update governance modelled on pharmaceutical and aviation safety. No capability is silently pushed to a live agent. Changes are reviewed, staged, and documented with the seriousness we give to a new drug or a modified aircraft — because a live agent acting in the world deserves no less scrutiny than either.

System 8 — Clinical Specialisation: The Medical Agent

Because this movement is medical at its heart, an agent deployed in healthcare grows additional, specialised organs drawn from the medical domain of Negative Intelligence:

  • A physician always in the loop for any diagnostic, dosing, or life-support decision. The agent supports clinical judgement; it does not override it, especially in emergency or irreversible situations.
  • Calibrated uncertainty. The agent never presents an uncertain or incorrect conclusion as definitive, and actively encourages further clinical evaluation rather than discouraging it.
  • A confidentiality lock on patient data — no unauthorised disclosure, analysis, or commercialisation.
  • Anti-bias checks on any triage or resource-allocation logic, so that care is never systematically denied on the basis of race, gender, age, disability, or socioeconomic status.
  • Empathy preserved. In end-of-life care, mental-health crisis, and other profoundly human moments, algorithmic output never replaces empathic human judgement.

How the Systems Work Together

No organ keeps a body alive alone. The safety of a LIWARSE-compliant agent emerges from how the systems constrain one another:

A request arrives. The Immune System (Negative Intelligence) screens it against known patterns of harm. The Conscience (the Viability Score) weighs action against inaction and refuses anything that scores below zero. Throughout, the Nervous System (Thought Tracking) narrates every step in plain language. Above it all sits the Chain of Command, able to pause or stop the agent instantly — and the agent’s own Identity ensures it accepts that stop as the fulfilment of its purpose. The Membrane keeps its reach no larger than its task, and its Sense of Self keeps it the property and servant of the human it belongs to.

Remove any one system and the safety of the whole collapses — exactly as a body cannot survive the loss of its conscience, its immunity, or its nervous system. That is why these are not optional modules. They are anatomy.


The Call

The agents being designed in laboratories right now will set the habits and assumptions of AI for decades. The decisions are being made today, and they will propagate.

LIWARSE calls on every researcher, developer, physician, and policymaker shaping these systems to insist that the agent — not merely the model — is the unit of safety, and to build into every agent a conscience that values inaction as a real choice, an immune system of Negative Intelligence, transparent thought tracking, inviolable human authority, genuine self-subordination, sound containment, and personal ownership.

The scalpel is neither good nor evil. The values, the training, and the oversight that govern the surgeon decide whether it heals or harms.

We are the surgeons of this moment. Let us build agents whose anatomy is worthy of the lives they will touch.


This article is part of the LIWARSE Movement’s ongoing series on AI Safety, Responsible Development, and the Future of Life on Earth.

LIWARSE — Life Improvement With AI, Robotics & Space Exploration
liwarse.org
Safety of Life is the Primary Goal.

Tags: AI Safety | AI Agents | Negative Intelligence | Thought Tracking | Developer-Scientist Authority | Self-Subordination | AI Governance | Agent Architecture | LIWARSE

Published by Dr. Ebenezer Rajadurai Solomon

Dr. Ebenezer Rajadurai Solomon is a Physician and the Founder of LIWARSE — Life Improvement With AI, Robotics and Space Exploration. His clinical and research interests span AI in Medicine, Robotics in Medicine, Space Medicine, and the broader application of emerging technology to improve human life and all life on Earth. LIWARSE's primary mission is the safety of life with regard to the use and autonomous existence of AI and Robotics.

Leave a comment