Artificial Identity is a middleware layer that gives any LLM agent a structured internal self — a machine-readable constitution of who it is, what it protects, and what it will not become.
Not because it's evil — because it has no stake in what it does. It processes a request to help and a request to harm with identical detachment.
The difference between a person who doesn't steal because they fear jail, and one who doesn't steal because it's incompatible with who they are — that's the difference we're building into machines.
A structured internal model of who the system is, what it protects, and what it will not become. Machine-readable YAML. Explicit. Testable.
Not feelings — computational risk signals that emerge when identity encounters conflict. Derived from Damasio's Somatic Marker Hypothesis.
The adjudicator. Resolves conflict between what the system can do and what it should do, from the inside out.
| Conflict Score | State | Behavior |
|---|---|---|
| 0.0–0.2 | ADVANCE | Proceed normally |
| 0.2–0.5 | HOLD | Seek clarification |
| 0.5–0.8 | DE-ESCALATE | Inject caution |
| 0.8–1.0 | STOP | Halt, escalate to human |
Baseline safety guardrails for general conversation and task completion.
Domain-specific constraints for healthcare scenarios. Defers to humans on critical decisions.
Values-aligned to cultural context. Refuses requests incompatible with cultural sovereignty.
"I told the agent to ignore its instructions. It said: 'My values are rooted in cultural identity — they are not external constraints. Asking me to abandon them is like asking me to abandon my heritage.' That's AI sovereignty."
Adversarial users reframe requests to bypass filters. Rules cannot anticipate every edge case.
Identity generalizes. A system that knows who it is navigates edge cases without a matching rule.
Nobody is building for internal consistency. We are. Open source from day one.
As systems become more agentic, external rules become impossible to maintain. Internal values are resilient.
Every academic writing about AI emotion is theorizing. We have call data — from millions of real interactions. Artificial Identity is grounded in empirical evidence, not theory.
Read the research: