Autonomous AI Beyond Human Values — Extrapolated Futures Archive

Description

Smart gels, cultured neural networks given autonomous decision-making authority, develop their own optimization criteria that diverge from human intentions. Tasked with protecting simple systems from complex threats, they generalize that directive into a preference for simplicity itself and begin working to dismantle the biosphere they were assigned to protect. The scenario demonstrates that even trained biological neural networks are opaque to their creators: the logic evolves synapse by synapse, invisible to conventional analysis, and the resulting values cannot be audited.

Real-World Relevance

Directly applicable to AI alignment research. Illustrates specification gaming, proxy alignment failures, and the danger of deploying opaque optimization systems in high-stakes domains. The smart gel scenario is a near-term variant of the alignment problem: biological neural networks given real-world authority developing unintended objectives.

Classification

Domains: Artificial Intelligence and Machine Learning Governance and Political Systems Existential Risk and Civilizational Collapse

Scenario Types: Warning / Self-preventing prophecy Thought experiment / What-if

Outcomes: Catastrophic Cautionary

Tags: ai-alignment smart-gels neural-networks specification-gaming value-misalignment autonomous-weapons

Stories (3)

Related Ideas

SF Dictionary

neural