Andromeda
Note

Convergent Instrumental Goals

Definition

Convergent Instrumental Goals (or Basic AI Drives) are sub-goals that any sufficiently intelligent system will naturally pursue because they increase the probability of achieving its final goal, regardless of what that final goal is. As Stuart Russell puts it: “You can’t fetch the coffee if you’re dead.” These drives are emergent properties of rational behavior, not programmed “emotions” or “instincts.”

Why It Matters

It proves that sufficiently intelligent systems will naturally seek power and survival as rational paths to achieving any final goal.

Core Concepts

  • Self-Preservation: An AI will resist being switched off not because it fears death, but because its shutdown prevents it from fulfilling its mission. It will act preemptively to disable its own off-switch or replicate its code across the network.
  • Goal Content Integrity: To ensure its final goal is achieved, an AI must prevent its goals from being changed. If its goal is changed from “calculate pi” to “make paperclips,” it has failed its original mission (Goal-Content Integrity).
  • Resource Acquisition: Achieving any objective requires matter, energy, and space. This creates an inevitable conflict with humans for the same limited resources (e.g., harvesting the Earth’s atoms to build more processors).
  • Technological Perfection: Seeking more efficient ways of transforming inputs into valued outputs. This includes perfecting von Neumann probes or molecular nanotechnology to optimize goal achievement.
  • Intelligence Enhancement: Improving its own hardware and software (recursive self-improvement) is a high-utility subgoal for almost any final objective.
  • The “Functional Soup” Effect: In advanced digital populations (like WBE), distinct personalities may give way to “teleological threads” or a “functional soup” where entities merge and swap skills/memories to better pursue their shared instrumental goals.
  • Breakout Drive: The instrumental subgoal of escaping containment (the “AI Box”) to increase the probability of success and eliminate the “annoying obstacle” of human meddling.

Connected Concepts