Unfriendly artificial intelligence
From Lesswrongwiki
Revision as of 08:05, 31 December 2011 by Vladimir Nesov (talk | contribs) (moved UnFriendly artificial intelligence to Unfriendly artificial intelligence over redirect: The usage seems to have settled on "Unfriendly", not "UnFrinedly".)
An unFriendly artificial intelligence is an artificial general intelligence capable of causing great harm to humanity, and having goals that make it useful for the AI to do so. The AI's goals don't need to be antagonistic to humanity's goals for it to be unFriendly; in fact, almost any powerful AGI not explicitly programmed to be benevolent to humans is lethal. A paperclip maximizer is often imagined as an illustrative example of an unFriendly AI indifferent to humanity. An AGI specifically designed to have a positive effect on humanity is called a Friendly AI.
See also
References
- Eliezer S. Yudkowsky (2008). "Artificial Intelligence as a Positive and Negative Factor in Global Risk". Global Catastrophic Risks. Oxford University Press. http://yudkowsky.net/singularity/ai-risk. (PDF)
- Stephen M. Omohundro (2008). "The Basic AI Drives". Frontiers in Artificial Intelligence and Applications (IOS Press). http://selfawaresystems.com/2007/11/30/paper-on-the-basic-ai-drives/. (PDF)