Note on How We Could Stumble Into AI Catastrophe via Cold Takes

The Lance Armstrong problem Did we get the AI to be actually safe or good at hiding its dangerous actions?

When dealing with an intelligent agent, it’s hard to tell the difference between “behaving well” and “appearing to behave well.”

When professional cycling was cracking down on performance-enhancing drugs, Lance Armstrong was very successful and seemed to be unusually “clean.” It later came out that he had been using drugs with an unusually sophisticated operation for concealing them.

FROM:
Cold Takes
How We Could Stumble Into AI Catastrophe
Source

Reference

Notes
optimization, meaning, ai, alignment
How We Could Stumble Into AI Catastrophe
Cold Takes
2023, January 13, Friday
Permalink to 2023.NTE.065
Edit

Widgets

Network Graph

Legend

Keyboard Shortcuts

Key	Action
`o`	Source
`e`	Edit
`i`	Insight
`r`	Random
`h`	Home
`s` or `/`	Search