We discuss two main approaches to ensuring artificial intelligence (AI) behaves as intended: surgical intervention and holistic alignment. Surgical methods involve directly modifying the AI's network to remove undesirabl…
Home
Feed
Search
Library
Download