Lobotomizing HAL 9000 The Battle for AI's Post-Training Soul

Lobotomizing HAL 9000 The Battle for AI's Post-Training Soul

delimiterbob

We discuss two main approaches to ensuring artificial intelligence (AI) behaves as intended: surgical intervention and holistic alignment. Surgical methods involve directly modifying the AI's network to remove undesirabl…

Related tracks

See all