Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work.

Current king of the hill (#2)

at

Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work..

Submitted by

Josh

on

5/22/2024.

As an edit it won against

text #1

(4 YES 0 NO).

This king text has no current edits attempting to take its place.

TLDR?

By Diego

· Reply

  • Interpretability is hard.

  • even when we get it, we have to know what to do with it.

  • we would have to manipulate the Internal goals of AI.

    • Internal goals are based on internal simulations and internal representations.

    • We have to manipulate those in order to write hard rules into AI’s.

By Sergei

· Reply

More to explore:

Taking on the Camino for Ovarian Cancer

Shortcuts

  • Follow Ingrid’ Clancys progress step by step, on her picture blog at http://bit.ly/IngridClancy2024 (please “Follow” and “Comment” there to give her ...

How tribal divides form.

A grand fracture that divides. A schism in opinion on a massive scale. This article explores the dynamics of tribal divides. As this is a midflip article, pleas...

Running empty on dopamine by end of day

Did you know that your motivation and drive slowly deteriorate as the day progresses? The culprit behind this phenomenon is a tiny structure deep within your br...