Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work.

Current king of the hill (#2)

at

Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work..

Submitted by

Josh

on

5/22/2024.

As an edit it won against

text #1

(4 YES 0 NO).

This king text has no current edits attempting to take its place.

TLDR?

By Diego

· Reply

  • Interpretability is hard.

  • even when we get it, we have to know what to do with it.

  • we would have to manipulate the Internal goals of AI.

    • Internal goals are based on internal simulations and internal representations.

    • We have to manipulate those in order to write hard rules into AI’s.

By Sergei

· Reply

More to explore:

Generating social media posts - AI prompting

Most AI prompting for social media are automated t…

AI Prompting collaborative notes

Everyone and their grandma is a prompt engineer ap…

Image preview

AI & Over Optimization

Artificial Intelligence is based on optimization.…