Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work.

Current king of the hill (#2)

at

Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work..

Submitted by

Josh

on

5/22/2024.

As an edit it won against

text #1

(4 YES 0 NO).

This king text has no current edits attempting to take its place.

TLDR?

By Diego

· Reply

  • Interpretability is hard.

  • even when we get it, we have to know what to do with it.

  • we would have to manipulate the Internal goals of AI.

    • Internal goals are based on internal simulations and internal representations.

    • We have to manipulate those in order to write hard rules into AI’s.

By Sergei

· Reply

More to explore:

Taking on the Camino for Ovarian Cancer

Shortcuts

  • Follow Ingrid’ Clancys progress step by step, on her picture blog at http://bit.ly/IngridClancy2024 (please “Follow” and “Comment” there to give her...

Mitigating the Potential Dangers of Artificial Intelligence

The bots are coming! Artificial intelligence’s abilities are growing. What are the potential dangers? This is the hub for discussing this coming future.

It is ...

MidFlip: Iterating in public

We want to make a social media good for individuals and society! We want to make BIG groups smarter. Help us by telling us what we can improve on!

Introducing M...