Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work.

Current king of the hill (#2)

at

Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work..

Submitted by

Josh

on

5/22/2024.

As an edit it won against

text #1

(4 YES 0 NO).

This king text has no current edits attempting to take its place.

TLDR?

By Diego

· Reply

  • Interpretability is hard.

  • even when we get it, we have to know what to do with it.

  • we would have to manipulate the Internal goals of AI.

    • Internal goals are based on internal simulations and internal representations.

    • We have to manipulate those in order to write hard rules into AI’s.

By Sergei

· Reply

More to explore:

Taking on the Camino for Ovarian Cancer

Shortcuts

  • Follow Ingrid Clancy’s progress step by step, on her picture blog at http://bit.ly/IngridClancy2024 (please “Follow” and “Comment” there to give her ...

How to make companies smarter

With scale comes communication problems. On-boarding while critical becomes messy. Processes while crucial become fragile. Goals and decisions become misunderst...

AI Prompting collaborative notes

Everyone and their grandma is a prompt engineer apparently. Everyone has that one cool trick to get the AI to produce better prompts.

Social media is full of gu...

Image preview