That was really interesting. So you see technical alignment to be possible once the AI has the ability to simulate and create plans. Otherwise the internal goal is not invariant to the input. That kind of makes sense.
I do want you to dig a bit deeper into the implementation however. I know its a long article but i think it deserves a bit more.
That was really interesting. So you see technical alignment to be possible once the AI has the ability to simulate and create plans. Otherwise the internal goal is not invariant to the input. That kind of makes sense.
I do want you to dig a bit deeper into the implementation however. I know its a long article but i think it deserves a bit more.
· Reply
Make
IanMiller
your Representive in the
Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work.
topic?
Share
Moderate