What’s in the box?! – Towards interpretability by distinguishing niches of value within neural networks.

King of the Hill 1

Current king of the hill (#1)

at

What’s in the box?! – Towards interpretability by distinguishing niches of value within neural networks. .

Submitted by

Josh

on

2/27/2024.

This king text has no current edits attempting to take its place.

Man, took me awhile to get through it. The ideas are really interesting i just think its too much of a slog. You have (understandably) made new words for various concepts (like inout, triggers, etc.). This makes it hard for the reader.

Particularly, this makes the intro really hard. It kind of reads like gobbiligoop. That is until you read the entire thing, then you look back and it all makes sense. If you want more people to read it (which you should, once again, really interesting perspective to interpretability) I would rewrite the intro, and perhaps the abstract to account for this.

By jake-denning ·

Reply

More to explore:

Taking on the Camino for Ovarian Cancer

Shortcuts

  • Follow Ingrid’ Clancys progress step by step, on her picture blog at http://bit.ly/IngridClancy2024 (please “Follow” and “Comment” there to give her ...

Mitigating the Potential Dangers of Artificial Intelligence

The bots are coming! Artificial intelligence’s abilities are growing. What are the potential dangers? This is the hub for discussing this coming future.

It is ...

Implementing Asimov’s Laws of Robotics (The first law) - How alignment could work.

The Three Laws of Robotics

In 1942 Isaac Asimov started a series of short stories about robots. In those stories, his robots were programed to obey the three la...