#interpretability

📐 Research The Gradient 11 min read

Do text embeddings perfectly encode text?

'Vec2text' can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data.

#interpretability #llm #nlp

🕐 2 years ago

Read →

🔬 Research Distill.pub 9 min read

Multimodal Neurons in Artificial Neural Networks

We report the existence of multimodal neurons in artificial neural networks, similar to those found in the human brain.

#neural networks #interpretability #multimodal learning

🕐 5 years ago

Read →

🔬 Research Distill.pub 15 min read

Visualizing Weights

We present techniques for visualizing, contextualizing, and understanding neural network weights.

#neural networks #interpretability #visualization

🕐 5 years ago

Read →

🔬 Research Distill.pub 3 min read

Curve Circuits

Reverse engineering the curve detection algorithm from InceptionV1 and reimplementing it from scratch.

#neural networks #interpretability #computer vision

🕐 5 years ago

Read →

🔬 Research Distill.pub 18 min read

High-Low Frequency Detectors

A family of early-vision neurons reacting to directional transitions from high to low spatial frequency.

#neural networks #feature detection #computer vision

🕐 5 years ago

Read →

🔬 Research Distill.pub 40 min read

Understanding RL Vision

With diverse environments, we can analyze, diagnose and edit deep reinforcement learning models using attribution.

#reinforcement learning #interpretability #vision

🕐 5 years ago

Read →

🔬 Research Distill.pub 36 min read

Curve Detectors

Part one of a three part deep dive into the curve neuron family.

#neural networks #interpretability #curve detection

🕐 5 years ago

Read →

🔬 Research Distill.pub 17 min read

An Overview of Early Vision in InceptionV1

An overview of all the neurons in the first five layers of InceptionV1, organized into a taxonomy of 'neuron groups.'

#neural networks #computer vision #inceptionv1

🕐 6 years ago

Read →

🔬 Research Distill.pub 42 min read

Visualizing Neural Networks with the Grand Tour

By focusing on linear dimensionality reduction, we show how to visualize many dynamic phenomena in neural networks.

#neural networks #visualization #grand tour

🕐 6 years ago

Read →

🔬 Research Distill.pub 5 min read

Thread: Circuits

What can we learn if we invest heavily in reverse engineering a single neural network?

#neural networks #interpretability #deep learning

🕐 6 years ago

Read →

🔬 Research Distill.pub 42 min read

Zoom In: An Introduction to Circuits

By studying the connections between neurons, we can find meaningful algorithms in the weights of neural networks.

#neural networks #interpretability #visualization

🕐 6 years ago

Read →

🔬 Research Distill.pub 9 min read

A Discussion of 'Adversarial Examples Are Not Bugs, They Are Features'

Six comments from the community and responses from the original authors

#adversarial examples #neural networks #robustness

🕐 6 years ago

Read →

#interpretability — AI News & Research · DeepTrendLab