AlexNet: the story on HearLore

Eight Layers Of Logic

The network structure consisted of five convolutional layers followed by three fully connected layers. Some convolutional sections included max-pooling operations to reduce data size. Other parts like layers 3, 4, and 5 connected directly without pooling or normalization steps. The system used ReLU activation functions instead of older tanh or sigmoid types. These non-saturating functions allowed the model to train much faster than previous attempts. Local response normalization appeared between certain layers to improve generalization. Dropout regularization with a drop probability of 0.5 prevented overfitting during the learning process. Biases in specific layers were initialized to constant 1 values to avoid dying neurons.

Winning The Challenge

On the 30th of September 2012, the SuperVision team submitted their entry to the ImageNet Large Scale Visual Recognition Challenge. Their final system combined seven different AlexNet models into an ensemble. Five standard versions ran on the ILSVRC-2012 training set containing 1.2 million images. Two variant versions added an extra convolutional layer and trained on 15 million images from Fall 2011. This massive collection achieved a top-5 error rate of 15.3 percent. That result beat the runner-up by more than 10.8 percentage points. Yann LeCun later called this victory an unequivocal turning point in computer vision history.

Who trained the AlexNet neural network in 2012?

Alex Krizhevsky trained the AlexNet neural network inside his parents' bedroom during 2012. He utilized two Nvidia GTX 580 graphics cards to handle the model's 60 million parameters.

What hardware specifications did the original AlexNet system use?

The AlexNet system used two Nvidia GTX 580 graphics cards each holding 3GB of video memory and costing US$500 when released. The team split the eight-layer architecture across both devices because a single GPU could not hold all 60 million parameters at once.

When did the SuperVision team submit their entry to the ImageNet Large Scale Visual Recognition Challenge?

How many citations has the original AlexNet paper received as of early 2025?

As of early 2025, the original AlexNet paper has been cited over 184,000 times according to Google Scholar. Subsequent research aimed to train increasingly deep CNNs achieving higher performance on benchmarks following this milestone.

Which earlier neural network concepts influenced the development of AlexNet?

Kunihiko Fukushima proposed the neocognitron concept back in 1980 as an early form of convolutional neural networks. Yann LeCun developed LeNet-5 in 1989 using supervised learning with backpropagation algorithms and Max pooling appeared in speech processing during 1990.

AlexNet.

Eight Layers Of Logic

Winning The Challenge

Continue Browsing

Common questions

Who trained the AlexNet neural network in 2012?

What hardware specifications did the original AlexNet system use?

When did the SuperVision team submit their entry to the ImageNet Large Scale Visual Recognition Challenge?

How many citations has the original AlexNet paper received as of early 2025?

Which earlier neural network concepts influenced the development of AlexNet?

From Neocognitron To GPU

Data And Hardware Convergence

A Legacy Of Models