[link]
Summary by Marek Rei 7 years ago
The authors investigate the generalisation properties of several well-known image recognition networks.
https://i.imgur.com/km0mrVs.png
They show that these networks are able to overfit to the training set with 100% accuracy even if the labels on the images are random, or if the pixels are randomly generated. Regularisation, such as weight decay and dropout, doesn’t stop overfitting as much as expected, still resulting in ~90% accuracy on random training data. They then argue that these models likely make use of massive memorization, in combination with learning low-complexity patterns, in order to perform well on these tasks.
more
less