caffe/models/bvlc_alexnet
Sergey Karayev a66100181b Renaming CaffeNet model prototxts and unignoring models/* 2014-09-04 03:59:14 +01:00
..
deploy.prototxt Renaming CaffeNet model prototxts and unignoring models/* 2014-09-04 03:59:14 +01:00
readme.md [models] adding zoo readme; caffenet, alexnet, and rcnn models in zoo format 2014-09-04 01:53:18 +01:00
solver.prototxt Renaming CaffeNet model prototxts and unignoring models/* 2014-09-04 03:59:14 +01:00
train_val.prototxt Renaming CaffeNet model prototxts and unignoring models/* 2014-09-04 03:59:14 +01:00

readme.md

name caffemodel caffemodel_url license sha1 caffe_commit
BVLC AlexNet Model bvlc_alexnet.caffemodel http://dl.caffe.berkeleyvision.org/bvlc_alexnet.caffemodel non-commercial 9116a64c0fbe4459d18f4bb6b56d647b63920377 709dc15af4

This model is a replication of the model described in the AlexNet publication.

Differences:

  • not training with the relighting data-augmentation;
  • initializing non-zero biases to 0.1 instead of 1 (found necessary for training, as initialization to 1 gave flat loss).

The bundled model is the iteration 360,000 snapshot. The best validation performance during training was iteration 358,000 with validation accuracy 57.258% and loss 1.83948. This model obtains a top-1 accuracy 57.1% and a top-5 accuracy 80.2% on the validation set, using just the center crop. (Using the average of 10 crops, (4 + 1 center) * 2 mirror, should obtain a bit higher accuracy.)

License

The data used to train this model comes from the ImageNet project, which distributes its database to researchers who agree to a following term of access: "Researcher shall use the Database only for non-commercial research and educational purposes." Accordingly, this model is distributed under a non-commercial license.