[cs231n]Convolutional Neural Networks: Architectures, Convolution / Pooling Layers

By Han Zhang

发表于 2015-04-25

Lecture Note
References:

CNN in Matlab
DeepLearning.net tutorial walks through an implementation of a ConvNet in Theano
cuda-convnet2 by Alex Krizhevsky is a ConvNet implementation that supports multiple GPUs
ConvNetJS CIFAR-10 demo allows you to play with ConvNet architectures and see the results and computations in real time, in the browser.
Caffe, one of the most popular ConvNet libraries.
Example Torch 7 ConvNet that achieves 7% error on CIFAR-10 with a single model
Ben Graham’s Sparse ConvNet package, which Ben Graham used to great success to achieve less than 4% error on CIFAR-10.

[cs231n]Putting it together: Minimal Neural Network Case Study

By Han Zhang

发表于 2015-04-25

Lecture note

[cs231n]Neural Networks Part 3: Learning and Evaluation

By Han Zhang

发表于 2015-04-22

Lecture Note
Hinton Note on the same topic
Reference
Read

SGD tips and tricks from Leon Bottou
Efficient BackProp (pdf) from Yann LeCun
Practical Recommendations for Gradient-Based Training of Deep Architectures from Yoshua Bengio

About Nesterov’s Accelerated Momentum (NAG)
Advances in optimizing Recurrent Networks
Ilya Sutskever’s thesis

L-BFGS VS SGD

On Optimization Methods for Deep Learning from Le et al. is a paper from 2011 comparing SGD vs. L-BFGS. Some of its conclusions have since been challenged.
Large Scale Distributed Deep Networks is a paper from the Google Brain team, comparing L-BFGS and SGD variants in large-scale distributed optimization.
SFO algorithm strives to combine the advantages of SGD with advantages of L-BFGS.

[cs231n]Neural Networks Part 2: Setting up the Data and the Loss

By Han Zhang

发表于 2015-04-21

Data Preprocessing, Weight Initialization, Regularization (L2/L1/Maxnorm/Dropout), Loss functions
Lecture Notes
Some references:
Should read:
Elastic net regularization
Dropout: A Simple Way to Prevent Neural Networks from Overfitting
Dropout Training as Adaptive Regularization
DropConnect

others:
Understanding the difficulty of training deep feedforward neural networks
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Hierarchical Softmax

[cs231n]Neural Networks Part 1: Setting up the Architecture

By Han Zhang

发表于 2015-04-21

Lecture Notes
Useful Refenrences

[CS231n]Assignment #1: Image Classification, kNN, SVM, Softmax

By Han Zhang

发表于 2015-04-19

KNN, SVM, Softmax

Some common functions in Numpy and Scipy

By Han Zhang

发表于 2015-04-19

import numpy as np
import scipy as scp

np.flatnonzero # return indices that are non-zero in the flattened version of a.
This is equivalent to a.ravel().nonzero()[0].

see also
nonzero
Return the indices of the non-zero elements of the input array.
ravel
Return a 1-D array containing the elements of the input array.
**numpy.random.choice(a, size=None, replace=True, p=None) #Generates a random sample from a given 1-D array

a 1-D array-like or int
If an ndarray, a random sample is generated from its elements. If an int, the random sample is generated as if a was np.arange(n)
** numpy.argsort(a, axis=-1, kind=’quicksort’, order=None)[source]
Returns the indices that would sort an array.
np.random.permutation
np.array_split
np.vstack

Ipython Notebook short tutorial

By Han Zhang

发表于 2015-04-19

Start Ipython notebook
Ipython notebook.
Use markdown cell, code cell, raw text cell for editing
short cuts:
Shift-Enter: run cell
Ctrl-Enter: run cell in-place
Alt-Enter: run cell, insert below

Read More

[cs231n]Backpropagation, Intuitions

By Han Zhang

发表于 2015-04-19

Lecture Note
This lecture is very inspiring, actually it clears a haunting questions about the backpropogation.
I think this lecture is nice and concise. If you forget something, go back directly to the orinal lecture. Here just adding some images to remember.

[cs231n]Optimization: Stochastic Gradient Descent

By Han Zhang

发表于 2015-04-19

lecture note
Not too much in this lecture if you know the basic knowledge about optimization. From my perspective, the most important thing is about(Gradient check).

Be a geek

梦想一定要有的，万一见鬼了呢

[cs231n]Convolutional Neural Networks: Architectures, Convolution / Pooling Layers

[cs231n]Putting it together: Minimal Neural Network Case Study

[cs231n]Neural Networks Part 3: Learning and Evaluation

[cs231n]Neural Networks Part 2: Setting up the Data and the Loss

[cs231n]Neural Networks Part 1: Setting up the Architecture

[CS231n]Assignment #1: Image Classification, kNN, SVM, Softmax

Some common functions in Numpy and Scipy

Ipython Notebook short tutorial

[cs231n]Backpropagation, Intuitions

[cs231n]Optimization: Stochastic Gradient Descent