| Sources: [1],[3],[4],[5],[6] |
Biological neuron and artificial neuron
| Source: [10] |
| Source: [10] |
| Source: [10] |
Easy? Difficult?
| Source: [12] |
What are the correct pixel values for a “bike” feature?
… a layer at a time
| Source: [12] |
Just a sec - let's meet a real neural network first!
Play around in the browser:
We need:
The loss (or cost) function indicates the cost incurred from false prediction / misclassification
Probably the best-known loss function in machine learning is mean squared error:
\( \frac{1}{n} \sum_n{(\hat{y} - y)^2} \)
Most of the time, in deep learning we use cross entropy:
\( - \sum_j{t_j log(y_j)} \)
This is the negative log probability of the right answer.
| Source: [12] |
| Source: [14] |
| Source: [13] |
| Source: [15] |
| Source: [13] |
| Source: [13] |
(Strictly, this is cross-correlation, but it doesn't matter)
| Source: [13] |
Blur: \( \begin{bmatrix}1 & 1 & 1\\1 & 1 & 1\\1 & 1 & 1\end{bmatrix} \), sharpen: \( \begin{bmatrix}0 & -1 & 0\\-1 & 5 & -1\\0 & -1 & 0\end{bmatrix} \), edge detect: \( \begin{bmatrix}0 & 1 & 0\\1 & -4 & 1\\0 & 1 & 0\end{bmatrix} \)
How do we handle sequences
?
Jane walked into the room. John walked in too. It was late in the day, and everyone was walking home after a long day at work. Jane said hi to ___
Source: [21]
| Sources: [22], [12] |
Sometimes we also need to forget!
The LSTM cell state is protected by three gates, the forget, input, and output gates:
| Source: [22] |
In translation, we have two sets of sequential data, one on the source and one on the target side!
Enter: sequence-to-sequence models
| Source: [23] |
| Source: [1] |
| Source: [1] |
If I get a reward many many actions later…
… how do I find out what concrete action I'm getting the reward for?
“Reinforcement learning + deep learning = AI” (David Silver, Google Deep Mind)
| Source: [29] |
For structured reading:
Just wanna have some cool fun?
[1] MIT 6.S094 Deep Learning for Self-Driving Cars Lecture Slides
[3] Esteva et al. Dermatologist-level classification of skin cancer with deep neural networks
[4] Wikipedia. AlphaGo versus Lee Sedol
[5] Yoshihara et al. Leveraging temporal properties of news events for stock market prediction
[6] The Weather Company. Seasonal forecast
[7] Neural Network Learns to Select Potential Anticancer Drugs
[8] Google Research Blog. AlphaGo: Mastering the ancient game of Go with Machine Learning
[10] Stergiou, C. and Siganos, D. Artificial neurons
[11] University of Waterloo Deep Learning Slides
[12] Goodfellow et al. Deep Learning
[13] Stanford CS231n Convolutional Neural Networks Lecture Notes
[14] Chris Olah. Calculus on Computational Graphs: Backpropagation
[15] Parkhi et al. Cats and Dogs
[16] Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks
[17] Erhan et al. Scalable Object Detection using Deep Neural Networks
[18] ImageNet Large Scale Visual Recognition Challenge 2014 (ILSVRC2014)
[19] Long et al. Fully Convolutional Networks for Semantic Segmentation
[20] Silberman et al. Instance Segmentation of Indoor Scenes using a Coverage Loss
[21] Stanford CS 224D Deep Learning for NLP Lecture Notes