[ 3 / biz / cgl / ck / diy / fa / ic / jp / lit / sci / vr / vt ] [ index / top / reports ] [ become a patron ] [ status ]
2023-11: Warosu is now out of extended maintenance.

/sci/ - Science & Math


View post   

File: 860 KB, 150x200, 1367465216356.gif [View same] [iqdb] [saucenao] [google]
7067994 No.7067994[DELETED]  [Reply] [Original]

In the Wikipedia article on deep learning, the subsection discussing deep neural networks mentions that a DNN can be trained using backpropagation with stochastic gradient descent, updating weights using the equation

<span class="math">\Delta w_{ij}(t + 1) = \Delta w_{ij}(t) + \eta \frac{\partial C}{\partial w_{ij}[/spoiler].

Why does this equation use addition rather than subtraction?