Jun Hyuk Kim's Blog
junhyuk1229
« 2026/05 »
| 일 |
월 |
화 |
수 |
목 |
금 |
토 |
| |
|
|
|
|
1 |
2 |
| 3 |
4 |
5 |
6 |
7 |
8 |
9 |
| 10 |
11 |
12 |
13 |
14 |
15 |
16 |
| 17 |
18 |
19 |
20 |
21 |
22 |
23 |
| 24 |
25 |
26 |
27 |
28 |
29 |
30 |
| 31 |
|
|
|
|
|
|
관리 메뉴
Jun Hyuk Kim's Blog
[Terms] Types of gradient descent 본문
Momentum
- Use the moving average to move the data.
- Due to having the previous momentum as a value it reduces large changes.
- Think of a ball rolling down a hill into a flat surface.
- Creates cases where the ball moves in a different direction from the slope.
Nesterov accelerated gradient
- Uses the gradient from the expected direction.
- Fixes the problem of moving in a different direction from the minimum.
- Using the expected gradient will increase responsiveness.
Adagrad
- Sets the learn rate for each individual parameters.
- Frequently occurring features gets a lower learn rates, while un-frequent occurring features gets a higher learn rate.
- Works well on sparce data.