Gradient Based Optimization

Link for the video of Gradient Based Optimization.

Introduction

What is optimization?

Optimization is selecting the best element with some criterion. Optimization problem is maximizing or minimizing functions by choosing inputs.¹

Real-life Optimizations Example

Optimizing the design to maximize the performance.

Optimizing the recommendation algorithm to show best personalized advertisement.

Optimizing the factory to spend less time and money.

Mathematical Optimization

This is just finding $x$ that maximizes or minimizes some function $f (x)$ . It is easy for some simple functions, i.e. quadratic functions.

f (x) = a x^{2} + b x + c (a > 0)

To find $x$ that minimizes function $f (x)$ , we simply differentiate $f (x)$ .

f^{'} (x) = 2 a x + b = 0

$f (x)$ is minimized when $x = - \frac{b}{2 a}$ . However, we want to find parameter values that minimizes the complex function, harder to differentiate symbolically.

Gradient Based Optimization

Observations

Assume that global minimum is a only local minimum, and $x^{'}$ is a value that makes $f (x)$ minimum. Then, slope of the function $f (x)$ indicates where minimum exists.

If slope $f^{'} (x_{0})$ is positive, it means $x^{'} < x_{0}$ .
If slope $f^{'} (x_{0})$ is negative, it means $x_{0} < x^{'}$ .

Therefore, whatever the value of $x_{0}$ is, $x_{1} = x_{0} - f^{'} (x_{0})$ will move value to the direction of $x^{'}$ . We can do this iteratively, and expect that sequence ${x_{n}}$ will be converge to $x^{'}$ .

x_{n + 1} = x_{n} - f^{'} (x_{n})

Problem: To steep slope

If function is to steep, sequence ${x_{n}}$ can diverge. Therefore, we add one hyperparameter, learning rate $α (0 < α < 1)$ . Hyperparameter is set before optimization algorithm.²

x_{n + 1} = x_{n} - α f^{'} (x_{n})

Problem: Local minimum

We assumed that global minimum is a only local minimum. However, general functions can have local minimum, which is not a global minimum.

Definition

Local minimum is a smallest value in the given range.

With method we build in previous chapter, sequence can easily trapped into the local minimum.

Another Observation

In this example, ball will go to global minimum due to inertia. Therefore, we can apply inertia to our formula.

VLMS

탐색기

Gradient Based Optimization

Introduction

What is optimization?

Mathematical Optimization

Gradient Based Optimization

Observations

Problem: To steep slope

Problem: Local minimum

Another Observation

그래프 뷰

목차

백링크

VLMS

탐색기

Gradient Based Optimization

Introduction

What is optimization?

Mathematical Optimization

Gradient Based Optimization

Observations

Problem: To steep slope

Problem: Local minimum

Another Observation

Footnotes

그래프 뷰

목차

백링크