当前位置：网站首页>Convex optimization notes

Convex optimization notes

2022-06-23 19:07:00 【Bachuan Xiaoxiaosheng】

Simple convex optimization notes

Convex functions and convex sets

The region above the convex function is a convex set
The region above the function is a convex set , Then the function is convex

convex set

aggregate $C$ The line segments between any two points are in the set $C$ in , It's called a collection $C$ For convex sets
$\forall x_{1},x_{2}\in C, \theta\in[0,1]\\ be \theta x_{1}+(1-\theta)x_{2}\in C$
expand
$\forall x_{1},...,x_{k}\in C, \theta_{i}\in[0,1] And \sum\theta_{i}=1\\ be \sum\theta_{i}x_{i}\in C$
Insert picture description here

A convex polygon is a convex set , Boundary missing is not a convex set

Hyperplane and half space

hyperplane

${x|a^{T}x=b\}$

Half space

$\{x|a^{T}x\leq b\}\\ \{x|a^{T}x\geq b\}$

Insert picture description here

Polyhedron

A polyhedron is the intersection of a finite half space and a hyperplane
$P=\{x|a_{j}^{T}x\leq b_{j},c_{i}^{T}x=d_{i}\}$
Affine set （ Such as hyperplane , A straight line ）, ray , Line segment , Half spaces are polyhedrons

A polyhedron is a convex set

A bounded polyhedron is called a polyhedron

Insert picture description here

Keep convexity operation

Set intersection operation

Insert picture description here

Proof of definition

Affine transformation
- Telescopic
- translation
- Projection

$f(x)=Ax+b,A\in R^{m\times n},b\in R^{m}\\ f:R^{n}\rightarrow R^{m} \quad f(S)=\{f(x)|x\in S\}\\ S For convex sets \rightarrow f(S) For convex sets \\ f(S) For convex sets \rightarrow S For convex sets \\$

Perspective transformation

The perspective function scales the vector （ standard ） Let the components of the last dimension be discarded together
$P:R^{n+1}\rightarrow R^{n}, P(z,t)=z/t$
Insert picture description here

Projective transformation （ Linear fractional transformation ）

Compound of perspective and affine
$g:R^{n}\rightarrow R^{n+1}\\ g(x)=\begin{bmatrix} A\\ c^{T} \end{bmatrix}x+ \begin{bmatrix} b \\ d \end{bmatrix}\\ A\in R^{m\times n},b\in R^{m},c\in R^{n},d\in R$
Definition $f$ Is a linear fractional function
$f(x)=\frac{(Ax+b)}{c^{T}x+d}\\ dom f=\{x|c^{T}x+d>0\}$
if $c = 0, d > 0$ be $f$ Is an ordinary affine function

Split hyperplane

set up $C$ and $D$ For two disjoint convex sets , Then there is a hyperplane $P$ , $P$ Can be $C$ and $D$ Separate
$\forall x\in C,a^{T}x\leq b And \forall x\in D,a^{T}x\geq b$

Note that you can take the equal sign

Insert picture description here

Converse proposition

If two convex sets $C$ and $D$ The divided hyperplane exists , $C$ and $D$ Disjoint is a false proposition

Strengthen the conditions

If at least one of the two convex sets is open , Then if and only if there is a split hyperplane , They don't intersect

Split hyperplane construction

distance

The distance between two sets is the shortest distance between two sets

structure

Make a distance perpendicular to

Supporting hyperplanes

A collection of $C$ , $x_{0}$ by $C$ Points on the border . If exist $a\neq 0$ , Satisfied with any $x\in C$ , There are $a^{T}x\leq a^{T}x_{0}$ establish , Is called hyperplane ${x|a^{T}x=a^{T}x_{0}\}$ For collection $C$ At point $x_{0}$ Support hyperplane at

There is a supporting hyperplane at any point on the boundary of a convex set

conversely , If a closed non hollow （ The inner point is not empty ） aggregate , There is a supporting hyperplane at any point on the boundary , Then the set is convex

Convex function

If the function $f$ Domain of definition $d o m f$ For convex sets , And meet
$\forall x,y\in domf,0\leq\theta\leq1 Yes f(\theta x+(1-\theta)y)\leq \theta f(\theta)+(1-\theta)f(y)$
Insert picture description here

First order differentiable

if $f$ First order differentiable , The function $f$ Is a convex function if and only if $f$ The domain of definition $d o m f$ Is a convex set and
$\forall x,y\in dom f,f(y)\geq f(x)+\bigtriangledown f(x)^{T}(y-x)$
Insert picture description here

For convex functions , The essence of the first-order Taylor approximation is the global estimation of the function

On the contrary, if a function's first-order Taylor approximation is always its global estimation , Then the function is convex

Second order differentiability

If the function $f$ Second order differentiability , The function $f$ Is a convex function if and only if $d o m f$ For convex sets , And
$\bigtriangledown^{2}f(x)\succ =0$
if $f$ For a function of one variable , The above formula indicates that the second derivative is greater than or equal to $0$

if $f$ Is a multivariate function , The above formula represents the positive semidefinite of the second derivative Hessian matrix

Example

$e^{ax}$
$x^{a},x\in R_{+},a\geq1 or a\leq 0$
$- l o g x$
$x l o g x$
$x||_{p}$
- $max(x_{1},...,x_{n})$
- $x^{2}/a,a>0$
- $log(e^{x_{1}+...+e^{x_{n}}})$

Shangjing map

Insert picture description here
function $f$ The image is defined as $\{(x,f(x))|x\in dom f\}$
function $f$ The context map of is defined as
$epif=\{(x,t)|x\in domf,f(x)\leq t\}$

Convex functions and convex sets

A function is a convex function , If and only if its context graph is a convex set
A function is a concave function , If and only if its set is subconvex
$f=\{(x,t)|t\leq f(x)\}$

Jason inequality

$f$ Is a convex function
$f(\theta x+(1-\theta)y)\leq\theta f(x)$
$\theta_{1}...\theta_{k}\geq 0,\theta_{1}+...+\theta_{k}=1\\ be f(\theta_{1}x_{1}+...+\theta_{k}x_{k})\leq\theta_{1}f(x_{1})+...+\theta_{k}f(x_{k})$
$p(x)\geq 0 stay S\subseteq domf,\int_{S}p(x)dx=1 \\ be f(\int_{S}p(x)xdx)\leq \int_{S}f(x)p(x)dx\\ or f(Ex)\leq E(f(x))$
It can be proved by Jason inequality
$D(p||q)=\sum p(x)log \frac{p(x)}{q(x)}=E_{p(x)}log\frac{p(x)}{q(x)}\geq 0$
wait

Operators that preserve the convexity of functions

Nonnegative weighted sum
$f(x)=w_{1}f_{1}(x)+...+w_{n}f_{n}(x)$
Compound with affine function
$g (x) = f (A x + b)$
Point by point maximum , Pointwise supremum
$f(x)=max(f_{1}(x),...,f_{n}(x))\\ f(x)=\sup_{y\in A}g(x,y)$

The pointwise supremum function of the function corresponds to the intersection of the boundary graph on the function

Convex optimization

The basic form of optimization problem

$f_{0}(x),x\in R^{n}\\ Inequality constraints f_{i}(x)\leq 0,i=1...m\\ Equality constraints h_{i}(x)=0,j=1...p\\ Unconstrained optimization m=p=0$
$D=\bigcap_{i=1}^{m} domf_{i} \cap \bigcap_{j=1}^{p}domh_{j} \\ It's possible （ Explain ）x\in D And meet the constraints \\ The feasible region , Set of all feasible points$
$p^{*}=inf\{f_{0}(x)|f_{i}(x)\leq0,i=1...m,h_{j}(x)=0,j=1...p\}\\ Optimal solution p^{*}=f_{0}(x^{*})$

The basic form of convex optimization problem

$f_{i}(x) Is a convex function \\ h_{j}(x) For affine functions$
Important nature

The feasible region is a convex set
The local optimal solution is the global optimal solution

The dual problem

Lagrange function

$L(x,\lambda,\upsilon)=f_{0}(x)+\sum \lambda_{i}f_{i}(x)+\sum\upsilon _{j}h_{j}(x)$
To fix $x$ , Lagrange function $L(x,\lambda,\upsilon )$ For about $\lambda$ and $\upsilon$ The affine function of

Lagrange dual function

$g(\lambda,\upsilon)=\inf_{x\in D}L(x,\lambda,\upsilon)=\inf_{x\in D}(f_{0}(x)+\sum \lambda_{i}f_{i}(x)+\sum\upsilon _{j}h_{j}(x))\\ If there is no infimum definition g(\lambda,\upsilon)=-\infty$
By definition, there are ： Yes $\forall \lambda\geq 0,\forall\upsilon$ , The original optimization problem has the optimal value $p^{*}$ , be
$g(\lambda,\upsilon)\leq p^{*}$
further , Lagrange dual function is concave function
Insert picture description here
hypothesis $x_{0}$ Is not workable , There is $f_{i}(x)>0$ , select $\lambda_{i}\rightarrow\infty$ , For other multipliers $\lambda_{i}=0,j\neq i$
hypothesis $x_{0}$ feasible , Then there are $f_{i}(x)\leq 0,i=1...m$ , Make $\lambda_{i}=0,i=1...m$
Yes
$\sup_{\lambda\geq 0}L(x,\lambda)=\sup_{\lambda\geq 0}(f_{0}(x)+\sum\lambda_{i}f_{i}(x))= \left\{\begin{matrix} f_{0}(x),f_{i}(x)<0 \\ \infty,other \end{matrix}\right.$

The original problem is $inf_{x} f_{0}(x)$ Turn into $\inf_{x} \sup_{\lambda\geq 0}L(x,\lambda)$
The dual problem is to find the maximum value of the dual function , namely
$\sup_{\lambda\geq0}\inf_{x}L(x,\lambda)$
and
$\sup_{\lambda\geq0}\inf_{x}L(x,\lambda)\leq\inf_{x}\sup_{\lambda\geq0}L(x,\lambda)$

Strong dual condition

The maximum value of the dual function is the minimum value of the original problem
$f_{0}(x^{*})=g(\lambda^{*}+\upsilon^{*})\\ =\inf_{x}(f_{0}(x)+\sum \lambda_{i}^{*}f_{i}(x)+\sum\upsilon _{j}^{*}h_{j}(x))\\ \leq f_{0}(x^{*})+\sum \lambda_{i}^{*}f_{i}(x^{x})+\sum\upsilon _{j}^{*}h_{j}(x^{*})\\ \leq f_{0}(x^{*})$
Conditions
$f_{i}(x^{*})\leq 0\\ h_{i}(x^{*})= 0\\ \lambda_{i}^{*}\geq 0\\ \lambda_{i}^{*}f_{i}(x^{*})= 0\\ i=1...m\\ \bigtriangledown f_{0}(x^{*})+\sum \lambda_{i}^{*}\bigtriangledown f_{i}(x^{x})+\sum\upsilon _{j}^{*}\bigtriangledown h_{j}(x^{*})=0$