当前位置：网站首页>CV learning notes - image filter

CV learning notes - image filter

2022-07-03 10:07:00 【Moresweet cat】

Image filter

1. Image filtering

1. Image filtering & filter

Image filtering ： That is to suppress the noise of the target image while preserving the details of the image as much as possible , It is an indispensable operation in image preprocessing , The quality of its processing effect will directly affect the effectiveness and reliability of subsequent image processing and analysis .

filter ： Think of the filter as a lens with weighting coefficients , When using this filter to smooth the image , Is to put this lens on the image , Look at the image we get through the lens .

2. The purpose of image filtering is & requirement

Filtering purpose ：

Eliminate the noise in the image .
Extract image features for image recognition .

Filtering requirements ：

Do not damage the image outline and edges .
The visual effect of the image should be better .

3. Filter type

Linear filtering

Linear filtering is the most basic method of image processing , Directly use the filter to process the image , According to the filter matrix elements , Can produce many different effects .

[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-gYYTGJqx-1638773250584)(./imgs/image-20211127161452417.png)]

Mean filtering

The ideal mean filtering is to replace each pixel in the image with the average value calculated by each pixel and its surrounding pixels . It is very commonly used in image processing , From the point of view of frequency domain, mean filtering is a low-pass filter , High frequency signals will be filtered and eliminated after processing , Because of this feature , Mean filtering is often used to eliminate image sharp noise , Achieve image smoothing 、 Fuzzy and other functions .
$g(x,y)=\frac{1}{M}\sum_{f\in{s}}f(x,y)$
Algorithm flow ：

1.  Each pixel in the image is calculated from left to right and from top to bottom , Finally, the processed image .
2.  Two parameters can be added to the mean filter , The number of iterations ,Kernel data size .
3.  One of the same Kernel, But many iterations will get better and better .
4.  Again , Same number of iterations ,Kernel The larger the matrix , The more obvious the effect of mean filtering .

[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-IoWxOwFA-1638773250586)(./imgs/image-20211127163240533.png)]

advantage ： Method is simple , Fast calculation .

shortcoming ： Reduce noise and blur the image , Especially the edges and details of the scene .

median filtering

The median filter replaces the central pixel with the median after sorting the surrounding pixel and the central pixel , The effect of median filtering is often better than that of mean filtering , Especially to eliminate salt and pepper noise .

[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-Q53hOyU1-1638773250586)(./imgs/image-20211127163848801.png)]

advantage ： The inhibition effect is very good , The clarity of the picture is basically maintained ;

shortcoming ： The effect of suppressing Gaussian noise is not very good .

Maximum and minimum filtering

The maximum and minimum filter uses the sorted maximum value of surrounding pixels to compare with the minimum value and the central pixel to determine the central replacement pixel , If the central pixel is larger than the maximum , Replace the center pixel with the maximum ; If the central pixel is smaller than the minimum , Then the center pixel is replaced with the minimum value . It is a conservative image processing method .

[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-wJ61S6gR-1638773250587)(./imgs/image-20211127164426552.png)]

Guided filtering

（ lead ） Local linear model

A point on a function has a linear relationship with the points of its adjacent parts , A complex function can be represented by many local linear functions , When you need to find the value of a point on the function , Just calculate and average the values of all linear functions containing this point . Local linear models are very helpful in representing non analytic functions .

[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-Jc2zw6Hq-1638773250589)(./imgs/image-20211128114959667.png)]

The two-dimensional image can be regarded as a two-dimensional linear function without an exact analytical expression , Suppose the output and input of the function are in a two-dimensional window Satisfy linear relationship ：
$q_i=a_kI_i+b_k, For all i\in\omega_k$

among ,q Is the value of the output pixel ,I Is the value of the input image ,i and k Is the pixel index ,a and b When the center of the window is k The coefficient of the linear function . Actually , The input image is not necessarily the image to be filtered, but also the guide image of other images , That's why it's called guided filtering . Take the gradient on both sides of the above formula , You can get ：
$\nabla q=a\nabla I$
That is, when inputting an image I With gradient , Output q There are similar gradients , This is also the reason why guided filtering has edge preserving characteristics .

The next step is to find the sparsity of the linear function , That's linear regression , That is, you want to fit the output value with the real value p The gap between them is the smallest , That is to say, let the following formula be the minimum ：
$E(a_k,b_k)=\sum_{i\in \omega_k}((a_kI_i+b_k-p_i)^2+\epsilon a^2_k)$
there p Only images to be filtered , It's not like I That can be other images , meanwhile a The previous coefficient （ Write it down as e） Used to prevent getting a Too big , It is also an important parameter to adjust the filtering effect of the filter . By the least square method , We can get ：
$a_k=\frac{\frac{1}{|\omega|}\sum_{i\in \omega_k}I_ip_I-\mu_k\overline{p}_k}{\sigma^2_k+\epsilon}\\ b_k=\overline{p}_k-a_k\mu_k$
among , $\mu_k$ yes I At the window $\omega_k$ The average of , $\sigma^2_k$ yes I At the window $\omega_k$ Variance in , $|\omega|$ It's the window $\omega _k$ Number of pixels in , $\overline{p}_k$ Is the image to be filtered p At the window $\omega _k$ Mean in .

When calculating the linear coefficient of each window , We can see that a pixel is contained in multiple windows , in other words , Each pixel is described by several linear functions . therefore , As I said before , When you want to specify the output value of a certain point , Just average the values of all linear functions containing the point ：
$\begin{aligned} q_i&=\frac{1}{|\omega|}\sum_{k:i\in\omega _k}a_kI_k+b_k\\&=\overline{a}_iI_i+\overline{b}_i \end{aligned}$
here , $\omega_k$ Include all pixels when i The window of ,k It's the center of it .

When the guided filter is used as an edge preserving filter , There is often $I = p$ , If $e = 0$ , obviously $a = 1$ , $b = 0$ yes $E (a, b)$ Is the solution of the minimum , It can be seen from the above formula , The filter doesn't work at all , The input will be the same as the output . If e>0, In areas where pixel intensity changes little （ Or monochromatic areas ）, Yes $a$ Approximate to （ Or equal to ）0, and $b$ Approximate to （ Or equal to ）, That is to say, we do a weighted mean filter ; And in areas of great change , $a$ Approximate to 1, $b$ Approximate to 0, The filtering effect on the image is very weak , It helps to keep the edges . and $e$ The role of big change is to define what big change is , What is small change . With the same window size , With $e$ The increase of , The more obvious the filtering effect is .

2. Image noise

1. summary

Image noise is the random signal interference in the process of image acquisition or transmission , Signals that hinder people's understanding, analysis and processing of images .

The generation of image noise comes from the environmental conditions in image acquisition and the quality of the sensor itself , The main factor of image noise in the transmission process is that the transmission channel used is polluted by noise .

2. Gaussian noise

Definition ： Gaussian noise （Gaussian noise） It refers to a kind of noise whose probability density obeys Gaussian distribution .

[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-HJfrScLh-1638773250591)(./imgs/image-20211206135347911.png)]

white Gaussian noise ： Noise whose amplitude obeys Gaussian distribution is uncorrelated between any two sampled samples .

White noise ： Uncorrelated noise between any two sampled samples .

Cause of occurrence ：

The image sensor is not bright enough when shooting 、 Uneven brightness ;
The noise and mutual influence of each component of the circuit ;
The image sensor works for a long time , The temperature is too high ;

Add Gaussian distribution ：

Input parameters sigma and mean
Generate Gaussian random number
The output pixel is calculated from the input pixel
Zoom all pixel values back to [0~255] Between
Cycle through all pixels
Output image

A normal Gaussian sampling distribution formula , Get the output pixels Pout
Pout = Pin + random.gauss
among random.gauss It's through sigma and mean To generate random numbers that conform to the Gaussian distribution .

3. Salt and pepper noise

Definition ： Salt and pepper noise is also called impulse noise , It is a random white dot or black dot .

Pepper noise & Salt noise ： Pepper noise （pepper noise）+ Salt noise （salt noise）. The value of salt and pepper noise is 0( Any of several hot spice plants ) perhaps 255( salt ). The former is low gray noise , The latter belongs to high gray noise . Generally, two kinds of noise appear at the same time , Black and white specks appear on the image . For color images , It may also appear in a single pixel BGR Three channels appear randomly 255 or 0.

The reasons causing ： If there is an error in communication , Some pixel values are lost during transmission , This noise will occur . The cause of salt and pepper noise may be the sudden strong interference of the image signal . For example, a failed sensor results in a minimum pixel value , Saturated sensors result in maximum pixel values .

Add salt and pepper noise ：

Specify the signal-to-noise ratio SNR（ The proportion of signal and noise ） , Its value range is [0, 1] Between
Calculate the total number of pixels SP, Get the number of pixels to noise NP = SP * SNR
Randomly get every pixel position to be noisy P（i, j）
Specify the pixel value as 255 perhaps 0.
repeat 3, 4 Complete all... In two steps NP Pixel noise

4. Other noise

Poisson noise ： A noise model with Poisson distribution , Poisson distribution is suitable to describe the probability distribution of the number of random events in unit time . For example, the number of service requests received by a service facility in a certain period of time , The number of calls received by the telephone exchange 、 The number of people waiting at the bus platform 、 The number of machine failures 、 The number of natural disasters 、DNA The number of variations in the sequence 、 Decay number of radioactive nuclei, etc .

characteristic ：

The longer the time , The more likely an event is to occur , And the probability of this event occurring at different times is independent of each other
For a very short period of time , The probability of this time occurring twice is almost zero
At the beginning, the incident did not happen

$P\{(N(t+\tau)-N(t))\}=\frac{e^{-\lambda\tau}(\lambda\tau)^k}{k!}\quad k=0,1...$

Multiplicative noise ： It is generally caused by the imperfect channel , The relationship between them and signals is multiplication , The signal is there. It's there , If the signal is not there, he will not be there .

rayleigh noise ： Compared with Gaussian noise , Its shape is skewed to the right , This is useful for fitting some skew histogram noise . Rayleigh noise can be realized by average noise .

Gamma Noise ： Its distribution obeys the distribution of gamma curve . Realization of gamma noise , Need to use b A noise subject to exponential distribution is superimposed . Exponentially distributed noise , Uniform distribution can be used to achieve .（b=1 Is exponential noise ,b>1 Through the superposition of several exponential noises , Get gamma noise ）

3. Image enhancement

1. summary

Purposefully emphasize the overall or local characteristics of the image , Make the original unclear image clear or emphasize some interesting features
sign , Expand the difference between different object features in the image , Suppress features that are not of interest , Make it improve the image quality 、 Enrich
The amount of information , Enhance the image interpretation and recognition effect , Meet the needs of some special analysis .

There are two types of image enhancement ：

Point processing technology . Only a single pixel is processed .
Neighborhood processing technology . Processing pixel points and points around them , That is, using convolution kernel .

2. Do something about it

1. linear transformation

effect ： Image enhancement linear transformation is mainly to adjust the contrast and brightness of the image ：
$y = a x + b$
Parameters $a$ Affect the contrast of the image , Parameters b Affect the brightness of the image , It can be divided into the following situations ：

$a > 1$ ： Enhanced the contrast of the image , The image looks clearer .
$a < 1$ ： Reduces the contrast of the image , The image looks blurred
$a=1\&\&b\neq0$ ： The gray value of the whole image moves up or down , That is to say, the whole image becomes bright or dark , Does not change the contrast of the image , $b > 0$ Image lightens when , $b < 0$ The image darkens when .

2. Piecewise linear transformation

effect ： In a given interval x Inside , Make a linear transformation , Adjust the contrast coefficient , Increase or decrease the contrast in the area .
$\left\{ \begin{aligned} y&=a_1x+b\quad x<x_1\\ y&=a_2x+b\quad x_1<x<x_2\\ y&=a_1x+b\quad x_2<x \end{aligned} \right.$

3. Logarithmic transformation

effect ： Logarithmic transformation expands the low gray value part of the image , Compress its high gray value part , In order to emphasize the low gray part of the image ; At the same time, it can well compress the dynamic range of the image with large changes in pixel values , The purpose is to highlight the details we need .
$y = c * l o g (1 + x)$
[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-zI7uxBph-1638773250592)(./imgs/image-20211206143045087.png)]

4. Power law transformation / Gamma transform

effect ： Power law transform is mainly used for image correction , Correct the bleached picture or too black picture .
$y=c*x^\gamma$
according to $\gamma$ Size , It can be divided into the following two situations ：

$\gamma >1$ ： Deal with bleached pictures , Gray level compression
$\gamma<1$ ： Processed black pictures , Contrast enhancement , Make the details more clear

[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-ABEyvveU-1638773250593)(./imgs/image-20211206143358536.png)]

3. Neighborhood processing

The output pixel value is determined by the pixel values of several pixels in a neighborhood including the current pixel . In the convoluted image (x,y) Pixels at g(x,y) Is the original image (x,y) Pixels at f(x,y) A neighborhood of $\Omega$ The median pixel value is based on a linear combination of certain convolution operator templates
$g(x,y)=\sum_{(m,n)\in\Omega}\sum Mask(x-m,y-n)f(m,n)$
Neighborhood $\Omega$ The contribution of medium pixels to the output value is represented by a two-dimensional convolution template matrix Mask[][] To weighted

** application ：** Histogram equalization 、 Image filtering

4. Common methods

Purposefully emphasize the overall or local characteristics of the image , Make the original unclear image clear or emphasize some interesting features , Expand the difference between different object features in the image , Suppress features that are not of interest , Make it improve the image quality 、 Rich information , Enhance the image interpretation and recognition effect , Meet the needs of some special analysis .