当前位置：网站首页>HOG feature study notes

HOG feature study notes

2022-08-05 01:52:00 【Wsyoneself】

The foregoing knowledge:
1. Feature descriptors simplify image representation by extracting useful information about the image and discarding irrelevant information
2. Determine what features are useful according to the task: detect lane lines on the road: edge detection, edge information is useful, color information is useless
Definition:
1. Orientation gradient histogram, which can be used to represent the object characteristics of the image, so that such objects can be detected.
2. is a feature descriptor similar to canny edge detector sift (scale invariant and feature transform), used in cv and image processing for object detection
3. Essential: Count the gradient directions that occur in local parts of the image.Use the magnitude and angle of the gradient to compute features (generate histogram)
4. HOG descriptor concerns the structure or shape of an object
5. HOG feature descriptor converts a 3-channel color image into a feature vector of a certain length
6. In the HOG feature descriptor, the distribution of gradient directions, that is, the histogram of gradient directions, is regarded as a feature.
7. The angle (the derivative of x and y) of the image, the gradients around the edges and corners (regions of sudden intensity) are large and contain more information about the shape of the object.
8. Using the image gradient, you can only focus on the corner information to outline the appearance of a person
HOG is often used in conjunction with svm to train high-precision target classifiers.HOG (similar to sift) + SVM workflow:
1. Preprocess the input image:
  1. Cut
  2. Scale to fixed size
  3. Grayscale processing (optional, for color images, the gradients are calculated relative to the three-channel color values, and the largest gradient value is taken as the gradient of the pixel)
  4. Gamma correction (the output image is a power function of the input image, the exponent is γ, the larger the γ, the darker the image): adjust the image contrast (reduce the influence of lighting (uneven lighting, partial shadows) on the image), so thatThe image is closer to what the human eye sees
2. Calculate the gradient value of each pixel point (calculated according to the 3X3 area), and get the magnitude matrix and angle matrix:
  1. For each pixel, compute the horizontal and vertical gradients (the same result can be obtained using the sobel operator with a kernel size of 1)
  2. Calculate the total gradient strength value and gradient direction (absolute value, so the range is [0-180])
3. Form gradient histogram:
  1. Calculated in an 8X8 cell, there are a total of 8X8X2=128 values (2 includes the gradient strength and gradient direction), and the gradient histogram is formed through statistics, 128 values will become 9 values (here9 is only because it is assumed that 0-180 is divided into 9 bins to count the pixel points m, the histogram of 9 columns, the angle range of each column is 20 degrees), while reducing the amount of calculation, it is also suitable for lighting and other environments.Changes are more robust.
  2. When the gradient direction tends to 0 degrees and 160 degrees, it indicates that the gradient direction of these points is upward or downward, indicating that there is a relatively obvious lateral edge at this position of the image.
4. Normalize blocks
  1. Each value of the vector is divided by the modulo length of the vector
  2. Specifically:
    1. A region of 8X8 is regarded as a cell, and 2X2 cells are used as a group, which is called a block.Since each cell has 9 values (because the 8X8X2 step of building the histogram has become 9 values (bin)), 2X2 cells have 36 values, hog gets the block by sliding the window.
    2. Normalize the gradient histogram of the block. It can be seen from the above that a block has 4 histograms. The 4 histograms are spliced into a vector of length 36, and then the entire vector is normalized
    3. Sliding window, the sliding step is 8 pixels (that is, a cell), and the window size is 2X2. Each time it slides, a feature vector with a length of 36 is obtained.
  3. Reason: Reduce the impact of lighting (the gradient of the image is sensitive to the overall lighting, but it is hoped that the feature descriptor will not be affected by lighting changes. So the histogram needs to be normalized. Because of the fold change of the original pixel, the normalizationThe result is the same after unification.)
5. Calculate HOG feature vector: splicing the feature vector calculated by sliding on the entire image to obtain the feature descriptor of the entire image
6. Collect HOG feature (a row of high-dimensional vector) and put it in svm for supervised learning.
7. All practice brings out true knowledge and shows the result of practice:

原网站

版权声明
本文为[Wsyoneself]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/217/202208050150599934.html

当前位置：网站首页>HOG feature study notes

HOG feature study notes

边栏推荐

猜你喜欢

随机推荐