当前位置:网站首页>We have built an intelligent retail settlement platform
We have built an intelligent retail settlement platform
2022-07-03 18:57:00 【Datawhale】
Datawhale Share
author : Yan Xin ,Datawhale member
At present, in the actual operation of the retail industry , There will be huge labor costs , For example, shopping guide 、 Procter & Gamble 、 Settlement, etc , And one of , In particular, it needs to spend a lot of manpower and time costs in the process of identifying goods and settling their prices , And in the process , Customers also need to wait in line . In this way, the labor cost of the retail industry is large 、 Very inefficient , Secondly, it also reduces the shopping experience of customers .

With the development of computer vision technology , And unmanned 、 The concept of automated supermarket operation , The demand of using image recognition technology and target detection technology to realize automatic product recognition and automatic settlement is imminent , Automatic closing system (Automatic checkout, ACO). The automatic checkout system based on computer vision can effectively reduce the operating costs of the retail industry , Improve customer checkout efficiency , So as to further improve users' sense of experience and happiness in the shopping process .
AI The core of settlement is image recognition . The accuracy of image recognition determines AI Feasibility of settlement . at present ,AI Settlement faces the following pain points :
1. Similar commodity packaging : Similar commodities have different tastes and prices , The outer packaging of different categories of goods is similar , Both have high requirements for image recognition accuracy ;
2. There are many interference factors : Similar commodities are prone to deformation due to angle problems during identification 、 Fold 、 Interference such as occlusion , Impact on identification results ;
3. Category update is very fast : Retail items are usually updated at an hourly rate , If the first mock exam is added to a new product, the training model is needed to rely on a single model. , Model training cost and time cost are extremely high ;
4. The system requires high performance : It is necessary to solve the two tasks of detection and identification at the same time , When choosing the model and optimizing, we should weigh the accuracy and speed .
Sack slide hemp ColugoMum Committed to Small and medium-sized offline retail experience stores Provide Vision based Smart retail settlement scheme , And in Github Community and Qizhi community open source synchronously .

Based on the above pain points ,ColugoMum The team uses oars PaddleClas[1] The team put forward PP-ShiTu[2] Image recognition system . be based on PP-ShiTu The realized product identification scheme is multi category products in the retail scene 、 Small sample 、 High similarity and frequent updates provide new ideas , It can not only accurately identify multiple categories of goods , It can also meet the ultimate pursuit of prediction efficiency . The most practical function is : When actually used online , Encounter new product categories that need to be identified , No need to retrain the model , Only the image features of this category need to be added to the retrieval library , Can realize the identification of new products !

PP-ShiTu It is a practical lightweight general image recognition system , Mainly detected by the main body 、 There are three modules: retrieval and feature vector learning . The system selects and adjusts from the backbone network 、 The choice of loss function 、 Data to enhance 、 Learning rate transformation strategy 、 Regularization parameter selection 、 Pre training model usage and model tailoring quantification 8 In terms of , Adopt a variety of strategies , Optimize the model of each module , And pass by 10w+ Category data for training , Finally get in CPU The last forecast time is only 0.2s Multi scene general image recognition system .
Simply speaking ,PP-ShiTu There are three steps to the use of :
1. Detect the model through the main body , Recognize the objects in the picture one by one ;
2. Feature extraction for each candidate region ;
3. The vector after feature extraction is retrieved in the retrieval library , Complete the match , Return recognition results .
Considering the extreme pursuit of accuracy and prediction speed in the actual retail scenario ,ColugoMum The team selected PicoDet Model as agent detection algorithm , Selected lightweight PPLCNet_x2_5_ssld The model is used for feature extraction , Finally, use the vector search module Faiss Medium HNSW32 As a retrieval algorithm , Achieve the ultimate balance between speed and accuracy .
Based on this ,ColugoMum The team is based on RP2K The dataset has achieved the highest **96.91%** The prediction accuracy of .
RP2K Data sets [3] Included 50 ten thousand + Pictures of retail shelves , Commodity category exceeds 2,000 Kind of , It is currently the data set with the largest number of product categories in the retail data set . Unlike data sets that generally focus on new products ,RP2K Included more than 50 10000 pictures of retail goods shelves , Commodity category exceeds 2000 Kind of , This data set is the number of product categories in the current retail data set TOP1, At the same time, all the pictures are collected manually in the real scene , For each product , Pinlan provides a very detailed annotation .
Besides ,ColugoMum Also collected and sorted out the industry SKU Level product image dataset , And look forward to working with developers , Open source can be influential in the industry 、 Data sets that meet the application requirements of actual scenarios .
https://github.com/ColugoMum/Datasetalso ,ColugoMum The team opened source based on RP2K High precision training model and prediction model of data set . Developers can fine tune the training model provided based on their own data , You can also use the provided prediction model to directly predict the experience . meanwhile ,ColugoMum Also opened based on RP2K List making activities , Developers are welcome to participate .
https://github.com/ColugoMum/ExprementsIn terms of deployment, the propeller service deployment framework is used Paddle Serving[4] Deployment , Meet user batch forecast 、 High data security 、 Delay low demand , stay CPU All you need to do is 0.2 Seconds to achieve the prediction effect , Truly achieve the ultimate balance between prediction speed and accuracy .

In order to facilitate developers to better understand PP-ShiTu、 Make better use of its advantages in the field of image recognition ,ColugoMum The team open source the smart retail product recognition tutorial based on image recognition , Developers can use on this basis PP-ShiTu Fast docking business .
https://github.com/ColugoMum/Goods_Recognition
Based on the above core technology , at present ColugoMum The team uses PP-ShiTu technology , Open source cloud edge integration 、 Product identification that meets the application requirements of the actual scenario Smart_container. It can accurately locate the goods purchased by customers , And intellectualize 、 Automated price settlement .
https://github.com/ColugoMum/Smart_container
When customers place their purchased goods in the designated area ,Smart_container Be able to accurately locate and identify each commodity , And it can return the complete shopping list and the total price of the actual goods that the customer should pay . When new products are added in the system , Just update the search library , No need to retrain the model .Smart_container Cover the hardware settlement desk 、 Applet management platform 、 Big data visualization platform , Realize multi terminal unification , Smart management .

Core development team
Yan Xin , A junior majoring in automation in East China University of science and Technology , The research direction is cooperative control and decision-making of multiple robots , The main point of interest is computer vision 、 Reinforcement learning 、 Reasoning deployment . Propeller developers, technologists 、Datawhale member 、 Huawei cloud sharing expert 、 The first host of the National Undergraduate Innovation and entrepreneurship project , Won the 13th “ Challenge Cup ” Shanghai bronze medal in the college students' Entrepreneurship Plan Competition 、 The 10th East China University of science and technology “ Endeavour Cup ” Gold medal in college students' Entrepreneurship Plan Competition 、 Have two software copyrights 、 An international conference paper .
Shen Chen , A junior majoring in Information Engineering at East China University of science and Technology , Have obtained CRAIC The second prize of China robot and artificial intelligence competition in Shanghai 、 The second prize of Shanghai University Students' computer application ability Design Competition , The 10th East China University of science and technology “ Endeavour Cup ” Gold medal in college students' Entrepreneurship Plan Competition , Participate in many large-scale excellent open source projects , Have two software copyrights , I was in IEEE An international conference published a paper , Another utility model patent is pending . He used to be the deputy director of the Organization Department of the Youth League Committee of the Information Institute , Won the excellent scholarship and the advanced title of excellent students .
Du Xudong , A junior majoring in Information Engineering at East China University of science and Technology , Have a good command of C/C++/Python/Jave/Matlab/verilog And many other programming languages , The second prize of Shanghai University Students' computer application ability Design Competition , Participate in many large-scale excellent open source projects , Have two software copyrights , A utility model patent is pending . He used to be the deputy director of the community management department of the school of information , Won the excellent scholarship of the University .
ad locum , We have participated in ColugoMum Thank you for your research and development : Huang Xiaoyue 、 Wang Xin 、 Zhao Yian 、 Zhou Tianyi 、 Shenjiachuan et al , for ColugoMum Helpful Datawhale organization 、 The paddle community and Qizhi community Thank you ! We welcome more developers to participate in the retail product identification data set 、 Product identification and Smart Container Open source co construction activities , To promote together AI Open source and open ecosystem construction , Promote China's physical retail to digitalization 、 Intelligent direction Transformation Development .
future ,ColugoMum The team will continue to break product and technology boundaries , Relying on the open source community , Open source produces more and better 、 Open source projects that can truly empower physical retail , We should really promote the real retail in China to be intelligent 、 Digital transformation , Realization ColugoMum“ Authors efficiency 、 Enabling retail ” The mission of .
reference :
1.https://github.com/PaddlePaddle/PaddleClas
2.S. Wei et al., "PP-ShiTu: A Practical Lightweight Image Recognition System," arXiv preprint arXiv:2111.00775, 2021.
3.J. Peng, C. Xiao, and Y. Li, "RP2K: A large-scale retail product dataset for fine-grained image classification," arXiv preprint arXiv:2006.12634, 2020.
4.https://github.com/PaddlePaddle/Serving

Open source sharing , give the thumbs-up Three even ↓
边栏推荐
- [leetcode weekly race] game 300 - 6110 Number of incremental paths in the grid graph - difficult
- Record: solve the problem that MySQL is not an internal or external command environment variable
- 平淡的生活里除了有扎破皮肤的刺,还有那些原本让你魂牵梦绕的诗与远方
- Record: install MySQL on ubuntu18.04
- In addition to the prickles that pierce your skin, there are poems and distant places that originally haunt you in plain life
- Software development freelancer's Road
- 【水质预测】基于matlab模糊神经网络水质预测【含Matlab源码 1923期】
- Sepconv (separable revolution) code recurrence
- 硬盘监控和分析工具:Smartctl
- Pytorch introduction to deep learning practice notes 13- advanced chapter of cyclic neural network - Classification
猜你喜欢
![235. Ancêtre public le plus proche de l'arbre de recherche binaire [modèle LCA + même chemin de recherche]](/img/f5/f2d244e7f19e9ddeebf070a1d06dce.png)
235. Ancêtre public le plus proche de l'arbre de recherche binaire [modèle LCA + même chemin de recherche]

Does SQL always report foreign key errors when creating tables?

Web3 credential network project galaxy is better than nym?

Recommend a simple browser tab

Mysql45 lecture learning notes (II)

为什么要做特征的归一化/标准化?

leetcode:11. 盛最多水的容器【雙指針 + 貪心 + 去除最短板】

记录在模拟器中运行flutter时报的错

Zhengda futures news: soaring oil prices may continue to push up global inflation

Pan for in-depth understanding of the attention mechanism in CV
随机推荐
Kratos微服务框架下实现CQRS架构模式
Software development freelancer's Road
平淡的生活里除了有扎破皮肤的刺,还有那些原本让你魂牵梦绕的诗与远方
my. INI file not found
Sustainable service business models
leetcode:11. 盛最多水的容器【双指针 + 贪心 + 去除最短板】
Simulation scheduling problem of SystemVerilog (1)
The more you talk, the more your stupidity will be exposed.
Smart wax therapy machine based on STM32 and smart cloud
Add control at the top of compose lazycolumn
What does a really excellent CTO look like in my eyes
Typescript configuration
[Yu Yue education] theoretical mechanics reference materials of Shanghai Jiaotong University
How many convolution methods does deep learning have? (including drawings)
Reading a line from ifstream into a string variable
Okaleido, a multimedia NFT aggregation platform, is about to go online, and a new NFT era may come
Max of PHP FPM_ Some misunderstandings of children
Dart JSON编码器和解码器剖析
Torch learning notes (6) -- logistic regression model (self training)
C enum contains value - C enum contains value