当前位置:网站首页>We have built an intelligent retail settlement platform
We have built an intelligent retail settlement platform
2022-07-03 18:57:00 【Datawhale】
Datawhale Share
author : Yan Xin ,Datawhale member
At present, in the actual operation of the retail industry , There will be huge labor costs , For example, shopping guide 、 Procter & Gamble 、 Settlement, etc , And one of , In particular, it needs to spend a lot of manpower and time costs in the process of identifying goods and settling their prices , And in the process , Customers also need to wait in line . In this way, the labor cost of the retail industry is large 、 Very inefficient , Secondly, it also reduces the shopping experience of customers .

With the development of computer vision technology , And unmanned 、 The concept of automated supermarket operation , The demand of using image recognition technology and target detection technology to realize automatic product recognition and automatic settlement is imminent , Automatic closing system (Automatic checkout, ACO). The automatic checkout system based on computer vision can effectively reduce the operating costs of the retail industry , Improve customer checkout efficiency , So as to further improve users' sense of experience and happiness in the shopping process .
AI The core of settlement is image recognition . The accuracy of image recognition determines AI Feasibility of settlement . at present ,AI Settlement faces the following pain points :
1. Similar commodity packaging : Similar commodities have different tastes and prices , The outer packaging of different categories of goods is similar , Both have high requirements for image recognition accuracy ;
2. There are many interference factors : Similar commodities are prone to deformation due to angle problems during identification 、 Fold 、 Interference such as occlusion , Impact on identification results ;
3. Category update is very fast : Retail items are usually updated at an hourly rate , If the first mock exam is added to a new product, the training model is needed to rely on a single model. , Model training cost and time cost are extremely high ;
4. The system requires high performance : It is necessary to solve the two tasks of detection and identification at the same time , When choosing the model and optimizing, we should weigh the accuracy and speed .
Sack slide hemp ColugoMum Committed to Small and medium-sized offline retail experience stores Provide Vision based Smart retail settlement scheme , And in Github Community and Qizhi community open source synchronously .

Based on the above pain points ,ColugoMum The team uses oars PaddleClas[1] The team put forward PP-ShiTu[2] Image recognition system . be based on PP-ShiTu The realized product identification scheme is multi category products in the retail scene 、 Small sample 、 High similarity and frequent updates provide new ideas , It can not only accurately identify multiple categories of goods , It can also meet the ultimate pursuit of prediction efficiency . The most practical function is : When actually used online , Encounter new product categories that need to be identified , No need to retrain the model , Only the image features of this category need to be added to the retrieval library , Can realize the identification of new products !

PP-ShiTu It is a practical lightweight general image recognition system , Mainly detected by the main body 、 There are three modules: retrieval and feature vector learning . The system selects and adjusts from the backbone network 、 The choice of loss function 、 Data to enhance 、 Learning rate transformation strategy 、 Regularization parameter selection 、 Pre training model usage and model tailoring quantification 8 In terms of , Adopt a variety of strategies , Optimize the model of each module , And pass by 10w+ Category data for training , Finally get in CPU The last forecast time is only 0.2s Multi scene general image recognition system .
Simply speaking ,PP-ShiTu There are three steps to the use of :
1. Detect the model through the main body , Recognize the objects in the picture one by one ;
2. Feature extraction for each candidate region ;
3. The vector after feature extraction is retrieved in the retrieval library , Complete the match , Return recognition results .
Considering the extreme pursuit of accuracy and prediction speed in the actual retail scenario ,ColugoMum The team selected PicoDet Model as agent detection algorithm , Selected lightweight PPLCNet_x2_5_ssld The model is used for feature extraction , Finally, use the vector search module Faiss Medium HNSW32 As a retrieval algorithm , Achieve the ultimate balance between speed and accuracy .
Based on this ,ColugoMum The team is based on RP2K The dataset has achieved the highest **96.91%** The prediction accuracy of .
RP2K Data sets [3] Included 50 ten thousand + Pictures of retail shelves , Commodity category exceeds 2,000 Kind of , It is currently the data set with the largest number of product categories in the retail data set . Unlike data sets that generally focus on new products ,RP2K Included more than 50 10000 pictures of retail goods shelves , Commodity category exceeds 2000 Kind of , This data set is the number of product categories in the current retail data set TOP1, At the same time, all the pictures are collected manually in the real scene , For each product , Pinlan provides a very detailed annotation .
Besides ,ColugoMum Also collected and sorted out the industry SKU Level product image dataset , And look forward to working with developers , Open source can be influential in the industry 、 Data sets that meet the application requirements of actual scenarios .
https://github.com/ColugoMum/Datasetalso ,ColugoMum The team opened source based on RP2K High precision training model and prediction model of data set . Developers can fine tune the training model provided based on their own data , You can also use the provided prediction model to directly predict the experience . meanwhile ,ColugoMum Also opened based on RP2K List making activities , Developers are welcome to participate .
https://github.com/ColugoMum/ExprementsIn terms of deployment, the propeller service deployment framework is used Paddle Serving[4] Deployment , Meet user batch forecast 、 High data security 、 Delay low demand , stay CPU All you need to do is 0.2 Seconds to achieve the prediction effect , Truly achieve the ultimate balance between prediction speed and accuracy .

In order to facilitate developers to better understand PP-ShiTu、 Make better use of its advantages in the field of image recognition ,ColugoMum The team open source the smart retail product recognition tutorial based on image recognition , Developers can use on this basis PP-ShiTu Fast docking business .
https://github.com/ColugoMum/Goods_Recognition
Based on the above core technology , at present ColugoMum The team uses PP-ShiTu technology , Open source cloud edge integration 、 Product identification that meets the application requirements of the actual scenario Smart_container. It can accurately locate the goods purchased by customers , And intellectualize 、 Automated price settlement .
https://github.com/ColugoMum/Smart_container
When customers place their purchased goods in the designated area ,Smart_container Be able to accurately locate and identify each commodity , And it can return the complete shopping list and the total price of the actual goods that the customer should pay . When new products are added in the system , Just update the search library , No need to retrain the model .Smart_container Cover the hardware settlement desk 、 Applet management platform 、 Big data visualization platform , Realize multi terminal unification , Smart management .

Core development team
Yan Xin , A junior majoring in automation in East China University of science and Technology , The research direction is cooperative control and decision-making of multiple robots , The main point of interest is computer vision 、 Reinforcement learning 、 Reasoning deployment . Propeller developers, technologists 、Datawhale member 、 Huawei cloud sharing expert 、 The first host of the National Undergraduate Innovation and entrepreneurship project , Won the 13th “ Challenge Cup ” Shanghai bronze medal in the college students' Entrepreneurship Plan Competition 、 The 10th East China University of science and technology “ Endeavour Cup ” Gold medal in college students' Entrepreneurship Plan Competition 、 Have two software copyrights 、 An international conference paper .
Shen Chen , A junior majoring in Information Engineering at East China University of science and Technology , Have obtained CRAIC The second prize of China robot and artificial intelligence competition in Shanghai 、 The second prize of Shanghai University Students' computer application ability Design Competition , The 10th East China University of science and technology “ Endeavour Cup ” Gold medal in college students' Entrepreneurship Plan Competition , Participate in many large-scale excellent open source projects , Have two software copyrights , I was in IEEE An international conference published a paper , Another utility model patent is pending . He used to be the deputy director of the Organization Department of the Youth League Committee of the Information Institute , Won the excellent scholarship and the advanced title of excellent students .
Du Xudong , A junior majoring in Information Engineering at East China University of science and Technology , Have a good command of C/C++/Python/Jave/Matlab/verilog And many other programming languages , The second prize of Shanghai University Students' computer application ability Design Competition , Participate in many large-scale excellent open source projects , Have two software copyrights , A utility model patent is pending . He used to be the deputy director of the community management department of the school of information , Won the excellent scholarship of the University .
ad locum , We have participated in ColugoMum Thank you for your research and development : Huang Xiaoyue 、 Wang Xin 、 Zhao Yian 、 Zhou Tianyi 、 Shenjiachuan et al , for ColugoMum Helpful Datawhale organization 、 The paddle community and Qizhi community Thank you ! We welcome more developers to participate in the retail product identification data set 、 Product identification and Smart Container Open source co construction activities , To promote together AI Open source and open ecosystem construction , Promote China's physical retail to digitalization 、 Intelligent direction Transformation Development .
future ,ColugoMum The team will continue to break product and technology boundaries , Relying on the open source community , Open source produces more and better 、 Open source projects that can truly empower physical retail , We should really promote the real retail in China to be intelligent 、 Digital transformation , Realization ColugoMum“ Authors efficiency 、 Enabling retail ” The mission of .
reference :
1.https://github.com/PaddlePaddle/PaddleClas
2.S. Wei et al., "PP-ShiTu: A Practical Lightweight Image Recognition System," arXiv preprint arXiv:2111.00775, 2021.
3.J. Peng, C. Xiao, and Y. Li, "RP2K: A large-scale retail product dataset for fine-grained image classification," arXiv preprint arXiv:2006.12634, 2020.
4.https://github.com/PaddlePaddle/Serving

Open source sharing , give the thumbs-up Three even ↓
边栏推荐
- What does a really excellent CTO look like in my eyes
- Why can deeplab v3+ be a God? (the explanation of the paper includes super detailed notes + Chinese English comparison + pictures)
- [Yu Yue education] world reference materials of Microbiology in Shanghai Jiaotong University
- Max of PHP FPM_ Some misunderstandings of children
- Suffix derivation based on query object fields
- cipher
- High concurrency Architecture - distributed search engine (ES)
- Le changement est un thème éternel
- Flutter网络和数据存储框架搭建 -b1
- Dart JSON编码器和解码器剖析
猜你喜欢

【学术相关】顶级论文创新点怎么找?中国高校首次获CVPR最佳学生论文奖有感...

A green plug-in that allows you to stay focused, live and work hard

Record: MySQL changes the time zone

MySQL duplicate check

【水质预测】基于matlab模糊神经网络水质预测【含Matlab源码 1923期】

leetcode:11. 盛最多水的容器【雙指針 + 貪心 + 去除最短板】

leetcode:556. 下一个更大元素 III【模拟 + 尽可能少变更】

SSM integration - joint debugging of front and rear protocols (list function, add function, add function status processing, modify function, delete function)

Xception for deeplab v3+ (including super detailed code comments and original drawing of the paper)

Opencv learning notes (continuously updated)
随机推荐
东数西算拉动千亿产业,敢啃“硬骨头”的存储厂商才更有机会
Understanding of database architecture
leetcode:11. 盛最多水的容器【雙指針 + 貪心 + 去除最短板】
leetcode:556. 下一个更大元素 III【模拟 + 尽可能少变更】
Record: pymysql is used in pycharm to connect to the database
编程中常见的 Foo 是什么意思?
235. 二叉搜索树的最近公共祖先【lca模板 + 找路径相同】
Le changement est un thème éternel
4. Load balancing and dynamic static separation
leetcode:11. 盛最多水的容器【双指针 + 贪心 + 去除最短板】
论文阅读 GloDyNE Global Topology Preserving Dynamic Network Embedding
Suffix derivation based on query object fields
SSM integration - joint debugging of front and rear protocols (list function, add function, add function status processing, modify function, delete function)
Record: MySQL changes the time zone
Torch learning notes (7) -- take lenet as an example for dataload operation (detailed explanation + reserve knowledge supplement)
High concurrency Architecture - read write separation
【水质预测】基于matlab模糊神经网络水质预测【含Matlab源码 1923期】
[combinatorics] exponential generating function (example of exponential generating function solving multiple set arrangement)
SQL: special update operation
The online customer service system developed by PHP is fully open source without encryption, and supports wechat customer service docking