当前位置:网站首页>Tensorflow serving high performance machine learning model service system
Tensorflow serving high performance machine learning model service system
2022-07-28 22:16:00 【AI Zeng Xiaojian】
framework
TensorFlow Serving It's a flexible one 、 High performance Machine learning model service system , Designed for the production environment . TensorFlow Serving New algorithms and experiments can be easily deployed , Keep the same server architecture and API. TensorFlow Serving Provide with TensorFlow Out of the box integration of models , But it can be easily extended to serve other types of models .
Key concepts
To understand TensorFlow Serving The architecture of , You need to understand the following key concepts :
Serviceable Servables
Servables yes TensorFlow Serving The core abstraction in . Servables yes The client is used to perform calculations ( for example , Find or reason ) Of Underlying object .
Servable The size and granularity of are flexible . Single Servable May include from Lookup table Single slice to Single model Until then Inference model tuple Any content of . Servables It can be Any type and interface , To achieve flexibility and future improvements , for example :
Streaming results
experimental API
Asynchronous operation mode
Servables Don't manage your life cycle .
Typical services include the following :
One TensorFlow SavedModelBundle (tensorflow::Session)
Lookup table for embedding or vocabulary lookup
Serviceable version
TensorFlow Serving One or more versions of can be processed during the life cycle of a single server instance servable. This allows new algorithm configurations to be loaded over time 、 Weights and other data . Version can load multiple versions at the same time servable, Support the gradual introduction and testing . When serving , The client can request the latest version of a specific model or a specific version ID.
Serviceable flow
Serviceable flow Is a serviceable version sequence , Sort by version number .
model
TensorFlow Serving Represent the model as one or more serviceable objects . Machine learning models may include one or more algorithms ( Including the weight of learning ) And finding or embedding tables .
You can express the composite model as any of the following :
Multiple independent serviceable objects
A single composite can serve (single composite servable)
A serviceable object may also correspond to a part of the model . for example , A large lookup table can span multiple TensorFlow Serving The examples are divided .
Loaders
Loader management servable Life cycle of . Loader API Support independent of the specific learning algorithm involved 、 Common infrastructure for data or product use cases . say concretely ,Loaders Standardized for loading and unloading servable Of API.
边栏推荐
猜你喜欢
随机推荐
KubeEdge发布云原生边缘计算威胁模型及安全防护技术白皮书
记录Flutter解决A RenderFlex overflowed by 7.3 pixels on the bottom溢出问题
90. 子集 II
Future trend of defi in bear market
熊市下 DeFi 的未来趋势
What is the difference between inline elements and block level elements? Semantic function
[LiteratureReview]Object Detection and Mapping with Bounding Box Constraints
DHCP and PPPoE protocols and packet capture analysis
04.toRef 默认值
Chapter 7: drawing rotating cubes
Lin Xiaobin, head of Tencent cloud database, borrowed 100 million yuan to speculate in stocks? Insider: the amount is not true
Intranet penetration learning (III) horizontal movement of domain - planning tasks
静态路由和缺省路由实验
JS DOM编程之平平无奇小练习
The applet listens for the target node to appear on the screen
The difference between get and post
Introduction to C language [detailed]
How to establish a decentralized community in Web3
[NLP] generate word cloud
Use pl/sql

![[LiteratureReview]Object Detection and Mapping with Bounding Box Constraints](/img/37/7cb5fa3a9078a5f5947485147c819d.png)







