当前位置:网站首页>Tensorflow serving high performance machine learning model service system
Tensorflow serving high performance machine learning model service system
2022-07-28 22:16:00 【AI Zeng Xiaojian】
framework
TensorFlow Serving It's a flexible one 、 High performance Machine learning model service system , Designed for the production environment . TensorFlow Serving New algorithms and experiments can be easily deployed , Keep the same server architecture and API. TensorFlow Serving Provide with TensorFlow Out of the box integration of models , But it can be easily extended to serve other types of models .
Key concepts
To understand TensorFlow Serving The architecture of , You need to understand the following key concepts :
Serviceable Servables
Servables yes TensorFlow Serving The core abstraction in . Servables yes The client is used to perform calculations ( for example , Find or reason ) Of Underlying object .
Servable The size and granularity of are flexible . Single Servable May include from Lookup table Single slice to Single model Until then Inference model tuple Any content of . Servables It can be Any type and interface , To achieve flexibility and future improvements , for example :
Streaming results
experimental API
Asynchronous operation mode
Servables Don't manage your life cycle .
Typical services include the following :
One TensorFlow SavedModelBundle (tensorflow::Session)
Lookup table for embedding or vocabulary lookup
Serviceable version
TensorFlow Serving One or more versions of can be processed during the life cycle of a single server instance servable. This allows new algorithm configurations to be loaded over time 、 Weights and other data . Version can load multiple versions at the same time servable, Support the gradual introduction and testing . When serving , The client can request the latest version of a specific model or a specific version ID.
Serviceable flow
Serviceable flow Is a serviceable version sequence , Sort by version number .
model
TensorFlow Serving Represent the model as one or more serviceable objects . Machine learning models may include one or more algorithms ( Including the weight of learning ) And finding or embedding tables .
You can express the composite model as any of the following :
Multiple independent serviceable objects
A single composite can serve (single composite servable)
A serviceable object may also correspond to a part of the model . for example , A large lookup table can span multiple TensorFlow Serving The examples are divided .
Loaders
Loader management servable Life cycle of . Loader API Support independent of the specific learning algorithm involved 、 Common infrastructure for data or product use cases . say concretely ,Loaders Standardized for loading and unloading servable Of API.
边栏推荐
- HCIP(10)
- Principle of object. Prototype. ToString. Call()
- Matlab from introduction to mastery Chapter 1 Introduction to matlab
- DHCP和PPPoE协议以及抓包分析
- What is a prime factor? In number theory, a prime factor (prime factor or prime factor) refers to a prime number that can divide a given positive integer
- Esp8266 Arduino programming example - timer and interrupt
- How does MySQL archive data?
- C语言编程规范学习笔记和总结
- 小程序 组件 定时器的清除
- 90. 子集 II
猜你喜欢

Aimbetter insight into your database, DPM and APM solutions

Hcip experiment (15)

Save 70% of the video memory and increase the training speed by 2 times! Zheda & Ali proposed online convolution re parameterization orepa, and the code has been open source! (CVPR 2022 )

Byte side: can TCP and UDP use the same port?

HYDAC overflow valve db08a-01-c-n-500v

Record the fluent to solve the problem of a renderflex overflowed by 7.3 pixels on the bottom

Lt7911d type-c/dp to Mipi scheme is mature and can provide technical support

Kubevera plug-in addons download address

39. Combined sum

How to establish a decentralized community in Web3
随机推荐
行内元素和块级元素有什么区别?语义化作用
Differences of display values
Introduction to C language [detailed]
小程序 组件 定时器的清除
openresty 请求鉴权
KubeEdge发布云原生边缘计算威胁模型及安全防护技术白皮书
[cloud native kubernetes] mapping external service under kubernetes cluster eendpoint
Part 8: creating camera classes
System Analyst
39. Combined sum
lotus 1.16.0 延长扇区过期时间
局域网添加DNS服务器进行域名解析
Use pl/sql
How to establish a decentralized community in Web3
从 Web3到Web2.5,是倒退还是另辟蹊径?
Small program canvas generates posters
How does MySQL archive data?
Hcip experiment (14)
What testing services do third-party software testing institutions provide? Charging standard of software test report
vuejs中如何实现动态路由切换及路由的缓存