当前位置:网站首页>On fedlearner, the latest open source federated machine learning platform of byte
On fedlearner, the latest open source federated machine learning platform of byte
2020-11-10 07:37:00 【osc_odp8kgup】
Recently, federal machine learning is becoming more and more popular , Byte also officially publicized the open source federal machine learning platform Fedlearner. This time the headlines are open source Fedlearner With Huawei, which I have analyzed before 、 What's the difference between micro crowd's federated machine learning platform ? Mainly reflected in the following aspects :
-
Commercialization :Fedlearner There's a lot of js、Html modular , It is also the first time that we can intuitively see what the federated machine learning platform looks like , If you make a product, you need to grow into something .
-
Business diversification : Before Huawei 、 Weizhong pays more attention to the implementation of federal machine learning in risk control business . The headlines start to emphasize that federal learning is recommending 、 Advertising and other business landing , And it gives very clear data , In an education business sector, the effect of advertising is increased 209%
-
Exportability : If the previous federal machine learning platform introduced more from the theoretical level , This time byte of Fedlearner It emphasizes the exportability , For example, in order to keep the environment consistency of both sides of Federated modeling , adopt K8S The deployment mode of rapid pull up and management cluster . This is for ToB Technical preparation for export services
Let's talk about Fedlearner Some work in these three areas .
Fedlearner Product work
Take recommendation advertising as an example , The advertiser and platform side of Federated machine learning platform should manage a set of model display service and model training service respectively .
Two sets of protocols are needed to guarantee the federated modeling of customers , One is data consistency . For example, in the context of vertical federal learning , The user clicks on an ad on the page , The platform and the advertiser will capture a part of the log respectively . How to ensure the consistency of the logs captured by these two parts in real time , And stitched together into training samples , Need a set of real-time data sample splicing protocol .
Another protocol is multi-party data security protocol . such as AB Two business parties ,A Yes 4 Billion users ,B Yes 3 Billion users , How to find... In some way A and B Cross users of , And don't let A and B Guess each other's data , Need to have a set of multi-party data security protocol .
Based on the above two sets of agreements , In the process of joint modeling by both parties , Use GRPC signal communication , utilize TensorFlow Do the gradient exchange of both sides for joint modeling .
Business diversity
The biggest business scenario for federated machine learning is recommendation advertising , I predicted this in an article a year ago . Sure enough, the headline highlights the application of recommended scenarios . He mentioned that recommendation services are more suitable for neural network algorithms , Risk control business is suitable for tree algorithm . The author also agrees with this statement , Because risk control needs high interpretability , Tree algorithm naturally meets this requirement . The recommendation business does not require high interpretability of the model , The complexity of neural network algorithm can fully guarantee the accuracy of the recommended sorting algorithm .
Fedlearner The person in charge of the business gave a set of numbers to prove the effect of Federated machine learning in the recommendation business .
This array is still very convincing . In fact, for new technologies , Most of the time, the barriers we face are not technical problems , It's about proving business value , Need the first crab eater , In order to promote the landing of new technologies in the industry . Federal machine learning has a bright future in the recommendation advertising business .
Exportability
Fedlearner It adopts a cloud native deployment scheme . Data stored in HDFS, use MySQL Storage system data . adopt Kubernetes Manage and pull up the task . Every Fedlearner The training tasks of the two sides need to pull up at the same time K8S Mission , adopt Master Unified management of nodes ,Worker Build communication .
This scheme fully considers the data warehouse compatibility of users who are currently doing recommendation business , Because most of the customers' warehouse system is still Hadoop ecology , The data is stored in HDFS. Simultaneous use K8S At the same time, it ensures the consistency of the computing engine environment of both sides of the joint modeling .
summary
With more and more manufacturers coming in , Federal machine learning is bound to be an inflection point in the product competition of machine learning platform .
Reference resources :
[1]https://www.jiqizhixin.com/articles/2020-11-03-9
[2]https://github.com/bytedance/fedlearner
版权声明
本文为[osc_odp8kgup]所创,转载请带上原文链接,感谢
边栏推荐
- Fear of reconstruction? I'm too late to tell you how to refactor. Now I'm here
- lodash.js Source code flatten
- Coding style: SSM environment in MVC mode, code hierarchical management
- 极验无感验证破解
- 走进C# abstract,了解抽象类与接口的异同
- Python prompt attributeerror or depreciation warning: This module was degraded solution
- Baishan cloud technology is selected as the top 100 Internet enterprises in China in 2020
- Guest interview: Wang Jian
- After seven years of pursuing, nearly one billion US dollars of bitcoin was eventually confiscated and confiscated by the US government
- proxy 的不完整polyfill
猜你喜欢
Top 5 Chinese cloud manufacturers in 2018: Alibaba cloud, Tencent cloud, AWS, telecom, Unicom
Use call, apply and bind to solve the annoying this in JS, this in event binding and parameter passing
Raspberry pie drum set WiFi
Visit 2020 PG Technology Conference
Mongodb index management of distributed document storage database
“wget: 无法解析主机地址”的解决方法
Unity使用transform.Rotate进行三维旋转角度出现偏差
About CentOS start error: the solution of failed to start crash recovery kernel arming
实验2
Python cookbook 3rd note (2.1): using multiple qualifiers to split strings
随机推荐
Prometheus安装配置
Simple use of JMeter
“wget: 无法解析主机地址”的解决方法
Hengxun Technology: the way to deal with server downtime
Fire knowledge online answer activity small program
jmeter接口测试--带有token的解决方法
表单验证,为避免全局污染,少定义全局变量写法
Coding style: SSM environment in MVC mode, code hierarchical management
Error running app: default activity not found solution
CUDA常用概念及注意点
Assign the corresponding key and value in the map to the object
竞争性编程的思考:那些神话和令人震惊的事实[图]
Centos7 operating system security hardening series (2)
What is the SRM system? SRM supplier management system functions
Experiment 2
树莓派鼓捣记 - 设置 wifi
If you need a million objects
jt-day10
Use call, apply and bind to solve the annoying this in JS, this in event binding and parameter passing
JMeter的简单使用