当前位置:网站首页>On fedlearner, the latest open source federated machine learning platform of byte
On fedlearner, the latest open source federated machine learning platform of byte
2020-11-10 07:37:00 【osc_odp8kgup】
Recently, federal machine learning is becoming more and more popular , Byte also officially publicized the open source federal machine learning platform Fedlearner. This time the headlines are open source Fedlearner With Huawei, which I have analyzed before 、 What's the difference between micro crowd's federated machine learning platform ? Mainly reflected in the following aspects :
-
Commercialization :Fedlearner There's a lot of js、Html modular , It is also the first time that we can intuitively see what the federated machine learning platform looks like , If you make a product, you need to grow into something .
-
Business diversification : Before Huawei 、 Weizhong pays more attention to the implementation of federal machine learning in risk control business . The headlines start to emphasize that federal learning is recommending 、 Advertising and other business landing , And it gives very clear data , In an education business sector, the effect of advertising is increased 209%
-
Exportability : If the previous federal machine learning platform introduced more from the theoretical level , This time byte of Fedlearner It emphasizes the exportability , For example, in order to keep the environment consistency of both sides of Federated modeling , adopt K8S The deployment mode of rapid pull up and management cluster . This is for ToB Technical preparation for export services
Let's talk about Fedlearner Some work in these three areas .
Fedlearner Product work

Take recommendation advertising as an example , The advertiser and platform side of Federated machine learning platform should manage a set of model display service and model training service respectively .

Two sets of protocols are needed to guarantee the federated modeling of customers , One is data consistency . For example, in the context of vertical federal learning , The user clicks on an ad on the page , The platform and the advertiser will capture a part of the log respectively . How to ensure the consistency of the logs captured by these two parts in real time , And stitched together into training samples , Need a set of real-time data sample splicing protocol .
Another protocol is multi-party data security protocol . such as AB Two business parties ,A Yes 4 Billion users ,B Yes 3 Billion users , How to find... In some way A and B Cross users of , And don't let A and B Guess each other's data , Need to have a set of multi-party data security protocol .
Based on the above two sets of agreements , In the process of joint modeling by both parties , Use GRPC signal communication , utilize TensorFlow Do the gradient exchange of both sides for joint modeling .

Business diversity
The biggest business scenario for federated machine learning is recommendation advertising , I predicted this in an article a year ago . Sure enough, the headline highlights the application of recommended scenarios . He mentioned that recommendation services are more suitable for neural network algorithms , Risk control business is suitable for tree algorithm . The author also agrees with this statement , Because risk control needs high interpretability , Tree algorithm naturally meets this requirement . The recommendation business does not require high interpretability of the model , The complexity of neural network algorithm can fully guarantee the accuracy of the recommended sorting algorithm .
Fedlearner The person in charge of the business gave a set of numbers to prove the effect of Federated machine learning in the recommendation business .

This array is still very convincing . In fact, for new technologies , Most of the time, the barriers we face are not technical problems , It's about proving business value , Need the first crab eater , In order to promote the landing of new technologies in the industry . Federal machine learning has a bright future in the recommendation advertising business .
Exportability
Fedlearner It adopts a cloud native deployment scheme . Data stored in HDFS, use MySQL Storage system data . adopt Kubernetes Manage and pull up the task . Every Fedlearner The training tasks of the two sides need to pull up at the same time K8S Mission , adopt Master Unified management of nodes ,Worker Build communication .
This scheme fully considers the data warehouse compatibility of users who are currently doing recommendation business , Because most of the customers' warehouse system is still Hadoop ecology , The data is stored in HDFS. Simultaneous use K8S At the same time, it ensures the consistency of the computing engine environment of both sides of the joint modeling .
summary
With more and more manufacturers coming in , Federal machine learning is bound to be an inflection point in the product competition of machine learning platform .
Reference resources :
[1]https://www.jiqizhixin.com/articles/2020-11-03-9
[2]https://github.com/bytedance/fedlearner
版权声明
本文为[osc_odp8kgup]所创,转载请带上原文链接,感谢
边栏推荐
- 编码风格:Mvc模式下SSM环境,代码分层管理
- Overview of the most complete anomaly detection algorithm in history
- CUDA_ Register and local memory
- Bifrost 位点管理 之 异构中间件实现难点(1)
- CUDA_ Memory model
- SQL case conversion, remove the space before and after
- Mongodb index management of distributed document storage database
- Prometheus安装配置
- C++异常实现机制
- 推动中国制造升级,汽车装配车间生产流水线 3D 可视化
猜你喜欢

js解决浏览器打印自动分页的问题

飞鸽传书局域网找不到其他人的问题解决

How much is the cost of CRM system?

CUDA_ Shared memory, memory access mechanism, access optimization

If you need a million objects

OSChina 周二乱弹 —— 我养的绿植分别为土豆,生姜,蒜

Problems and solutions in configuring FTP server with FileZilla server

解决Coursera视频无法观看的三种方法(亲测有效)

Difficulties in heterogeneous middleware implementation of Bifrost site management (1)

selenium webdriver使用click一直失效问题的几种解决方法
随机推荐
CUDA_ Get the specified device
自己上手写性能测试工具(二)
Fear of reconstruction? I'm too late to tell you how to refactor. Now I'm here
C++ exception implementation mechanism
CUDA_常量内存
JMeter的简单使用
The kth smallest node in the print binary search tree of offer
Solution of MAC terminal iterm2 supporting RZ and sz
Prometheus安装配置
使用call、apply和bind解决js中烦人的this,事件绑定时的this和传参问题
CUDA_存储器模型
So what should investors do with the current market? Now a new investment outlet is coming!
How much is the cost of CRM system?
“wget: 无法解析主机地址”的解决方法
Explanation of Z-index attribute
[elixir! 0073] beam built-in memory database ETS
If you need a million objects
Simple use of JMeter
一个名为不安全的类Unsafe
Incomplete Polyfill of proxy