当前位置:网站首页>Flume learning III
Flume learning III
2022-06-30 09:52:00 【Keep-upup】
Flume Business
flume Streaming processing has two transactions Put Business and Take Business
Put The flow of a transaction is ,source And channel Communicating yes ,doput Write data to temporary buffer putlist, It's triggering doCommit Submitted to the channel, if channel Full or otherwise abnormal , It will trigger doRollback Roll back to source Waiting for resubmission .
Take Business :sink And channel communicate , Put the data into Takelist Inside ,sink Again from Talelist Pull data from it , If an exception occurs during data transmission , The trigger rollback take takelist Data rollback to channel Memory queue .
Flume Agent internals
Important components :
ChannelSelector The purpose of this is to select Event Which one will be sent to Channel. There are two types of them , Namely Replicating( Copy ) and Multiplexing( Multiplexing ).
ReplicatingSelector Will be the same Event To all Channel,Multiplexing According to the phase Principle of due , Will be different Event To different Channel.
SinkProcessor common Yes 3、 ... and Kind of class type , branch other yes DefaultSinkProcessor 、 LoadBalancingSinkProcessor and FailoverSinkProcessor DefaultSinkProcessor Yes Should be Of yes single individual Of Sink , LoadBalancingSinkProcessor and FailoverSinkProcessor The corresponding is Sink Group,LoadBalancingSinkProcessor Negative... Can be achieved Load balancing function ,FailoverSinkProcessor Can realize the function of fail over .
Flume topology
Simple series connection :
This pattern is to put more than one flume The sequence is connected , From the initial source From the beginning to the end sink The transfer of Destination storage system . This mode is not recommended to bridge too many flume Number ,flume Too much will not only affect the transmission rate , And once a node is in transit flume Downtime , It will affect the whole transmission system .
Replication and multiplexing :
Flume Support the flow of events to one or more destinations . This mode can copy the same data to multiple channel in , Or distribute different data to different channel in ,sink You can choose to send to different destinations The earth .
Load balancing and fail over :
Flume Support the use of multiple sink Logically, there is a sink Group ,sink Group with different SinkProcessor Can achieve load balancing and error recovery function .
polymerization :
This pattern is our most common , It's also very practical , daily web Applications are usually distributed on hundreds of servers , The great Even thousands of 、 Tens of thousands of servers . Generated log , It's also very troublesome to deal with . use flume This combination of It can solve this problem very well , One for each server flume Collect logs , Transfer to a centralized log collector flume, And so on flume Upload to hdfs、hive、hbase etc. , Log analysis .
边栏推荐
- Notes on masking and padding in tensorflow keras
- OCX child thread cannot trigger event event (forward)
- Create thread pool demo
- MySQL index and data storage structure foundation
- Idea setting automatic package Guide
- Object detection yolov5 open source project debugging
- JVM garbage collector G1 & ZGC details
- Financial private cloud infrastructure scheme evaluation (Architecture and storage)
- P. Summary of NP, NPC, NP hard and other issues
- 小程序开发踩坑之旅
猜你喜欢
近期学习遇到的比较问题
Xlnet (generalized autorefressive trainingfor language understanding) paper notes
NER – Named Entity Recognition Summary
Deberta (decoding enhanced Bert with distinguished attention)
How to build a private cloud and create a hybrid cloud ecosystem?
MySQL internal component structure
Follow the wechat oauth2.0 access scheme
抽象类和接口
【新书推荐】Cleaning Data for Effective Data Science
oracle跨数据库复制数据表-dblink
随机推荐
Shenhe thermomagnetic: Super fusion dual active cluster solution for MES system
Solution to pychart's failure in importing torch package
Mysq database remote connection error, remote connection is not allowed
工作小记: sendto失败 errno 22
Differences and relationships among hyper convergence, software defined storage (SDS), distributed storage and server San
Forrester senior analyst: five important trends in the development of the hyper convergence market
目标检测yolov5开源项目调试
11. customize hooks
Cronexpression expression explanation and cases
Framework program of browser self-service terminal based on IE kernel
thrift简单使用
直播带货源码开发中,如何降低直播中的延迟?
Self service terminal development process
11.自定义hooks
Initialize static resource demo
Dart development skills
Tclistener server and tcpclient client use -- socket listening server and socketclient use
Thrift easy to use
Why won't gold depreciate???
银河麒麟server-V10配置镜像源