当前位置:网站首页>Flume learning III
Flume learning III
2022-06-30 09:52:00 【Keep-upup】
Flume Business

flume Streaming processing has two transactions Put Business and Take Business
Put The flow of a transaction is ,source And channel Communicating yes ,doput Write data to temporary buffer putlist, It's triggering doCommit Submitted to the channel, if channel Full or otherwise abnormal , It will trigger doRollback Roll back to source Waiting for resubmission .
Take Business :sink And channel communicate , Put the data into Takelist Inside ,sink Again from Talelist Pull data from it , If an exception occurs during data transmission , The trigger rollback take takelist Data rollback to channel Memory queue .
Flume Agent internals

Important components :
ChannelSelector The purpose of this is to select Event Which one will be sent to Channel. There are two types of them , Namely Replicating( Copy ) and Multiplexing( Multiplexing ).
ReplicatingSelector Will be the same Event To all Channel,Multiplexing According to the phase Principle of due , Will be different Event To different Channel.
SinkProcessor common Yes 3、 ... and Kind of class type , branch other yes DefaultSinkProcessor 、 LoadBalancingSinkProcessor and FailoverSinkProcessor DefaultSinkProcessor Yes Should be Of yes single individual Of Sink , LoadBalancingSinkProcessor and FailoverSinkProcessor The corresponding is Sink Group,LoadBalancingSinkProcessor Negative... Can be achieved Load balancing function ,FailoverSinkProcessor Can realize the function of fail over .
Flume topology
Simple series connection :

This pattern is to put more than one flume The sequence is connected , From the initial source From the beginning to the end sink The transfer of Destination storage system . This mode is not recommended to bridge too many flume Number ,flume Too much will not only affect the transmission rate , And once a node is in transit flume Downtime , It will affect the whole transmission system .
Replication and multiplexing :
Flume Support the flow of events to one or more destinations . This mode can copy the same data to multiple channel in , Or distribute different data to different channel in ,sink You can choose to send to different destinations The earth .
Load balancing and fail over :
Flume Support the use of multiple sink Logically, there is a sink Group ,sink Group with different SinkProcessor Can achieve load balancing and error recovery function .
polymerization :

This pattern is our most common , It's also very practical , daily web Applications are usually distributed on hundreds of servers , The great Even thousands of 、 Tens of thousands of servers . Generated log , It's also very troublesome to deal with . use flume This combination of It can solve this problem very well , One for each server flume Collect logs , Transfer to a centralized log collector flume, And so on flume Upload to hdfs、hive、hbase etc. , Log analysis .
边栏推荐
- 云技能提升好伙伴,亚马逊云师兄今天正式营业
- JVM family
- Using OpenCV Net for image restoration
- Eight sorts (I)
- Mysq database remote connection error, remote connection is not allowed
- MySQL explain
- MySQL optimization
- AutoUpdater. Net client custom update file
- Recommend a very easy-to-use network communication framework HP socket
- Cftpconnection:: getfile() download FTP server files and related parameter descriptions
猜你喜欢

MySQL优化

Microsoft. Bcl. Async usage summary -- in Net framework 4.5 project Net framework version 4.5 and above can use async/await asynchronous feature in C 5

CRF (conditional random field) learning summary

MySQL index and data storage structure foundation

2021-10-20

Framework program of browser self-service terminal based on IE kernel

云技能提升好伙伴,亚马逊云师兄今天正式营业

Financial private cloud infrastructure scheme evaluation (Architecture and storage)

Object detection yolov5 open source project debugging

Self service terminal handwritten Chinese character recognition input method library tjfink introduction
随机推荐
Numpy (time date and time increment)
Properties of string
QR code generation and analysis
Redis docker master-slave mode and sentinel
训练一个图像分类器demo in PyTorch【学习笔记】
Utlis memory pool object pool
11.自定义hooks
oracle跨数据库复制数据表-dblink
Read the difference and connection between hyperfusion and private cloud
NER – Named Entity Recognition Summary
Review the old and know the new
Simple redis lock
Using OpenCV Net for image restoration
Net framework system requirements
Redis docker 主从模式与哨兵sentinel
Recommend a very easy-to-use network communication framework HP socket
DataTableToModelList实体类
Bluetooth BT RF test (forwarding)
I once met a girl whom I most wanted to take care of all my life. Later... No later
qmlplugindump executable not found.It is required to generate the qmltypes file for VTK Qml