当前位置:网站首页>30: Kakfa simulates JSON data generation and transmission
30: Kakfa simulates JSON data generation and transmission
2022-06-13 01:36:00 【Python's path to becoming a God】
In the calculation PV and UV A key step in the process of is to clean the log data . In fact, in other businesses , For example, in order data statistics , We also need to filter out some “ Dirty data ”.
So-called “ Dirty data ” It is inconsistent with the standard data structure defined by us , Or unwanted data . Because in data cleaning ETL Data deserialization, parsing and Java Class mapping , In this mapping process “ Dirty data ” Will cause deserialization failure , This causes the task to fail and restart . In some big assignments , Restarting will cause the task to become unstable , And too much “ Dirty data ” This will cause our task to report errors frequently , Finally, it failed completely .
framework

Mentioned the whole PV and UV Data processing architecture in computing process , It uses Flume Collect business data and send it to Kafka in , So calculating PV、UV Need to consume before Kafka Data in , And will “ Dirty data ” To filter out .
In real business , We consume primitive Kafka After processing the log data , The detailed data will also be written to similar Elasticsearch Query in such an engine ; The summary data will also be written into HBase perhaps Redis And other databases for front-end query and display . meanwhile , And write the data again Kafka For other businesses .
边栏推荐
- MySQL download and installation
- Mysql database listening -canal
- 工作与生活
- MySQL performance optimization
- Leetcode-19- delete the penultimate node of the linked list (medium)
- Stack stack LIFO
- Lecture on Compilation Principles
- ES6 deconstruction assignment
- My crawler learning notes
- Machine learning basic SVM (support vector machine)
猜你喜欢

C language implementation of the classic eight queens problem

Happy string

csdn涨薪技术之Jmeter接口测试数据库断言的实现与设计

DFS and BFS notes (II): depth first search (implemented in C language)

Run Presto under docker to access redis and Bi presentation

Argparse command line passes list type parameter

Leetcode question brushing 04 string

Leetcode 01 array

MySQL connection query

pycharm add configutions
随机推荐
Simple operation of MySQL database
Leetcode question brushing 04 string
[WSL2]限制WSL2可访问的硬件资源(CPU/内存)
Rasa dialogue robot helpdesk (III)
Exercise 5.14 input n strings, arrange them in alphabetical order and output them.
Database query user mailbox
Docker install MySQL
兴趣相似的受众群体
Anims of phaser3
ES6解构赋值
Polymorphism and virtual function
Getting started with phaser 3
Stack stack LIFO
Stone from another mountain: Web3 investment territory of a16z
4K sea bottom and water surface fabrication method and ocean bump displacement texture Download
Answer to matrix theory of Nanjing University of Aeronautics and Astronautics
Wildcard usage of go standard library FMT
About the proposed signature file migration to industry standard format pkcs12
他山之石:a16z 的 Web3 投资版图
Leetcode-19- delete the penultimate node of the linked list (medium)