当前位置:网站首页>datax json说明
datax json说明
2022-06-30 10:36:00 【feifeidata】
{
"job": {
"content": [
{
"reader": {
"name": "mysqlreader",
"parameter": {
"querySql": "", #自定义sql,支持多表关联,当用户配置querySql时,直接忽略table、column、where条件的配置。
"fetchSize": "", #默认1024,该配置项定义了插件和数据库服务器端每次批量数据获取条数,该值决定了DataX和服务器端的网络交互次数,能够较大的提升数据抽取性能,注意,该值过大(>2048)可能造成DataX进程OOM
"splitPk": "db_id", #仅支持整形型数据切分;如果指定splitPk,表示用户希望使用splitPk代表的字段进行数据分片,如果该值为空,代表不切分,使用单通道进行抽取
"column": [], #"*"默认所有列,支持列裁剪,列换序
"connection": [
{
"jdbcUrl": ["jdbc:mysql://IP:3306/database?useUnicode=true&characterEncoding=utf8"],
"table": [] #支持多张表同时抽取
}
],
"password": "",
"username": "",
"where": "" #指定的column、table、where条件拼接SQL,可以指定limit 10,也可以增量数据同步,如果该值为空,代表同步全表所有的信息
}
},
"writer": {
"name": "hdfswriter",
"parameter": {
"column": [], #必须指定字段名,字段类型,{"name":"","tpye":""}
"compress": "", #hdfs文件压缩类型,默认不填写意味着没有压缩。其中:text类型文件支持压缩类型有gzip、bzip2;orc类型文件支持的压缩类型有NONE、SNAPPY(需要用户安装SnappyCodec)。
"defaultFS": "", #Hadoop hdfs文件系统namenode节点地址。
"fieldDelimiter": "", #需要用户保证与创建的Hive表的字段分隔符一致
"fileName": "", #HdfsWriter写入时的文件名,需要指定表中所有字段名和字段类型,其中:name指定字段名,type指定字段类型。
"fileType": "", #目前只支持用户配置为”text”或”orc”
"path": "", #存储到Hadoop hdfs文件系统的路径信息,hive表在hdfs上的存储路径
"hadoopConfig": {} #hadoopConfig里可以配置与Hadoop相关的一些高级参数,比如HA的配置。
"writeMode": "" #append,写入前不做任何处理,文件名不冲突;nonConflict,如果目录下有fileName前缀的文件,直接报错。
}
}
}
],
"setting": {
"speed": { #流量控制
"byte": 1048576, #控制传输速度,单位为byte/s,DataX运行会尽可能达到该速度但是不超过它
"channel": "" #控制同步时的并发数
}
"errorLimit": { #脏数据控制
"record": 0 #对脏数据最大记录数阈值(record值)或者脏数据占比阈值(percentage值,当数量或百分比,DataX Job报错退出
}
}
}
}
边栏推荐
- LVGL 8.2 Drop down in four directions
- What is erdma as illustrated by Coptic cartoon?
- Problems and solutions in pyinstall packaging for pychart project
- Pytorch notes: validation, model eval V.S torch. no_ grad
- 深潜Kotlin协程(十七):演员
- Ionic4 drag the ion reorder group component to change the item order
- The number of users of the home-made self-developed system exceeded 400million, breaking the monopoly of American enterprises, and Google repented
- 时间复杂度与空间复杂度
- Google 辟谣放弃 TensorFlow,它还活着!
- Lvgl 8.2 picture scaling and rotation
猜你喜欢

Cp2112 teaching example of using USB to IIC communication

Machine learning interview preparation (I) KNN

时间复杂度与空间复杂度

Q-Learning笔记

The reasoning delay on iphone12 is only 1.6 MS! Snap et al. Analyzed the transformer structure latency in detail, and used NAS to find out the efficient network structure of mobile devices

DQN笔记

Review of mathematical knowledge: curve integral of the second type

19:00 p.m. tonight, knowledge empowerment phase 2 live broadcast - control panel interface design of openharmony smart home project

智能DNA分子纳米机器人模型来了

CSDN blog operation team 2022 H1 summary
随机推荐
Key library function based on Hal Library
Retest the cloud native database performance: polardb is still the strongest, while tdsql-c and gaussdb have little change
Typescript – classes in Es5, inheritance, static methods
蚂蚁金服笔试题:需求文档有什么可以量化的【杭州多测师】【杭州多测师_王sir】...
Anhui "requirements for design depth of Hefei fabricated building construction drawing review" was printed and distributed; Hebei Hengshui city adjusts the pre-sale license standard for prefabricated
Viewing technological changes through Huawei Corps (V): smart Park
Circuit breaker hystrixcircuitbreaker
pytorch 笔记:validation ,model.eval V.S torch.no_grad
LVGL 8.2 menu from a drop-down list
Collectors.toMap应用
Lvgl 8.2 picture scaling and rotation
深潜Kotlin协程(十八):冷热数据流
Didi open source agile test case management platform!
Iptables target tproxy
LeetCode Algorithm 86. Separate linked list
LVGL 8.2 Simple Drop down list
LVGL 8.2 Simple Drop down list
& and - > priority
CP2112使用USB转IIC通信教学示例
煥發青春的戴爾和蘋果夾擊,兩大老牌PC企業極速衰敗