DataX安装环境准备:
jdk1.6以上
python2.0
DataX安装:
1.官网下载DataX:https://github.com/gkbattle13/DataX 点击Download下载地址
2.解压DataX:tar -zxvf datax.tar.gz
3.根据mysql库版本替换对应的驱动包:
替换该目录下mysql驱动包:datax/plugin/writer/mysqlwriter/
DataX使用:
1. 准备源数据文件,记录文件所在路径
2.新建DataX的json配置文件
例如 XX.json
复制如下数据同步模板修改标红的相应参数内容:
{
"job": {
"setting": {
"speed": {
"channel": 1
}
},
"content": [{
"reader": {
"name": "txtfilereader",
"parameter": {
"path": ["/opt/dataX/data/testdata"],
"encoding": "utf-8",
"column": [
{
"index": 0,
"type": "string"
},{
"index": 1,
"type": "string"
}, {
"index": 2,
"type": "string"
}, {
"index": 3,
"type": "date"
}],
"fieldDelimiter": ","
}
},
"writer": {
"name": "mysqlwriter",
"parameter": {
"writeMode": "insert",
"username": "uu",
"password": "pp",
"column": ["dateid","userid","last_order_time","create_time"],
"session": [
"set session sql_mode='ANSI'"
],
"connection": [{
"jdbcUrl": "jdbc:mysql://ip:port/db?serverTimezone=Asia/Shanghai&useUnicode=true&characterEncoding=utf-8&useLegacyDatetimeCode=false&allowMultiQueries=true",
"table": [
"table_name"
]
}]
}
}
}]
}
}
3. 运行脚本
python /opt/dataX/datax/bin/datax.py ./XX.json