如下为生产环境的sqoop配置php
一.登陆Mysql,建立数据库和表,以及插入数据。mysql
1.1建立数据库sql
create database mydb;
1.2建立表数据库
use mydb; CREATE TABLE emp( id INT NOT NULL, name VARCHAR(100), deg VARCHAR(100), salary BIGINT, dept VARCHAR(50) ); CREATE TABLE empp( id INT NOT NULL, name VARCHAR(100), deg VARCHAR(100), salary BIGINT, dept VARCHAR(50) );
1.3插入数据bash
insert into emp values(1201,'gopal','manager','50000','TP'); insert into emp values(1202,'manisha','Proof reader','50000','TP'); insert into emp values(1203,'khalil','php dev','30000','AC'); insert into emp values(1204,'prasanth','php dev','30000','AC'); insert into emp values(1205,'kranthi','admin','20000','TP'); insert into empp values(12011,'gopal','manager','50000','TP'); insert into empp values(12022,'manisha','Proof reader','50000','TP'); insert into empp values(12033,'khalil','php dev','30000','AC'); insert into empp values(12044,'prasanth','php dev','30000','AC'); insert into empp values(12055,'kranthi','admin','20000','TP');
二.######sqoop import 从数据库导入到Hive并发
通常RDBMS的导出速度控制在60~80MB/s,每一个 map 任务的处理速度5~10MB/s 估算,即 -m 参数通常设置4~8,表示启动 4~8 个map 任务并发抽取 ##最好不要超过6个ide
#sqoop 首先要kinit 才能够 若是是hive 要有hive的权限 若是是hdfs 要有acl的权限 否则没法写入和读取 kint user/user -kt /opt/user.keytab username='admin' psd='123456' table=('emp' 'empp') for mysql_table in ${table[@]} do sqoop import \ --connect jdbc:mysql://192.168.10.41:3307/mydb \ --username ${username} \ --password ${psd} \ --table ${mysql_table} \ --hive-import \ --hive-drop-import-delims \ --hive-database emp \ --hive-table mysql_${mysql_table} \ --fields-terminated-by '\t' \ --m 6 \ --delete-target-dir \ --null-string '\\N' \ --null-non-string '\\N' done
三.######sqoop export 从Hive导入到大数据
oop
kint user/user -kt /opt/user.keytab username='admin' psd='123456' table=('emp' 'empp') for mysql_table in ${table[@]} do sqoop export \ --connect jdbc:mysql://192.168.10.40:3307/mydb \ --username ${username} \ --password ${psd} \ --table ${mysql_table} \ --export-dir '/user/hive/warehouse/mydb.db/'mysql_${mysql_table} \ --input-null-string '\\N' \ --input-null-non-string '\\N' \ --fields-terminated-by '\t' \ --input-fields-terminated-by '\t' \ --m 6 done
四.Sqoop参数大全大数据
如下截图引用自(感谢博主 本身截图方便本身查看):spa
https://blog.csdn.net/young_0609/article/details/107376183?utm_medium=distribute.pc_relevant.none-task-blog-baidujs_baidulandingword-0&spm=1001.2101.3001.4242