004.hive命令的3种调用方式 | ApacheCN(apache中文网)

时间 2019-11-20

标签 004.hive hive 命令调用方式 apachecn apache 中文网栏目 Hadoop 繁體版

原文原文链接

ApacheCN | apache中文网linux

hive命令的3种调用方式 sql

官网地址：https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Cli（可参考）shell

方式1：hive –f /root/shell/hive-script.sql（适合多语句）数据库

hive-script.sql相似于script同样，直接写查询命令就行apache

例如：bash

[root@cloud4 shell]# vi hive_script3.sqloop

select * from t1;性能

select count(*) from t1;ui

不进入交互模式，执行一个hive script spa

这里能够和静音模式-S联合使用,经过第三方程序调用，第三方程序经过hive的标准输出获取结果集。

$HIVE_HOME/bin/hive -S -f /home/my/hive-script.sql （不会显示mapreduct的操做过程）

那么问题来了：如何传递参数呢？

demo以下：

start_hql.sh 内容：

#!/bin/bash
# -S 打印输出mapreduce日志
hive \
-hivevar id=1 \
-hivevar col2=2 \
-S -f test.sql

test.sql 内容：
-- 数据库
use tmp;
-- 表名
select *
from tmp_jzl_20140725_test11
where
id='${hivevar:id}' and col2='${hivevar:col2}';

方式2：hive -e 'sql语句'（适合短语句）

直接执行sql语句

例如：
[root@cloud4 shell]# hive -e 'select * from t1'
静音模式：

[root@cloud4 shell]# hive -S -e 'select * from t1' (用法与第一种方式的静音模式同样，不会显示mapreduce的操做过程)
此处还有一亮点，用于导出数据到linux本地目录下
例如：

[root@cloud4 shell]# hive -e 'select * from t1' > test.txt
有点相似pig导出分析结果同样，都挺方便的

方式3：hive （直接使用hive交互式模式）

都挺方便的
介绍一种有意思的用法：
1.sql的语法

#hive 启动

hive>quit; 退出hive

hive> show databases; 查看数据库

hive> create database test; 建立数据库

hive> use default; 使用哪一个数据库

hive>create table t1 (key string); 建立表
对于建立表咱们能够选择读取文件字段按照什么字符进行分割
例如：
hive>create table t1(id ) '/wlan'
partitioned by (log_date string) 表示经过log_date进行分区
row format delimited fields terminated by '\t' 表示表明用‘\t’进行分割来读取字段
stored as textfile/sequencefile/rcfile/; 表示文件的存储的格式

存储格式的参考地址：http://blog.csdn.net/yfkiss/article/details/7787742
textfile 默认格式，数据不作压缩，磁盘开销大，数据解析开销大。
可结合Gzip、Bzip2使用（系统自动检查，执行查询时自动解压），但使用这种方式，hive不会对数据进行切分，从而没法对数据进行并行操做。
实例：

[plain] view plain copy