第一个Hive程序

之前已经体验过MapReduce完成word count,现在我们来看一下如何用hive来完成这个功能,首先同过hive命令进入hive cli,之后执行以下步骤

创建表

CREATE TABLE words (line STRING);

导入数据

LOAD DATA [LOCAL] INPATH 'words.txt' OVERWRITE INTO TABLE words;

执行单词统计

CREATE TABLE word_counts AS
SELECT word, count(1) AS count FROM
(SELECT explode(split(line, ' ')) AS word FROM words) w
GROUP BY word
ORDER BY word;

words.txt 文件在 res 目录下

results matching ""

    No results matching ""