一、定义agent,并运行

1、配置文件

#计划
##收集hive的日志,存到hdfs
/opt/cdh-5.3.6/hive-0.13.1-cdh5.3.6/logs/hive.log
命令:tail -f source:Exec source 在启动时运行一个给定的Unix命令,并期望该进程在标准输出上不断地生成数据
channel:memory
sink:hdfs #/user/root/flume/hive-logs/ ##准备agent配置文件:flume-tail.conf 如下:
# The configuration file needs to define the sources,
# the channels and the sinks. ####define agent
a2.sources = r2
a2.channels = c2
a2.sinks = k2 ###define sources
a2.sources.r2.type = exec
a2.sources.r2.command = tail -f /opt/cdh-5.3.6/hive-0.13.1-cdh5.3.6/logs/hive.log
a2.sources.r2.shell = /bin/bash -c ###define channel
a2.channels.c2.type = memory
a2.channels.c2.capacity = 1000
a2.channels.c2.transactionCapacity = 100 ###define sink
a2.sinks.k2.type = hdfs
a2.sinks.k2.hdfs.path = hdfs://hadoop-senior.ibeifeng.com:8020/user/root/flume/hive-logs/
a2.sinks.k2.hdfs.fileType = DataStream
a2.sinks.k2.hdfs.writeFormat = Text
a2.sinks.k2.hdfs.batchSize = 10 ###bind the soures and sink to the channel
a2.sources.r2.channels = c2
a2.sinks.k2.channel = c2

2、运行

##flume开始实时监听抽取
[root@hadoop-senior flume-1.5.0-cdh5.3.6]# bin/flume-ng agent -c conf -n a2 -f conf/flume-tail.conf -Dflume.root.logger=DEBUG,console ##此时可以去hive中执行一些命令,产生日志 ##hdfs查看,已经抽取到很多文件了
[root@hadoop-senior hadoop-2.5.0-cdh5.3.6]# bin/hdfs dfs -ls -R /user/root/flume/hive-logs/
-rw-r--r-- 3 root supergroup 1133 2019-05-08 13:43 /user/root/flume/hive-logs/FlumeData.1557294191838
-rw-r--r-- 3 root supergroup 534 2019-05-08 13:43 /user/root/flume/hive-logs/FlumeData.1557294191839
-rw-r--r-- 3 root supergroup 1056 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160087
-rw-r--r-- 3 root supergroup 408 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160088
-rw-r--r-- 3 root supergroup 1319 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160089
-rw-r--r-- 3 root supergroup 240 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160090
-rw-r--r-- 3 root supergroup 1083 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160091
-rw-r--r-- 3 root supergroup 255 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160092
-rw-r--r-- 3 root supergroup 122 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160093
-rw-r--r-- 3 root supergroup 956 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160094
-rw-r--r-- 3 root supergroup 515 2019-05-08 13:59 /user/root/flume/hive-logs/FlumeData.1557295160095.tmp

3、当hdfs的架构为HA时

##当hdfs的架构为HA时,define sink字段定义:
a2.sinks.k2.type = hdfs a2.sinks.k2.hdfs.path = hdfs://代理名:8020/user/root/flume/hive-logs/ 把主机名改为HA的代理名称,然后可以直接把core-site.xml hdfs-site.xml放进flume的conf目录中,让flume读取;

最新文章

  1. linux 下开放端口问题
  2. 支持多选的Spinner控件
  3. Unity插件研究院之ResourceChecker
  4. javaSE第二十五天
  5. Java [leetcode 14] Longest Common Prefix
  6. d037: 鸡兔同笼
  7. 1、工程log4j 配置
  8. Android 使用网络ADB调试.
  9. WPF界面设计
  10. tr069开源协议EasyCwmp移植
  11. Java数据结构和算法总结-冒泡排序、选择排序、插入排序算法分析
  12. 【转载】ASP.NET中Server.MapPath方法获取网站根目录总结
  13. 解读 IoC 框架 InversifyJS
  14. discuzX3.4安装之后,没有任何样式怎么办?
  15. 2--STM32+USB移植+HID 与AUDIO类MIDI设备组成的复合设备(原创)
  16. OCM_第四天课程:Section2 —》GC 的安装和配置
  17. vue 弹性布局 实现长图垂直居上,短图垂直居中
  18. excel 如何快速实现绝对引用
  19. POJ2228 Naptime
  20. c# 测试方法执行时间

热门文章

  1. WPF 基础到企业应用系列1——开篇故意
  2. JAVA传输概念
  3. yum安装nginx+PHP+Mysql
  4. JSTL简单介绍
  5. open-source Julius speech-recognition engine
  6. kubernetes对象之Job
  7. MFC学习之对话框---对话框之间的domodle 数据传递
  8. Windows踩坑笔记之使用_tWinMain报错的解决方案
  9. 20170314 OO ALV 出现双滚动条
  10. 不懂不能装懂--邮箱后缀“inc”的含义