https://www2.eecs.berkeley.edu/Pubs/TechRpts/2012/EECS-2012-259.pdf

Discretized Streams: A Fault-Tolerant Model for Scalable Stream Processing Matei Zaharia, Tathagata Das, Haoyuan Li, Timothy Hunter, Scott Shenker, Ion Stoica University of California, Berkeley

Many “big data” applications need to act on data arriving in real time. However, current programming models for distributed stream processing are relatively low-level, often leaving the user to worry about consistency of state across the system and fault recovery. Furthermore, the models that provide fault recovery do so in an expensive manner, requiring either hot replication or long recovery times. We propose a new programming model, discretized streams (D-Streams), that offers a high-level functional API, strong consistency, and efficient fault recovery. D-Streams support a new recovery mechanism that improves efficiency over the traditional replication and upstream backup schemes in streaming databases— parallel recovery of lost state—and unlike previous systems, also mitigate stragglers. We implement D-Streams as an extension to the Spark cluster computing engine that lets users seamlessly intermix streaming, batch and interactive queries. Our system can process over 60 million records/second at sub-second latency on 100 nodes.

最新文章

  1. Razor速记
  2. Service基础使用
  3. TypeError: 'bool' object is not callable g.user.is_authenticated()
  4. Android Studio 单元测试
  5. [LeetCode#247] Strobogrammatic Number II
  6. hdu 5607 graph (矩阵乘法快速幂)
  7. 先对数组排序,在进行折半查找(C++)
  8. js-常用数组方法总结
  9. linux小白成长之路2————yum安装火狐浏览器
  10. 【LaTeX排版】LaTeX论文排版<二>
  11. 判断HDFS文件是否存在
  12. API与软件架构-接口
  13. jvm内存配置参数
  14. vue 路由参数变化,页面不更新的问题
  15. Hadoop日记Day11---主从节点接口分析
  16. linux:查询软件是否安装以及删除
  17. pairs
  18. 前端独立引用 ejs模版
  19. 使用Apache FtpServer
  20. 解决:The APR based Apache Tomcat Native library which allows optimal performance in production...

热门文章

  1. linux网络编程之socket编程(十一)
  2. MySQL进阶16 - 视图的创建/修改/删除/更新--可更新性的不适用条件
  3. 第七届蓝桥杯C/C++程序设计本科B组决赛 ——棋子换位(代码补全题)
  4. python 'NoneType' object has no attribute 'get'
  5. java中的switch
  6. fs模块
  7. HDU 6000 - Wash
  8. Linux 查看系统配置参数
  9. Linux系统出现hung_task_timeout_secs和blocked for more than 120 seconds的解决方法
  10. 如何利用Wireshark解密SSL和TLS流量