PacificA: Replication in Log-Based Distributed Storage Systems - Microsoft Research https://www.microsoft.com/en-us/research/publication/pacifica-replication-in-log-based-distributed-storage-systems/

Wei Lin, Mao YangLintao ZhangLidong Zhou

MSR-TR-2008-25 | February 2008

Large-scale distributed storage systems have gained popularity for storing and processing ever increasing amount of data. Replication mechanisms are often key to achieving high availability and high throughput in such systems. Research on fundamental problems such as consensus has laid out a solid foundation for replication protocols. Yet, both the architectural design and engineering issues of practical replication mechanisms remain an art. This paper describes our experience in designing and implementing replication for commonly used log-based storage systems. We advocate a general replication framework that is simple, practical, and strongly consistent. We show that the framework is flexible enough to accommodate a variety of different design choices that we explore. Using a prototype system called PacificA, we implemented three different replication strategies, all using the same replication framework. The paper reports detailed performance evaluation results, especially on system behavior during failure, reconciliation, and recovery.

Reading and Writing documents | Elasticsearch Reference [6.5] | Elastic https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-replication.html

Reading and Writing documents

Introduction

Each index in Elasticsearch is divided into shards and each shard can have multiple copies. These copies are known as a replication group and must be kept in sync when documents are added or removed. If we fail to do so, reading from one copy will result in very different results than reading from another. The process of keeping the shard copies in sync and serving reads from them is what we call the data replication model.

Elasticsearch’s data replication model is based on the primary-backup model and is described very well in the PacificA paper of Microsoft Research. That model is based on having a single copy from the replication group that acts as the primary shard. The other copies are called replica shards. The primary serves as the main entry point for all indexing operations. It is in charge of validating them and making sure they are correct. Once an index operation has been accepted by the primary, the primary is also responsible for replicating the operation to the other copies.

This purpose of this section is to give a high level overview of the Elasticsearch replication model and discuss the implications it has for various interactions between write and read operations.

最新文章

  1. JAVA学习心得
  2. 东大OJ 2SAT 异或
  3. Spring boot 打成jar包问题总结
  4. Spring之JDBC模板jdbcTemplate
  5. codevs3731 寻找道路
  6. OC中 block 的用法
  7. 2016022609 - redis哈希命令集合
  8. Vuex 模块化实现待办事项的状态管理
  9. Android通过聚合数据API实现天气预报
  10. spring+activemq中多个consumer同时处理消息时遇到的性能问题
  11. 【BZOJ5471】[FJOI2018]邮递员问题(动态规划)
  12. 背水一战 Windows 10 (80) - 本地化
  13. Catfish CMS漏洞集合
  14. H.264流媒体协议格式中的Annex B格式和AVCC格式深度解析
  15. Ubuntu下缓冲器溢出攻击实验(可以看看问题分析)
  16. python输入输出及变量
  17. 1305 Pairwise Sum and Divide(数学 ,规律)
  18. Mac下使用sublime Text打开隐藏目录
  19. Android 歌词显示
  20. ApplicationContex是干啥的

热门文章

  1. nginx 反向代理 配置 https 实现http https同时存在 经测试 支持location 规则
  2. Ubuntu环境下使用npm安装node模块时报错的处理方法
  3. poj 2524 Ubiquitous Religions 一简单并查集
  4. vue2.0实现图片加载失败默认显示图片
  5. ewebeditor编辑器配合IIS6.0解析漏洞拿shell
  6. Genymotion常见问题整合与解决方案(转)
  7. Android基础总结(八)Service
  8. Istio流量管理实现机制深度解析
  9. WINDOWS中设置计划任务执行PHP文件
  10. Spring Cloud对于中小型互联网公司来说是一种福音