Paper

In Search of an Understandable Consensus Algorithm

Raft more understandable than Paxos and also provides a better foundation for building practical systems

Novel features

Strong leader: Raft uses a stronger form of leadership than other consensus algorithms. For example, log entries only flow from the leader to other servers. This simplifies the management of the replicated log and makes Raft easier to understand. 只有leader对外提供服务，log只能从leader流向其他角色。
Leader election: Raft uses randomized timers to elect leaders. This adds only a small amount of mechanism to the heartbeats already required for any consensus algorithm, while resolving conflicts simply and rapidly. 这个特点很有意思，每次选举都增加随机时间，防止多个候选者同时发起选举，分割选票，导致无法正常结束选举。
Membership changes: Raft’s mechanism for changing the set of servers in the cluster uses a new joint consensus approach where the majorities of two different configurations overlap during transitions. This allows the cluster to continue operating normally during configuration changes. 集群配置改变时，仍然可以提供服务。

副本状态机常用于解决分布式系统中的容错问题，比如管理选举leader以及存储配置信息防止leader雪崩。

接收来自客户端的命令，保存至log；与其他的一致性模块通信保证log的正确性

每个服务都会保存一份相同的操作顺序日志，这样每个服务都会执行相同的命令，产生相同的结果。

根据log，计算状态

Raft通过leader解决一致性问题，leader全权管理log副本。leader接收客户端请求，然后复制给其他服务，最后通知其他服务提交，改变各自的状态机。

提高理解性（paxos难以理解），Raft将一致性问题分解成三个子问题：

Raft集群有三种角色：leader（领导者）, follower（追随者）, or candidate（候选者）。一般情况下，集群中只有一个leader以及若干个follower。

Figure 4 - states and transitions

Raft将时间切分成任意长度的term。

term用连续的整数进行编号。
每个term开始前都有一次选举，如果一个candidate赢得选举，就将成为leader。
每个服务都保存自己的当前term编号（CTN），服务通信时交换CTN
- follower发现自己过期，就会更新自己的CTN
- leader/候选人发现自己过期，就会立马变成follower
- 服务拒绝过期请求

Figure 5 - terms

服务启动的时候都是follower，如果follower能够从leader/candidate定期收到合法的RPC，就会一直保持状态。如果follower超过一段时间没有收到RPC（超时），就会开始选举流程。

follower增长自己的CTN，并且转变成candidate
对集群中其他服务发起RequestVote RPC，争取选票，直到发生以下情况：
- win（在一个term中，收到一半以上选票，变成leader，发送AppendEntries RPC）
- other wins（接收其他服务的AppendEntries RPC，如果RPC中的CTN大于等于自己的CTN，就变成follower，否则拒绝，继续candidate）
- time out without winner（随机timeout + 重试）

Noted:

针对timeout，分割选票无法选举问题，Raft采用随机选举超时解决。