Byzantine Fault Tolerance

Like many good stories, ours also begins with war. In the fifth century, the titular city-state of Byzantium was expanding. In one particular excursion, several Byzantine legions all surrounded the same fortress. Knowing their only chance to topple its impregnable walls is if all generals work together, they need to somehow agree among themselves whether to attack or retreat. Most crucially, they must all reach the same decision. But here is the crux: due to their importance, it is never allowed that more than two generals confer. They cannot all convene in one tent and hash it out. The entire decision must only rely on one-on-one two-party communication. This is not just quite a problem, it is the problem.

The Byzantine Generals Problem (BGP)

While the historical novel style has its charm, we will shed it for a more colloquial jargon. (After all, it would be weird if three chapters from now we would refer to miners as generals).

The BGT problem describes a network of nodes communicating with each other, having to agree on one of a given list of options. (e.g. a bunch of generals having to decide whether to attack or retreat). The nodes communicate with each other in rounds, where in each round, any node can send a message to any other node. However, nodes only start to receive messages after all nodes are done sending their messages. Their goal is to devise a protocol that guarantees that, if all nodes follow the protocol they will all reach the same decision after a known number of rounds.

We've already mentioned one such protocol: majority voting. In this protocol, each node tells all other nodes its preference, and nodes agree to accept the popular choice. This protocol only takes one round.

Majority voting is a very simple protocol. Unfortunately, it also turns out to be a very bad protocol. Recall the story of the generals, and assume there are only three of them: Nicephorus, who wants to fight; Martin, who wants to retreat; and Euphemius, who is secretly a traitor to Byzantium.

Nicephorus and Martin are loyal to Byzantium and its protocols, so they broadcast their intentions. However, Euphemius the treacherous had other plans: he told Martin that he wanted to fight, and Nicephorus that he wanted to retreat! Consequentially, Nicephorus the fierce was coerced to retreat, while Martin the calm-minded was coerced to fight alone, leading to a great defeat!

That example shows where majority voting lacks as a protocol: in its fault tolerance. We call a node faulty if it deviates from the protocol for any reason. Adversarial generals and hackers are a reason for concern, but in real-world scenarios, nodes that are simply malfunctioning are also a serious concern.

A byzantine fault tolerance (BFT) protocol is a protocol that can guarantee that the nodes will reach a mutual decision in the prescribed amount of time even if some of the nodes are faulty. The level of security such a protocol provides is usually described as a number $\alpha$ between $0$ and $1$ that describes the fraction of faulty nodes a protocol can tolerate.

In this terms, we can say that PSL created a protocol with $1/3$ fault tolerance, and proved that this is the maximal fault tolerance possible. In the other corner, majority voting has a fault tolerance of zero.

Note that for a BFT protocol to truly have $\alpha$ fault tolerance, two requirements must be met: one is that any minority of less than $\alpha$ cannot coerce a decision, and the other is that it cannot delay it (in the sense that it could make it so that consensus is not reached within the prescribed amount of rounds). These two properties (assuming the remaining $1-\alpha$ of the nodes are honest) are called safety and liveness, and together they comprise the form of security we call fault tolerance.

As we will see, liveness is trivially true for the PSL protocol: the number of rounds is known in advance and the proof shows that it is sufficient. Nothing that participants do can trigger additional rounds. This is the exception to the rule. More modern BFT protocols improve performance by adapting the number of rounds according to how the protocol proceeds. For such protocols, it is plausible that an attacker could exploit this mechanism to perform a liveness attack (forcing the network to remain indefinitely split, or take a very long time to converge).

We will revisit safety and liveness more thoroughly in the next chapter.

PSL's Protocol

We describe the PSL protocol for four nodes called John, Paul, George and Ringo who are trying to decide whether they should call their new album Revolver or Rubber Soul.

To easily track the protocol, we will assume the point of view of one of John.

The first round of the protocol is very simple: each Beetle just tells all other Beetles how they would like to call the album. After the first round, John should have three pieces of information, that could look e.g. like this:

Paul wants to call the album Revolver
George wants to call the album Revolver
Ringo wants to call the album Rubber Soul

This is a good start, but we already know it is not quite enough. What could Paul do with this information? In particular, how could he know that the remaining Beetles were consistent, and that, say, Paul didn't tell George they wanted to call the album Rubber Soul?

To verify that, all Beetles now tell all other Beetles all they have learned in the first round. Confusing? Let's see an example. After the round is over, John should have six more pieces of information. If anyone were honest, they should look like this:

Paul said that George wants to call the album Revolver
Paul said that Ringo wants to call the album Rubber Soul
George said that Paul wants to call the album Revolver
George said that Ringo wants to call the album Rubber Soul
Ringo said that Paul wants to call the album Revolver
Ringo said that George wants to call the album Revolver

This is enough information for John to deduce that (discounting his own vote) there is a two-to-one agreement for Revolver. If John also wants to call the album Revolver, then we are done here: all Beetles will see Revolver as preferred by most of the band, and agree on the name.

But what if John wants to call the album Rubber Soul? How do we handle the ensuing tie? That's an interesting question I am going to defer for when we discuss tie-breaking rules.

So the protocol above works nicely if

This works nicely if everyone is playing nicely, but what if someone tries to cheat? Say that Ringo really wants to call the album Rubber Soul, and he knows John is on his side. What if he lies to John about what George and Paul had to say? Then John would see the following messages:

Paul wants to call the album Revolver
George wants to call the album Revolver
Ringo wants to call the album Rubber Soul
Paul said that George wants to call the album Revolver
Paul said that Ringo wants to call the album Rubber Soul
George said that Paul wants to call the album Revolver
George said that Ringo wants to call the album Rubber Soul
Ringo said that Paul wants to call the album Rubber Soul
Ringo said that George wants to call the album Rubber Soul

Here John will note a contradiction: Paul told him they want to call the album Revolver, George also told him that Paul wants to call the album Revolver, only Ringo for some reason claims that Paul wants to call the album Rubber Soul.

How should John resolve this contradiction? Essentially, the answer is second-order majority voting: John tallies how all Beetles voted on how Paul wants to call the album. Note that even though Paul is the subject of discussion, his vote counts exactly as much as George and Ringo's.

It is not hard to complete this line of thought, and convince ourselves that no matter how Ringo tries to cheat, as long the rest of the Beetles follow the protocol there is no way for him to coerce a decision that is not backed by a strict majority.

But what if Ringo was innocent, and it was Paul and George who colluded to enact the name Revolver, despite half of the Beetles being against it? Now John will see the following messages:

Paul wants to call the album Revolver
George wants to call the album Revolver
Ringo wants to call the album Rubber Soul
Paul said that George wants to call the album Revolver
Paul said that Ringo wants to call the album Revolver
George said that Paul wants to call the album Revolver
George said that Ringo wants to call the album Revolver
Ringo said that Paul wants to call the album Revolver
Ringo said that George wants to call the album Revolver

Upon the contradictions regarding Ringo's position, Paul will again carry out a second-order voting, and see that two Beetles claim Ringo is pro Revolver, but only one Beetle claims Ringo wants to call the album Rubber Soul.

But wait, you might ask, the Beetle who thinks Ringo prefers Rubber Soul is Ringo, shouldn't that account for something? Maybe, but the ability to override Ringo is exactly what protects us from Ringo playing a double game. He can no longer coerce the protocol by telling each Beetle they have a different preference in an attempt to split the consensus because he would lose to the majority voting. This is the crux of the protocol's security.

This protocol can be extended to $n=3f+1$ Beetles (or any other coleopterans) by adding more rounds. So by the end of the say fourth round John will hold messages such as

George said that Ringo said that said that Paul said that Yoko wants to call the album Let it Be

One can prove by way of induction that if at most $f$ of the nodes are faulty, then the protocol is guaranteed to provide consensus within $f$ rounds.

Practical BFT*

A BFT protocol is not only measured by its security. Fault tolerance is an important feature, but a fault-tolerant protocol isn't going to help anyone if it requires a trillion rounds or an exabyte of RAM. The communication complexity is the number of rounds the protocol requires, the space complexity is how much data it has to keep track of (or more accurately, what is the largest amount of data it has to hold on to simultaneously), and the time complexity is the amount of processing required to compute the result from the received messages. One might wonder whether the length of the messages should play a part, and it does: it manifests itself in the time complexity, as we consider reading the messages a part of the work.

In the PSL protocol, the number of rounds is a third of the nodes. That's already less than ideal. For example, the Ethereum network has more than 750,000 validators, so using the PSL protocol on this network would require 250,000 rounds per block.

However, it is the runtime complexity where the real problem shows. To see that, it suffices just to look at how many times a Beetle is named in each message.

Let us assume there are $n=3f+1$ Beetles and consider the messages Paul sends John. In the first round, Paul just sends his messages, so he names no Beetles. In the second round, Paul lists all Beetles besides Paul and John (and their alleged preferences), making him name $n-2$ Beetles.

What happens in the third round? Well, consider Ringo. For any Beetle X, Paul's message will include a line starting with "Ringo says that X wants", where there are $n-3$ options for X, as it could be any Beetle besides John, Paul, or Ringo. This means that we have $n-3$ lines starting with "Ringo said that". However, we don't need to name Ringo in every line, as we can only write "Ringo says" once, and then list everything he told us. The total number of Beetle names in Ringo's statement is $n-2$ . But wait, that's just Ringo! We have a similar list for all $n-2$ Beetles besides Paul and John, making us name a total of $(n-2)^2$ Beetles.

By the time we reach round $f=\frac{n-1}{3}$ we find that each message contains $(n-2)^{(n-1)/3}$ Beetle names, which is a lot. Even if we assume storing the name of Beetle only requires one bit, we get that if there are $20$ Beetles, each message in the last round is more than 10 megabytes. Doesn't sound like a lot? Well, for $30$ Beetles the message size would surpass 10 terabytes, for $40$ it will succeed 344 exabyte, and $50$ Beetles will topple two brontobyte, a unit of storage so large I never heard of it before writing this paragraph, and even my spell checker does not recognize. And what about 750,000 Ethereum validators? Well, the number is so large you would need about 170 kilobytes just to write down how many digits it has.

Fortunately, PSL was just the first in a royal lineage of BFT protocols. In 1999, Miguel Castro and Barbara Liskov introduced the Parctical Byzantine Fault Tolerance (PBFT) protocol. PBFT's most significant contribution is actually in terms we have not defined here: synchronicity model. Roughly, a synchronicity model models message latency. In the PSL analysis, there is an implicit assumption of full synchronicity. That is, all participants know when each round starts and when it ends. Full synchronicity is not how networks such as the internet naturally behave, and imposing it incurs large overheads. The internet is best modeled as admitting partial synchronicity, where we don't know how long it takes messages to transmit, but we do have an upper bound. Castro and Liskov's protocol is the first to provide $1/3$ fault-tolerance in the partially synchronous model. But more than that, it reduces the complexity to $O(n^2)$ . Several improvements of various sorts succeeded PBFT, such as Zyzzyva and ABsTRACTs which provide improved performance, Aaardvark which provides improved robustness, and Adapt which switches between protocols to respond to changing conditions. More recent protocols such as HotStuff and Marlin further reduce the complexity to linear, and we should probably also mention BFT protocols crafted specifically for proof-of-stake such as Alphabet and Tendermint.

PreviousChapter 1: From BFT to PoW NextProof-of-Work (PoW)

Last updated 4 months ago