Proto commits in lalithsuresh/rapid

These 47 commits are when the Protocol Buffers files have changed:

2020-06-10

Commit:	74d414b
Author:	Lalith Suresh	2020-06-09 22:42:29 -0700
Committer:	GitHub	2020-06-09 22:42:29 -0700

Revert "Simple anti-entropy mechanism (#24)" (#28) This reverts commit 0a1788f671ef6690b5bd3d97951578ede9cf693a.

The documentation is generated from this commit.

Commit:	51bedce
Author:	Lalith Suresh	2020-06-09 22:35:34 -0700
Committer:	GitHub	2020-06-09 22:35:34 -0700

Revert "Simple anti-entropy mechanism (#24)" This reverts commit 0a1788f671ef6690b5bd3d97951578ede9cf693a.

The documentation is generated from this commit.

2020-06-05

Commit:	0a1788f
Author:	Manuel Bernhardt	2020-06-05 19:49:13 +0200
Committer:	GitHub	2020-06-05 10:49:13 -0700

Simple anti-entropy mechanism (#24) * Anti-entropy mechanism It can happen that a node misses a part of the consensus messages whilst still being able to send out its own vote (unidirectional network partition, message overload, ...). In this case, the rest of the group will see this node as being part of the group and the monitoring mechanism will still be working as expected, but the stale node will run an old configuration. In order to enforce consistency in this case, the following new anti-entropy mechanism is used: - each node maintains a set of configurations it has been part of - probe messages now contain the configuration ID of the observer - when a node receives a probe message with a configuration ID it does not know, it will start a background task to check again after a configured timeout (1 minute by default) - if the configuration ID is still unknown after the timeout has reached, the node leaves (using the LEAVE protocol) * Allows a node to catch up if it misses a consensus round * Removing leave when out-of-sync strategy, improve implementation

2020-04-09

Commit:	941aeb3
Author:	Manuel Bernhardt	2020-04-09 17:20:22 +0200
Committer:	GitHub	2020-04-09 08:20:22 -0700

Endpoint performance and memory pressure optimization (#19) This is a bit of a controversial change from the view point of the API, yet makes a lot of sense from the view point of performance and memory utilization for very large clusters. The issue here is that the hostname of an Endpoint is modelled as a protobuf "string" type. This type carries with it the overhead of encoding to or decoding from UTF-8 every time a message is sent or received (and the field accessed). From the view point of the algorithms in place, there's no added value in having the endpoint host data be encoded as byte array or utf-8 encoded string. It is just data, what matters is that the ordering of the endpoints can be established. Having a string only matters at the interfaces: when configuring a hostname, when sending a message to one and when printing log statements (most of which at DEBUG/TRACE level). Yet at the moment, when adding a new endpoint to the membership ring(s), the following code runs: ``` public java.lang.String getHostname() { java.lang.Object ref = hostname_; if (ref instanceof java.lang.String) { return (java.lang.String) ref; } else { com.google.protobuf.ByteString bs = (com.google.protobuf.ByteString) ref; java.lang.String s = bs.toStringUtf8(); hostname_ = s; return s; } } ``` For freshly received messages containing Endpoints, this means running `toStringUtf8()`, which when there are many is quite expensive in terms of CPU and memory usage. This PR does the following: - use `bytes` rather than `string` to encode the hostname in protobuf - adjust all interfaces - the Cluster APIs are actually (almost) not affected since they use the `HostAndPort` construct - use the existing underlying / existing byte array when computing the hashcode of an Endpoint in `Utils.AddressComparator` - getting rid of the mapping between `Map<String, Metadata>` and `Map<Endpoint, Metadata>` by representing the map as two lists in protobuf (keys and values) On local tests with 1000 concurrent nodes joining, there's a 10% improvement in memory allocation and a 20% improvement in CPU usage of the stack starting at the `TreeSet.add` method (39% vs 58%).

The documentation is generated from this commit.

2020-02-27

Commit:	b04d666
Author:	Manuel Bernhardt	2020-02-27 16:57:25 +0100
Committer:	GitHub	2020-02-27 07:57:25 -0800

Proactively informing observers when shutting down (#15) Rather than waiting for edge failure detection to kick in when a cluster has been shut down, this change proactively informs the observers of a node with a new Leaving message. In turn the observers the trigger edge failure alerting immediately. Adds Cluster.leaveGracefully() and Cluster.shutdown() APIs for graceful and forced shutdowns respectfully. Accessing membership state after either of these APIs are invoked is illegal and will result in a thrown exception. * Proactively informs observer nodes that the node is leaving when the cluster is shut down * Leave notifications delivered in parallel, call to leave() protected by try/finally * Fixing parallel leave message sending - tolerating failure in delivering the messages, i.e. not cancelling other notifications - adjusting test intervals in order to reach agreement faster * Throw exceptions when trying to access membership state after shutting down

2018-06-17

Commit:	bce1e27
Author:	Lalith Suresh	2018-06-17 12:00:34 -0700
Committer:	GitHub	2018-06-17 12:00:34 -0700

Terminology edits (#11) * Rename APIs to match observer -> subject terminology * WatermarkBuffer -> almost-everywhere agreement filter * Rename monitoring links -> monitoring edges * Use cut detection terminology

2017-12-07

Commit:	11f4b73
Author:	lalithsuresh	2017-12-07 11:32:28 -0800

Endpoint is now tagged with metadata

2017-12-01

Commit:	bfed8d4
Author:	lalithsuresh	2017-11-21 21:47:23 -0800
Committer:	lalithsuresh	2017-12-01 11:35:48 -0800

Endpoint protobuf type now represents each node to avoid back-and-forth conversions between strings and Guava HostAndPort

2017-11-20

Commit:	80738c9
Author:	lalithsuresh	2017-11-17 10:53:41 -0800
Committer:	lalithsuresh	2017-11-19 17:20:57 -0800

Add Classic Paxos implementation for recovering from Fast Paxos conflicts

Commit:	e452ed0
Author:	lalithsuresh	2017-11-16 20:22:29 -0800
Committer:	lalithsuresh	2017-11-19 17:20:06 -0800

Refactor messaging interfaces to decouple Rapid from the messaging implementation

2017-10-24

Commit:	5271b37
Author:	lalithsuresh	2017-10-24 10:24:25 -0700

Cleanup interface boundaries for messaging

2017-08-19

Commit:	c119ef4
Author:	lalithsuresh	2017-08-19 11:18:29 -0700

Netty tests

2017-07-13

Commit:	0045ce4
Author:	lalithsuresh	2017-07-12 17:44:45 -0700

Metadata values are now ByteStrings

2017-07-01

Commit:	80e4acd
Author:	lalithsuresh	2017-07-01 12:46:12 -0700
Committer:	lalithsuresh	2017-07-01 13:19:31 -0700

Remove back-and-forth conversions for UUIDs

2017-06-10

Commit:	67c9bdf
Author:	lalithsuresh	2017-06-09 13:27:01 -0700
Committer:	lalithsuresh	2017-06-09 19:45:32 -0700

Use best effort broadcast and re-organize executor usage.

2017-06-09

Commit:	afd9b50
Author:	lalithsuresh	2017-06-08 22:05:26 -0700

Avoid creating redundant copies of link-update-messages

2017-05-26

Commit:	538e894
Author:	lalithsuresh	2017-05-26 11:19:12 -0700
Committer:	lalithsuresh	2017-05-26 11:23:46 -0700

Changes to the metadata API to avoid sending strings around

2017-04-05

Commit:	e6e4b81
Author:	lalithsuresh	2017-04-04 23:01:13 -0700

Batch join-messages for multiple rings that are directed to the same monitor

2017-04-04

Commit:	252b907
Author:	lalithsuresh	2017-04-04 14:36:32 -0700
Committer:	lalithsuresh	2017-04-04 14:52:01 -0700

Support informing ProbeMessage-based failure detectors about whether a monitoree is bootstrapping

Commit:	5184c26
Author:	lalithsuresh	2017-04-03 20:58:00 -0700
Committer:	lalithsuresh	2017-04-04 10:07:00 -0700

Supply executors to prevent grpc's usage of a cachedThreadPool

2017-04-03

Commit:	856c3e4
Author:	lalithsuresh	2017-04-03 15:10:23 -0700

Revert changes to receiving join-confirmations

2017-03-28

Commit:	4cc8f8c
Author:	lalithsuresh	2017-03-27 18:41:05 -0700

Refactor join protocol to be retry friendly

2017-03-27

Commit:	7f89246
Author:	lalithsuresh	2017-03-27 09:08:49 -0700

Metadata manager now maintains a set of key-value pairs per-node

2017-03-26

Commit:	5b7f43b
Author:	lalithsuresh	2017-03-26 14:45:06 -0700

Cluster can now track metadata per-node. Confined to features like roles for now.

2017-03-24

Commit:	d4a1c07
Author:	lalithsuresh	2017-03-24 13:36:36 -0700

Refactor repository into a parent project with modules

Commit:	3798363
Author:	lalithsuresh	2017-03-23 17:08:06 -0700
Committer:	lalithsuresh	2017-03-23 20:57:42 -0700

Consensus implementation

2017-03-09

Commit:	cf1f4e7
Author:	lalithsuresh	2017-03-08 21:49:10 -0800

Implement monitoring support

2017-03-07

Commit:	9c28c32
Author:	lalithsuresh	2017-03-06 16:25:40 -0800

Avoid proposal logging by default + nits

Commit:	4423070
Author:	lalithsuresh	2017-03-06 16:16:06 -0800

Cleanup protobuf descriptions

2017-03-06

Commit:	72c122d
Author:	lalithsuresh	2017-03-06 15:29:38 -0800

Cleanup protobuf descriptions

Commit:	91d82a4
Author:	lalithsuresh	2017-03-06 13:25:12 -0800
Committer:	lalithsuresh	2017-03-06 13:31:34 -0800

Refactor out redundant LinkUpdateMessage class. We only use the protobuf definition now.

2017-03-05

Commit:	5305e39
Author:	lalithsuresh	2017-03-04 17:15:40 -0800

Implement update batching

2017-03-02

Commit:	fe196f2
Author:	lalithsuresh	2017-03-02 11:00:51 -0800

Use InProcessChannel for tests.

2017-02-28

Commit:	af88219
Author:	lalithsuresh	2017-02-28 15:10:15 -0800

Join protocol works until a configuration change. Need to stream back configuration.

Commit:	731faaa
Author:	lalithsuresh	2017-02-28 10:14:56 -0800

Refactor code to accommodate changes to bootstrap procedure

Commit:	a13daf1
Author:	lalithsuresh	2017-02-27 20:48:22 -0800

Checkpoint before re-working MembershipView

2017-02-27

Commit:	1edb1e2
Author:	lalithsuresh	2017-02-26 16:20:19 -0800

Checkpoint before gossip implementation

2017-02-26

Commit:	9c9abbc
Author:	lalithsuresh	2017-02-25 20:06:19 -0800

Checkpoint before async implementation

2017-02-25

Commit:	00a1c2d
Author:	lalithsuresh	2017-02-25 13:54:26 -0800

Test bootstrap

Commit:	c87a041
Author:	lalithsuresh	2017-02-24 23:32:57 -0800

Improve tests and hashing stability

Commit:	da5dd04
Author:	lalithsuresh	2017-02-24 18:35:11 -0800

Part 1 of join protocol

2017-02-24

Commit:	2da809a
Author:	lalithsuresh	2017-02-24 15:30:53 -0800

Prepare to implement join protocol

Commit:	5b73d2c
Author:	lalithsuresh	2017-02-24 14:40:19 -0800

Performance improvements to MembershipView

Commit:	d86641a
Author:	lalithsuresh	2017-02-24 08:57:24 -0800

Introduce node-id maintenance

2017-02-18

Commit:	b6c2c62
Author:	lalithsuresh	2017-02-17 22:46:41 -0800

First take at messaging tests with a simple broadcaster

2017-02-10

Commit:	13a4bd5
Author:	lalithsuresh	2017-02-10 08:49:50 -0800

Split protobuf generated definitions into multiple files

Commit:	1cf205a
Author:	lalithsuresh	2017-02-09 17:19:56 -0800

Transition to gRPC and remove checker framework