futriix

Author	SHA1	Message	Date
antirez	1560b70889	Cluster: cluster stuff moved from redis.h to cluster.h.	2013-10-09 15:38:05 +02:00
antirez	d2cbc8fee3	Merge branch 'bettercluster' into unstable	2013-10-08 13:04:33 +02:00
antirez	0f079966c7	Cluster: masters don't vote for a slave with stale config. When a slave requests our vote, the configEpoch he claims for its master and the set of served slots must be greater or equal to the configEpoch of the nodes serving these slots in the current configuraiton of the master granting its vote. In other terms, masters don't vote for slaves having a stale configuration for the slots they want to serve.	2013-10-08 12:45:35 +02:00
antirez	26ea55b7f5	Cluster: fix slave data age computation when master is still connected.	2013-10-07 16:07:13 +02:00
antirez	acd9ec222e	Cluster: log message improved when FAIL is cleared from a slave node.	2013-10-07 15:44:58 +02:00
antirez	e9b8b30c81	Cluster: slave nodes advertise master slots bitmap and configEpoch.	2013-10-07 11:31:12 +02:00
antirez	8adeb2b2e3	Replication: install the write handler when reusing a cached master. Sometimes when we resurrect a cached master after a successful partial resynchronization attempt, there is pending data in the output buffers of the client structure representing the master (likely REPLCONF ACK commands). If we don't reinstall the write handler, it will never be installed again by addReply*() family functions as they'll assume that if there is already data pending, the write handler is already installed. This bug caused some slaves after a successful partial sync to never send REPLCONF ACK, and continuously being detected as timing out by the master, with a disconnection / reconnection loop.	2013-10-04 16:14:54 +02:00
antirez	8432ddcedb	Replication: install the write handler when reusing a cached master. Sometimes when we resurrect a cached master after a successful partial resynchronization attempt, there is pending data in the output buffers of the client structure representing the master (likely REPLCONF ACK commands). If we don't reinstall the write handler, it will never be installed again by addReply*() family functions as they'll assume that if there is already data pending, the write handler is already installed. This bug caused some slaves after a successful partial sync to never send REPLCONF ACK, and continuously being detected as timing out by the master, with a disconnection / reconnection loop.	2013-10-04 16:12:25 +02:00
antirez	4cddbc8ad4	Replication: fix master timeout. Since we started sending REPLCONF ACK from slaves to masters, the lastinteraction field of the client structure is always refreshed as soon as there is room in the socket output buffer, so masters in timeout are detected with too much delay (the socket buffer takes a lot of time to be filled by small REPLCONF ACK <number> entries). This commit only counts data received as interactions with a master, solving the issue.	2013-10-04 13:01:45 +02:00
antirez	cca9f8c432	Replication: fix master timeout. Since we started sending REPLCONF ACK from slaves to masters, the lastinteraction field of the client structure is always refreshed as soon as there is room in the socket output buffer, so masters in timeout are detected with too much delay (the socket buffer takes a lot of time to be filled by small REPLCONF ACK <number> entries). This commit only counts data received as interactions with a master, solving the issue.	2013-10-04 12:59:24 +02:00
antirez	e2e4c81d9d	PSYNC: safer handling of PSYNC requests. There was a bug that over-esteemed the amount of backlog available, however this could only happen when a slave was asking for an offset that was in the "future" compared to the master replication backlog. Now this case is handled well and logged as an incident in the master log file.	2013-10-04 12:27:30 +02:00
antirez	de86e24ba6	Add REWRITE to CONFIG subcommands help message.	2013-10-04 12:27:26 +02:00
antirez	cd73a69c18	PSYNC: safer handling of PSYNC requests. There was a bug that over-esteemed the amount of backlog available, however this could only happen when a slave was asking for an offset that was in the "future" compared to the master replication backlog. Now this case is handled well and logged as an incident in the master log file.	2013-10-04 12:25:09 +02:00
antirez	dbf6c85d5e	Cluster: new clusterDoBeforeSleep() API. The new API is able to remember operations to perform before returning to the event loop, such as checking if there is the failover quorum for a slave, save and fsync the configuraiton file, and so forth. Because this operations are performed before returning on the event loop we are sure that messages that are sent in the same event loop run will be delivered after the configuration is already saved, that is a requirement sometimes. For instance we want to publish a new epoch only when it is already stored in nodes.conf in order to avoid returning back in the logical clock when a node is restarted. This new API provides a big performance advantage compared to saving and possibly fsyncing the configuration file multiple times in the same event loop run, especially in the case of big clusters with tens or hundreds of nodes.	2013-10-03 09:58:06 +02:00
antirez	43f3df99c8	Cluster: update cluster config when slave changes master.	2013-10-02 12:27:12 +02:00
antirez	5cbb913994	Cluster: bus messages stats in CLUSTER info.	2013-10-02 10:10:08 +02:00
antirez	90b06ab7b5	Cluster: FAIL messages from unknown senders are handled better. Previously the event was not logged but instead the node reported an unknown packet type received.	2013-10-02 09:42:45 +02:00
antirez	3be5010adb	Cluster: senderCurrentEpoch == node currentEpoch was too strict. We can accept a vote as long as its epoch is >= the epoch at which we started the voting process. There is no need for it to be exactly the same.	2013-10-01 17:21:28 +02:00
antirez	0000cfbf38	Cluster: fix typo in clusterProcessPacket() comment.	2013-10-01 15:40:20 +02:00
antirez	6ed0dee927	Cluster: time field removed from cluster messages header. The new algorithm does not check replies time as checking for the currentEpoch in the reply ensures that the reply is about the current election process.	2013-09-30 16:19:44 +02:00
antirez	0b3a8f2072	Add REWRITE to CONFIG subcommands help message.	2013-09-30 11:53:18 +02:00
antirez	60d4ae49be	Cluster: log message shortened.	2013-09-30 11:51:58 +02:00
antirez	ec3bd0695b	Make clear that runids are not cluster node IDs.	2013-09-30 11:48:09 +02:00
antirez	1239f49065	Cluster: detect cluster reconfiguration when master slots drop to 0. The old algorithm used a PROMOTED flag and explicitly checks about slave->master convertions. Wit the new cluster meta-data propagation algorithm we just look at the configEpoch to check if we need to reconfigure slots, then: 1) If a node is a master but it reaches zero served slots becuase of reconfiguration. 2) If a node is a slave but the master reaches zero served slots because of a reconfiguration. We switch as a replica of the new slots owner.	2013-09-30 11:45:26 +02:00
antirez	2a391b8bac	Cluster: re-order failover operations to make it safer. We need to: 1) Increment the configEpoch. 2) Save it to disk and fsync the file. 3) Broadcast the PONG with the new configuration. If other nodes will receive the updated configuration we need to be sure to restart with this new config in the event of a crash.	2013-09-30 10:16:48 +02:00
antirez	0b63dc2841	Cluster: when upading the configEpoch for a node, save config on disk ASAP.	2013-09-30 10:16:25 +02:00
antirez	5d393adeac	Cluster: fsync data when saving the cluster config.	2013-09-30 10:13:07 +02:00
antirez	8fa4e7817a	Cluster: update the node configEpoch when newer is detected.	2013-09-27 09:55:41 +02:00
antirez	c8d6bc94e4	Cluster: react faster when a slave wins an election.	2013-09-26 16:54:43 +02:00
antirez	7dfa4c5981	Cluster: removed an old source of delay to start the slave failover.	2013-09-26 13:28:19 +02:00
antirez	3bd69bcdf1	Cluster: master node now uses new protocol to vote.	2013-09-26 13:00:41 +02:00
antirez	f941650091	Cluster: slave node now uses the new protocol to get elected.	2013-09-26 11:13:17 +02:00
Michel Martens	17ce9a8da0	Document the redis-cli --csv option.	2013-09-26 10:12:46 +02:00
antirez	d392f33abb	Cluster: fix redis-trib node config fingerprinting for new nodes format.	2013-09-25 12:58:06 +02:00
antirez	2cac667a8b	Cluster: fix redis-trib for added configEpoch field in CLUSTER NODES.	2013-09-25 12:44:56 +02:00
antirez	24b2894194	Cluster: add currentEpoch to CLUSTER INFO.	2013-09-25 12:38:36 +02:00
antirez	6dbd939a24	Cluster: update our currentEpoch when a greater one is seen.	2013-09-25 12:36:29 +02:00
antirez	1adf457b5b	Cluster: broadcast currentEpoch and configEpoch in packets header.	2013-09-25 11:53:35 +02:00
antirez	cdf4eede58	Cluster: configEpoch added in cluster nodes description.	2013-09-25 11:47:13 +02:00
antirez	98d1253053	htonu64() and ntohu64 added to endianconv.h.	2013-09-25 09:26:36 +02:00
antirez	3a9bf5e618	Cluster: PFAIL -> FAIL transition allowed for slaves. First change: now there is no need to be a master in order to detect a failure, however the majority of masters signaling PFAIL or FAIL is needed. This change is important because it allows slaves rejoining the cluster after a partition to sense the FAIL condition so that eventually all the nodes agree on failures.	2013-09-20 11:26:44 +02:00
antirez	3f5034d1d7	Cluster: added time field in cluster bus messages. The time is sent in requests, and copied back in reply packets. This way the receiver can compare the time field in a reply with its local clock and check the age of the request associated with this reply. This is an easy way to discard delayed replies. Note that only a clock is used here, that is the one of the node sending the packet. The receiver only copies the field back into the reply, so no synchronization is needed between clocks of different hosts.	2013-09-20 09:22:21 +02:00
antirez	90e1829ec4	Allow AUTH / PING when disconnected from slave and serve-stale-data is no.	2013-09-17 09:46:06 +02:00
antirez	c7cb80c8bb	Cluster: don't add an handshake node for the same ip:port pair multiple times.	2013-09-04 15:52:16 +02:00
antirez	79a1deac28	Cluster: free HANDSHAKE nodes after node_timeout. Handshake nodes should turn into normal nodes or be freed in a reasonable amount of time, otherwise they'll keep accumulating if the address they are associated with is not reachable for some reason.	2013-09-04 12:41:21 +02:00
antirez	ee99df2d59	redis-cli: fix big keys search when the key no longer exist. The code freed a reply object that was never created, resulting in a segfault every time randomkey returned a key that was deleted before we queried it for size.	2013-09-04 10:35:53 +02:00
antirez	232f84ec2d	Cluster: CLUSTER SAVECONFIG command added.	2013-09-04 10:33:00 +02:00
antirez	61eb16c4da	Cluster: don't save HANDSHAKE nodes in nodes.conf.	2013-09-04 10:25:26 +02:00
antirez	6e460eac58	Cluster: always use safe iteartors to iterate server.cluster->nodes.	2013-09-04 10:07:50 +02:00
Maxim Zakharov	ff18243fce	mistype fixed	2013-09-03 15:15:51 +02:00

1 2 3 4 5 ...

3600 Commits