futriix

Author	SHA1	Message	Date
antirez	1cf532dc37	redis-cli --help output improved with --scan and periods.	2014-01-22 12:07:42 +01:00
antirez	d97e8ccd5c	redis-cli: support for --scan option.	2014-01-22 12:04:08 +01:00
antirez	994c5b26dd	redis-cli: support for --scan option.	2014-01-22 12:04:08 +01:00
antirez	54dcf13614	Use fflush() before fsync() in rio.c. Incremental flushing in rio.c is only used to avoid huge kernel buffers synched to slow disks creating big latency spikes, so this fix has no durability implications, however it is certainly more correct to make sure that the FILE buffers are flushed to the kernel before calling fsync on the file descriptor. Thanks to Li Shao Kai for reporting this issue in the Redis mailing list.	2014-01-22 09:54:55 +01:00
antirez	172f14d48c	Use fflush() before fsync() in rio.c. Incremental flushing in rio.c is only used to avoid huge kernel buffers synched to slow disks creating big latency spikes, so this fix has no durability implications, however it is certainly more correct to make sure that the FILE buffers are flushed to the kernel before calling fsync on the file descriptor. Thanks to Li Shao Kai for reporting this issue in the Redis mailing list.	2014-01-22 09:54:55 +01:00
antirez	f57d70571f	Cluster: master nodes wait before rejoining the cluster after reboot. One of the simple heuristics used by Redis Cluster in order to avoid losing data in the typical failure modes created by the asynchronous replication with the slaves (a master is unable, when accepting a write, to immediately tell if it should be really accepted or refused because of a configuration change), is to wait some time before to rejoin the cluster after being partitioned away from the majority of instances. A similar condition happens when a master is restarted. It does not know if it was already failed over, nor if all the clients have already an updated configuration about the cluster map, so it is possible that clients will try to write to stale masters that were restarted. In a similar way this commit changes masters behavior so they wait 2000 milliseconds before accepting writes after a reboot. There is nothing special about 2 seconds if not to be a value supposedly larger a few orders of magnitude compared to the cluster bus communication latencies.	2014-01-20 11:52:52 +01:00
antirez	80e80668f4	Cluster: master nodes wait before rejoining the cluster after reboot. One of the simple heuristics used by Redis Cluster in order to avoid losing data in the typical failure modes created by the asynchronous replication with the slaves (a master is unable, when accepting a write, to immediately tell if it should be really accepted or refused because of a configuration change), is to wait some time before to rejoin the cluster after being partitioned away from the majority of instances. A similar condition happens when a master is restarted. It does not know if it was already failed over, nor if all the clients have already an updated configuration about the cluster map, so it is possible that clients will try to write to stale masters that were restarted. In a similar way this commit changes masters behavior so they wait 2000 milliseconds before accepting writes after a reboot. There is nothing special about 2 seconds if not to be a value supposedly larger a few orders of magnitude compared to the cluster bus communication latencies.	2014-01-20 11:52:52 +01:00
antirez	c7de167cb1	Cluster: debug printf statemets removed. These were committed for error after being inserted in order to fix an issue.	2014-01-20 11:19:04 +01:00
antirez	e6970e204f	Cluster: debug printf statemets removed. These were committed for error after being inserted in order to fix an issue.	2014-01-20 11:19:04 +01:00
antirez	26c1bf2a61	Cluster: don't rewrite slaveof config directive in cluster mode.	2014-01-20 11:10:42 +01:00
antirez	e4a5605c9a	Cluster: don't rewrite slaveof config directive in cluster mode.	2014-01-20 11:10:42 +01:00
antirez	4dff037c2b	Cluster: fix error reporting when slaveof is found in config.	2014-01-20 11:08:14 +01:00
antirez	437fc2cb56	Cluster: fix error reporting when slaveof is found in config.	2014-01-20 11:08:14 +01:00
antirez	aa79af76fc	Cluster: allow CLUSTER REPLICATE to switch master. The code was doing checks for slaves that should be done only when the instance is currently a master. Switching a slave from a master to another one should just work.	2014-01-17 18:22:35 +01:00
antirez	ac3850cabd	Cluster: allow CLUSTER REPLICATE to switch master. The code was doing checks for slaves that should be done only when the instance is currently a master. Switching a slave from a master to another one should just work.	2014-01-17 18:22:35 +01:00
antirez	1caae15fdd	Set server.repl_down_since to 0 when changing master. When an instance is potentially set to replicate with another master, it is conceptually disconnected forever, since we have no old copy of the dataset for this master in memory.	2014-01-17 18:20:31 +01:00
antirez	abd6308d27	Set server.repl_down_since to 0 when changing master. When an instance is potentially set to replicate with another master, it is conceptually disconnected forever, since we have no old copy of the dataset for this master in memory.	2014-01-17 18:20:31 +01:00
antirez	c40ca44941	Cluster: redis-trib shows number of replicas of masters.	2014-01-17 17:56:45 +01:00
antirez	36c24bcca0	Cluster: redis-trib shows number of replicas of masters.	2014-01-17 17:56:45 +01:00
antirez	383b2d63ea	Cluster: redis-trib help output format modified.	2014-01-17 12:32:49 +01:00
antirez	27ed9da383	Cluster: redis-trib help output format modified.	2014-01-17 12:32:49 +01:00
antirez	9fe6577942	Cluster: redis-trib shows what a slave replicates + fixes. Also the :replicates info field in the node object is now correctly populated. This also fixes the :replicas field computation.	2014-01-17 12:06:18 +01:00
antirez	a68c9ba97e	Cluster: redis-trib shows what a slave replicates + fixes. Also the :replicates info field in the node object is now correctly populated. This also fixes the :replicas field computation.	2014-01-17 12:06:18 +01:00
antirez	82859e547d	Cluster: redis-trib addnode is now able to add replicas.	2014-01-17 11:48:42 +01:00
antirez	b451176734	Cluster: redis-trib addnode is now able to add replicas.	2014-01-17 11:48:42 +01:00
antirez	a422dceef7	Cluster: fix redis-trib help subcommand.	2014-01-17 10:29:40 +01:00
antirez	30d9c1dc32	Cluster: fix redis-trib help subcommand.	2014-01-17 10:29:40 +01:00
antirez	ef73ddbca9	Cluster: redis-trib delnode implementation.	2014-01-16 18:22:03 +01:00
antirez	17d0c3e85a	Cluster: redis-trib delnode implementation.	2014-01-16 18:22:03 +01:00
antirez	3c581e39ce	Cluster: don't let a node forget its own master. redis-trib should make sure to reconfigure slaves of a node to remove from the cluster to replicate with other nodes before sending CLUSTER FORGET.	2014-01-16 17:49:35 +01:00
antirez	3d455393a6	Cluster: don't let a node forget its own master. redis-trib should make sure to reconfigure slaves of a node to remove from the cluster to replicate with other nodes before sending CLUSTER FORGET.	2014-01-16 17:49:35 +01:00
antirez	2c9f8fc22b	Cluster: redis-trib help output improved. Show options if any. Clarify that for some command any node address is ok.	2014-01-16 16:23:33 +01:00
antirez	9531c84807	Cluster: redis-trib help output improved. Show options if any. Clarify that for some command any node address is ok.	2014-01-16 16:23:33 +01:00
antirez	595ab5f26b	Cluster: don't forget yourself with CLUSTER FORGET.	2014-01-16 09:46:23 +01:00
antirez	0c373207fa	Cluster: don't forget yourself with CLUSTER FORGET.	2014-01-16 09:46:23 +01:00
antirez	ed0cfc31f3	Cluster: use the node blacklist in CLUSTER FORGET. CLUSTER FORGET is not useful if we can't remove a node from all the nodes of our cluster because of the Gossip protocol that keeps adding a given node to nodes where we already tried to remove it. So now CLUSTER FORGET implements a nodes blacklist that is set and checked by the Gossip section processing function. This way before a node is re-added at least 60 seconds must elapse since the FORGET execution. This means that redis-trib has some time to remove a node from a whole cluster. It is possible that in the future it will be uesful to raise the 60 sec figure to something bigger.	2014-01-15 16:50:45 +01:00
antirez	3e948970fe	Cluster: use the node blacklist in CLUSTER FORGET. CLUSTER FORGET is not useful if we can't remove a node from all the nodes of our cluster because of the Gossip protocol that keeps adding a given node to nodes where we already tried to remove it. So now CLUSTER FORGET implements a nodes blacklist that is set and checked by the Gossip section processing function. This way before a node is re-added at least 60 seconds must elapse since the FORGET execution. This means that redis-trib has some time to remove a node from a whole cluster. It is possible that in the future it will be uesful to raise the 60 sec figure to something bigger.	2014-01-15 16:50:45 +01:00
antirez	7f48ad0629	Cluster: fix clusterBlacklistAddNode() by setting right expire time. The hash table value should be set to now + 60 seconds otherwise it expires immediately.	2014-01-15 16:49:31 +01:00
antirez	ccf268fa17	Cluster: fix clusterBlacklistAddNode() by setting right expire time. The hash table value should be set to now + 60 seconds otherwise it expires immediately.	2014-01-15 16:49:31 +01:00
antirez	8602cbc8b4	Cluster: clusterBlacklistAddNode() key lookup fixed. We can't lookup by node->name that's not an SDS string but a plain C array in the node structure.	2014-01-15 16:45:07 +01:00
antirez	4e1861155f	Cluster: clusterBlacklistAddNode() key lookup fixed. We can't lookup by node->name that's not an SDS string but a plain C array in the node structure.	2014-01-15 16:45:07 +01:00
antirez	90197bd012	Cluster: clusterBlacklistExists() requires blacklist cleanup before lookup.	2014-01-15 16:06:54 +01:00
antirez	b51be7b34f	Cluster: clusterBlacklistExists() requires blacklist cleanup before lookup.	2014-01-15 16:06:54 +01:00
antirez	25cfa0817e	Cluster: set a minimum rejoin delay if node_timeout is too small. The rejoin delay usually is the node timeout. However if the node timeout is too small, we set it to 500 milliseconds, that is a value chosen to be greater than most setups RTT / instances latency figures so that likely communication with other nodes happen before rejoining.	2014-01-15 12:34:33 +01:00
antirez	a81340abaf	Cluster: set a minimum rejoin delay if node_timeout is too small. The rejoin delay usually is the node timeout. However if the node timeout is too small, we set it to 500 milliseconds, that is a value chosen to be greater than most setups RTT / instances latency figures so that likely communication with other nodes happen before rejoining.	2014-01-15 12:34:33 +01:00
antirez	58459f96cd	Cluster: periodically call clusterUpdateState() when cluster is down. Usually we update the cluster state (to understand if we should accept queries or reply with an error) only when there is a change in the state of the nodes. However for the "delayed rejoin" feature to work, that is, for a master to wait some time before accepting queries again after it rejoins the majority, we need to periodically update the last time when the node was partitioned away from the majority. With this commit if the cluster is down we update the state ten times per second.	2014-01-15 12:26:12 +01:00
antirez	a687cbc19c	Cluster: periodically call clusterUpdateState() when cluster is down. Usually we update the cluster state (to understand if we should accept queries or reply with an error) only when there is a change in the state of the nodes. However for the "delayed rejoin" feature to work, that is, for a master to wait some time before accepting queries again after it rejoins the majority, we need to periodically update the last time when the node was partitioned away from the majority. With this commit if the cluster is down we update the state ten times per second.	2014-01-15 12:26:12 +01:00
antirez	d5fdcc269f	Cluster: range checking in getSlotOrReply() fixed. See issue #1426 on Github.	2014-01-15 11:33:46 +01:00
antirez	25ddefdea3	Cluster: range checking in getSlotOrReply() fixed. See issue #1426 on Github.	2014-01-15 11:33:46 +01:00
antirez	adc613f456	Cluster: ignore empty lines in nodes.conf. Even without the user messing manually with the file, it is still possible to have blank lines (just a single "\n" per line) because of how the nodes.conf update/write process works.	2014-01-15 11:23:41 +01:00

... 332 333 334 335 336 ...

20963 Commits