futriix

Author	SHA1	Message	Date
antirez	45a900e448	Cluster: manual failover initial implementation.	2014-02-05 13:01:24 +01:00
antirez	c94826dfdf	CLIENT PAUSE and related API implemented. The API is one of the bulding blocks of CLUSTER FAILOVER command that executes a manual failover in Redis Cluster. However exposed as a command that the user can call directly, it makes much simpler to upgrade a standalone Redis instance using a slave in a safer way. The commands works like that: CLIENT PAUSE <milliesconds> All the clients that are not slaves and not in MONITOR state are paused for the specified number of milliesconds. This means that slaves are normally served in the meantime. At the end of the specified amount of time all the clients are unblocked and will continue operations normally. This command has no effects on the population of the slow log, since clients are not blocked in the middle of operations but only when there is to process new data. Note that while the clients are unblocked, still new commands are accepted and queued in the client buffer, so clients will likely not block while writing to the server while the pause is active.	2014-02-04 16:16:09 +01:00
antirez	c7426670a0	Scripting: expire keys in scripts only at first access. Keys expiring in the middle of the execution of Lua scripts are to create inconsistencies in masters and / or AOF files. See the following example: if redis.call("exists",KEYS[1]) == 1 then redis.call("incr","mycounter") end if redis.call("exists",KEYS[1]) == 1 then return redis.call("incr","mycounter") end The script executes two times the same if key exists then incrementcounter logic. However the two executions will work differently in the master and the slaves, provided some unlucky timing happens. In the master the first time the key may still exist, while the second time the key may no longer exist. This will result in the key incremented just one time. However as a side effect the master will generate a synthetic `DEL` command in the replication channel in order to force the slaves to expire the key (given that key expiration is master-driven). When the same script will run in the slave, the key will no longer be there, so the script will not increment the key. The key idea used to implement the expire-at-first-lookup semantics was provided by Marc Gravell.	2014-02-03 16:15:53 +01:00
antirez	6ed232ebfd	Allow CONFIG and SHUTDOWN while in stale-slave state.	2014-02-03 15:51:03 +01:00
antirez	d0db6b9fcf	Scripting: use mstime() and mstime_t for lua_time_start. server.lua_time_start is expressed in milliseconds. Use mstime_t instead of long long, and populate it with mstime() instead of ustime()/1000. Functionally identical but more natural.	2014-02-03 15:45:40 +01:00
Salvatore Sanfilippo	7751d90b7d	Merge pull request #1534 from gdi2290/patch-1 update copyright year	2014-02-03 02:18:24 -08:00
PatrickJS	4400b317ab	update copyright year	2014-02-03 02:10:54 -08:00
antirez	3c7a5b29aa	Test: fixed osx msg passing issue in testing framework. The Redis test uses a server-clients model in order to parallelize the execution of different tests. However in recent versions of osx not setting the channel to a binary encoding caused issues even if AFAIK no binary data is really sent via this channel. However now the channels are deliberately set to a binary encoding and this solves the issue. The exact issue was the test not terminating and giving the impression of running forever, since test clients or servers were unable to exchange the messages to continue.	2014-01-31 16:27:03 +01:00
antirez	3d54628d55	Redis.conf comment about tcp-backlog option improved.	2014-01-31 14:59:50 +01:00
antirez	2d253d1543	Option "backlog" renamed "tcp-backlog". This is especially important since we already have a concept of backlog (the replication backlog).	2014-01-31 14:56:10 +01:00
Nenad Merdanovic	ca81272ea4	Add support for listen(2) backlog definition In high RPS environments, the default listen backlog is not sufficient, so giving users the power to configure it is the right approach, especially since it requires only minor modifications to the code.	2014-01-31 14:52:10 +01:00
antirez	1c93894ace	Cluster: fix an error in migration-barrier comment in redis.conf.	2014-01-31 11:31:50 +01:00
antirez	244b6263db	Cluster: barrer -> barrier in redis.conf.	2014-01-31 11:30:49 +01:00
antirez	bc300b22af	Cluster: configurable replicas migration barrier. It is possible to configure the min number of additional working slaves a master should be left with, for a slave to migrate to an orphaned master.	2014-01-31 11:26:36 +01:00
antirez	b9c51f518b	Sentinel: check arity for SENTINEL MASTER command. This fixes issue #1530.	2014-01-31 10:13:38 +01:00
antirez	1d804400f5	Cluster: perform orphaned masters check before continue statements. The check was placed in a way that conflicted with the continue statements used by the node hearth beat code later that needs to skip the current node sometimes. Moved at the start of the function so that's always executed.	2014-01-30 18:23:31 +01:00
antirez	4f1fdc2c64	Cluster: replica migration implementation. This feature allows slaves to migrate to orphaned masters (masters without working slaves), as long as a set of conditions are met, including the fact that the migrating slave needs to be in a master-slaves ring with at least another slave working.	2014-01-30 18:05:11 +01:00
antirez	ef82c59716	Cluster: swap two code blocks to have a more obvious flow.	2014-01-30 16:34:23 +01:00
antirez	b582a91335	Cluster: remove not needed return statement breaking failover.	2014-01-29 17:28:46 +01:00
antirez	378e9745db	Cluster: broadcast pong to other slaves in the same ring. When we schedule a failover, broadcast a PONG to the slaves. The other slaves that plan to get elected will do the same too, this way it is likely that every slave will have a good picture of its own rank. Note that this is N*N messages where N is the number of slaves for the failing master, however usually even large clusters have many master nodes but a limited number of replicas per node, so this is harmless.	2014-01-29 17:19:55 +01:00
antirez	6546c6ff3b	Cluster: log offset when announcing the failover election delay.	2014-01-29 17:16:10 +01:00
antirez	f5c5e2707c	Cluster: added progressive election delay according to slave rank. Note that when we compute the initial delay, there are probably still more up to date information to receive from slaves with new offsets, so the delay is recomputed when new data is available.	2014-01-29 16:53:45 +01:00
antirez	6ece04b1fc	Cluster: function clusterGetSlaveRank() added. Return the number of slaves for the same master having a better replication offset of the current slave, that is, the slave "rank" used to pick a delay before the request for election.	2014-01-29 16:39:04 +01:00
antirez	e529f813ae	Cluster: update node replication offset from bus packets headers.	2014-01-29 16:01:00 +01:00
antirez	0c36b17916	Cluster: refactoring: new macros to check node flags.	2014-01-29 12:17:16 +01:00
antirez	604fe17883	Cluster: use myself instead of server->cluster.myself.	2014-01-29 11:38:14 +01:00
antirez	9e91cc8b28	Cluster: added a global myself pointer in cluster.c. Accessing to the 'myself' node, the node representing the currently running instance, is handy without the need to type server.cluster->myself every time.	2014-01-29 11:22:22 +01:00
antirez	5124365a8f	Cluster: clusterBroadcastPong() improved with target selection. Now we can broadcast a pong to all the instances or just the local slaves (that is useful for replication offset propagation).	2014-01-29 11:08:52 +01:00
antirez	b8a0220498	Cluster: broadcast master/slave replication offset in bus header.	2014-01-28 16:51:50 +01:00
antirez	0ae1d07b01	Cluster: limit cluster.h to 80 cols.	2014-01-28 16:34:23 +01:00
antirez	91afff8fee	Cluster: introduced repl_offset fields in clusterNode. The two fields are used in order to remember the latest known replication offset and the time we received it from other slave nodes. This will be used by slaves in order to start the election procedure with a delay that is proportional to the rank of the slave among the other slaves for this master, when sorted for replication offset. Usually this allows the slave with the most updated offset to win the election and replace the failing master in the cluster.	2014-01-28 16:28:07 +01:00
antirez	68fdf31201	Fixed inverted if condition in MISCONF error code path.	2014-01-28 10:11:12 +01:00
antirez	e13076867f	Don't log MONITOR clients as disconnecting slaves.	2014-01-25 11:53:53 +01:00
antirez	505e3836a7	Cluster: redis-trib set-timeout implemented.	2014-01-24 15:06:01 +01:00
antirez	ee6f19d7cb	Cluster: update slaves lists in clusterSetMaster().	2014-01-22 18:46:53 +01:00
antirez	423e5e601b	Cluster: CLUSTER SLAVES subcommand added.	2014-01-22 18:38:42 +01:00
antirez	ae66039bb1	Cluster: clusterGenNodesDescription() refactored into two functions.	2014-01-22 18:36:12 +01:00
antirez	a166f49794	redis-cli --help output improved with --scan and periods.	2014-01-22 12:07:42 +01:00
antirez	d97e8ccd5c	redis-cli: support for --scan option.	2014-01-22 12:04:08 +01:00
antirez	54dcf13614	Use fflush() before fsync() in rio.c. Incremental flushing in rio.c is only used to avoid huge kernel buffers synched to slow disks creating big latency spikes, so this fix has no durability implications, however it is certainly more correct to make sure that the FILE buffers are flushed to the kernel before calling fsync on the file descriptor. Thanks to Li Shao Kai for reporting this issue in the Redis mailing list.	2014-01-22 09:54:55 +01:00
antirez	f57d70571f	Cluster: master nodes wait before rejoining the cluster after reboot. One of the simple heuristics used by Redis Cluster in order to avoid losing data in the typical failure modes created by the asynchronous replication with the slaves (a master is unable, when accepting a write, to immediately tell if it should be really accepted or refused because of a configuration change), is to wait some time before to rejoin the cluster after being partitioned away from the majority of instances. A similar condition happens when a master is restarted. It does not know if it was already failed over, nor if all the clients have already an updated configuration about the cluster map, so it is possible that clients will try to write to stale masters that were restarted. In a similar way this commit changes masters behavior so they wait 2000 milliseconds before accepting writes after a reboot. There is nothing special about 2 seconds if not to be a value supposedly larger a few orders of magnitude compared to the cluster bus communication latencies.	2014-01-20 11:52:52 +01:00
antirez	c7de167cb1	Cluster: debug printf statemets removed. These were committed for error after being inserted in order to fix an issue.	2014-01-20 11:19:04 +01:00
antirez	26c1bf2a61	Cluster: don't rewrite slaveof config directive in cluster mode.	2014-01-20 11:10:42 +01:00
antirez	4dff037c2b	Cluster: fix error reporting when slaveof is found in config.	2014-01-20 11:08:14 +01:00
antirez	aa79af76fc	Cluster: allow CLUSTER REPLICATE to switch master. The code was doing checks for slaves that should be done only when the instance is currently a master. Switching a slave from a master to another one should just work.	2014-01-17 18:22:35 +01:00
antirez	1caae15fdd	Set server.repl_down_since to 0 when changing master. When an instance is potentially set to replicate with another master, it is conceptually disconnected forever, since we have no old copy of the dataset for this master in memory.	2014-01-17 18:20:31 +01:00
antirez	c40ca44941	Cluster: redis-trib shows number of replicas of masters.	2014-01-17 17:56:45 +01:00
antirez	383b2d63ea	Cluster: redis-trib help output format modified.	2014-01-17 12:32:49 +01:00
antirez	9fe6577942	Cluster: redis-trib shows what a slave replicates + fixes. Also the :replicates info field in the node object is now correctly populated. This also fixes the :replicas field computation.	2014-01-17 12:06:18 +01:00
antirez	82859e547d	Cluster: redis-trib addnode is now able to add replicas.	2014-01-17 11:48:42 +01:00

1 2 3 4 5 ...

3938 Commits