futriix

Author	SHA1	Message	Date
antirez	a67c20a067	Replication: disconnect blocked clients when switching to slave role. Bug as old as Redis and blocking operations. It's hard to trigger since only happens on instance role switch, but the results are quite bad since an inconsistency between master and slave is created. How to trigger the bug is a good description of the bug itself. 1. Client does "BLPOP mylist 0" in master. 2. Master is turned into slave, that replicates from New-Master. 3. Client does "LPUSH mylist foo" in New-Master. 4. New-Master propagates write to slave. 5. Slave receives the LPUSH, the blocked client get served. Now Master "mylist" key has "foo", Slave "mylist" key is empty. Highlights: * At step "2" above, the client remains attached, basically escaping any check performed during command dispatch: read only slave, in that case. * At step "5" the slave (that was the master), serves the blocked client consuming a list element, which is not consumed on the master side. This scenario is technically likely to happen during failovers, however since Redis Sentinel already disconnects clients using the CLIENT command when changing the role of the instance, the bug is avoided in Sentinel deployments. Closes #2473.	2015-03-24 16:00:09 +01:00
antirez	d8d3c7acf4	Replication: put server.master client creation into separated function.	2015-02-04 11:26:20 +01:00
antirez	3d476bf2b6	AnetFormatIP(): renamed, commented, now sticks to IP:port format. A few code style changes + consistent format: not nice for humans but better for parsers.	2014-12-11 18:20:30 +01:00
Matt Stancliff	f7a98bdf4d	Cleanup all IP formatting code Instead of manually checking for strchr(n,':') everywhere, we can use our new centralized IP formatting functions.	2014-12-11 10:12:18 -05:00
antirez	cf30e64410	Network bandwidth tracking + refactoring. Track bandwidth used by clients and replication (but diskless replication is not tracked since the actual transfer happens in the child process). This includes a refactoring that makes tracking new instantaneous metrics simpler.	2014-12-03 12:16:25 +01:00
antirez	7197f03ef2	Diskless SYNC: fix RDB EOF detection. RDB EOF detection was relying on the final part of the RDB transfer to be a magic 40 bytes EOF marker. However as the slave is put online immediately, and because of sockets timeouts, the replication stream is actually contiguous with the RDB file. This means that to detect the EOF correctly we should either: 1) Scan all the stream searching for the mark. Sucks CPU-wise. 2) Start to send the replication stream only after an acknowledge. 3) Implement a proper chunked encoding. For now solution "2" was picked, so the master does not start to send ASAP the stream of commands in the case of diskless replication. We wait for the first REPLCONF ACK command from the slave, that certifies us that the slave correctly loaded the RDB file and is ready to get more data.	2014-11-11 17:12:12 +01:00
antirez	e996a7edca	Disconnect timedout slave: regression introduced with diskless repl.	2014-11-11 15:10:58 +01:00
Matt Stancliff	2dbdfa708d	Networking: add more outbound IP binding fixes Same as the original bind fixes (we just missed these the first time around). This helps Redis not automatically send connections from the first IP on an interface if we are bound to a specific IP address (e.g. with multiple IP aliases on one interface, you want to send from _your_ IP, not from the first IP on the interface).	2014-10-29 15:09:09 -04:00
antirez	60e3e155b2	Diskless replication: missing listRewind() added. This caused BGSAVE to be triggered a second time without any need when we switch from socket to disk target via the command CONFIG SET repl-diskless-sync no and there is already a slave waiting for the BGSAVE to start. Also comments clarified about what is happening.	2014-10-29 12:48:22 +01:00
antirez	a9313254bb	Log slave ip:port in more log messages.	2014-10-27 12:30:07 +01:00
antirez	c1b26827d0	Added a function to get slave name for logs.	2014-10-27 11:58:20 +01:00
antirez	d4eb64993e	Diskless replication: log BGSAVE delay only when it is non-zero.	2014-10-27 10:48:39 +01:00
antirez	7265428d8e	Diskless sync delay is now configurable.	2014-10-27 10:36:30 +01:00
antirez	951a2f75e4	Remove duplicated log message about starting BGSAVE.	2014-10-24 10:38:42 +02:00
antirez	084a55ab30	Diskless replication: less debugging printfs around.	2014-10-17 17:11:48 +02:00
antirez	ff228efb5c	rio fdset target: handle short writes. While the socket is set in blocking mode, we still can get short writes writing to a socket.	2014-10-17 16:45:53 +02:00
antirez	45df99fdb0	Diskless replication: don't send "\n" pings to slaves. This is useful for normal replication in order to refresh the slave when we are persisting on disk, but for diskless replication the child is already receiving data while in WAIT_BGSAVE_END state.	2014-10-17 10:23:44 +02:00
antirez	9e1e1e0efb	Diskless replication: remove 40 bytes EOF mark from end of RDB file.	2014-10-17 10:23:11 +02:00
antirez	ef2a05e346	Diskless replication: swap inverted branches to compute read len.	2014-10-17 10:22:29 +02:00
antirez	cb80a8d652	Diskless replication: don't enter the read-payload branch forever.	2014-10-17 10:21:18 +02:00
antirez	f470772f8b	Diskless replication: EOF:<mark> streaming support slave side.	2014-10-16 17:09:35 +02:00
antirez	d7a9be4319	Diskless replication: redis.conf and CONFIG SET/GET support.	2014-10-16 10:22:02 +02:00
antirez	20b0165730	Diskless replication: trigger a BGSAVE after a config change. If we turn from diskless to disk-based replication via CONFIG SET, we need a way to start a BGSAVE if there are slaves alerady waiting for a BGSAVE to start. Normally with disk-based replication we do it as soon as the previous child exits, but when there is a configuration change via CONFIG SET, we may have slaves in WAIT_BGSAVE_START state without an RDB background process currently active.	2014-10-16 10:15:18 +02:00
antirez	6f3aa18617	Diskless replication flag renamed repl_diskless -> repl_diskless_sync.	2014-10-16 10:00:50 +02:00
antirez	df6b4c8f64	Diskless replication: trigger diskless RDB transfer if needed.	2014-10-16 09:03:52 +02:00
antirez	e3645e1809	Diskless replication: handle putting the slave online.	2014-10-15 15:31:19 +02:00
antirez	1900d091d7	Diskless replication: RDB -> slaves transfer draft implementation.	2014-10-14 10:11:29 +02:00
antirez	766cd4bc15	Add some comments in syncCommand() to clarify RDB target.	2014-10-10 16:25:58 +02:00
Aaron Rutkovsky	bd82bd65c0	Fix typos Closes #1513	2014-09-29 06:49:07 -04:00
Jan-Erik Rediger	ebffd515f6	Fix typo: ad -> and Closes #1537	2014-09-29 06:49:06 -04:00
antirez	1c94889182	No more trailing spaces in Redis source code.	2014-06-26 18:48:40 +02:00
antirez	53014b4a9c	ROLE command: array len fixed for slave output.	2014-06-21 11:17:18 +02:00
antirez	0c4f31c53d	ROLE output improved for slaves. Info about the replication state with the master added.	2014-06-07 17:38:20 +02:00
antirez	fbdff35f11	ROLE command added. The new ROLE command is designed in order to provide a client with informations about the replication in a fast and easy to use way compared to the INFO command where the same information is also available.	2014-06-07 17:27:49 +02:00
antirez	d4a180bbc1	CLIENT LIST speedup via peerid caching + smart allocation. This commit adds peer ID caching in the client structure plus an API change and the use of sdsMakeRoomFor() in order to improve the reallocation pattern to generate the CLIENT LIST output. Both the changes account for a very significant speedup.	2014-04-28 17:36:57 +02:00
antirez	f64c5c67ce	Check for EAGAIN in sendBulkToSlave(). Sometime an osx master with a Linux server over a slow link caused a strange error where osx called the writable function for the socket but actually apparently there was no room in the socket buffer to accept the write: write(2) call returned an EAGAIN error, that was not checked, so we considered write(2) == 0 always as a connection reset, which was unfortunate since the bulk transfer has to start again. Also more errors are logged with the WARNING level in the same code path now.	2014-02-05 16:38:10 +01:00
antirez	6ece04b1fc	Cluster: function clusterGetSlaveRank() added. Return the number of slaves for the same master having a better replication offset of the current slave, that is, the slave "rank" used to pick a delay before the request for election.	2014-01-29 16:39:04 +01:00
antirez	1caae15fdd	Set server.repl_down_since to 0 when changing master. When an instance is potentially set to replicate with another master, it is conceptually disconnected forever, since we have no old copy of the dataset for this master in memory.	2014-01-17 18:20:31 +01:00
antirez	c0cdcaf373	Don't send REPLCONF ACK to old masters. Masters not understanding REPLCONF ACK will reply with errors to our requests causing a number of possible issues. This commit detects a global replication offest set to -1 at the end of the replication, and marks the client representing the master with the REDIS_PRE_PSYNC flag. Note that this flag was called REDIS_PRE_PSYNC_SLAVE but now it is just REDIS_PRE_PSYNC as it is used for both slaves and masters starting with this commit. This commit fixes issue #1488.	2014-01-08 14:28:16 +01:00
antirez	c1a042fda9	Clarify a comment in slaveTryPartialResynchronization().	2014-01-08 14:28:13 +01:00
antirez	c123005f8c	Make new masters inherit replication offsets. Currently replication offsets could be used into a limited way in order to understand, out of a set of slaves, what is the one with the most updated data. For example this comparison is possible of N slaves were replicating all with the same master. However the replication offset was not transferred from master to slaves (that are later promoted as masters) in any way, so for instance if there were three instances A, B, C, with A master and B and C replication from A, the following could happen: C disconnects from A. B is turned into master. A is switched to master of B. B receives some write. In this context there was no way to compare the offset of A and C, because B would use its own local master replication offset as replication offset to initialize the replication with A. With this commit what happens is that when B is turned into master it inherits the replication offset from A, making A and C comparable. In the above case assuming no inconsistencies are created during the disconnection and failover process, A will show to have a replication offset greater than C. Note that this does not mean offsets are always comparable to understand what is, in a set of instances, since in more complex examples the replica with the higher replication offset could be partitioned away when picking the instance to elect as new master. However this in general improves the ability of a system to try to pick a good replica to promote to master.	2013-12-22 11:43:25 +01:00
antirez	ccd6ccc7dd	Slaves heartbeats during sync improved. The previous fix for false positive timeout detected by master was not complete. There is another blocking stage while loading data for the first synchronization with the master, that is, flushing away the current data from the DB memory. This commit uses the newly introduced dict.c callback in order to make some incremental work (to send "\n" heartbeats to the master) while flushing the old data from memory. It is hard to write a regression test for this issue unfortunately. More support for debugging in the Redis core would be needed in terms of functionalities to simulate a slow DB loading / deletion.	2013-12-10 18:47:31 +01:00
antirez	247a311317	dict.c: added optional callback to dictEmpty(). Redis hash table implementation has many non-blocking features like incremental rehashing, however while deleting a large hash table there was no way to have a callback called to do some incremental work. This commit adds this support, as an optiona callback argument to dictEmpty() that is currently called at a fixed interval (one time every 65k deletions).	2013-12-10 18:46:24 +01:00
antirez	2860b5e234	Log empty DB + Loading data into two separated messages.	2013-12-10 18:43:25 +01:00
antirez	7a5a646df9	Fixed grammar: before H the article is a, not an.	2013-12-05 16:35:32 +01:00
antirez	a7ebb0c7bf	WAIT command: synchronous replication for Redis.	2013-12-04 16:20:03 +01:00
antirez	c46f655c90	Log to what master a slave is going to connect to.	2013-11-11 09:25:36 +01:00
antirez	8432ddcedb	Replication: install the write handler when reusing a cached master. Sometimes when we resurrect a cached master after a successful partial resynchronization attempt, there is pending data in the output buffers of the client structure representing the master (likely REPLCONF ACK commands). If we don't reinstall the write handler, it will never be installed again by addReply*() family functions as they'll assume that if there is already data pending, the write handler is already installed. This bug caused some slaves after a successful partial sync to never send REPLCONF ACK, and continuously being detected as timing out by the master, with a disconnection / reconnection loop.	2013-10-04 16:12:25 +02:00
antirez	cd73a69c18	PSYNC: safer handling of PSYNC requests. There was a bug that over-esteemed the amount of backlog available, however this could only happen when a slave was asking for an offset that was in the "future" compared to the master replication backlog. Now this case is handled well and logged as an incident in the master log file.	2013-10-04 12:25:09 +02:00
antirez	ec3bd0695b	Make clear that runids are not cluster node IDs.	2013-09-30 11:48:09 +02:00

1 2 3

145 Commits