futriix

Author	SHA1	Message	Date
antirez	d4a180bbc1	CLIENT LIST speedup via peerid caching + smart allocation. This commit adds peer ID caching in the client structure plus an API change and the use of sdsMakeRoomFor() in order to improve the reallocation pattern to generate the CLIENT LIST output. Both the changes account for a very significant speedup.	2014-04-28 17:36:57 +02:00
antirez	f64c5c67ce	Check for EAGAIN in sendBulkToSlave(). Sometime an osx master with a Linux server over a slow link caused a strange error where osx called the writable function for the socket but actually apparently there was no room in the socket buffer to accept the write: write(2) call returned an EAGAIN error, that was not checked, so we considered write(2) == 0 always as a connection reset, which was unfortunate since the bulk transfer has to start again. Also more errors are logged with the WARNING level in the same code path now.	2014-02-05 16:38:10 +01:00
antirez	6ece04b1fc	Cluster: function clusterGetSlaveRank() added. Return the number of slaves for the same master having a better replication offset of the current slave, that is, the slave "rank" used to pick a delay before the request for election.	2014-01-29 16:39:04 +01:00
antirez	1caae15fdd	Set server.repl_down_since to 0 when changing master. When an instance is potentially set to replicate with another master, it is conceptually disconnected forever, since we have no old copy of the dataset for this master in memory.	2014-01-17 18:20:31 +01:00
antirez	c0cdcaf373	Don't send REPLCONF ACK to old masters. Masters not understanding REPLCONF ACK will reply with errors to our requests causing a number of possible issues. This commit detects a global replication offest set to -1 at the end of the replication, and marks the client representing the master with the REDIS_PRE_PSYNC flag. Note that this flag was called REDIS_PRE_PSYNC_SLAVE but now it is just REDIS_PRE_PSYNC as it is used for both slaves and masters starting with this commit. This commit fixes issue #1488.	2014-01-08 14:28:16 +01:00
antirez	c1a042fda9	Clarify a comment in slaveTryPartialResynchronization().	2014-01-08 14:28:13 +01:00
antirez	c123005f8c	Make new masters inherit replication offsets. Currently replication offsets could be used into a limited way in order to understand, out of a set of slaves, what is the one with the most updated data. For example this comparison is possible of N slaves were replicating all with the same master. However the replication offset was not transferred from master to slaves (that are later promoted as masters) in any way, so for instance if there were three instances A, B, C, with A master and B and C replication from A, the following could happen: C disconnects from A. B is turned into master. A is switched to master of B. B receives some write. In this context there was no way to compare the offset of A and C, because B would use its own local master replication offset as replication offset to initialize the replication with A. With this commit what happens is that when B is turned into master it inherits the replication offset from A, making A and C comparable. In the above case assuming no inconsistencies are created during the disconnection and failover process, A will show to have a replication offset greater than C. Note that this does not mean offsets are always comparable to understand what is, in a set of instances, since in more complex examples the replica with the higher replication offset could be partitioned away when picking the instance to elect as new master. However this in general improves the ability of a system to try to pick a good replica to promote to master.	2013-12-22 11:43:25 +01:00
antirez	ccd6ccc7dd	Slaves heartbeats during sync improved. The previous fix for false positive timeout detected by master was not complete. There is another blocking stage while loading data for the first synchronization with the master, that is, flushing away the current data from the DB memory. This commit uses the newly introduced dict.c callback in order to make some incremental work (to send "\n" heartbeats to the master) while flushing the old data from memory. It is hard to write a regression test for this issue unfortunately. More support for debugging in the Redis core would be needed in terms of functionalities to simulate a slow DB loading / deletion.	2013-12-10 18:47:31 +01:00
antirez	247a311317	dict.c: added optional callback to dictEmpty(). Redis hash table implementation has many non-blocking features like incremental rehashing, however while deleting a large hash table there was no way to have a callback called to do some incremental work. This commit adds this support, as an optiona callback argument to dictEmpty() that is currently called at a fixed interval (one time every 65k deletions).	2013-12-10 18:46:24 +01:00
antirez	2860b5e234	Log empty DB + Loading data into two separated messages.	2013-12-10 18:43:25 +01:00
antirez	7a5a646df9	Fixed grammar: before H the article is a, not an.	2013-12-05 16:35:32 +01:00
antirez	a7ebb0c7bf	WAIT command: synchronous replication for Redis.	2013-12-04 16:20:03 +01:00
antirez	c46f655c90	Log to what master a slave is going to connect to.	2013-11-11 09:25:36 +01:00
antirez	8432ddcedb	Replication: install the write handler when reusing a cached master. Sometimes when we resurrect a cached master after a successful partial resynchronization attempt, there is pending data in the output buffers of the client structure representing the master (likely REPLCONF ACK commands). If we don't reinstall the write handler, it will never be installed again by addReply*() family functions as they'll assume that if there is already data pending, the write handler is already installed. This bug caused some slaves after a successful partial sync to never send REPLCONF ACK, and continuously being detected as timing out by the master, with a disconnection / reconnection loop.	2013-10-04 16:12:25 +02:00
antirez	cd73a69c18	PSYNC: safer handling of PSYNC requests. There was a bug that over-esteemed the amount of backlog available, however this could only happen when a slave was asking for an offset that was in the "future" compared to the master replication backlog. Now this case is handled well and logged as an incident in the master log file.	2013-10-04 12:25:09 +02:00
antirez	ec3bd0695b	Make clear that runids are not cluster node IDs.	2013-09-30 11:48:09 +02:00
Maxim Zakharov	1885c6bada	A mistype fixed	2013-09-03 15:15:48 +02:00
antirez	a33c9fb250	replicationFeedSlaves() func name typo: feedReplicationBacklogWithObject -> feedReplicationBacklog.	2013-08-12 12:50:45 +02:00
antirez	6268dbdd94	replicationFeedSlave() reworked for correctness and speed. The previous code using a static buffer as an optimization was lame: 1) Premature optimization, actually it was slower than naive code because resulted into the creation / destruction of the object encapsulating the output buffer. 2) The code was very hard to test, since it was needed to have specific tests for command lines exceeding the size of the static buffer. 3) As a result of "2" the code was bugged as the current tests were not able to stress specific corner cases. It was replaced with easy to understand code that is safer and faster.	2013-08-12 12:50:29 +02:00
antirez	21cde6ecb7	Fix a PSYNC bug caused by a variable name typo.	2013-08-12 11:51:35 +02:00
antirez	4b8b7cb964	Replication: better way to send a preamble before RDB payload. During the replication full resynchronization process, the RDB file is transfered from the master to the slave. However there is a short preamble to send, that is currently just the bulk payload length of the file in the usual Redis form $..length..<CR><LF>. This preamble used to be sent with a direct write call, assuming that there was alway room in the socket output buffer to hold the few bytes needed, however this does not scale in case we'll need to send more stuff, and is not very robust code in general. This commit introduces a more general mechanism to send a preamble up to 2GB in size (the max length of an sds string) in a non blocking way.	2013-08-12 10:29:14 +02:00
antirez	9efbe0dca0	Fix replicationFeedSlaves() off-by-one bug. This fixes issue #1221.	2013-07-28 12:49:34 +02:00
antirez	06a0f621d6	Fix replicationFeedSlaves() to use sdsEncodedObject() macro.	2013-07-22 10:36:27 +02:00
Ted Nyman	efaa9d0bc4	Make sure the log standardizes on 'timeout'	2013-07-12 14:06:27 -07:00
antirez	80993d9892	Use getClientPeerId() for MONITOR implementation.	2013-07-09 16:21:21 +02:00
antirez	2fa66d5e76	Fix old anetPeerToString() API call in replication.c	2013-07-08 16:11:52 +02:00
Geoff Garside	dc7e8ec27f	Update calls to anetPeerToString to include ip_len.	2013-07-08 15:57:22 +02:00
antirez	cdaacf03aa	Don't disconnect pre PSYNC replication clients for timeout. Clients using SYNC to replicate are older implementations, such as redis-cli --slave, and are not designed to acknowledge the master with REPLCONF ACK commands, so we don't have any feedback and should not disconnect them on timeout.	2013-06-26 10:11:20 +02:00
antirez	eaebabe564	Use the RSC to replicate EVALSHA unmodified. This commit uses the Replication Script Cache in order to avoid translating EVALSHA into EVAL whenever possible for both the AOF and slaves.	2013-06-24 18:57:31 +02:00
antirez	a9e1c46f40	Replication of scripts as EVALSHA: sha1 caching implemented. This code is only responsible to take an LRU-evicted fixed length cache of SHA1 that we are sure all the slaves received. In this commit only the implementation is provided, but the Redis core does not use it to actually send EVALSHA to slaves when possible.	2013-06-24 10:26:04 +02:00
antirez	e802b22dfb	Refresh good slaves count when setting slave state as online.	2013-05-30 12:13:25 +02:00
antirez	cb76f29230	min-slaves-to-write: don't accept writes with less than N replicas. This feature allows the user to specify the minimum number of connected replicas having a lag less or equal than the specified amount of seconds for writes to be accepted.	2013-05-30 11:30:04 +02:00
antirez	8752adc059	Close connection with timedout slaves. Now masters, using the time at which the last REPLCONF ACK was received, are able to explicitly disconnect slaves that are no longer responding. Previously the only chance was to see a very long output buffer, that was highly suboptimal.	2013-05-27 11:42:42 +02:00
antirez	8304ed9a06	Send ACK to master once every second. ACKs can be also used as a base for synchronous replication. However in that case they'll be explicitly requested by the master when the client sends a request that needs to be replicated synchronously.	2013-05-27 11:42:38 +02:00
antirez	c9af89d8cd	Don't ACK the master after every command. Sending an ACK is now moved into the replicationSendAck() function.	2013-05-27 11:42:35 +02:00
antirez	dfc2575703	Make sure that REPLCONF ACK really has no return value.	2013-05-27 11:42:30 +02:00
antirez	6d2b8f5845	REPLCONF ACK command. This special command is used by the slave to inform the master the amount of replication stream it currently consumed. it does not return anything so that we not need to consume additional bandwidth needed by the master to reply something. The master can do a number of things knowing the amount of stream processed, such as understanding the "lag" in bytes of the slave, verify if a given command was already processed by the slave, and so forth.	2013-05-27 11:42:17 +02:00
antirez	39c5f8a615	Cluster: SLAVEOF command not allowed in cluster mode.	2013-03-05 12:39:41 +01:00
antirez	7cf96d66ef	Make sure replicationSetMaster() works when ip argument is not an sds.	2013-03-04 15:39:55 +01:00
antirez	06fa5f82d7	SLAVEOF command refactored into a proper API. We now have replicationSetMaster() and replicationUnsetMaster() that can be called in other contexts (for instance Redis Cluster).	2013-03-04 13:22:21 +01:00
antirez	646785ae48	Use GCC printf format attribute for redisLog(). This commit also fixes redisLog() statements producing warnings.	2013-02-27 12:27:15 +01:00
antirez	c72be04d12	PSYNC: another change to unexpected reply from PSYNC.	2013-02-13 18:43:40 +01:00
antirez	67ef554e2e	PSYNC: More robust handling of unexpected reply to PSYNC.	2013-02-13 18:33:33 +01:00
antirez	f8e3cd19ad	Replication: more strict error checking for master PING reply.	2013-02-12 16:53:27 +01:00
antirez	12a3bf6245	Replication: added new stats counting full and partial resynchronizations.	2013-02-12 15:33:54 +01:00
antirez	d5bed58b08	PSYNC: debugging printf() calls are now logs at DEBUG level.	2013-02-12 12:52:22 +01:00
antirez	33a7ca234d	Remove harmless warning in slaveTryPartialResynchronization().	2013-02-12 12:52:21 +01:00
antirez	10756f5c4a	PSYNC: don't use the client buffer to send +CONTINUE and +FULLRESYNC. When we are preparing an handshake with the slave we can't touch the connection buffer as it'll be used to accumulate differences between the sent RDB file and what arrives next from clients. So in short we can't use addReply() family functions. However we just use write(2) because we know that the socket buffer is empty, since a prerequisite for SYNC to work is that the static buffer and the output list are empty, and in general it is not expected that a client SYNCs after doing some heavy I/O with the master. However a short write connection is explicitly handled to avoid fragility (we simply close the connection and the slave will retry).	2013-02-12 12:52:21 +01:00
antirez	b5ddb829b5	SYNC not allowed with pending data on the static output buffer.	2013-02-12 12:52:21 +01:00
antirez	65c0a0eb2b	Log the unexpected string received in place of the SYNC payload length.	2013-02-12 12:52:21 +01:00

... 3 4 5 6 7

311 Commits