futriix

Author	SHA1	Message	Date
antirez	c8d6bc94e4	Cluster: react faster when a slave wins an election.	2013-09-26 16:54:43 +02:00
antirez	7dfa4c5981	Cluster: removed an old source of delay to start the slave failover.	2013-09-26 13:28:19 +02:00
antirez	3bd69bcdf1	Cluster: master node now uses new protocol to vote.	2013-09-26 13:00:41 +02:00
antirez	f941650091	Cluster: slave node now uses the new protocol to get elected.	2013-09-26 11:13:17 +02:00
antirez	24b2894194	Cluster: add currentEpoch to CLUSTER INFO.	2013-09-25 12:38:36 +02:00
antirez	6dbd939a24	Cluster: update our currentEpoch when a greater one is seen.	2013-09-25 12:36:29 +02:00
antirez	1adf457b5b	Cluster: broadcast currentEpoch and configEpoch in packets header.	2013-09-25 11:53:35 +02:00
antirez	cdf4eede58	Cluster: configEpoch added in cluster nodes description.	2013-09-25 11:47:13 +02:00
antirez	3a9bf5e618	Cluster: PFAIL -> FAIL transition allowed for slaves. First change: now there is no need to be a master in order to detect a failure, however the majority of masters signaling PFAIL or FAIL is needed. This change is important because it allows slaves rejoining the cluster after a partition to sense the FAIL condition so that eventually all the nodes agree on failures.	2013-09-20 11:26:44 +02:00
antirez	3f5034d1d7	Cluster: added time field in cluster bus messages. The time is sent in requests, and copied back in reply packets. This way the receiver can compare the time field in a reply with its local clock and check the age of the request associated with this reply. This is an easy way to discard delayed replies. Note that only a clock is used here, that is the one of the node sending the packet. The receiver only copies the field back into the reply, so no synchronization is needed between clocks of different hosts.	2013-09-20 09:22:21 +02:00
antirez	c7cb80c8bb	Cluster: don't add an handshake node for the same ip:port pair multiple times.	2013-09-04 15:52:16 +02:00
antirez	79a1deac28	Cluster: free HANDSHAKE nodes after node_timeout. Handshake nodes should turn into normal nodes or be freed in a reasonable amount of time, otherwise they'll keep accumulating if the address they are associated with is not reachable for some reason.	2013-09-04 12:41:21 +02:00
antirez	232f84ec2d	Cluster: CLUSTER SAVECONFIG command added.	2013-09-04 10:33:00 +02:00
antirez	61eb16c4da	Cluster: don't save HANDSHAKE nodes in nodes.conf.	2013-09-04 10:25:26 +02:00
antirez	6e460eac58	Cluster: always use safe iteartors to iterate server.cluster->nodes.	2013-09-04 10:07:50 +02:00
antirez	853defe071	Cluster: clusterReadHandler() reworked to be more correct and simpler to follow.	2013-09-03 11:43:52 +02:00
antirez	e307150c21	Cluster: use non-blocking I/O for the cluster bus.	2013-09-03 11:43:52 +02:00
antirez	d3726385c2	Cluster: fixed a bug in clusterSendPublish() due to inverted statements. The code used to copy the header after the 'hdr' pointer was already switched to the new buffer. Of course we need to do the reverse.	2013-09-03 11:43:43 +02:00
antirez	33b286bf68	Don't update node pong time via gossip. This feature was implemented in the initial days of the Redis Cluster implementaiton but is not a good idea at all. 1) It depends on clocks to be synchronized, that is already very bad. 2) Moreover it adds a bug where the pong time is updated via gossip so no new PING is ever sent by the current node, with the effect of no PONG received, no update of tables, no clearing of PFAIL flag. In general to trust other nodes about the reachability of other nodes is a broken distributed programming model.	2013-08-26 16:16:25 +02:00
antirez	d7d90442f5	Cluster: set event handler in cluster bus listening socket. The commit using listenToPort() introduced this bug by no longer creating the event handler to handle incoming messages from the cluster bus.	2013-08-22 14:53:53 +02:00
antirez	a8dc4ecd21	Use listenToPort() in cluster.c as well.	2013-08-22 14:05:07 +02:00
antirez	f45f05531d	Cluster: fix CLUSTER MEET ip address validation. This was broken by the IPv6 support patches.	2013-08-22 11:54:28 +02:00
antirez	77c71b2046	Cluster: process MEET packets as PING packets. Somewhat a previous commit broken this so CLUSTER MEET was no longer working.	2013-08-22 11:53:28 +02:00
antirez	487951c9b4	Use a safe dict.c iterator in clusterCron().	2013-08-21 15:51:15 +02:00
antirez	fc11a99390	sdsrange() does not need to return a value. Actaully the string is modified in-place and a reallocation is never needed, so there is no need to return the new sds string pointer as return value of the function, that is now just "void".	2013-07-24 11:21:39 +02:00
antirez	aa32f92338	Introduction of a new string encoding: EMBSTR Previously two string encodings were used for string objects: 1) REDIS_ENCODING_RAW: a string object with obj->ptr pointing to an sds stirng. 2) REDIS_ENCODING_INT: a string object where the obj->ptr void pointer is casted to a long. This commit introduces a experimental new encoding called REDIS_ENCODING_EMBSTR that implements an object represented by an sds string that is not modifiable but allocated in the same memory chunk as the robj structure itself. The chunk looks like the following: +--------------+-----------+------------+--------+----+ \| robj data... \| robj->ptr \| sds header \| string \| \0 \| +--------------+-----+-----+------------+--------+----+ \| ^ +-----------------------+ The robj->ptr points to the contiguous sds string data, so the object can be manipulated with the same functions used to manipulate plan string objects, however we need just on malloc and one free in order to allocate or release this kind of objects. Moreover it has better cache locality. This new allocation strategy should benefit both the memory usage and the performances. A performance gain between 60 and 70% was observed during micro-benchmarks, however there is more work to do to evaluate the performance impact and the memory usage behavior.	2013-07-22 10:31:38 +02:00
antirez	e4d2e6fc9d	All IP string repr buffers are now REDIS_IP_STR_LEN bytes.	2013-07-09 11:32:52 +02:00
Geoff Garside	5d702e012e	Mark places that might want changing for IPv6. Any places which I feel might want to be updated to work differently with IPv6 have been marked with a comment starting "IPV6:". Currently the only comments address places where an IP address is combined with a port using the standard : separated form. These may want to be changed when printing IPv6 addresses to wrap the address in [] such as [2001:db8::c0:ffee]:6379 instead of 2001:db8::c0:ffee:6379 as the latter format is a technically valid IPv6 address and it is hard to distinguish the IPv6 address component from the port unless you know the port is supposed to be there.	2013-07-08 15:58:14 +02:00
Geoff Garside	15e37522ff	Mark ip string buffers which could be reduced. In two places buffers have been created with a size of 128 bytes which could be reduced to INET6_ADDRSTRLEN to still hold a full IP address. These places have been marked as they are presently big enough to handle the needs of storing a printable IPv6 address.	2013-07-08 15:57:23 +02:00
Geoff Garside	9c994de435	Update clusterCommand to handle AF_INET6 addresses Changes the sockaddr_in to a sockaddr_storage. Attempts to convert the IP address into an AF_INET or AF_INET6 before returning an "Invalid IP address" error. Handles converting the sockaddr from either AF_INET or AF_INET6 back into a string for storage in the clusterNode ip field.	2013-07-08 15:57:23 +02:00
Geoff Garside	241e41a527	Update node2IpString to handle AF_INET6 addresses. Change the sockaddr_in to sockaddr_storage which is capable of storing both AF_INET and AF_INET6 sockets. Uses the sockaddr_storage ss_family to correctly return the printable IP address and port. Function makes the assumption that the buffer is of at least REDIS_CLUSTER_IPLEN bytes in size.	2013-07-08 15:57:23 +02:00
Geoff Garside	cc9c474c60	Add missing includes for getpeername. getpeername(2) requires <sys/socket.h> which on some systems also requires <sys/types.h>. Include both to avoid compilation warnings.	2013-07-08 15:55:39 +02:00
Geoff Garside	a6c9ad267c	Add macro to define clusterNode.ip buffer size. Add REDIS_CLUSTER_IPLEN macro to define the size of the clusterNode ip character array. Additionally use this macro in inet_ntop(3) calls where the size of the array was being defined manually. The REDIS_CLUSTER_IPLEN is defined as INET_ADDRSTRLEN which defines the correct size of a buffer to store an IPv4 address in. The INET_ADDRSTRLEN macro itself is defined in the <netinet/in.h> header file and should be portable across the majority of systems.	2013-07-08 15:55:39 +02:00
Geoff Garside	74b7731781	Fix cluster.c inet_ntop use of sizeof(n->ip). Using sizeof with an array will only return expected results if the array is created in the scope of the function where sizeof is used. This commit changes the inet_ntop calls so that they use the fixed buffer value as defined in redis.h which is 16.	2013-07-08 15:51:37 +02:00
Geoff Garside	9ddaff53a9	Use inet_pton(3) in clusterCommand. Replace inet_aton(3) call with the more future proof inet_pton(3) function which is capable of handling additional address families.	2013-07-08 15:51:37 +02:00
Geoff Garside	c87105431c	Use inet_ntop(3) in nodeIp2String & clusterCommand Replace inet_ntoa(3) calls with the more future proof inet_ntop(3) function which is capable of handling additional address families.	2013-07-08 15:51:37 +02:00
Geoff Garside	8b2e90acec	Update anetTcpAccept & anetPeerToString calls. Add the additional ip buffer length argument to function calls of anetTcpAccept and anetPeerToString in network.c and cluster.c	2013-07-08 15:51:37 +02:00
antirez	d3cde09645	Binding multiple IPs done properly with multiple sockets.	2013-07-05 11:47:20 +02:00
antirez	8ea3b1e79d	Revert "Cluster: use new anet.c listening socket creation API." This reverts commit c5e87a13de69dba4f9d9017271ba59fe36231144.	2013-07-05 11:08:44 +02:00
antirez	c5e87a13de	Cluster: use new anet.c listening socket creation API.	2013-07-04 18:49:49 +02:00
antirez	811c5a2cdf	Cluster: detect nodes address change.	2013-06-12 10:50:07 -07:00
antirez	fffc5f809e	clusterProcessPacket() comments improved for correctness.	2013-06-11 21:34:34 +02:00
antirez	4f96bde1e2	Cluster: link reconnection on delayed PONG reply. When the PONG delay is half the cluster node timeout, the link gets disconnected (and later automatically reconnected) in order to ensure that it's not just a dead connection issue. However this operation is only performed if the link is old enough, in order to avoid to disconnect the same link again and again (and among the other problems, never receive the PONG because of that). Note: when the link is reconnected, the 'ping_sent' field is not updated even if a new ping is sent using the new connection, so we can still reliably detect a node ping timeout.	2013-05-03 15:43:03 +02:00
antirez	9a3532afa8	Cluster: restore PING sent time on reconnections.	2013-05-03 15:42:59 +02:00
antirez	bff532a1f1	Cluster: PING/PONG handling redesigned.	2013-05-03 15:42:38 +02:00
antirez	431642e8f6	Cluster: process config from PING packets as we do for PONG. Also clusterBroadcastPing() was renamed into clusterBroadcastPong() that's what the function is actually doing.	2013-05-03 15:41:34 +02:00
antirez	88cb0faa7a	Cluster: createClusterLink() comment fixed for grammar.	2013-05-03 15:41:29 +02:00
xiaost7	d284570deb	Cluster: fix clusterNode.name print format on debug message. It was %40s instead of %.40s, and since the string is not null terminated it caused random garbage to be displayed, and possibly a crash.	2013-04-19 09:53:43 +02:00
antirez	352e1a86a8	Cluster: reconfigure additonal slaves on failover.	2013-04-09 12:13:26 +02:00
antirez	442af928f9	Cluster: use server.cluster_node_timeout directly. We used to copy this value into the server.cluster structure, however this was not necessary. The reason why we don't directly use server.cluster->node_timeout is that things that can be configured via redis.conf need to be directly available in the server structure as server.cluster is allocated later only if needed in order to reduce the memory footprint of non-cluster instances.	2013-04-09 11:24:18 +02:00

... 7 8 9 10 11 ...

632 Commits