futriix

Author	SHA1	Message	Date
antirez	5781712458	Improve the PSYNC2 test reliability.	2020-05-17 18:24:34 +02:00
Salvatore Sanfilippo	3eac39fa96	Merge pull request #7264 from oranagra/test_for_7205 add regression test for the race in #7205	2020-05-17 17:59:14 +02:00
Oran Agra	ba6f40ea94	add regression test for the race in #7205 with the original version of 6.0.0, this test detects an excessive full sync. with the fix in 146201c69, this test detects memory corruption, especially when using libc allocator with or without valgrind.	2020-05-17 18:26:02 +03:00
antirez	8ca98ae414	Redis 6.0.3.	2020-05-16 18:11:58 +02:00
antirez	ae73a26f14	Remove the client from CLOSE_ASAP list before caching the master. This was broken in 146201c: we identified a crash in the CI, what was happening before the fix should be like that: 1. The client gets in the async free list. 2. However freeClient() gets called again against the same client which is a master. 3. The client arrived in freeClient() with the CLOSE_ASAP flag set. 4. The master gets cached, but NOT removed from the CLOSE_ASAP linked list. 5. The master client that was cached was immediately removed since it was still in the list. 6. Redis accessed a freed cached master. This is how the crash looked like: === REDIS BUG REPORT START: Cut & paste starting from here === 1092:S 16 May 2020 11:44:09.731 # Redis 999.999.999 crashed by signal: 11 1092:S 16 May 2020 11:44:09.731 # Crashed running the instruction at: 0x447e18 1092:S 16 May 2020 11:44:09.731 # Accessing address: 0xffffffffffffffff 1092:S 16 May 2020 11:44:09.731 # Failed assertion: (:0) ------ STACK TRACE ------ EIP: src/redis-server 127.0.0.1:21300(readQueryFromClient+0x48)[0x447e18] And the 0xffff address access likely comes from accessing an SDS that is set to NULL (we go -1 offset to read the header).	2020-05-16 18:04:17 +02:00
antirez	492bbefdb0	Remove the client from CLOSE_ASAP list before caching the master. This was broken in 146201c: we identified a crash in the CI, what was happening before the fix should be like that: 1. The client gets in the async free list. 2. However freeClient() gets called again against the same client which is a master. 3. The client arrived in freeClient() with the CLOSE_ASAP flag set. 4. The master gets cached, but NOT removed from the CLOSE_ASAP linked list. 5. The master client that was cached was immediately removed since it was still in the list. 6. Redis accessed a freed cached master. This is how the crash looked like: === REDIS BUG REPORT START: Cut & paste starting from here === 1092:S 16 May 2020 11:44:09.731 # Redis 999.999.999 crashed by signal: 11 1092:S 16 May 2020 11:44:09.731 # Crashed running the instruction at: 0x447e18 1092:S 16 May 2020 11:44:09.731 # Accessing address: 0xffffffffffffffff 1092:S 16 May 2020 11:44:09.731 # Failed assertion: (:0) ------ STACK TRACE ------ EIP: src/redis-server 127.0.0.1:21300(readQueryFromClient+0x48)[0x447e18] And the 0xffff address access likely comes from accessing an SDS that is set to NULL (we go -1 offset to read the header).	2020-05-16 17:15:35 +02:00
antirez	38cc6cbbb4	Redis 6.0.2.	2020-05-15 22:29:52 +02:00
Yossi Gottlieb	114d06ea1f	TLS: Fix test failures on recent Debian/Ubuntu. Seems like on some systems choosing specific TLS v1/v1.1 versions no longer works as expected. Test is reduced for v1.2 now which is still good enough to test the mechansim, and matters most anyway.	2020-05-15 22:23:24 +02:00
Yossi Gottlieb	c8b490b216	TLS: Add crypto locks for older OpenSSL support. This is really required only for older OpenSSL versions. Also, at the moment Redis does not use OpenSSL from multiple threads so this will only be useful if modules end up doing that.	2020-05-15 22:23:24 +02:00
David Carlier	b1e21f4672	NetBSD build update. This platform supports CPU affinity (but not OpenBSD).	2020-05-15 22:23:24 +02:00
Madelyn Olson	a7ba7a2e9e	Added a refcount on timer events to prevent deletion of recursive timer calls	2020-05-15 22:23:24 +02:00
antirez	22a6d152f3	Cache master without checking of deferred close flags. The context is issue #7205: since the introduction of threaded I/O we close clients asynchronously by default from readQueryFromClient(). So we should no longer prevent the caching of the master client, to later PSYNC incrementally, if such flags are set. However we also don't want the master client to be cached with such flags (would be closed immediately after being restored). And yet we want a way to understand if a master was closed because of a protocol error, and in that case prevent the caching.	2020-05-15 22:23:24 +02:00
antirez	bf6bf4e8de	Merge branch 'unstable' of github.com:/antirez/redis into unstable	2020-05-15 10:21:51 +02:00
antirez	146201c694	Cache master without checking of deferred close flags. The context is issue #7205: since the introduction of threaded I/O we close clients asynchronously by default from readQueryFromClient(). So we should no longer prevent the caching of the master client, to later PSYNC incrementally, if such flags are set. However we also don't want the master client to be cached with such flags (would be closed immediately after being restored). And yet we want a way to understand if a master was closed because of a protocol error, and in that case prevent the caching.	2020-05-15 10:19:13 +02:00
John Sully	fe8acd0b30	Remove debugging logs Former-commit-id: 407375887429ebd3efd6e21ae58766397d1324d3	2020-05-14 22:18:29 -04:00
Benjamin Sergeant	4bbf07dbbb	do not handle --cluster-yes for cluster fix mode	2020-05-14 15:29:06 -07:00
Madelyn Olson	4e8d141cba	Converge hash validation for adding and removing	2020-05-14 11:07:51 -07:00
Salvatore Sanfilippo	3c8824aa54	Merge pull request #7229 from yossigo/tls-fails-on-recent-debian TLS: Fix test failures on recent Debian/Ubuntu.	2020-05-14 18:15:17 +02:00
Salvatore Sanfilippo	3a52c1c0ec	Merge pull request #7230 from yossigo/tls-crypto-locks TLS: Add crypto locks for older OpenSSL support.	2020-05-14 18:14:49 +02:00
Salvatore Sanfilippo	cff2fcb7c6	Merge pull request #7244 from devnexen/netbsd_build_fix NetBSD build update.	2020-05-14 18:14:06 +02:00
Salvatore Sanfilippo	4932dd7fc3	Merge pull request #7253 from madolson/unstable-ae-recursion Added a refcount on timer events to prevent deletion of recursive timer events	2020-05-14 18:10:57 +02:00
Madelyn Olson	1bc06c7e85	Added a refcount on timer events to prevent deletion of recursive timer calls	2020-05-14 08:49:11 -07:00
antirez	9d7e2fef3d	Track events processed while blocked globally. Related to #7234.	2020-05-14 11:29:43 +02:00
antirez	e1b134e972	Some rework of #7234 .	2020-05-14 11:29:43 +02:00
Oran Agra	a3dd04410d	fix redis 6.0 not freeing closed connections during loading. This bug was introduced by a recent change in which readQueryFromClient is using freeClientAsync, and despite the fact that now freeClientsInAsyncFreeQueue is in beforeSleep, that's not enough since it's not called during loading in processEventsWhileBlocked. furthermore, afterSleep was called in that case but beforeSleep wasn't. This bug also caused slowness sine the level-triggered mode of epoll kept signaling these connections as readable causing us to keep doing connRead again and again for ll of these, which keep accumulating. now both before and after sleep are called, but not all of their actions are performed during loading, some are only reserved for the main loop. fixes issue #7215	2020-05-14 11:29:43 +02:00
antirez	60a6f16ced	Regression test for #7249 .	2020-05-14 11:29:43 +02:00
antirez	9ad227dca8	rax.c updated from upstream antirez/rax.	2020-05-14 11:29:43 +02:00
antirez	0b21426a45	Tracking: send eviction messages when evicting entries. A fix for #7249.	2020-05-14 11:29:43 +02:00
Oran Agra	5258341880	fix unstable replication test this test which has coverage for varoius flows of diskless master was failing randomly from time to time. the failure was: [err]: diskless all replicas drop during rdb pipe in tests/integration/replication.tcl log message of 'Diskless rdb transfer, last replica dropped, killing fork child' not found what seemed to have happened is that the master didn't detect that all replicas dropped by the time the replication ended, it thought that one replica is still connected. now the test takes a few seconds longer but it seems stable.	2020-05-14 11:29:43 +02:00
antirez	17546e831e	Merge branch 'free_clients_during_loading' into unstable	2020-05-14 11:28:08 +02:00
antirez	94e78d22fd	Regression test for #7249 .	2020-05-14 11:27:31 +02:00
antirez	949521deb2	rax.c updated from upstream antirez/rax.	2020-05-14 11:17:47 +02:00
antirez	afcea23e1a	Tracking: send eviction messages when evicting entries. A fix for #7249.	2020-05-14 11:17:04 +02:00
antirez	e3166461ed	Track events processed while blocked globally. Related to #7234.	2020-05-14 10:06:27 +02:00
Benjamin Sergeant	ca5aaa961a	fix typo ...	2020-05-13 09:32:27 -07:00
Benjamin Sergeant	c00b8e959a	Redis-cli 6.0.1 `--cluster-yes` doesn't work (fix #7246 ) This make it so that all prompts for all redis-cli --cluster commands are automatically answered with a yes.	2020-05-13 09:24:51 -07:00
David Carlier	f75ab4fcb5	NetBSD build update. This platform supports CPU affinity (but not OpenBSD).	2020-05-12 21:21:22 +01:00
antirez	7a3005da90	Some rework of #7234 .	2020-05-12 13:07:44 +02:00
Salvatore Sanfilippo	8ab703f73d	Merge pull request #7240 from oranagra/fix_replication_test fix unstable replication test	2020-05-12 08:44:18 +02:00
Oran Agra	00323f342d	fix unstable replication test this test which has coverage for varoius flows of diskless master was failing randomly from time to time. the failure was: [err]: diskless all replicas drop during rdb pipe in tests/integration/replication.tcl log message of 'Diskless rdb transfer, last replica dropped, killing fork child' not found what seemed to have happened is that the master didn't detect that all replicas dropped by the time the replication ended, it thought that one replica is still connected. now the test takes a few seconds longer but it seems stable.	2020-05-12 08:59:09 +03:00
John Sully	14e32ff682	Update ci.yml Former-commit-id: d4b1d3f755f7c1f6ca4642b5258b4a5157eacb5a	2020-05-12 01:06:08 -04:00
John Sully	27e605293e	Update ci.yml Former-commit-id: d06c6c67ea36a21e515390405a0f947a23ec4b42	2020-05-11 22:09:27 -04:00
John Sully	35ad3b53fc	Reduce test load for more reliability Former-commit-id: 3f8f9743c1f6d75647cfbdbc1544a8c8a5142df3	2020-05-11 22:04:52 -04:00
John Sully	464326eb07	limit clients to make tests more reliable Former-commit-id: 2c130a816647bacaba5d99d76c19e09b9c9875da	2020-05-11 21:56:17 -04:00
John Sully	84ce8b3ca8	Update cached time only in main cron Former-commit-id: 5396dd9587599d6672e8d80967aa1fd8ba16e7ec	2020-05-11 20:31:47 -04:00
John	063672dbdb	more reliability fixes for multimaster Former-commit-id: 3543a3c763de91a4d76bca89659fec9bf6b7a1c8	2020-05-11 05:38:21 -04:00
John	b03c4ccc50	more reliability fixes for multimaster Former-commit-id: fd5b541260908423c35227ff9e42a83f96ace6c0	2020-05-11 09:37:42 +00:00
John	0e6add2e84	Make multimaster tests more reliable Former-commit-id: 3122912920973cb433d625a09b183c3f538e2523	2020-05-11 05:23:47 -04:00
John	680a6ac90f	Make multimaster tests more reliable Former-commit-id: 4fe59ba11b720864ea0124885b358cb72127cc2d	2020-05-11 09:22:27 +00:00
John Sully	c48c9a57a9	too many clients makes tests unreliable Former-commit-id: 89b39629980ab3e2930663c2922141e18e2fc38c	2020-05-11 04:54:17 -04:00

... 49 50 51 52 53 ...

13087 Commits