futriix

Author	SHA1	Message	Date
Yossi Gottlieb	8d79702d8a	Tests: fix redis-cli with remote hosts. (#7693 ) (cherry picked from commit f80f3f492a0ca56e163899eeca7ad40d67d903be)	2020-09-01 09:27:58 +03:00
Oran Agra	916b215fc5	fix new rdb test failing on timing issues (#7604 ) apparenlty on github actions sometimes 500ms is not enough (cherry picked from commit 824bd2ac11472b7a3fce9fcf3189a8e6c6048115)	2020-09-01 09:27:58 +03:00
Oran Agra	67750ce3b3	Fix failing tests due to issues with wait_for_log_message (#7572 ) - the test now waits for specific set of log messages rather than wait for timeout looking for just one message. - we don't wanna sample the current length of the log after an action, due to a race, we need to start the search from the line number of the last message we where waiting for. - when attempting to trigger a full sync, use multi-exec to avoid a race where the replica manages to re-connect before we completed the set of actions that should force a full sync. - fix verify_log_message which was broken and unused (cherry picked from commit 109b5ccdcd6e6b8cecdaeb13a246bc49ce7a61f4)	2020-09-01 09:27:58 +03:00
Oran Agra	6daa8b9adb	Stabilize bgsave test that sometimes fails with valgrind (#7559 ) on ci.redis.io the test fails a lot, reporting that bgsave didn't end. increaseing the timeout we wait for that bgsave to get aborted. in addition to that, i also verify that it indeed got aborted by checking that the save counter wasn't reset. add another test to verify that a successful bgsave indeed resets the change counter. (cherry picked from commit 8a57969fd75db01b881d438200911d95bdead293)	2020-09-01 09:27:58 +03:00
Yossi Gottlieb	f1d5d5d28e	Tests: drop TCL 8.6 dependency. (#7548 ) This re-implements the redis-cli --pipe test so it no longer depends on a close feature available only in TCL 8.6. Basically what this test does is run redis-cli --pipe, generates a bunch of commands and pipes them through redis-cli, and inspects the result in both Redis and the redis-cli output. To do that, we need to close stdin for redis-cli to indicate we're done so it can flush its buffers and exit. TCL has bi-directional channels can only offers a way to "one-way close" a channel with TCL 8.6. To work around that, we now generate the commands into a file and feed that file to redis-cli directly. As we're writing to an actual file, the number of commands is now reduced. (cherry picked from commit f57e844b2edbb86a5df2f3436045814812c0a3ae)	2020-09-01 09:27:58 +03:00
Yossi Gottlieb	f80f3f492a	Tests: fix redis-cli with remote hosts. (#7693 )	2020-08-23 10:17:43 +03:00
Oran Agra	c17e597d05	Accelerate diskless master connections, and general re-connections (#6271 ) Diskless master has some inherent latencies. 1) fork starts with delay from cron rather than immediately 2) replica is put online only after an ACK. but the ACK was sent only once a second. 3) but even if it would arrive immediately, it will not register in case cron didn't yet detect that the fork is done. Besides that, when a replica disconnects, it doesn't immediately attempts to re-connect, it waits for replication cron (one per second). in case it was already online, it may be important to try to re-connect as soon as possible, so that the backlog at the master doesn't vanish. In case it disconnected during rdb transfer, one can argue that it's not very important to re-connect immediately, but this is needed for the "diskless loading short read" test to be able to run 100 iterations in 5 seconds, rather than 3 (waiting for replication cron re-connection) changes in this commit: 1) sync command starts a fork immediately if no sync_delay is configured 2) replica sends REPLCONF ACK when done reading the rdb (rather than on 1s cron) 3) when a replica unexpectedly disconnets, it immediately tries to re-connect rather than waiting 1s 4) when when a child exits, if there is another replica waiting, we spawn a new one right away, instead of waiting for 1s replicationCron. 5) added a call to connectWithMaster from replicationSetMaster. which is called from the REPLICAOF command but also in 3 places in cluster.c, in all of these the connection attempt will now be immediate instead of delayed by 1 second. side note: we can add a call to rdbPipeReadHandler in replconfCommand when getting a REPLCONF ACK from the replica to solve a race where the replica got the entire rdb and EOF marker before we detected that the pipe was closed. in the test i did see this race happens in one about of some 300 runs, but i concluded that this race is unlikely in real life (where the replica is on another host and we're more likely to first detect the pipe was closed. the test runs 100 iterations in 3 seconds, so in some cases it'll take 4 seconds instead (waiting for another REPLCONF ACK). Removing unneeded startBgsaveForReplication from updateSlavesWaitingForBgsave Now that CheckChildrenDone is calling the new replicationStartPendingFork (extracted from serverCron) there's actually no need to call startBgsaveForReplication from updateSlavesWaitingForBgsave anymore, since as soon as updateSlavesWaitingForBgsave returns, CheckChildrenDone is calling replicationStartPendingFork that handles that anyway. The code in updateSlavesWaitingForBgsave had a bug in which it ignored repl-diskless-sync-delay, but removing that code shows that this bug was hiding another bug, which is that the max_idle should have used >= and not >, this one second delay has a big impact on my new test.	2020-08-06 16:53:06 +03:00
Oran Agra	824bd2ac11	fix new rdb test failing on timing issues (#7604 ) apparenlty on github actions sometimes 500ms is not enough	2020-08-04 08:53:50 +03:00
Oran Agra	109b5ccdcd	Fix failing tests due to issues with wait_for_log_message (#7572 ) - the test now waits for specific set of log messages rather than wait for timeout looking for just one message. - we don't wanna sample the current length of the log after an action, due to a race, we need to start the search from the line number of the last message we where waiting for. - when attempting to trigger a full sync, use multi-exec to avoid a race where the replica manages to re-connect before we completed the set of actions that should force a full sync. - fix verify_log_message which was broken and unused	2020-07-28 11:15:29 +03:00
Oran Agra	8a57969fd7	Stabilize bgsave test that sometimes fails with valgrind (#7559 ) on ci.redis.io the test fails a lot, reporting that bgsave didn't end. increaseing the timeout we wait for that bgsave to get aborted. in addition to that, i also verify that it indeed got aborted by checking that the save counter wasn't reset. add another test to verify that a successful bgsave indeed resets the change counter.	2020-07-23 13:06:24 +03:00
Yossi Gottlieb	f57e844b2e	Tests: drop TCL 8.6 dependency. (#7548 ) This re-implements the redis-cli --pipe test so it no longer depends on a close feature available only in TCL 8.6. Basically what this test does is run redis-cli --pipe, generates a bunch of commands and pipes them through redis-cli, and inspects the result in both Redis and the redis-cli output. To do that, we need to close stdin for redis-cli to indicate we're done so it can flush its buffers and exit. TCL has bi-directional channels can only offers a way to "one-way close" a channel with TCL 8.6. To work around that, we now generate the commands into a file and feed that file to redis-cli directly. As we're writing to an actual file, the number of commands is now reduced.	2020-07-21 14:17:14 +03:00
Oran Agra	05f8975d21	redis-cli tests, fix valgrind timing issue (#7519 ) this test when run with valgrind on github actions takes 160 seconds (cherry picked from commit 254c96255420e950bcad1a46bc4f8617b4373797)	2020-07-20 21:08:26 +03:00
Oran Agra	aea4db2f5a	fix recently added time sensitive tests failing with valgrind (#7512 ) interestingly the latency monitor test fails because valgrind is slow enough so that the time inside PEXPIREAT command from the moment of the first mstime() call to get the basetime until checkAlreadyExpired calls mstime() again is more than 1ms, and that test was too sensitive. using this opportunity to speed up the test (unrelated to the failure) the fix is just the longer time passed to PEXPIRE. (cherry picked from commit e5227aab899628653285478a9d1083e8e8f51b57)	2020-07-20 21:08:26 +03:00
Yossi Gottlieb	b057ff81ee	TLS: Add missing redis-cli options. (#7456 ) * Tests: fix and reintroduce redis-cli tests. These tests have been broken and disabled for 10 years now! * TLS: add remaining redis-cli support. This adds support for the redis-cli --pipe, --rdb and --replica options previously unsupported in --tls mode. * Fix writeConn(). (cherry picked from commit d9f970d8d3f0b694f1e8915cab6d4eab9cfb2ef1)	2020-07-20 21:08:26 +03:00
Oran Agra	2b5f23197c	stabilize tests that look for log lines (#7367 ) tests were sensitive to additional log lines appearing in the log causing the search to come empty handed. instead of just looking for the n last log lines, capture the log lines before performing the action, and then search from that offset. (cherry picked from commit 8e76e13472b7d277af78691775c2cf845f68ab90)	2020-07-20 21:08:26 +03:00
Oran Agra	1104113c07	tests/valgrind: don't use debug restart (#7404 ) * tests/valgrind: don't use debug restart DEBUG REATART causes two issues: 1. it uses execve which replaces the original process and valgrind doesn't have a chance to check for errors, so leaks go unreported. 2. valgrind report invalid calls to close() which we're unable to resolve. So now the tests use restart_server mechanism in the tests, that terminates the old server and starts a new one, new PID, but same stdout, stderr. since the stderr can contain two or more valgrind report, it is not enough to just check for the absence of leaks, we also need to check for some known errors, we do both, and fail if we either find an error, or can't find a report saying there are no leaks. other changes: - when killing a server that was already terminated we check for leaks too. - adding DEBUG LEAK which was used to test it. - adding --trace-children to valgrind, although no longer needed. - since the stdout contains two or more runs, we need slightly different way of checking if the new process is up (explicitly looking for the new PID) - move the code that handles --wait-server to happen earlier (before watching the startup message in the log), and serve the restarted server too. * squashme - CR fixes (cherry picked from commit 69ade87325eedebdb44760af9a8c28e15381888e)	2020-07-20 21:08:26 +03:00
Oran Agra	254c962554	redis-cli tests, fix valgrind timing issue (#7519 ) this test when run with valgrind on github actions takes 160 seconds	2020-07-14 18:04:08 +03:00
Oran Agra	e5227aab89	fix recently added time sensitive tests failing with valgrind (#7512 ) interestingly the latency monitor test fails because valgrind is slow enough so that the time inside PEXPIREAT command from the moment of the first mstime() call to get the basetime until checkAlreadyExpired calls mstime() again is more than 1ms, and that test was too sensitive. using this opportunity to speed up the test (unrelated to the failure) the fix is just the longer time passed to PEXPIRE.	2020-07-13 16:40:03 +03:00
John Sully	d4dd336834	Merge tag '6.0.5' into unstable Redis 6.0.5 Former-commit-id: b736a95b0d23e4b73daa88c676b76d1d18e8bd17	2020-07-13 00:55:23 +00:00
John Sully	c5f6cb1ba5	Add multi-master-no-forward command to reduce bus traffic with multi-master Former-commit-id: d99d06b1250a51ea4bc54f678f451acbb7901e33	2020-07-12 19:25:19 +00:00
John Sully	cd08792df7	Fix failure to merge databases on active replica sync, due to bad merge with Redis 6 Former-commit-id: cd9514f4c8624932df2ec60ae3c2244899844aa6	2020-07-12 01:13:22 +00:00
Yossi Gottlieb	d9f970d8d3	TLS: Add missing redis-cli options. (#7456 ) * Tests: fix and reintroduce redis-cli tests. These tests have been broken and disabled for 10 years now! * TLS: add remaining redis-cli support. This adds support for the redis-cli --pipe, --rdb and --replica options previously unsupported in --tls mode. * Fix writeConn().	2020-07-10 10:25:55 +03:00
Oran Agra	8e76e13472	stabilize tests that look for log lines (#7367 ) tests were sensitive to additional log lines appearing in the log causing the search to come empty handed. instead of just looking for the n last log lines, capture the log lines before performing the action, and then search from that offset.	2020-07-10 08:28:22 +03:00
Oran Agra	69ade87325	tests/valgrind: don't use debug restart (#7404 ) * tests/valgrind: don't use debug restart DEBUG REATART causes two issues: 1. it uses execve which replaces the original process and valgrind doesn't have a chance to check for errors, so leaks go unreported. 2. valgrind report invalid calls to close() which we're unable to resolve. So now the tests use restart_server mechanism in the tests, that terminates the old server and starts a new one, new PID, but same stdout, stderr. since the stderr can contain two or more valgrind report, it is not enough to just check for the absence of leaks, we also need to check for some known errors, we do both, and fail if we either find an error, or can't find a report saying there are no leaks. other changes: - when killing a server that was already terminated we check for leaks too. - adding DEBUG LEAK which was used to test it. - adding --trace-children to valgrind, although no longer needed. - since the stdout contains two or more runs, we need slightly different way of checking if the new process is up (explicitly looking for the new PID) - move the code that handles --wait-server to happen earlier (before watching the startup message in the log), and serve the restarted server too. * squashme - CR fixes	2020-07-10 08:26:52 +03:00
John Sully	2c560f27b8	replication test race Former-commit-id: e1f3cd6ec3bf2319484de04c3796dcfa75e0479c	2020-06-07 01:14:57 -04:00
Oran Agra	f33de403ed	fix pingoff test race	2020-06-06 11:44:21 +02:00
John Sully	9fb7552b63	PSYNC test shouldn't wait forever Former-commit-id: 130613e16636923296a8d5b2c4bc623e62fef2f5	2020-06-01 16:13:58 -04:00
John Sully	2b08505fed	PSYNC test reliability improvements (test only issue) Former-commit-id: 50fd4fa7e62f3996f15f6a8c4dcd892022f111ec	2020-06-01 16:01:26 -04:00
John Sully	4f7102f46c	Fix for issue #187 we need to properly handle the case where a key with a subkey expirey itself expires during load Former-commit-id: e6a9a6b428b91b6108df24ae6285ea9b582b7b23	2020-06-01 15:33:19 -04:00
John Sully	df5b0f0be5	sendfile has high latency in some scenarios, don't use it Former-commit-id: 1eb0e3c1c604e71c54423f1d11b8c709c847a516	2020-05-31 23:22:25 -04:00
John Sully	eddc1ad46a	Don't start multimaster tests until all nodes are connected Former-commit-id: 202b97eff76501e736a2f0969607e3297e9703a4	2020-05-31 22:50:30 -04:00
Oran Agra	c480af9007	fix pingoff test race	2020-05-31 15:51:52 +03:00
John Sully	2aed24d0a5	active replica tests on slow computers Former-commit-id: c9920849dd6d6d0f6ecfe0d1002cb0edd7f7bfa9	2020-05-29 01:58:15 -04:00
John Sully	acde7c340e	Fix test issue with TLS Former-commit-id: 81b240f81d1c52fd331c4e0e89659913380229c4	2020-05-29 01:44:52 -04:00
John Sully	cfe9f8f3bc	Merge tag '6.0.4' into unstable Redis 6.0.4. Former-commit-id: 9c31ac7925edba187e527f506e5e992946bd38a6	2020-05-29 00:57:07 -04:00
antirez	59cd4c9f65	Test: take PSYNC2 test master timeout high during switch. This will likely avoid false positives due to trailing pings.	2020-05-28 10:56:14 +02:00
antirez	23f2b4d0a8	Test: take PSYNC2 test master timeout high during switch. This will likely avoid false positives due to trailing pings.	2020-05-28 10:47:30 +02:00
Oran Agra	ab2984b1e2	adjust revived meaningful offset tests these tests create several edge cases that are otherwise uncovered (at least not consistently) by the test suite, so although they're no longer testing what they were meant to test, it's still a good idea to keep them in hope that they'll expose some issue in the future.	2020-05-28 10:09:51 +02:00
Oran Agra	1ff5a222de	revive meaningful offset tests	2020-05-28 10:09:51 +02:00
antirez	3f8d113f1b	Another meaningful offset test removed.	2020-05-28 10:09:51 +02:00
antirez	d4541349dc	Remove the PSYNC2 meaningful offset test.	2020-05-28 10:09:51 +02:00
antirez	8f10137227	Test: PSYNC2 test can now show server logs.	2020-05-28 10:09:51 +02:00
Oran Agra	2a8af8e675	adjust revived meaningful offset tests these tests create several edge cases that are otherwise uncovered (at least not consistently) by the test suite, so although they're no longer testing what they were meant to test, it's still a good idea to keep them in hope that they'll expose some issue in the future.	2020-05-28 09:10:51 +03:00
Oran Agra	90f3856fd5	revive meaningful offset tests	2020-05-28 08:21:24 +03:00
antirez	484cfc3d76	Another meaningful offset test removed.	2020-05-27 12:50:02 +02:00
antirez	32d0df0c1f	Remove the PSYNC2 meaningful offset test.	2020-05-27 12:47:34 +02:00
antirez	091fb64681	Test: PSYNC2 test can now show server logs.	2020-05-25 20:26:29 +02:00
John Sully	2d783a3cbf	Merge tag '6.0.2' into unstable Redis 6.0.2 Former-commit-id: a010e4a4b2cc2bcad1cb14604b7ebc596c35b05e	2020-05-22 16:45:18 -04:00
John Sully	1eeb5de69f	Merge commit 'c57d9146f41f4b661d9d2cb48b83b3abc757ba0e' into unstable Former-commit-id: d74871da40dea11bd1a226fbecb0974ff5f8ec8c	2020-05-22 15:36:44 -04:00
Qu Chen	58fc456cbd	Disconnect chained replicas when the replica performs PSYNC with the master always to avoid replication offset mismatch between master and chained replicas.	2020-05-22 12:37:59 +02:00

1 2 3 4 5 ...

278 Commits