futriix

Author	SHA1	Message	Date
Yossi Gottlieb	c5675c66bc	Tests: fix redis-cli with remote hosts. (#7693 ) (cherry picked from commit 257f9f462f7782dcaecf7bbf35f4701b20b88a45)	2020-09-01 09:27:58 +03:00
Oran Agra	b4a6b4f28d	fix new rdb test failing on timing issues (#7604 ) apparenlty on github actions sometimes 500ms is not enough (cherry picked from commit 191b1181023b0860ec60afde7a41bd4f03c55097)	2020-09-01 09:27:58 +03:00
Oran Agra	10a8407a4f	Fix failing tests due to issues with wait_for_log_message (#7572 ) - the test now waits for specific set of log messages rather than wait for timeout looking for just one message. - we don't wanna sample the current length of the log after an action, due to a race, we need to start the search from the line number of the last message we where waiting for. - when attempting to trigger a full sync, use multi-exec to avoid a race where the replica manages to re-connect before we completed the set of actions that should force a full sync. - fix verify_log_message which was broken and unused (cherry picked from commit 06aaeabaea9d9b248e8a790dde352cd14d66628a)	2020-09-01 09:27:58 +03:00
Oran Agra	558a343b3c	Stabilize bgsave test that sometimes fails with valgrind (#7559 ) on ci.redis.io the test fails a lot, reporting that bgsave didn't end. increaseing the timeout we wait for that bgsave to get aborted. in addition to that, i also verify that it indeed got aborted by checking that the save counter wasn't reset. add another test to verify that a successful bgsave indeed resets the change counter. (cherry picked from commit 49d4aebce0a0b94cd2b302d276be95d1a1ce8610)	2020-09-01 09:27:58 +03:00
Yossi Gottlieb	6d80011e73	Tests: drop TCL 8.6 dependency. (#7548 ) This re-implements the redis-cli --pipe test so it no longer depends on a close feature available only in TCL 8.6. Basically what this test does is run redis-cli --pipe, generates a bunch of commands and pipes them through redis-cli, and inspects the result in both Redis and the redis-cli output. To do that, we need to close stdin for redis-cli to indicate we're done so it can flush its buffers and exit. TCL has bi-directional channels can only offers a way to "one-way close" a channel with TCL 8.6. To work around that, we now generate the commands into a file and feed that file to redis-cli directly. As we're writing to an actual file, the number of commands is now reduced. (cherry picked from commit dbc0a64843ccd07515ac41ca80497a9e5ffd107a)	2020-09-01 09:27:58 +03:00
Yossi Gottlieb	257f9f462f	Tests: fix redis-cli with remote hosts. (#7693 )	2020-08-23 10:17:43 +03:00
Oran Agra	cad93ed273	Accelerate diskless master connections, and general re-connections (#6271 ) Diskless master has some inherent latencies. 1) fork starts with delay from cron rather than immediately 2) replica is put online only after an ACK. but the ACK was sent only once a second. 3) but even if it would arrive immediately, it will not register in case cron didn't yet detect that the fork is done. Besides that, when a replica disconnects, it doesn't immediately attempts to re-connect, it waits for replication cron (one per second). in case it was already online, it may be important to try to re-connect as soon as possible, so that the backlog at the master doesn't vanish. In case it disconnected during rdb transfer, one can argue that it's not very important to re-connect immediately, but this is needed for the "diskless loading short read" test to be able to run 100 iterations in 5 seconds, rather than 3 (waiting for replication cron re-connection) changes in this commit: 1) sync command starts a fork immediately if no sync_delay is configured 2) replica sends REPLCONF ACK when done reading the rdb (rather than on 1s cron) 3) when a replica unexpectedly disconnets, it immediately tries to re-connect rather than waiting 1s 4) when when a child exits, if there is another replica waiting, we spawn a new one right away, instead of waiting for 1s replicationCron. 5) added a call to connectWithMaster from replicationSetMaster. which is called from the REPLICAOF command but also in 3 places in cluster.c, in all of these the connection attempt will now be immediate instead of delayed by 1 second. side note: we can add a call to rdbPipeReadHandler in replconfCommand when getting a REPLCONF ACK from the replica to solve a race where the replica got the entire rdb and EOF marker before we detected that the pipe was closed. in the test i did see this race happens in one about of some 300 runs, but i concluded that this race is unlikely in real life (where the replica is on another host and we're more likely to first detect the pipe was closed. the test runs 100 iterations in 3 seconds, so in some cases it'll take 4 seconds instead (waiting for another REPLCONF ACK). Removing unneeded startBgsaveForReplication from updateSlavesWaitingForBgsave Now that CheckChildrenDone is calling the new replicationStartPendingFork (extracted from serverCron) there's actually no need to call startBgsaveForReplication from updateSlavesWaitingForBgsave anymore, since as soon as updateSlavesWaitingForBgsave returns, CheckChildrenDone is calling replicationStartPendingFork that handles that anyway. The code in updateSlavesWaitingForBgsave had a bug in which it ignored repl-diskless-sync-delay, but removing that code shows that this bug was hiding another bug, which is that the max_idle should have used >= and not >, this one second delay has a big impact on my new test.	2020-08-06 16:53:06 +03:00
Oran Agra	191b118102	fix new rdb test failing on timing issues (#7604 ) apparenlty on github actions sometimes 500ms is not enough	2020-08-04 08:53:50 +03:00
Oran Agra	06aaeabaea	Fix failing tests due to issues with wait_for_log_message (#7572 ) - the test now waits for specific set of log messages rather than wait for timeout looking for just one message. - we don't wanna sample the current length of the log after an action, due to a race, we need to start the search from the line number of the last message we where waiting for. - when attempting to trigger a full sync, use multi-exec to avoid a race where the replica manages to re-connect before we completed the set of actions that should force a full sync. - fix verify_log_message which was broken and unused	2020-07-28 11:15:29 +03:00
Oran Agra	49d4aebce0	Stabilize bgsave test that sometimes fails with valgrind (#7559 ) on ci.redis.io the test fails a lot, reporting that bgsave didn't end. increaseing the timeout we wait for that bgsave to get aborted. in addition to that, i also verify that it indeed got aborted by checking that the save counter wasn't reset. add another test to verify that a successful bgsave indeed resets the change counter.	2020-07-23 13:06:24 +03:00
Yossi Gottlieb	dbc0a64843	Tests: drop TCL 8.6 dependency. (#7548 ) This re-implements the redis-cli --pipe test so it no longer depends on a close feature available only in TCL 8.6. Basically what this test does is run redis-cli --pipe, generates a bunch of commands and pipes them through redis-cli, and inspects the result in both Redis and the redis-cli output. To do that, we need to close stdin for redis-cli to indicate we're done so it can flush its buffers and exit. TCL has bi-directional channels can only offers a way to "one-way close" a channel with TCL 8.6. To work around that, we now generate the commands into a file and feed that file to redis-cli directly. As we're writing to an actual file, the number of commands is now reduced.	2020-07-21 14:17:14 +03:00
Oran Agra	6bdc5a4a08	redis-cli tests, fix valgrind timing issue (#7519 ) this test when run with valgrind on github actions takes 160 seconds (cherry picked from commit 8a14ce8634c49d992aa929cf0f98e96f03bccba4)	2020-07-20 21:08:26 +03:00
Oran Agra	dde79afbf7	fix recently added time sensitive tests failing with valgrind (#7512 ) interestingly the latency monitor test fails because valgrind is slow enough so that the time inside PEXPIREAT command from the moment of the first mstime() call to get the basetime until checkAlreadyExpired calls mstime() again is more than 1ms, and that test was too sensitive. using this opportunity to speed up the test (unrelated to the failure) the fix is just the longer time passed to PEXPIRE. (cherry picked from commit 663e637da87ee9385527fe3a37edb241a1f97cc6)	2020-07-20 21:08:26 +03:00
Yossi Gottlieb	6af3d57beb	TLS: Add missing redis-cli options. (#7456 ) * Tests: fix and reintroduce redis-cli tests. These tests have been broken and disabled for 10 years now! * TLS: add remaining redis-cli support. This adds support for the redis-cli --pipe, --rdb and --replica options previously unsupported in --tls mode. * Fix writeConn(). (cherry picked from commit 99b920534f7710d544c38b870fd10c6053283d99)	2020-07-20 21:08:26 +03:00
Oran Agra	c994e73c8e	stabilize tests that look for log lines (#7367 ) tests were sensitive to additional log lines appearing in the log causing the search to come empty handed. instead of just looking for the n last log lines, capture the log lines before performing the action, and then search from that offset. (cherry picked from commit efc4189b6227a17f26ed9bd6bbac62bf4bf7ab66)	2020-07-20 21:08:26 +03:00
Oran Agra	298e93c360	tests/valgrind: don't use debug restart (#7404 ) * tests/valgrind: don't use debug restart DEBUG REATART causes two issues: 1. it uses execve which replaces the original process and valgrind doesn't have a chance to check for errors, so leaks go unreported. 2. valgrind report invalid calls to close() which we're unable to resolve. So now the tests use restart_server mechanism in the tests, that terminates the old server and starts a new one, new PID, but same stdout, stderr. since the stderr can contain two or more valgrind report, it is not enough to just check for the absence of leaks, we also need to check for some known errors, we do both, and fail if we either find an error, or can't find a report saying there are no leaks. other changes: - when killing a server that was already terminated we check for leaks too. - adding DEBUG LEAK which was used to test it. - adding --trace-children to valgrind, although no longer needed. - since the stdout contains two or more runs, we need slightly different way of checking if the new process is up (explicitly looking for the new PID) - move the code that handles --wait-server to happen earlier (before watching the startup message in the log), and serve the restarted server too. * squashme - CR fixes (cherry picked from commit 8d4f055e43ab554adfce617c971f10c4b6423484)	2020-07-20 21:08:26 +03:00
Oran Agra	8a14ce8634	redis-cli tests, fix valgrind timing issue (#7519 ) this test when run with valgrind on github actions takes 160 seconds	2020-07-14 18:04:08 +03:00
Oran Agra	663e637da8	fix recently added time sensitive tests failing with valgrind (#7512 ) interestingly the latency monitor test fails because valgrind is slow enough so that the time inside PEXPIREAT command from the moment of the first mstime() call to get the basetime until checkAlreadyExpired calls mstime() again is more than 1ms, and that test was too sensitive. using this opportunity to speed up the test (unrelated to the failure) the fix is just the longer time passed to PEXPIRE.	2020-07-13 16:40:03 +03:00
John Sully	84bf240caa	Merge tag '6.0.5' into unstable Redis 6.0.5 Former-commit-id: b736a95b0d23e4b73daa88c676b76d1d18e8bd17	2020-07-13 00:55:23 +00:00
John Sully	f853142083	Add multi-master-no-forward command to reduce bus traffic with multi-master Former-commit-id: d99d06b1250a51ea4bc54f678f451acbb7901e33	2020-07-12 19:25:19 +00:00
John Sully	785779ee40	Fix failure to merge databases on active replica sync, due to bad merge with Redis 6 Former-commit-id: cd9514f4c8624932df2ec60ae3c2244899844aa6	2020-07-12 01:13:22 +00:00
Yossi Gottlieb	99b920534f	TLS: Add missing redis-cli options. (#7456 ) * Tests: fix and reintroduce redis-cli tests. These tests have been broken and disabled for 10 years now! * TLS: add remaining redis-cli support. This adds support for the redis-cli --pipe, --rdb and --replica options previously unsupported in --tls mode. * Fix writeConn().	2020-07-10 10:25:55 +03:00
Oran Agra	efc4189b62	stabilize tests that look for log lines (#7367 ) tests were sensitive to additional log lines appearing in the log causing the search to come empty handed. instead of just looking for the n last log lines, capture the log lines before performing the action, and then search from that offset.	2020-07-10 08:28:22 +03:00
Oran Agra	8d4f055e43	tests/valgrind: don't use debug restart (#7404 ) * tests/valgrind: don't use debug restart DEBUG REATART causes two issues: 1. it uses execve which replaces the original process and valgrind doesn't have a chance to check for errors, so leaks go unreported. 2. valgrind report invalid calls to close() which we're unable to resolve. So now the tests use restart_server mechanism in the tests, that terminates the old server and starts a new one, new PID, but same stdout, stderr. since the stderr can contain two or more valgrind report, it is not enough to just check for the absence of leaks, we also need to check for some known errors, we do both, and fail if we either find an error, or can't find a report saying there are no leaks. other changes: - when killing a server that was already terminated we check for leaks too. - adding DEBUG LEAK which was used to test it. - adding --trace-children to valgrind, although no longer needed. - since the stdout contains two or more runs, we need slightly different way of checking if the new process is up (explicitly looking for the new PID) - move the code that handles --wait-server to happen earlier (before watching the startup message in the log), and serve the restarted server too. * squashme - CR fixes	2020-07-10 08:26:52 +03:00
John Sully	7384abfe56	replication test race Former-commit-id: e1f3cd6ec3bf2319484de04c3796dcfa75e0479c	2020-06-07 01:14:57 -04:00
Oran Agra	fed743b2e1	fix pingoff test race	2020-06-06 11:44:21 +02:00
John Sully	4820142896	PSYNC test shouldn't wait forever Former-commit-id: 130613e16636923296a8d5b2c4bc623e62fef2f5	2020-06-01 16:13:58 -04:00
John Sully	92de178bfe	PSYNC test reliability improvements (test only issue) Former-commit-id: 50fd4fa7e62f3996f15f6a8c4dcd892022f111ec	2020-06-01 16:01:26 -04:00
John Sully	9e87395c34	Fix for issue #187 we need to properly handle the case where a key with a subkey expirey itself expires during load Former-commit-id: e6a9a6b428b91b6108df24ae6285ea9b582b7b23	2020-06-01 15:33:19 -04:00
John Sully	08fca5ef31	sendfile has high latency in some scenarios, don't use it Former-commit-id: 1eb0e3c1c604e71c54423f1d11b8c709c847a516	2020-05-31 23:22:25 -04:00
John Sully	4b317392be	Don't start multimaster tests until all nodes are connected Former-commit-id: 202b97eff76501e736a2f0969607e3297e9703a4	2020-05-31 22:50:30 -04:00
Oran Agra	8422c4f9d6	fix pingoff test race	2020-05-31 15:51:52 +03:00
John Sully	2e0c684324	active replica tests on slow computers Former-commit-id: c9920849dd6d6d0f6ecfe0d1002cb0edd7f7bfa9	2020-05-29 01:58:15 -04:00
John Sully	688dceb3a8	Fix test issue with TLS Former-commit-id: 81b240f81d1c52fd331c4e0e89659913380229c4	2020-05-29 01:44:52 -04:00
John Sully	ed2e0e66f6	Merge tag '6.0.4' into unstable Redis 6.0.4. Former-commit-id: 9c31ac7925edba187e527f506e5e992946bd38a6	2020-05-29 00:57:07 -04:00
antirez	41bb699867	Test: take PSYNC2 test master timeout high during switch. This will likely avoid false positives due to trailing pings.	2020-05-28 10:56:14 +02:00
antirez	0071eb1311	Test: take PSYNC2 test master timeout high during switch. This will likely avoid false positives due to trailing pings.	2020-05-28 10:47:30 +02:00
Oran Agra	01039e5964	adjust revived meaningful offset tests these tests create several edge cases that are otherwise uncovered (at least not consistently) by the test suite, so although they're no longer testing what they were meant to test, it's still a good idea to keep them in hope that they'll expose some issue in the future.	2020-05-28 10:09:51 +02:00
Oran Agra	98e6f2cd5b	revive meaningful offset tests	2020-05-28 10:09:51 +02:00
antirez	0163e4e495	Another meaningful offset test removed.	2020-05-28 10:09:51 +02:00
antirez	24a0f7bf55	Remove the PSYNC2 meaningful offset test.	2020-05-28 10:09:51 +02:00
antirez	2411e4e33f	Test: PSYNC2 test can now show server logs.	2020-05-28 10:09:51 +02:00
Oran Agra	afc7ea44b5	adjust revived meaningful offset tests these tests create several edge cases that are otherwise uncovered (at least not consistently) by the test suite, so although they're no longer testing what they were meant to test, it's still a good idea to keep them in hope that they'll expose some issue in the future.	2020-05-28 09:10:51 +03:00
Oran Agra	49687e9cb6	revive meaningful offset tests	2020-05-28 08:21:24 +03:00
antirez	fafe3502da	Another meaningful offset test removed.	2020-05-27 12:50:02 +02:00
antirez	4c264e994e	Remove the PSYNC2 meaningful offset test.	2020-05-27 12:47:34 +02:00
antirez	d325091ba6	Test: PSYNC2 test can now show server logs.	2020-05-25 20:26:29 +02:00
John Sully	fa0be83fd9	Merge tag '6.0.2' into unstable Redis 6.0.2 Former-commit-id: a010e4a4b2cc2bcad1cb14604b7ebc596c35b05e	2020-05-22 16:45:18 -04:00
John Sully	5a7ce664d0	Merge commit '78cbd3039858407837632bc37abb36e36ec60ce5' into unstable Former-commit-id: d74871da40dea11bd1a226fbecb0974ff5f8ec8c	2020-05-22 15:36:44 -04:00
Qu Chen	5d59bbb6d9	Disconnect chained replicas when the replica performs PSYNC with the master always to avoid replication offset mismatch between master and chained replicas.	2020-05-22 12:37:59 +02:00

1 2 3 4 5 ...

278 Commits