futriix

Author	SHA1	Message	Date
Yossi Gottlieb	14305a876b	Fix crashes with io-threads-do-reads enabled. (#8230 ) Normally IO threads should simply read data from the socket into the buffer and attempt to parse it. If a protocol error is detected, a reply is generated which may result with installing a write handler which is not thread safe. This fix delays that until the client is processed back in the main thread. Fixes #8220 (cherry picked from commit 53f926dd9068a62795abc2e01dabac5be0ccbada)	2021-01-12 16:25:37 +02:00
George Prekas	fecbb48519	Add check for the MADV_FREE/fork arm64 Linux kernel bug (#8224 ) Older arm64 Linux kernels have a bug that could lead to data corruption during background save under the following scenario: 1) jemalloc uses MADV_FREE on a page, 2) jemalloc reuses and writes the page, 3) Redis forks the background save process, and 4) Linux performs page reclamation. Under these conditions, Linux will reclaim the page wrongfully and the background save process will read zeros when it tries to read the page. The bug has been fixed in Linux with commit: ff1712f953e27f0b0718762ec17d0adb15c9fd0b ("arm64: pgtable: Ensure dirty bit is preserved across pte_wrprotect()") This Commit adds an ignore-warnings config, when not found, redis will print a warning and exit on startup (default behavior). Co-authored-by: Oran Agra <oran@redislabs.com> (cherry picked from commit 75bbf56d37d1134cc9c4f36900a4a623604776a5) in 6.0 this warning is ignored by default in order to avoid adding regression, specifically for deployments that don't need persistence or replication	2021-01-12 16:25:37 +02:00
Qu Chen	01db11ce8d	Not over-allocate client query buffer when reading large objects. (#5954 ) In response to large client query buffer optimization introduced in 1898e6c. The calculation of the amount of remaining bytes we need to write to the query buffer was calculated wrong, as a result we are unnecessarily growing the client query buffer by sdslen(c->querybuf) always. This fix corrects that behavior. Please note the previous behavior prior to the before-mentioned change was correctly calculating the remaining additional bytes, and this change makes that calculate to be consistent. Useful context, the argument of size `ll` starts at qb_pos (which is now the beginning of the sds), but much of it may have already been read from the socket, so we only need to grow the sds for the remainder of it. (cherry picked from commit b8c98337cd8f90064085a7af4b416f6fb3e80e1c)	2021-01-12 16:25:37 +02:00
Oran Agra	fb0783c710	Handle output buffer limits for Module blocked clients (#8141 ) Module blocked clients cache the response in a temporary client, the reply list in this client would be affected by the recent fix in #7202, but when the reply is later copied into the real client, it would have bypassed all the checks for output buffer limit, which would have resulted in both: responding with a partial response to the client, and also not disconnecting it at all. (cherry picked from commit 11e0c4739b40c23a70aff5287e67e8ef6418bb2a)	2021-01-12 16:25:37 +02:00
Yossi Gottlieb	0b5312a4a6	Fix setproctitle related crashes. (#8150 ) Makes spt_init more careful with assumptions about what memory regions may be overwritten. It will now only consider a contiguous block of argv and envp elements and mind any gaps. (cherry picked from commit 9f272197ea0d8e59f565782a6a1b420b9b94266d)	2021-01-12 16:25:37 +02:00
Yossi Gottlieb	e2f6b95926	Fix use-after-free issue in spt_copyenv. (#8088 ) Seems to have gone unnoticed for a long time, because at least with glibc it will only be triggered if setenv() was called before spt_init, which Redis doesn't. Fixes #8064. (cherry picked from commit 87f59674b4f24fab3e98d3cb029ad85be5e68266)	2021-01-12 16:25:37 +02:00
Wang Yuan	f5d6057cbb	Backup keys to slots map and restore when fail to sync if diskless-load type is swapdb in cluster mode (#8108 ) When replica diskless-load type is swapdb in cluster mode, we didn't backup keys to slots map, so we will lose keys to slots map if fail to sync. Now we backup keys to slots map at first, and restore it properly when fail. This commit includes a refactory/cleanup of the backups mechanism (moving it to db.c and re-structuring it a bit). Co-authored-by: Oran Agra <oran@redislabs.com> (cherry picked from commit 10712afaf3e7f2ea859622fa5b27c96ee8f478c5)	2021-01-12 16:25:37 +02:00
Wang Yuan	ce6eee028d	Reset average ttl when empty databases (#8106 ) On FLUSHDB or full sync, reset old average TTL stat. This Stat is incrementally collected by the master over time when it searches for expired keys. (cherry picked from commit b6916ca91c623a892f61cc9d1958c19490eb73ae)	2021-01-12 16:25:37 +02:00
Oran Agra	626b8b2a13	Fix bug with module GIL being released prematurely (#8061 ) This is hopefully usually harmles. The server.ready_keys will usually be empty so the code after releasing the GIL will soon be done. The only case where it'll actually process things is when a module releases a client (or module) blocked on a key, by triggering this NOT from within a command (e.g. a timer event). This bug was introduced in redis 6.0.9, see #7903 (cherry picked from commit f5604de4c8962db75a770dafff3b0ac830b6c4d1)	2021-01-12 16:25:37 +02:00
Oran Agra	878d8476e5	Fix oom-score-adj-values range, abs options, and bug when used in config file (#8046 ) Fix: When oom-score-adj-values is provided in the config file after oom-score-adj yes, it'll take an immediate action, before readOOMScoreAdj was acquired, resulting in an error (out of range score due to uninitialized value. delay the reaction the real call is made by main(). Since the values are clamped to -1000..1000, and they're applied as an offset from the value at startup (which may be -1000), we need to allow the offsets to reach to +2000 so that a value of +1000 is achievable in case the value at startup was -1000. Adding an option for absolute values rather than relative ones. (cherry picked from commit f08fd95c6686467ecd46116dab150d47563aed1c)	2021-01-12 16:25:37 +02:00
guybe7	02236725af	EXISTS should not alter LRU, OBJECT should not reveal expired keys on replica (#8016 ) The bug was introduced by #5021 which only attempted avoid EXIST on an already expired key from returning 1 on a replica. Before that commit, dbExists was used instead of lookupKeyRead (which had an undesired effect to "touch" the LRU/LFU) Other than that, this commit fixes OBJECT to also come empty handed on expired keys in replica. And DEBUG DIGEST-VALUE to behave like DEBUG OBJECT (get the data from the key regardless of it's expired state) (cherry picked from commit 168dcb549c3d5253fc69d6150ce98f8eb9ca0c99)	2021-01-12 16:25:37 +02:00
Wang Yuan	bde35e3aaa	Disable rehash when redis has child process (#8007 ) In redisFork(), we don't set child pid, so updateDictResizePolicy() doesn't take effect, that isn't friendly for copy-on-write. The bug was introduced this in redis 6.0: e70fbad (cherry picked from commit 16e3af9d23bb4791a7571f889aad8f2c85546521)	2021-01-12 16:25:37 +02:00
Oran Agra	50012c9392	Merge 6.2 RC2	2021-01-12 16:21:03 +02:00
Oran Agra	51310e249a	Redis 6.2 RC2	2021-01-12 16:17:58 +02:00
Oran Agra	438a71e8bc	Merge unstable into 6.2	2021-01-12 10:14:04 +02:00
Oran Agra	7d23490bd7	fix race in cluster transactions test (#8312 ) we didn't wait for the commands executed on the master to reach the replica.	2021-01-12 10:03:45 +02:00
Madelyn Olson	2860aa88fc	Fix issues in wait test (#8310 ) This fixes three issues: 1. Using debug SLEEP was impacting the subsequent test, and causing it to pass reliably even though it should have failed. There was exactly 5 seconds of artificial pause (after 1000, wait 3000, wait 1000) between the debug sleep 5 and when we needed to unblock the client in the subsequent test. Now the test properly makes sure the client is unblocked, and the subsequent test is fixed. 2. Minor, the client pause types were using & comparisons instead of ==, since it was previously a flag. 3. Test is faster now that some of the hand wavy time is removed.	2021-01-12 09:46:24 +02:00
Oran Agra	37cf8d8f7f	Fix cluster diskless load swapdb test (#8308 ) The test was trying to wait for the replica to start loading the rdb from the master before it kills the master, but it was actually waiting for ROLE to be in "sync" mode, which corresponds to REPL_STATE_TRANSFER that starts before the actual loading starts. now instead it waits for the loading flag to be set. Besides, the test was dependent on the previous configuration of the servers, relying on the fact the replica is configured to persist (either RDB of AOF), now it is set explicitly.	2021-01-12 09:41:57 +02:00
Bob Li	3fb4197a74	Fix overflow of rdbWriteRaw return value (#8306 ) Saving string of more than 2GB to the RDB file, can result in corrupt RDB, or failure in rdbSave. S	2021-01-12 08:22:53 +02:00
John Sully	e9274a0a93	Fix intermittent module test failure, resizing the repl backlog is not good enough to clear the backlog Former-commit-id: d50e4bb8193a349eb3295a53e3111e764676ddcf	2021-01-12 00:29:07 +00:00
benschermel	dd58268712	update perf chart, build info, job post link Former-commit-id: 4485ca74f6d177803f7ca8d2a2b75dcba8e6817b	2021-01-11 11:40:00 -05:00
benschermel	ca7ca0ee3a	created ascii logo image Former-commit-id: 7f72b214e58164a9731508da6f95241874174210	2021-01-11 11:40:00 -05:00
sundb	c917e35c49	Assert that clusterAddNode can't fail (#8296 ) Assert that clusterAddNode can't fail	2021-01-09 10:24:58 -08:00
Oran Agra	be36d838dc	CLIENT PAUSE - don't drop together with other blocked clients (#8302 ) When the server state changes and blocked clients are being dropped, the paused clients should not be dropped, they're safe to keep since unlike other blocked types, these commands are not half way though processing, and the commands they sent may get rejected according to the new server state.	2021-01-09 10:22:20 -08:00
sundb	802fdc129b	Fix compile warning when define REDIS_TEST (#8261 ) Co-authored-by: Oran Agra <oran@redislabs.com>	2021-01-09 19:52:42 +02:00
Oran Agra	46729a7404	Fix last COW INFO report, Skip test on non-linux platforms (#8301 ) - the last COW report wasn't always read from the pipe (receiveLastChildInfo wasn't used) - but in fact, there's no reason we won't always try to drain that pipe so i'm unifying receiveLastChildInfo with receiveChildInfo - adjust threshold of the COW test when run in accurate mode - add some prints in case this test fails again - fix indentation, page size, and PID! in MacOS proc info p.s. it seems that pri_pages_dirtied is always 0	2021-01-08 23:35:30 +02:00
Yang Bodong	db1cc665f4	GEOSEARCH - ANY option, for limited search that returns ASAP (#8259 ) Support ANY option to return some results that match the criteria ASAP, without a complete search and implicit sorting.	2021-01-08 18:29:44 +02:00
guybe7	fc18c4ca6e	XADD and XTRIM, Trim by MINID, and new LIMIT argument (#8169 ) This PR adds another trimming strategy to XADD and XTRIM named MINID (complements the existing MAXLEN). It also adds a new LIMIT argument that allows incremental trimming by repeated calls (rather than all at once). This provides the ability to trim all records older than a certain ID (which makes it possible for the user to trim by age too). Example: XTRIM mystream MINID ~ 1608540753 will trim entries with id < 1608540753, but might not trim all (because of the ~ modifier) The purpose is to ease the use of streams. many users use streams as logs and the common case is wanting a log of the last X seconds rather than a log that contains maximum X entries (new MINID vs existing MAXLEN) The new LIMIT modifier is only supported when the trim strategy uses ~. i.e. when the user asked for exact trimming, it all happens in one go (no possibility for incremental trimming). However, when ~ is provided, we trim full rax nodes, up to the limit number of records. The default limit is 100*stream_node_max_entries (used when LIMIT is not provided). I.e. this is a behavior change (even if the existing MAXLEN strategy is used). An explicit limit of 0 means unlimited (but note that it's not the default). Other changes: Refactor arg parsing code for XADD and XTRIM to use common code.	2021-01-08 18:13:25 +02:00
Oran Agra	0967eff8be	Skip defrag tests on systems with bigger page sizes (#8294 ) The defragger works well on these systems, but the tests and their thresholds are not adjusted for these big pages, so the defragger isn't able to get down the fragmentation to the levels the test expects and it fails on "defrag didn't stop". Randomly choosing 8k as the threshold for the skipping Fixes #8265 (which had 65k pages)	2021-01-08 10:03:21 +02:00
Madelyn Olson	180cea0fe6	Throw error for conflicting bcast tracking prefixes (#8176 ) Throw an error if there are conflicting bcast tracking prefixes.	2021-01-08 00:00:35 -08:00
Madelyn Olson	6c2669397c	Add support for client pause WRITE (#8170 ) Implementation of client pause WRITE and client unpause	2021-01-07 23:36:54 -08:00
George Prekas	75bbf56d37	Add check for the MADV_FREE/fork arm64 Linux kernel bug (#8224 ) Older arm64 Linux kernels have a bug that could lead to data corruption during background save under the following scenario: 1) jemalloc uses MADV_FREE on a page, 2) jemalloc reuses and writes the page, 3) Redis forks the background save process, and 4) Linux performs page reclamation. Under these conditions, Linux will reclaim the page wrongfully and the background save process will read zeros when it tries to read the page. The bug has been fixed in Linux with commit: ff1712f953e27f0b0718762ec17d0adb15c9fd0b ("arm64: pgtable: Ensure dirty bit is preserved across pte_wrprotect()") This Commit adds an ignore-warnings config, when not found, redis will print a warning and exit on startup (default behavior). Co-authored-by: Oran Agra <oran@redislabs.com>	2021-01-07 17:06:05 +02:00
YaacovHazan	4dc4343d18	Report child copy-on-write info continuously Add INFO field, rdb_active_cow_size, to report COW of a live fork child while it's active. - once in 1024 keys check the time, and if there's more than one second since the last report send a report to the parent via the pipe. - refactor the child_info_data struct, it's an implementation detail that shouldn't be in the server struct, and not used to communicate data between caller and callee - remove the magic value from that struct (not sure what it was good for), and instead add handling of short reads. - add another value to the structure, cow_type, to indicate if the report is for the new rdb_active_cow_size field, or it's the last report of a successful operation - add new Module API to report the active COW - add more asserts variants to test.tcl	2021-01-07 16:14:29 +02:00
YaacovHazan	b43bcd0e8e	Refactory fork child related infra, Unify child pid This is a refactory commit, isn't suppose to have any actual impact. it does the following: - keep just one server struct fork child pid variable instead of 3 - have one server struct variable indicating the purpose of the current fork child. - redisFork is now responsible of updating the server struct with the pid, which means it can be the one that calls updateDictResizePolicy - move child info pipe handling into redisFork instead of having them repeated outside - there are two classes of fork purposes, mutually exclusive group (AOF, RDB, Module), and one that can create several forks to coexist in parallel (LDB, but maybe Modules some day too, Module API allows for that). - minor fix to killRDBChild: unlike killAppendOnlyChild and TerminateModuleForkChild, the killRDBChild doesn't clear the pid variable or call wait4, so checkChildrenDone does the cleanup for it. This commit removes the explicit calls to rdbRemoveTempFile, closeChildInfoPipe, updateDictResizePolicy, which didn't do any harm, but where unnecessary.	2021-01-07 16:14:29 +02:00
Jonah H. Harris	6211b360b7	Add ZRANGESTORE command, and improve ZSTORE command (#7844 ) Add ZRANGESTORE command, and improve ZSTORE command to deprecated Z[REV]RANGE[BYSCORE\|BYLEX]. Syntax for the new ZRANGESTORE command: ZRANGESTORE [BYSCORE \| BYLEX] [REV] [LIMIT offset count] New syntax for ZRANGE: ZRANGE [BYSCORE \| BYLEX] [REV] [WITHSCORES] [LIMIT offset count] Old syntax for ZRANGE: ZRANGE [WITHSCORES] Other ZRANGE commands remain unchanged. The implementation uses common code for all of these, by utilizing a consumer interface that in one command response to the client, and in the other command stores a zset key. Co-authored-by: Oran Agra <oran@redislabs.com>	2021-01-07 10:58:53 +02:00
Wen Hui	138ecf1de7	fix memory leak in processInlineBuffer error handling code (#8295 ) This code path is normally executed only when v6.0 and above replicates from v2.4	2021-01-06 21:20:53 +02:00
guybe7	68463baf82	Add XAUTOCLAIM (#7973 ) New command: XAUTOCLAIM <key> <group> <consumer> <min-idle-time> <start> [COUNT <count>] [JUSTID] The purpose is to claim entries from a stale consumer without the usual XPENDING+XCLAIM combo which takes two round trips. The syntax for XAUTOCLAIM is similar to scan: A cursor is returned (streamID) by each call and should be used as start for the next call. 0-0 means the scan is complete. This PR extends the deferred reply mechanism for any bulk string (not just counts) This PR carries some unrelated test code changes: - Renames the term "client" into "consumer" in the stream-cgroups test - And also changes DEBUG SLEEP into "after" Co-authored-by: Oran Agra <oran@redislabs.com>	2021-01-06 10:34:27 +02:00
huangzhw	f2bde2268a	sdscatfmt call sdsMakeRoomFor, asked for more space than intended (#8286 ) instead of asking for the extra new space it wanted, it asked to grow the string by the size it already has too. i.e. a string of 1000 bytes, needing to grow by 10 bytes, would have been asking for an additional 1010 bytes.	2021-01-05 18:41:53 +02:00
Oran Agra	a67621f495	Fix rdb checksum / crc64 on bigendian (#8270 ) Turns out the RDB checksum in Redis 6.0 on bigendian is broken. It always returned 0, so the RDB files are generated as if checksum is disabled, and will be loaded ok on littleendian, and on bigendian. But it'll not be able to load RDB files generated on littleendian or older versions. Similarly DUMP and RESTORE will work on the same version (0==0), but will be unable to exchange dump payloads with littleendian or old versions.	2021-01-05 09:15:10 +02:00
Oran Agra	bcde1d978d	Fix wrong order of key/value in Lua map response (#8266 ) When a Lua script returns a map to redis (a feature which was added in redis 6 together with RESP3), it would have returned the value first and the key second. If the client was using RESP2, it was getting them out of order, and if the client was in RESP3, it was getting a map of value => key. This was happening regardless of the Lua script using redis.setresp(3) or not. This also affects a case where the script was returning a map which it got from from redis by doing something like: redis.setresp(3); return redis.call() This fix is a breaking change for redis 6.0 users who happened to rely on the wrong order (either ones that used redis.setresp(3), or ones that returned a map explicitly). This commit also includes other two changes in the tests: 1. The test suite now handles RESP3 maps as dicts rather than nested lists 2. Remove some redundant (duplicate) tests from tracking.tcl	2021-01-05 08:29:20 +02:00
Oran Agra	0c9c1d3943	remove unused call to zmalloc_size in defrag.c (#8285 )	2021-01-04 23:16:19 +02:00
Itamar Haber	2a16c67f66	HELP subcommand, continued (#5531 ) * man-like consistent long formatting * Uppercases commands, subcommands and options * Adds 'HELP' to HELP for all * Lexicographical order * Uses value notation and other .md likeness * Moves const char help to top Keeps it under 80 chars * Misc help typos, consistent conjuctioning (i.e return and not returns) * Uses addReplySubcommandSyntaxError(c) all over Signed-off-by: Itamar Haber <itamar@redislabs.com>	2021-01-04 17:02:57 +02:00
Yang Bodong	f571f8467c	Swapdb should make transaction fail if there is any client watching keys (#8239 ) This PR not only fixes the problem that swapdb does not make the transaction fail, but also optimizes the FLUSHALL and FLUSHDB command to set the CLIENT_DIRTY_CAS flag to avoid unnecessary traversal of clients. FLUSHDB was changed to first iterate on all watched keys, and then on the clients watching each key. Instead of iterating though all clients, and for each iterate on watched keys. Co-authored-by: Oran Agra <oran@redislabs.com>	2021-01-04 14:48:28 +02:00
Meir Shpilraien (Spielrein)	fd4bc50b26	Fix assertion on loading AOF with timed out script. (#8284 ) If AOF file contains a long Lua script that timed out, then the `evalCommand` calls `blockingOperationEnds` which sets `server.blocked_last_cron` to 0. later on, the AOF `whileBlockedCron` function asserts that this value is not 0. The fix allows nesting call to `blockingOperationStarts` and `blockingOperationEnds`. The issue was first introduce in this commit: eca1014fe (Redis 6.2 RC1)	2021-01-04 13:42:17 +02:00
huangzhw	6230ba0811	sort Command lookupKeyRead and lookupKeyWrite are used on the opposite (#8283 ) This is a recent problem, introduced by 9b2a426 (redis 6.0) The implications are: The sole difference between LookupKeyRead and LookupKeyWrite is for command executed on a replica, which are not received from its master client. (for the master, and for the master client on the replica, these two functions behave the same)! Since SORT is a write command, this bug only implicates a writable-replica. And these are its implications: - SORT STORE will behave as it did before the above mentioned commit (like before redis 6.0). on a writable-replica an already logically expired the key would have appeared missing. (store dest key would be deleted, instead of being populated with the data from the already logically expired key) - SORT (the non store variant, which in theory could have been executed on read-only-replica if it weren't for the write flag), will (in redis 6.0) have a new bug and return the data from the already logically expired key instead of empty response.	2021-01-04 10:28:47 +02:00
kukey	bb75703b17	GEOADD - add [CH] [NX\|XX] options (#8227 ) New command flags similar to what SADD already has. Co-authored-by: huangwei03 <huangwei03@kuaishou.com> Co-authored-by: Itamar Haber <itamar@redislabs.com> Co-authored-by: Oran Agra <oran@redislabs.com>	2021-01-03 17:13:37 +02:00
Oran Agra	e0440ea3a3	Fix uninitialized variable in new errorstats commit (#8280 )	2021-01-03 16:14:52 +02:00
Oran Agra	23c7f27df2	Fix rare assertion as a result of: active defrag while loading (#8281 ) In #7726 (part of 6.2), we added a mechanism for whileBlockedCron, this mechanism has an assertion to make sure the timestamp in whileBlockedCron was always set correctly before the blocking operation starts. I now found (thanks to our CI) two bugs in that area: 1) CONFIG RESETSTAT (if it was allowed during loading) would have cleared this var 2) the call stopLoading (which calls whileBlockedCron) was made too early, while the rio is still in use, in which case the update_cksum (rdbLoadProgressCallback) may still be called and whileBlockedCron can assert.	2021-01-03 16:09:29 +02:00
Oran Agra	93b8b13930	fix crash in redis-cli after making cluster backup (#8267 ) getRDB is "designed" to work in two modes: one for redis-cli --rdb and one for redis-cli --cluster backup. in the later case it uses the hiredis connection from the cluster nodes and it used to free it without nullifying the context, so a later attempt to free the context would crash. I suppose the reason it seems to want to free the hiredis context ASAP is that it wants to disconnect the replica link, so that replication buffers will not be accumulated.	2021-01-03 11:56:26 +02:00
Oran Agra	ba4bff0723	Fix leak in new errorstats commit, and a flaky test (#8278 )	2021-01-02 08:37:19 +02:00

... 7 8 9 10 11 ...

11922 Commits