futriix

Author	SHA1	Message	Date
guybe7	8b33aa6661	Modules: Replicate lazy-expire even if replication is not allowed (#8816 ) Before this commit using RM_Call without "!" could cause the master to lazy-expire a key (delete it) but without replicating to replicas. This could cause the replica's memory usage to gradually grow and could also cause consistency issues if the master and replica have a clock diff. This bug was introduced in #8617 Added a test which demonstrates that scenario.	2021-04-19 17:16:02 +03:00
Harkrishn Patro	b5b604aebc	ACL channels permission handling for save/load scenario. (#8794 ) In the initial release of Redis 6.2 setting a user to only allow pubsub access to a specific channel, and doing ACL SAVE, resulted in an assertion when ACL LOAD was used. This was later changed by #8723 (not yet released), but still not properly resolved (now it errors instead of crash). The problem is that the server that generates an ACL file, doesn't know what would be the setting of the acl-pubsub-default config in the server that will load it. so ACL SAVE needs to always start with resetchannels directive. This should still be compatible with old acl files (from redis 6.0), and ones from earlier versions of 6.2 that didn't mess with channels. Co-authored-by: Harkrishn Patro <harkrisp@amazon.com> Co-authored-by: Oran Agra <oran@redislabs.com>	2021-04-19 13:27:44 +03:00
sundb	cb73d3a084	Fix ouput buffer limit test (#8803 ) The tail size of c->reply is 16kb, but in the test only publish a few chars each time, due to a change in #8699, the obuf limit is now checked a new memory allocation is made, so this test would have sometimes failed to trigger a soft limit disconnection in time. The solution is to write bigger payloads to the output buffer, but still limit their rate (not more than 100k/s).	2021-04-19 10:08:07 +03:00
Wen Hui	06c087e5ce	fix invalid master_link_down_since_seconds in info repication (#8785 ) When replica never successfully connect to master, server.repl_down_since will be initialized to 0, therefore, the info master_link_down_since_seconds was showing the current unix timestamp, which does not make much sense. This commit fixes the issue by showing master_link_down_since_seconds to -1. means the replica never connect to master before. This commit also resets this variable back to 0 when a replica is turned into a master, so that it'll behave the same if the master is later turned into a replica again. The implication of this change is that if some app is checking if the value is > 60 do something, like conclude the replica is stale, this could case harm (changing a big positive number with a small one).	2021-04-19 09:34:21 +03:00
Yossi Gottlieb	4d0706b759	Revert cluster slot migration tests. (#8806 ) Disables #8649 and subsequent attempts to stabilize the test.	2021-04-18 20:51:08 +03:00
Oran Agra	442bd29612	Fix timing of new replication test (#8807 ) In github actions CI with valgrind, i saw that even the fast replica (one that wasn't paused), didn't get to complete the replication fast enough, and ended up getting disconnected by timeout. Additionally, due to a typo in uname, we didn't get to actually run the CPU efficiency part of the test.	2021-04-18 15:12:34 +03:00
Oran Agra	ca1a42e3e6	Improve testsuite print of log file (#8805 ) 1. the `dump_logs` option would have printed only logs of servers that were spawn before the test proc started, and not ones that the test proc started inside it. 2. when a server proc catches an exception it should normally forward the exception upwards, specifically when it's an assertion that should be caught by a test proc above. however, in `durable` mode, we caught all exceptions printed them to stdout and let the code continue, this was wrong to do for assertions, which should have still been propagated to the test function. 3. don't bother to search for crash log to print if we printed the the entire log anyway 4. if no crash log was found, no need to print anything (i.e. the fact it wasn't found) 5. rename warnings_from_file to crashlog_from_file	2021-04-18 11:55:54 +03:00
guybe7	9be210e082	ReplicationCron: Prevent invalid access to freed pointer (#8799 ) Fixes #8797	2021-04-16 16:56:38 +03:00
Wen Hui	256ae69e16	Avoid close before logging to preserve errno (#8703 )	2021-04-15 21:11:40 +03:00
guybe7	47c54769ce	Fix error reply in case zset command is not the STORE variant (#8793 )	2021-04-15 18:36:51 +03:00
guybe7	0bf6c205db	Add a timeout mechanism for replicas stuck in fullsync (#8762 ) Starting redis 6.0 (part of the TLS feature), diskless master uses pipe from the fork child so that the parent is the one sending data to the replicas. This mechanism has an issue in which a hung replica will cause the master to wait for it to read the data sent to it forever, thus preventing the fork child from terminating and preventing the creations of any other forks. This PR adds a timeout mechanism, much like the ACK-based timeout, we disconnect replicas that aren't reading the RDB file fast enough.	2021-04-15 17:18:51 +03:00
Bonsai	eb08b431fe	clean condition and variable store to nwritten that is never read (#8788 )	2021-04-14 22:44:08 -07:00
Bonsai	0d8cd01476	fix typo, stracture to structure (#8784 )	2021-04-14 15:46:54 +03:00
YaacovHazan	ef92c48b70	stabilized and improve pendingquerybuf test suit (#8780 ) replace the hardcoded after 2000, with waiting for the sync and wait for condition	2021-04-14 11:49:00 +03:00
Viktor Söderqvist	bd024cf32a	Modules API docs: Sections and links (#8442 ) * Modules API docs: Link API function names to their definitions Occurrences of API functions are linked to their definition. A function index with links to all functions is added on the bottom of the page. Comment blocks in module.c starting with a markdown h2 heading are used as sections. A table of contents is generated from these headings. The functions names are changed from h2 to h3, since they are now rendered as sub-headings within each section. Existing sections in module.c are used with some minor changes. Some documentation text is added or sligtly modified. The markdown renderer will add IDs which may clash with our generated IDs. By prefixing section IDs with "section-" we make them different. Replace double dashes with a unicode long ndash	2021-04-14 00:58:05 +03:00
Huang Zhw	ec267f173f	Remove extra param in role-change event of sentinelEvent. (#8742 )	2021-04-13 20:19:42 +03:00
Viktor Söderqvist	59a7a1e353	Small doc fix for stream module API (#8757 ) In a code example, using RedisModule_FreeString instead of RedisModule_Free makes it behave correctly regardless of whether automatic memory is used or not.	2021-04-13 20:14:12 +03:00
Oran Agra	1ab7cdff30	Revert "Fix: server will crash if rdbload or rdbsave method is not provided in module (#8670 )" (#8771 ) This reverts commit 15e2b68214042c4a72a4f793799420fac59f7b4c.	2021-04-13 17:41:46 +03:00
Oran Agra	5518114bc4	Add more attempts to a timing sensitive test (#8770 )	2021-04-13 17:35:10 +03:00
Wen Hui	05bc6b8679	Clean up no-conf server warning for sentinel mode (#8769 )	2021-04-13 16:28:54 +03:00
Oran Agra	8133b72268	fix access to uninitialized var in checkClientPauseTimeoutAndReturnIfPaused (#8765 ) server.client_pause_end_time is uninitialized, or actually 0, at startup, which means this method would think the timeout was reached and go look for paused clients. This causes no harm since unpauseClients will not find any paused clients.	2021-04-13 08:41:12 +03:00
Oran Agra	e57e8779d8	Fix busy loop in ae.c when timer event is about to fire (#8764 ) The code used to decide on the next time to wake on a timer with microsecond accuracy, but when deciding to go to sleep it used milliseconds accuracy (with truncation), this means that it would wake up too early, see that there's no timer to process, and go to sleep again for 0ms again and again until the right microsecond arrived. i.e. a timer for 100ms, would sleep for 99ms, but then do a busy loop through the kernel in the last millisecond, triggering many calls to beforeSleep. The fix is to change all the logic in ae.c to work with microseconds, which is good since most of the ae backends support micro (or even nano) seconds. however the epoll backend, doesn't support micro, so to avoid this problem it needs to round upwards, rather than truncate. Issue created by the monotonic timer PR #7644 (redis 6.2) Before that, all the timers in ae.c were in milliseconds (using mstime), so when it requested the backend to sleep till the next timer event, it would have worked ok.	2021-04-13 07:35:03 +03:00
Yossi Gottlieb	43e1f93f25	Fix failing cluster tests. (#8763 ) Disable replica migration to avoid a race condition where the migrated-from node turns into a replica. Long term, this test should probably be improved to handle multiple slots and accept such auto migrations but this is a quick fix to stabilize the CI without completely dropping this test.	2021-04-13 00:00:57 +03:00
John Sully	7f68765981	Allow prefetch even with a single thread Former-commit-id: 2e68821b330a6bae8a352e37c3da643d504b2ce3	2021-04-12 04:44:52 +00:00
John Sully	92028ddb5c	Avoid posting unnecessary async tasks Former-commit-id: 1661d2a05943f96992be195f5dc32dd9a67c0e68	2021-04-12 03:44:29 +00:00
John Sully	2eb3710039	Fix issue where GC is not free'd until a BGSAVE Former-commit-id: 38523e6b508cf5f4b40c178dfe98554abea8f6bd	2021-04-12 03:44:15 +00:00
John Sully	2c8540c4fd	Make prefetch more aggressive Former-commit-id: 25a5cfefcf7fa3451e92500f5d310290c4b6bbde	2021-04-12 03:42:05 +00:00
John Sully	5793bb0c34	Don't prefetch when lock contention is low, it increases latency Former-commit-id: 0d21614e0e5aba28acd364231823d51a3073081f	2021-04-12 03:41:53 +00:00
John Sully	3ed9963363	Reduce P99 latency with async rehash Former-commit-id: 7ea4c26fd82c0cdfa377183083f38a43336e480b	2021-04-12 03:39:13 +00:00
John Sully	aa64a260fc	Make prefetch configurable Former-commit-id: 3b660347d70cc25d57119080bd43fb4671e36488	2021-04-12 03:38:57 +00:00
Wang Yuan	8c5634f4eb	Fix wrong check for aof fsync and handle aof fsync errno (#8751 ) The bio aof fsync fd may be closed by main thread (AOFRW done handler) and even possibly reused for another socket, pipe, or file. This can can an EBADF or EINVAL fsync error, which will lead to -MISCONF errors failing all writes. We just ignore these errno because aof fsync did not really fail. We handle errno when fsyncing aof in bio, so we could know the real reason when users get -MISCONF Errors writing to the AOF file error Issue created with #8419	2021-04-11 08:14:31 +03:00
yjph	2fa3e1d999	Fix the display of make install (#8667 )	2021-04-10 21:25:53 +03:00
John Sully	c92b90eb91	DO not set the timethread as high priority, it can starve the server threads Former-commit-id: faeac65371af9d6b6effe0886bcbdefaec24ad6d	2021-04-09 01:06:24 +00:00
Wen Hui	dfbfe4c558	use getPositiveLongFromObjectOrReply for positive check of args (#8750 ) just a cleanup	2021-04-07 10:28:53 +03:00
Yang Bodong	79793c3d55	Fix out of range confusing error messages (XAUTOCLAIM, RPOP count) (#8746 ) Fix out of range error messages to be clearer (avoid mentioning 9223372036854775807) * Fix XAUTOCLAIM COUNT option confusing error msg * Fix other RPOP and alike error message to mention positive	2021-04-07 10:01:28 +03:00
Oran Agra	7af6fff7b9	update help.h (#8747 )	2021-04-06 12:42:18 +03:00
yjph	1ea4f128d3	Update the location information in some URLs (#8595 )	2021-04-06 12:29:02 +03:00
Bonsai	15e2b68214	Fix: server will crash if rdbload or rdbsave method is not provided in module (#8670 ) With this fix, module data type registration will fail if the load or save callbacks are not defined, or the optional aux load and save callbacks are not either both defined or both missing.	2021-04-06 12:09:36 +03:00
Yossi Gottlieb	e9ae932bc7	Clean up and stabilize cluster migration tests. (#8745 ) This is work in progress, focusing on two main areas: * Avoiding race conditions with cluster configuration propagation. * Ignoring limitations with redis-cli --cluster fix which makes it hard to distinguish real errors (e.g. failure to fix) from expected conditions in this test (e.g. nodes not agreeing on configuration).	2021-04-06 11:57:57 +03:00
Huang Zhw	f5cc88e65d	Fix "default" and overwritten / reset users will not have pubsub channels permissions by default. (#8723 ) Background: Redis 6.2 added ACL control for pubsub channels (#7993), which were supposed to be permissive by default to retain compatibility with redis 6.0 ACL. But due to a bug, only newly created users got this `acl-pubsub-default` applied, while overwritten (updated) users got reset to `resetchannels` (denied). Since the "default" user exists before loading the config file, any ACL change to it, results in an update / overwrite. So when a "default" user is loaded from config file or include ACL file with no channels related rules, the user will not have any permissions to any channels. But other users will have default permissions to any channels. When upgraded from 6.0 with config rewrite, this will lead to "default" user channels permissions lost. When users are loaded from include file, then call "acl load", users will also lost channels permissions. Similarly, the `reset` ACL rule, would have reset the user to be denied access to any channels, ignoring `acl-pubsub-default` and breaking compatibility with redis 6.0. The implication of this fix is that it regains compatibility with redis 6.0, but breaks compatibility with redis 6.2.0 and 2.0.1. e.g. after the upgrade, the default user will regain access to pubsub channels. Other changes: Additionally this commit rename server.acl_pubusub_default to server.acl_pubsub_default and fix typo in acl tests.	2021-04-05 23:13:20 +03:00
christianEQ	ec11cc177e	check for musl in logging.tcl as backtrace() is not available with it Former-commit-id: 2c8239f9cb30aa32de936be09522e6429aa40326	2021-04-05 11:09:49 -04:00
Huang Zhw	84414bb33f	redis-cli --bigkeys / memkeys, report detailed error on dbsize failure (#8740 ) When DBSIZE failed (e.g. on AUTH error), the printed error didn't reflect the reason.	2021-04-05 08:30:41 +03:00
Sokolov Yura	0d7a17b252	Add cluster-allow-replica-migration option. (#5285 ) Previously (and by default after commit) when master loose its last slot (due to migration, for example), its replicas will migrate to new last slot holder. There are cases where this is not desired: * Consolidation that results with removed nodes (including the replica, eventually). * Manually configured cluster topologies, which the admin wishes to preserve. Needlessly migrating a replica triggers a full synchronization and can have a negative impact, so we prefer to be able to avoid it where possible. This commit adds 'cluster-allow-replica-migration' configuration option that is enabled by default to preserve existed behavior. When disabled, replicas will not be auto-migrated. Fixes #4896 Co-authored-by: Oran Agra <oran@redislabs.com>	2021-04-04 09:43:24 +03:00
Wang Yuan	689b9ce067	Handle remaining fsync errors (#8419 ) In `aof.c`, we call fsync when stop aof, and now print a log to let user know that if fail. In `cluster.c`, we now return error, the calling function already handles these write errors. In `redis-cli.c`, users hope to save rdb, we now print a message if fsync failed. In `rio.c`, we now treat fsync errors like we do for write errors. In `server.c`, we try to fsync aof file when shutdown redis, we only can print one log if fail. In `bio.c`, if failing to fsync aof file, we will set `aof_bio_fsync_status` to error , and reject writing just like last writing aof error, moreover also set INFO command field `aof_last_write_status` to error.	2021-04-01 12:45:15 +03:00
Valentino Geron	f2a8041cd5	Fix XAUTOCLAIM response to return the next available id as the cursor (#8725 ) This command used to return the last scanned entry id as the cursor, instead of the next one to be scanned. so in the next call, the user could / should have sent `(cursor` and not just `cursor` if he wanted to avoid scanning the same record twice. Scanning the record twice would look odd if someone is checking what exactly was scanned, but it also has a side effect of incrementing the delivery count twice.	2021-04-01 12:13:55 +03:00
Oran Agra	9ab2625ce2	Solve sentinel test issue in TLS due to recent tests change. (#8728 ) 371d7f120 added a change that configures the tcp (plaintext) port alongside the tls port, this causes the INFO command for tcp_port to return that instead of the tls port when running in tls, and that broke the sentinel tests that query it. the fix is to add a method that gets the right port from CONFIG instead of relying on the tcp_port info field.	2021-04-01 09:44:44 +03:00
guybe7	41832aee11	zsetAdd: Fix wrong reply in case of INCR and GT/LT (#8717 ) If GT/LT fails the operation we need to reply with nill (like failure due to NX). Other changes: Add the missing $encoding suffix to many zset tests Note: there's a behavior change just in case of INCR + GT/LT that fails. The old code was replying with the wrong (rejected) score, and now it'll reply with nil. Note that that's anyway a corner case so this "behavior change" shouldn't have too much affect. Using GT/LT with INCR has a predictable result even before we run the command (INCR GT will only only / always fail if the increment is negative).	2021-04-01 09:33:53 +03:00
Wen Hui	b37ac422e7	generalize config file check for sentinel (#8730 ) The implications of this change is just that in the past when a config file was missing, in some cases it was exiting before printing the sever startup prints and sometimes after, and now it'll always exit before printing them.	2021-04-01 09:01:05 +03:00
wuYin	38c3522388	reuse existing range comparators in the zset (#8714 ) There are 2 common range comparators for skiplist: zslValueGteMin and zslValueLteMax, but they're not being reused in zslDeleteRangeByScore This is a small change to make code cleaner.	2021-04-01 08:50:23 +03:00
Dvir Volk	7225b3cb53	Added macros for RM_log logging levels (#4246 ) Added macros for RM_log logging levels, to avoid typos and the need to memorize the level strings by heart	2021-04-01 08:44:57 +03:00

1 2 3 4 5 ...

12492 Commits