futriix

Author	SHA1	Message	Date
John Sully	a265f815e2	Merge OSS back into pro	2022-05-18 01:29:15 +00:00
jsully	5e4dec1a16	Merge branch 'multithread_load' into 'keydbpro' Multithread load See merge request external-collab/keydb-pro-6!5 Former-commit-id: 20e712244071028b0f75ccad477308efd139261f	2021-10-08 17:55:55 +00:00
Malavan Sotheeswaran	5ed9217c15	Merge fix to dict resize during rdb load Former-commit-id: c398d5f8a027c67acac64bdbfbd01486dde555eb	2021-09-17 16:10:48 +00:00
malavan	51fe42b10e	improve overwrite key performance Former-commit-id: 56f9d5528385ea78074a308c6d3987b920d6cc35	2021-09-14 17:06:04 +00:00
John Sully	6b4f686f5f	Merge branch 'unstable' into keydbpro Former-commit-id: 205d8f18d2bb8df5253bab40578b006b7aa73fd5	2021-05-28 23:32:46 +00:00
John Sully	5267928381	Merge tag '6.2.2' into unstable Former-commit-id: 93ebb31b17adec5d406d2e30a5b9ea71c07fce5c	2021-05-21 05:54:39 +00:00
John Sully	fe8efa916b	Merge tag '6.2.1' into unstable Former-commit-id: bfed57e3e0edaa724b9d060a6bb8edc5a6de65fa	2021-05-19 02:59:48 +00:00
John Sully	61e054f826	Fix test hang Former-commit-id: 23647390e628de07759f8e7d8768a7f638edf01d	2021-05-07 00:28:10 +00:00
John Sully	d71956e1ce	Fix bug where we skip valid dict elements in dictGetRandomKey Former-commit-id: 626b56b00824573660af0c47b210fd1e8d2cfeb2	2021-03-24 20:26:33 +00:00
John Sully	4f06fb2b4f	Make async rehash behave with snapshots (thread safety issues) Former-commit-id: 372adf39a80252b8035e3c948fcaf7d5ef6f928f	2021-03-16 02:38:41 +00:00
sundb	95d6297db8	Add run all test support with define REDIS_TEST (#8570 ) 1. Add `redis-server test all` support to run all tests. 2. Add redis test to daily ci. 3. Add `--accurate` option to run slow tests for more iterations (so that by default we run less cycles (shorter time, and less prints). 4. Move dict benchmark to REDIS_TEST. 5. fix some leaks in tests 6. make quicklist tests run on a specific fill set of options rather than huge ranges 7. move some prints in quicklist test outside their loops to reduce prints 8. removing sds.h from dict.c since it is now used in both redis-server and redis-cli (uses hiredis sds)	2021-03-10 09:13:11 +02:00
John Sully	91afeb5d0e	Fix issue where finding random keys is slow due to not shrinking the hash table. Former-commit-id: fd05010cdcf9d6a6187ca2e18bc55adbaa680a02	2021-02-22 09:14:24 +00:00
Jim Brunner	06966d2a0e	dict: pause rehash, minor readability refactor (#8515 ) The `dict` field `iterators` is misleading and incorrect. This variable is used for 1 purpose - to pause rehashing. The current `iterators` field doesn't actually count "iterators". It counts "safe iterators". But - it doesn't actually count safe iterators either. For one, it's only incremented once the safe iterator begins to iterate, not when it's created. It's also incremented in `dictScan` to prevent rehashing (and commented to make it clear why `iterators` is being incremented during a scan). This update renames the field as `pauserehash` and creates 2 helper macros `dictPauseRehashing(d)` and `dictResumeRehashing(d)`	2021-02-20 12:56:30 +02:00
John Sully	e5343f47c2	Merge branch 'unstable' into keydbpro Former-commit-id: e2140793f2bf565972ada799af73bf4457e2718d	2021-02-08 18:17:09 +00:00
John Sully	7c700f1987	Ensure rehash completes even when we're in a long running task Former-commit-id: f107746e90f7a8ff3c7094145ee1ad438911e8c2	2021-02-07 19:11:05 -05:00
John Sully	bef72e5f6a	Implement rehash during spinlock Former-commit-id: f68a26381a35b27948046d46c2c7bcfbdc21143d	2021-02-07 19:11:05 -05:00
John Sully	5ab1095022	Allow multiple threads to rehash simultaneously Former-commit-id: 5a2cc90786dfd1bfd341dbf5713bcde01f0cfff3	2021-02-07 19:11:05 -05:00
John Sully	c6c1084dd8	Initial implementation Former-commit-id: 958f2c00c8efc15dc91fdeec2ff2e2ae2016c124	2021-02-07 19:11:05 -05:00
Greg Femec	266949c7fc	Fix random element selection for large hash tables. (#8133 ) When a database on a 64 bit build grows past 2^31 keys, the underlying hash table expands to 2^32 buckets. After this point, the algorithms for selecting random elements only return elements from half of the available buckets because they use random() which has a range of 0 to 2^31 - 1. This causes problems for eviction policies which use dictGetSomeKeys or dictGetRandomKey. Over time they cause the hash table to become unbalanced because, while new keys are spread out evenly across all buckets, evictions come from only half of the available buckets. Eventually this half of the table starts to run out of keys and it takes longer and longer to find candidates for eviction. This continues until no more evictions can happen. This solution addresses this by using a 64 bit PRNG instead of libc random(). Co-authored-by: Greg Femec <gfemec@google.com>	2020-12-23 15:52:07 +02:00
Oran Agra	7ca00d694d	Sanitize dump payload: fail RESTORE if memory allocation fails When RDB input attempts to make a huge memory allocation that fails, RESTORE should fail gracefully rather than die with panic	2020-12-06 14:54:34 +02:00
Wang Yuan	75f9dec644	Limit the main db and expires dictionaries to expand (#7954 ) As we know, redis may reject user's requests or evict some keys if used memory is over maxmemory. Dictionaries expanding may make things worse, some big dictionaries, such as main db and expires dict, may eat huge memory at once for allocating a new big hash table and be far more than maxmemory after expanding. There are related issues: #4213 #4583 More details, when expand dict in redis, we will allocate a new big ht[1] that generally is double of ht[0], The size of ht[1] will be very big if ht[0] already is big. For db dict, if we have more than 64 million keys, we need to cost 1GB for ht[1] when dict expands. If the sum of used memory and new hash table of dict needed exceeds maxmemory, we shouldn't allow the dict to expand. Because, if we enable keys eviction, we still couldn't add much more keys after eviction and rehashing, what's worse, redis will keep less keys when redis only remains a little memory for storing new hash table instead of users' data. Moreover users can't write data in redis if disable keys eviction. What this commit changed ? Add a new member function expandAllowed for dict type, it provide a way for caller to allow expand or not. We expose two parameters for this function: more memory needed for expanding and dict current load factor, users can implement a function to make a decision by them. For main db dict and expires dict type, these dictionaries may be very big and cost huge memory for expanding, so we implement a judgement function: we can stop dict to expand provisionally if used memory will be over maxmemory after dict expands, but to guarantee the performance of redis, we still allow dict to expand if dict load factor exceeds the safe load factor. Add test cases to verify we don't allow main db to expand when left memory is not enough, so that avoid keys eviction. Other changes: For new hash table size when expand. Before this commit, the size is that double used of dict and later _dictNextPower. Actually we aim to control a dict load factor between 0.5 and 1.0. Now we replace 2 with +1, since the first check is that used >= size, the outcome of before will usually be the same as _dictNextPower(used+1). The only case where it'll differ is when dict_can_resize is false during fork, so that later the _dictNextPower(used2) will cause the dict to jump to 4 (i.e. _dictNextPower(10252) will return 4096). Fix rehash test cases due to changing algorithm of new hash table size when expand.	2020-12-06 11:53:04 +02:00
John Sully	a14b2097c3	Remove unnecessary key comparisons in perf critical snapshot paths Former-commit-id: 40f8a8d102fdca9443399ef03a47df609b146d58	2020-08-15 23:25:58 +00:00
John Sully	6c83ecbb48	Prehash the tombstone for cleanup Former-commit-id: c9d97a7c7448fc769486175bea1648589487c87c	2020-08-14 16:05:39 +00:00
John Sully	54cc984d86	Make snapshot completion faster and add latency monitor Former-commit-id: 8063be6ee70a652c22c3263dccf318366e208891	2020-06-04 01:07:14 -04:00
John Sully	d8dfc76673	Add new faster dictionary merging for use by snapshotting code Former-commit-id: b6f120b3d401c92ef5cf1cc6f5e77da139e33a97	2020-02-01 20:17:40 -05:00
John Sully	dc47a20da3	Fix multithreading data races Former-commit-id: 80f6e5818fd575cb08a5f620c35eed1cd862eb57	2019-11-24 13:44:43 -05:00
John Sully	7859e0562f	Move remaning files dependent on server.h over to C++ Former-commit-id: 8c133b605c65212b023d35b3cb71e63b6a4c443a	2019-04-08 01:00:48 -04:00
John Sully	ebf0ae3e97	Merge branch 'unstable' of https://github.com/antirez/redis into Multithread	2019-02-21 18:17:12 -05:00
antirez	61a01793ed	Better distribution for set get-random-element operations.	2019-02-18 18:27:18 +01:00
John Sully	5fc8747feb	make headers C++ safe	2019-02-15 16:55:40 -05:00
zhaozhao.zz	af08cd716d	dict: fix the int problem for defrag	2017-12-05 15:38:03 +01:00
zhaozhao.zz	7c6ddbc37d	dict: fix the int problem for defrag	2017-12-05 15:38:03 +01:00
antirez	b49721d57d	Use SipHash hash function to mitigate HashDos attempts. This change attempts to switch to an hash function which mitigates the effects of the HashDoS attack (denial of service attack trying to force data structures to worst case behavior) while at the same time providing Redis with an hash function that does not expect the input data to be word aligned, a condition no longer true now that sds.c strings have a varialbe length header. Note that it is possible sometimes that even using an hash function for which collisions cannot be generated without knowing the seed, special implementation details or the exposure of the seed in an indirect way (for example the ability to add elements to a Set and check the return in which Redis returns them with SMEMBERS) may make the attacker's life simpler in the process of trying to guess the correct seed, however the next step would be to switch to a log(N) data structure when too many items in a single bucket are detected: this seems like an overkill in the case of Redis. SPEED REGRESION TESTS: In order to verify that switching from MurmurHash to SipHash had no impact on speed, a set of benchmarks involving fast insertion of 5 million of keys were performed. The result shows Redis with SipHash in high pipelining conditions to be about 4% slower compared to using the previous hash function. However this could partially be related to the fact that the current implementation does not attempt to hash whole words at a time but reads single bytes, in order to have an output which is endian-netural and at the same time working on systems where unaligned memory accesses are a problem. Further X86 specific optimizations should be tested, the function may easily get at the same level of MurMurHash2 if a few optimizations are performed.	2017-02-20 17:29:17 +01:00
antirez	adeed29a99	Use SipHash hash function to mitigate HashDos attempts. This change attempts to switch to an hash function which mitigates the effects of the HashDoS attack (denial of service attack trying to force data structures to worst case behavior) while at the same time providing Redis with an hash function that does not expect the input data to be word aligned, a condition no longer true now that sds.c strings have a varialbe length header. Note that it is possible sometimes that even using an hash function for which collisions cannot be generated without knowing the seed, special implementation details or the exposure of the seed in an indirect way (for example the ability to add elements to a Set and check the return in which Redis returns them with SMEMBERS) may make the attacker's life simpler in the process of trying to guess the correct seed, however the next step would be to switch to a log(N) data structure when too many items in a single bucket are detected: this seems like an overkill in the case of Redis. SPEED REGRESION TESTS: In order to verify that switching from MurmurHash to SipHash had no impact on speed, a set of benchmarks involving fast insertion of 5 million of keys were performed. The result shows Redis with SipHash in high pipelining conditions to be about 4% slower compared to using the previous hash function. However this could partially be related to the fact that the current implementation does not attempt to hash whole words at a time but reads single bytes, in order to have an output which is endian-netural and at the same time working on systems where unaligned memory accesses are a problem. Further X86 specific optimizations should be tested, the function may easily get at the same level of MurMurHash2 if a few optimizations are performed.	2017-02-20 17:29:17 +01:00
oranagra	763f49243d	active defrag improvements	2017-01-02 09:42:32 +02:00
oranagra	5ab6a54cc6	active defrag improvements	2017-01-02 09:42:32 +02:00
oranagra	53511a429c	active memory defragmentation	2016-12-30 03:37:52 +02:00
oranagra	7aa9e6d2ae	active memory defragmentation	2016-12-30 03:37:52 +02:00
antirez	42acf62e5f	dict.c: dictReplaceRaw() -> dictAddOrFind(). What they say about "naming things" in programming?	2016-09-14 16:43:38 +02:00
antirez	09a50d34a2	dict.c: dictReplaceRaw() -> dictAddOrFind(). What they say about "naming things" in programming?	2016-09-14 16:43:38 +02:00
oranagra	40cf4d9a0a	dict.c: introduce dictUnlink(). Notes by @antirez: This patch was picked from a larger commit by Oran and adapted to change the API a bit. The basic idea is to avoid double lookups when there is to use the value of the deleted entry. BEFORE: entry = dictFind( ... ); /* 1st lookup. / / Do somethjing with the entry. / dictDelete(...); / 2nd lookup. / AFTER: entry = dictUnlink( ... ); / 1st lookup. / / Do somethjing with the entry. / dictFreeUnlinkedEntry(entry); / No lookups!. */	2016-09-14 12:18:59 +02:00
oranagra	afcbcc0e58	dict.c: introduce dictUnlink(). Notes by @antirez: This patch was picked from a larger commit by Oran and adapted to change the API a bit. The basic idea is to avoid double lookups when there is to use the value of the deleted entry. BEFORE: entry = dictFind( ... ); /* 1st lookup. / / Do somethjing with the entry. / dictDelete(...); / 2nd lookup. / AFTER: entry = dictUnlink( ... ); / 1st lookup. / / Do somethjing with the entry. / dictFreeUnlinkedEntry(entry); / No lookups!. */	2016-09-14 12:18:59 +02:00
oranagra	1ef16debfb	Optimize repeated keyname hashing. (Change cherry-picked and modified by @antirez from a larger commit provided by @oranagra in PR #3223).	2016-09-12 13:19:05 +02:00
oranagra	68bf45fa1e	Optimize repeated keyname hashing. (Change cherry-picked and modified by @antirez from a larger commit provided by @oranagra in PR #3223).	2016-09-12 13:19:05 +02:00
antirez	0c05436cef	Lazyfree: a first implementation of non blocking DEL.	2015-10-01 13:00:19 +02:00
antirez	0f64080dcb	DEBUG HTSTATS <dbid> added. The command reports information about the hash table internal state representing the specified database ID. This can be used in order to investigate rehashings, memory usage issues and for other debugging purposes.	2015-07-14 17:15:37 +02:00
antirez	9feee428f2	SPOP: reimplemented for speed and better distribution. The old version of SPOP with "count" argument used an API call of dict.c which was actually designed for a different goal, and was not capable of good distribution. We follow a different three-cases approach optimized for different ratiion between sets and requested number of elements. The implementation is simpler and allowed the removal of a large amount of code.	2015-02-11 10:52:28 +01:00
antirez	5792a217f8	dict.c: add dictGetSomeKeys(), specialized for eviction.	2015-02-11 10:52:27 +01:00
antirez	064d5c96ac	Use long for rehash and iterator index in dict.h. This allows to support datasets with more than 2 billion of keys (possible in very large memory instances, this bug was actually reported). Closes issue #1814.	2014-08-26 10:18:56 +02:00
xiaoyu	d786fb6e94	Clarify argument to dict macro d is more clear because the type of argument is dict not dictht Closes #513	2014-08-18 10:59:01 +02:00

1 2

69 Commits