-
-
Notifications
You must be signed in to change notification settings - Fork 989
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
optimize redis cluster slotmap with compact slot range object #1493
optimize redis cluster slotmap with compact slot range object #1493
Conversation
…b.com/messikiller/predis into feature-v2.x-optimize-cluster-slotmap
Thanks, sorry for forgetting phpstan check, the problem has been fixed right now. @tillkruss |
Is there anything I can do to get this PR merged? @tillkruss |
@messikiller: No, looks good to me, but to double check this won't cause any breaking changes with an existing data set, right? |
Thanks for your approvement ! 😄 |
Is there anything else that can be improved in this modification? Could you please give some suggestions for improvement on this PR when you have time? @vladvildanov @tillkruss |
@messikiller LGTM! Great job |
Recently, I discovered a noteworthy code snippet located in the
Predis\Cluster\SlotMap.php
file. Upon reviewing and debugging the code, I realized that this class handles slot mapping for Redis in its default cluster mode. This class stores slot mappings using a simple attribute,Predis\Cluster\SlotMap::slots
. It's worth mentioning that this array-type attribute uses the array keys as slots and the values to store the corresponding connection information for those slots. Since Redis clusters have 16384 slots by default, the maximum possible length of this array is 16384. When a large number of slot mappings are stored (normally, this involves synchronizing all slot data from the cluster), the array takes the following format:Evidently, this array is quite large and contains numerous duplicate values (connection information), which I believe consumes a significant amount of memory. To investigate, I wrote a test script to measure memory usage when storing a large number of slot mappings:
Upon running this script, I obtained the following results:
As evident from the results, memory usage increased by 260kb when storing a small number of slot mappings. With a larger number of mappings, the memory footprint would be even greater.
To address this, I experimented with a new approach using the
Predis\Cluster\SlotRange
object to store slot range information. Instead of maintaining a gigantic array with all slot mappings, this object only records the start and end nodes along with connection information for each slot range. This approach simplifies the process of merging and splitting multiplePredis\Cluster\SlotRange
objects during each call to thePredis\Cluster\SlotMap::setSlots()
method. For detailed code, please refer to the PR I submitted.Furthermore, I renamed the original
Predis\Cluster\SlotMap
toPredis\Cluster\SimpleSlotMap
and compared the memory usage of both solutions using a simple test script. The script is as follows:After running the comparison with 10,000
setSlots
method calls, here are the memory usage results:Additionally, I conducted unit tests for the new
SlotMap
class, reusing the existingSlotMapTest
class to ensure functional compatibility between the old and new solutions.That's all for now. I hope this PR will be accepted and merged into the main branch. I welcome any questions or suggestions and look forward to receiving feedback. Thanks.