Help:Toolforge/Redis for Toolforge
Redis is a key-value store that can be used to implement publish/subscribe protocols between processes, maintain persistent queues, and serve as a caching service for data that is expensive to compute. Stored values can be different data structures, such as hash tables, lists, queues, etc. Stored data persists across service restarts.
Redis instances
A Redis instance that can be used by all tools is available via the service name redis.svc.tools.eqiad1.wikimedia.cloud
, on the standard port 6379
. It has been allocated a maximum of 12G of memory, which should be enough for most usage. You can set limits for how long your data stays in Redis; otherwise it will be evicted when memory limits are exceeded. See the Redis documentation for a list of available
commands.
Redis libraries
Libraries for interacting with Redis from PHP (phpredis
), Python (redis-py
), and Perl (perl-redis
) are installed on all the bastions and grid engine nodes. For an example of a bot using Redis, see gerrit-to-redis.
For quick & dirty debugging, you can connect directly to the Redis server with nc -C redis.svc.tools.eqiad1.wikimedia.cloud 6379
and execute commands (for example "INFO") or using redis-cli (also installed on the grid-engine nodes) redis-cli -h redis.svc.tools.eqiad1.wikimedia.cloud
.
Celery
The redis service can be used as a broker between the worker and the web frontend to run a celery worker in a kubernetes container, as continuous job, see the documentation for Kubernetes (for instance to execute long-running tasks triggered by a web frontend).
![]() | Make sure you use a unique queue name so that your tasks get sent to the right workers. |
If using Django, assuming you use following definition of the namespace in your celery.py file:
app.config_from_object('django.conf:settings', namespace='CELERY')
Then, you can adapt the settings.py as following:
REDIS_PASSWORD=""
REDIS_HOST="redis.svc.tools.eqiad1.wikimedia.cloud"
REDIS_PORT="6379"
REDIS_DB=0
REDIS_URL = ':%s@%s:%s/%d' % (
REDIS_PASSWORD,
REDIS_HOST,
REDIS_PORT,
REDIS_DB)
CELERY_BROKER_URL = 'redis://'+REDIS_URL
CELERY_DEFAULT_QUEUE = "abcdefghijklm" # Pick a random string here
To run the Celery worker, use something like this:
$ toolforge-jobs run --continuous --image python3.9 --command "/data/project/your-tool-name/www/python/venv/bin/celery -A yourDjangoApp worker" celery-worker
Of course, if your virtual environment is at a different position, adapt the path.
Security
Redis has no access control mechanism, so other users can accidentally/intentionally overwrite and access the keys you set. Even if you are not worried about security, it is highly probable that multiple tools will try to use the same key (such as lastupdated
, etc). To prevent
this, it is highly recommended that you prefix all your keys with an application-specific, lengthy, randomly generated secret key.
PLEASE PREFIX YOUR KEYS! We have also disabled the Redis commands that let users 'list' keys. This protection however should not be trusted to protect any secret data. Do not store plain text secrets or decryption keys in Redis for your own protection.
You can generate a prefix by running the following command:
openssl rand -base64 32
Disabled Redis commands
Some built-in Redis commands have been disabled in an attempt to make Redis safer for multi-tenant usage:
- CLIENT
- CONFIG
- DEBUG
- FLUSHALL
- FLUSHDB
- KEYS
- MONITOR
- RANDOMKEY
- SCAN
- SHUTDOWN
- SLAVEOF
See Also
Communication and support
Support and administration of the WMCS resources is provided by the Wikimedia Foundation Cloud Services team and Wikimedia Movement volunteers. Please reach out with questions and join the conversation:
- Chat in real time in the IRC channel #wikimedia-cloud connect, the bridged Telegram group, or the bridged Mattermost channel
- Discuss via email after you subscribed to the cloud@ mailing list