Help:Trove database user guide
This page explains how to add and manage databases within a Cloud VPS project. Cloud Services implements Trove, an OpenStack component, to provide database services for Cloud VPS projects. Toolforge tools may use Trove if they obtain approval through the Toolforge Quota Request process, but most tools should use the user database options in Toolforge.
Why add Trove to a project?
- Your Cloud VPS project needs persistent data storage and retrieval, but you don't want the burden of directly managing that database, a virtual machine and its install of MySQL/MariaDB/PostgreSQL software. Trove can handle much of this complexity for you.
- Your tool's data has become too large for ToolsDB. For example, in phab:T323502, the popularity and usage of a tool has grown to the point that it has 160 GB of saved data. This is so much data that it's causing issues for other ToolsDB users. For this tool, "graduating" from ToolsDB to a dedicated Trove instance is the next step in managing growth.
A Trove instance (or database instance) is a database server that may contain one or more databases. A Trove instance is not the same as a Cloud VPS instance, which is a virtual machine (VM). A Trove instance uses Cinder volumes for storage, and a Nova instance for compute. However, database instances and their storage are managed separately from Cloud VPS instances and their Cinder volumes.
A Cloud VPS project can have multiple Trove instances, as long as a project has sufficient Trove quota. Multiple Cloud VPS instances may access the same Trove instance. A Trove instance doesn't count against your project's "instance" quotas, and doesn't show up in your list of "instances", because it's a different kind of instance – a database instance, rather than a VM. Trove storage quota defaults to around 10GB, but you can request more.
Toolforge tools that are approved to use Trove have a Cloud VPS project that exists only to provide their database instance.
Before you start
To create and manage Trove instances and databases:
- Cloud VPS users: You must be a project admin for the Cloud VPS project to which you are adding a database.
- Toolforge users: To use Trove for a Toolforge tool, you must first get approval through the Toolforge Quota Request process.
- You should know the database type and version of the datastore(s) you intend to use. Trove primarily targets MySQL and MariaDB servers, but has some limited support for Postgres databases.
Launch a Trove instance
Before you can create a database, you must launch a Trove instance:
- In the Horizon main menu, navigate to the Databases section.
- In the upper right side of the screen, click the "Launch Instance" button.
- Specify values in the required fields to launch your instance.
- For Volume Size, choose an option that is slightly more than what you think you might need, so you can avoid resizing later.
- For Volume Type, select "standard".
- If your Datastore type is mariadb, select a Flavor with at least 2GB of RAM. (The amount of RAM is indicated by the number after "ram" in the flavor name.)
Create a database
A Trove instance can contain one or more databases. To add a database to an instance:
- In Horizon (mysql and Mariadb instances only): click on the instance name and go to the "Databases" tab.
- On the command line: enable root access and use a mysql, psql, or equivalent CLI.
Manage access and users
If you prefer a command line interface for database management (or need more fine-grained control than that offered by Horizon) you can enable root access to an instance.
Enable root access
To use the command line to manage your databases, enable root access in Horizon:
- In the "Actions" menu for an instance, select "Manage Root Access".
- Click the "Enable root" button for the instance.
- Use the resulting credentials to access a root database login.
Access databases from a VM
Horizon provides an example shell command and an example URI as shortcuts for database access. To view these examples, click an Instance name and go to the "Connection Information" section on the Overview tab.
Copy this command and paste it into your shell to get a mysql, psql, or equivalent prompt. Substitute the appropriate username and password. For a root shell, enable root access.
By default you can only access the DB from VMs within the same Cloud VPS project, if you need to connect directly from your desktop/laptop, use ssh tunnels to a VM with access.
You can add users to specific databases within a Trove instance, or provide access at the instance level. Database usernames are not the same as Cloud VPS account names.
To add users to an instance:
- Go to your list of instances and click the instance name.
- Navigate to the "Users" tab. Use the "+ Create User" button in the upper right corner of the screen to add users.
Modify database configuration
Trove databases are configured through OpenStack – not through Puppet or text files. In Horizon, the "Defaults" tab for the Trove instance displays its default configurations. These values are applied to your database at startup, and may be different from the database system's defaults. To change the configuration of your database so that it persists across restarts, follow the steps below to create a Configuration Group and then attach it to an instance.
Create a Configuration Group
- Under "Database" in the Horizon menu, select "Configurations".
- Select "+ Create Configuration Group" and give your Configuration Group a descriptive name.
- In the "datastore" field, select the database type and version you're using.
- After you create your Configuration Group, click its name to specify its parameters. The "Add Parameter" menu lists possible parameters for your database, and a text box for your custom values. Some parameters require a restart of the database when you apply them, while others apply immediately without disruption.
Attach Configuration Group
After you add parameters, attach the Configuration Group to an instance:
- Return to your list of instances.
- Select "Attach Configuration Group" from the dropdown menu for the instance, and choose the custom Configuration Group you want to add.
- Your configuration settings now apply persistently to all databases in that instance.
To view which instances a Configuration applies to, navigate to the Configurations section and click the name of a Configuration Group, then select the "Instances" tab.
Manage storage and compute
Resize a volume
Resizing a volume means changing the amount of disk space for a database. WMCS uses Cinder to provide attachable block storage for Trove instances and for Cloud VPS instances, but quota for those volumes is tracked and managed separately. As long as you're not exceeding your quota, you can resize your storage without needing to rebuild your database.
Resize an instance
Resizing an instance means adding cores or RAM to the database server that is running the database software. WMCS uses Nova to provision compute instances. As long as you're not exceeding your quota, you can resize your instance without needing to rebuild your database.
Request a quota increase
By default, any Cloud VPS project should be able to create one or two small Trove instances. If you need additional cores or gigabytes, open a quota request in Phabricator. Database instances don't count against a project's storage or virtualization quotas; Trove has separate quotas to govern database use.
Backup or snapshot your databases
To backup a database, run a database dump using mysqldump, and store the backups on a separate Cinder volume.
WMCS has plans to add Swift storage backend and Trove database backup and snapshot features, but that is not yet available (phab:T276961).
Database instance creation fails with cryptic error messages
A properly functioning Trove instance is quite complicated and relies on many different OpenStack features working in concert: storage, compute, DNS, networking, and Docker. This means that if OpenStack APIs are the least bit fragile, database instance creation may fail with cryptic error messages. Fragility in the orchestration layer does not necessarily mean that your data or database itself is fragile. Contact the WMCS team if you encounter ongoing errors in Trove functionality, but also please do not panic.
Flavor is not supported for datastore version (HTTP 400)
This error appears if you attempt to launch an instance but the Flavor you selected in the "Launch Instance" menu is insufficient for the type of Datastore you selected. For example, if your Datastore type is mariadb, you must select a Flavor with at least 2GB of RAM, because it requires that much memory to launch a mariadb instance.
Running database instances appear as 'down' in Horizon
Existing, running database instances may appear as 'down' in Horizon if any of OpenStack's communication channels are unreliable. The good news is that Trove and OpenStack have very little interaction with the database once it's up and running. Fragility in the OpenStack orchestration layer does not necessarily mean that your data or database itself is fragile. Contact the WMCS team if you encounter ongoing errors in Trove functionality, but also please do not panic.
Operation not supported for datastore postgresql. (HTTP 422)
This error appears if you click a tab or menu item for an instance that targets a Postgres datastore, but that action isn't supported for Postgres databases. Trove can create and manage Postgres databases, but many runtime configuration options are unsupported.
Internal server error (HTTP 500)
Error: Unable to retrieve database instances. May be resolved by reloading the page. If reload doesn't resolve the issue, use one of the Cloud communication or support channels to get help or file a bug.
Communication and support
Support and administration of the WMCS resources is provided by the Wikimedia Foundation Cloud Services team and Wikimedia movement volunteers. Please reach out with questions and join the conversation: