Jump to content

Nova Resource:Wiktionary-tools/Documentation

From Wikitech

Wiktionary-tools/Documentation

Description
The purpose of this project is to develop and maintain tools for the various Wiktionary languages.

Wiktionary

Purpose of the project

There are a lot of tools that were developed by Wiktionarians individually, a lot of them on the toolserver. These tools should soon migrate to the Labs, and this project was created for that purpose.

Any tool that can be used for Wiktionary projects is welcome here.

Joining

To join this project, you need a Labs account. Then contact one of the current members of the project (right side) to be added.

Code repositories

Git can be used for version control of your source code (in place of svn on the toolserver). Here you can request a repository on gerrit (with a web interface).

To create a git repository on gerrit:

  1. Login with the same account as the Labs on gerrit
    • You should import your public ssh key if you want to be able to push
  2. Ask to create a repository here: Requests
    • If your project is named foo, you should ask for the name wiktionary/foo
  3. Once the repository is created, go visit gerrit and find your project.
    • In your project page, you'll find a "clone" line to copy and paste in a terminal to checkout the repository

To push:

  1. You need to have the rights to push (automatic if you requested the repository creation)
  2. You need to have your public ssh key uploaded, as well as git-review (on Ubuntu: sudo apt-get install get-review)
  3. Set your username and email address the same as in gerrit:
    • git config --global user.name MyUserName
    • git config --global user.email MyMailAddress
  4. after your commits (git commit -m comment), add your changes to review: git review
  5. Your changes should be visible in gerrit and ready for review

Tools

Anagrimes

Perl scripts to extract data from Wiktionary dumps.

Data

Mediawiki dumps are available at /public/datasets/public/:

  • English Wiktionary: /public/datasets/public/enwiktionary
  • French Wiktionary: /public/datasets/public/frwiktionary