Refinery contains scripts, artifacts, and configuration for WMF's analytics cluster.
-
Install git-fat from https://github.com/jedbrown/git-fat on your system.
-
Make sure the
docoptanddateutilPython packages are available on your system.On Ubuntu systems, you can achiev this by running
sudo apt-get install python-docopt sudo apt-get install python-dateutil -
Clone the repository.
You can find the commands to clone the repository at WMF's gerrit.
To clone anonymously, just run
git clone https://gerrit.wikimedia.org/r/analytics/refinery -
change to the cloned repository by running
cd refinery -
Initialize git-fat by running
git fat init -
Pull existing artifacts into the repository by running
git fat pull(Depending on you internet connection, this step may take some time.)
-
Add the
refinery/pythondirectory to yourPYTHONPATH.To add it only in the running shell, you can use
export PYTHONPATH=/path/to/analytics/refinery/pythonPlease refer to your operating system's documentation on how to do this globally.
-
Done.