Spark learning materials compiled by members of the Office for National Statistics (ONS). The materials in the book are intended as general resources on using Spark and should also be applicable to users outside of ONS. Please note that for internal ONS users, the guidance in this book is not intended to be specific to any particular platform where Spark can be run. As a result, you may find it useful to consult internal sources of guidance (such as DAPCATS training materials) if you need support with your platform specific setup.
Contributions are welcome, and they are greatly appreciated! Every little bit helps, and credit will always be given. You can contribute in the ways listed below.
We have recently made use of GitHub issue forms which we hope will make suggesting content even easier!
- If you want to suggest a new page, please use this link for the issue form.
- For bugs, please use this link for the issue form.
- To create any other type of issue, use the blank issue form and label your issue with any of the following tags:
label | description |
---|---|
Bug | Used for general bug reports |
Documentation | Improvements or additions to documentation |
Duplicate | This issue or pull request already exists |
Help wanted | External help or guidance (outside of DAPCATS) is needed before this can be implemented |
Question | Further information is needed or something needs clarification |
New page | Suggestions for new topics and pages |
Addition to existing page | Suggestions to add information to existing pages e.g. Python/R equivalent code, new subsection etc. |
Enhancement | Suggestion for change not relating to content eg. improving the usability or appearance of the book |
Other | An issue that is not covered by any of the other labels |
If you wish to take an active role in developing this book please refer to the contributing guidance.
Please see the Jupyter Book documentation to discover options for deploying a book online using services such as GitHub, GitLab, or Netlify.
For GitHub and GitLab deployment specifically, the cookiecutter-jupyter-book includes templates for, and information about, optional continuous integration (CI) workflow files to help easily and automatically deploy books online with GitHub or GitLab. For example, if you chose github
for the include_ci
cookiecutter option, your book template was created with a GitHub actions workflow file that, once pushed to GitHub, automatically renders and pushes your book to the gh-pages
branch of your repo and hosts it on GitHub Pages when a push or pull request is made to the main branch.
We welcome and recognize all contributions. You can see a list of current contributors in the contributors tab.
This project is created using the excellent open source Jupyter Book project and the executablebooks/cookiecutter-jupyter-book template.