Guidelines for contribution
This document described guidelines for contributions for projects led by members of the Poisot lab of computational ecology, Université de Montréal. It should be followed by lab members (at all times), as well as people with suggested added features, or willing to contribute enhancement or bug fixes.
Edits to this document must be suggested as pull requests on the github project
located at https://github.com/PoisotLab/PLCG
, and not made on the document
itself.
It is suggested to include this file to your projects via a Makefile
rule:
CONTRIBUTING.md:
wget -O $@ https://raw.githubusercontent.com/PoisotLab/PLCG/master/README.md
General code of behavior
We follow the Contributor Covenant and The No Asshole Rule. An amazing technical or scientific contribution is no excuse for creating a toxic environment.
Issues
Reporting issues is the simplest and most efficient way to contribute. A useful issue includes:
- The version of all relevant software
- Your operating system
- A minimal reproducible example (the shortest code we can copy/paste to reproduce the issue)
Authorship
Any contribution to the code of a software grants authorship on the next paper
describing this software, and immediate authorship to the next release (which
will receive a DOI). Any contribution to a paper hosted as a public
version-controlled repository grants authorship of the paper. For this reason,
it is important to correctly identify yourself. For R
packages, this is done
through the DESCRIPTION
file. Whenever a CONTRIBUTORS
file is found, add
your name to it as part of the workflow described next. For authorship of
papers, the first and last authors will decide on the position.s
Workflow
This section describes the general steps to make sure that your contribution is integrated rapidly. The general workflow is as follows:
- Fork the repository (see Branches, etc. below)
- Create an explicitely named branch
- Create a pull request even if you haven't pushed code yet
- Be as explicit as possible on your goals
Pull requests
Creating a pull request before you push any code will signal that you are interested in contributing to the project. Once this is done, push often, and be explicit about what the commits do. This gives the opportunity for feedback during your work, and allow for tweaks in what you are doing.
A good pull request (in addition to satisfying to all of the criteria below) is:
- Single purpose - it should do one thing, and one thing only
- Short - it should ideally involve less than 250 lines of code
- Limited in scope - it should ideally not span more than a handful of files
This will ensure that your contribution is rapidly reviewed and evaluated.
Branches, etc.
The master
branch is always the currently stable, build passing, adequately
covered version. Ideally, no one should ever push directly into master. The
reference branch for development is dev
.
For the development of new features, it is generally recommended to fork the
master
branch, and submit your pull request to the dev
branch. For
"hotfixes" (changes that address issues), forking either branch is fine, and
we'll take care of pulling the changes into dev
and/or master
as needed.
Of emojis
Atom's guideline suggest the use of emojis to easily identify what is the purpose of each commit. This is a good idea and should be followed, and it also saves a few characters from the commit first line. Specifically, prepend your commits as follow
What the commit is about | emoji | example |
---|---|---|
Add test | :white_check_mark: | :white_check_mark: wget JSON resource |
Fix bug | :bug: | :bug: mean fails if NA |
Documentation | :page_facing_up: | :page_facing_up: null models wrapper |
New data | :bar_chart: | :bar_chart: example pollination network |
Improve performance | :clock2: | :clock2: parallelizes null model by default |
New feature | :metal: | :metal: (anything amazing) |
Upcoming release | :package: | :package: v1.0.2 |
Tests and coverage
Most of our repositories undergo continuous integration, and code coverage analysis.
It is expected that:
- Your commits will not break a passing repo
- Your commits can make a breaking repo pass (thank you so much!)
- Your additions or changes will be adequately tested
This information will be given in the thread of your pull request. Most of our repositories are set-up in a way that new commits that decrease coverage by more than 10% won't be pulled.
Good tests make sure that:
- The code works as it should
- The code breaks as it should (see Program defensively below)
- Functions play nicely together
Before merging the content of any pull request, the following tests are done:
- The branch to be merged from passes the build
- The future branch (i.e. with the changeset) passes the build
- The code coverage does not decrease by more than (usually) 10%
Coding practices
Use functions
Don't repeat yourself. If you have to do something more than once, write a function for it. Functions should be as short as possible, and should be single purpose. Think of every piece of code you write as an element of a library.
Program defensively
Check user input, check data structure, and fail often but explicitly.
Bad:
function add(x, y)
return x + y
end
Acceptable:
function add(x, y)
@assert typeof(x) == Int64
@assert typeof(y) == Int64
return x + y
end
Good:
function add(x, y)
if typeof(x) != Int64
throw(TypeError("The first argument should be an Int64", "add", Int64, typeof(x)))
end
if typeof(y) != Int64
throw(TypeError("The second argument should be an Int64", "add", Int64, typeof(y)))
end
return x + y
end
Use type-static functions
Any given function must always return an object of the same type.
A function like y = (x) -> x>3?true:x
(returns true
if x
is larger than 3,
else returns x
) is not acceptable, as it is hard to predict, hard to debug,
and hard to use.
Comment
Comment your code -- comments should not explain what the code does (this is for the documentation), but how it does it.
Variables names
Be as explicit as possible. Have a look at the rest of the codebase before you
start. Using i
, j
, k
for loops is, as usual, fine.
Docstrings and documentation
Write some. R
packages, must be compiled with roxygen
, python
code must
have docstrings for all documentable objects, and Julia
functions must be
documented using Docile.jl
and Lexicon.jl
.##
There are three levels of documentation: the API documentation (which will be generated from the docstrings), the documentation for fellow developers (which resides ideally entirely in the comments), and documentation for the end users. Your pull request must include relevant changes and additions to the documentation.
Versioning
We use semantic versioning (major
.minor
.patch
). Anything that adds
no new feature should increase the patch
number, new non-API-breaking changes
should increase minor
, and major changes should increase major
. Any increase
of a higher level resets the number after it (e.g, 0.3.1
becomes 1.0.0
for
the first major release). It is highly recommended that you do not start working
on API-breaking changes without having received a go from us first.