I reproduce here below principles from the amazing paper Best Practices for Scientific Computing, published on 2012 by a group of US and UK professors. The main purpose of the paper is to “teach” good programming habits shared from professional developers to people that weren’t born developer, and became developers just for professional purposes.
Scientists spend an increasing amount of time building and using software. However, most scientists are never taught how to do this efficiently
Best Practices for Scientific Computing
-
Write programs for people, not computers.
- a program should not require its readers to hold more than a handful of facts in memory at once
- names should be consistent, distinctive and meaningful
- code style and formatting should be consistent
- all aspects of software development should be broken down into tasks roughly an hour long
-
Automate repetitive tasks.
- rely on the computer to repeat tasks
- save recent commands in a file for re-use
- use a build tool to automate scientific workflows
-
Use the computer to record history.
- software tools should be used to track computational work automatically
-
Make incremental changes.
- work in small steps with frequent feedback and course correction
-
Use version control.
- use a version control system
- everything that has been created manually should be put in version control
-
Don’t repeat yourself (or others).
- every piece of data must have a single authoritative representation in the system
- code should be modularized rather than copied and pasted
- re-use code instead of rewriting it
-
Plan for mistakes.
- add assertions to programs to check their operation
- use an off-the-shelf unit testing library
- use all available oracles when testing programs
- turn bugs into test cases
- use a symbolic debugger
-
Optimize software only after it works correctly.
- use a profiler to identify bottlenecks
- write code in the highest-level language possible
-
Document design and purpose, not mechanics.
- document interfaces and reasons, not implementations
- refactor code instead of explaining how it works
- embed the documentation for a piece of software in that software
-
Collaborate.
- use pre-merge code reviews
- use pair programming when bringing someone new up to speed and when tackling particularly tricky problems
if you want to discover more, you can download your copy of Best Practice Scientific Computing here below
Thanks for the summary. Since I read it a year ago & I wanted to blog a post like this. Thumbs up 🙂
Reblogged this on Ibrahim El Merehbi and commented:
A summary of a very interesting paper on “Best Practices for Scientific Computing” I read a year ago.
Just use iPython 🙂
thanks for commenting, and thumbs up for iPython! 🙂