Project management in Computational Biology

In a recent PLoS Computational Biology article William Stafford Noble highlights few basic principles and strategies for carrying out computational biology experiments.

The core guiding principle is simple: Someone unfamiliar with your project should be able to look at your computer files and understand in detail what you did and why. This “someone” could be any of a variety of people: someone who read your published article and wants to try to reproduce your work, a collaborator who wants to understand the details of your experiments, a future student working in your lab who wants to extend your work after you have moved on to a new job, your research advisor, who may be interested in understanding your work or who may be evaluating your research skills. Most commonly, however, that “someone” is you. A few months from now, you may not remember what you were up to when you created a particular set of files, or you may not remember what conclusions you drew. You will either have to then spend time reconstructing your previous experiments or lose whatever insights you gained from those experiments.

This leads to the second principle, which is actually more like a version of Murphy’s Law: Everything you do, you will probably have to do over again. Inevitably, you will discover some flaw in your initial preparation of the data being analyzed, or you will get access to new data, or you will decide that your parameterization of a particular model was not broad enough. This means that the experiment you did last week, or even the set of experiments you’ve been working on over the past month, will probably need to be redone. If you have organized and documented your work clearly, then repeating the experiment with the new data or the new parameterization will be much, much easier.

We all know that project management is currently a gray area in bioinformatics and computational biology. This article describes few very basic and practical strategies based on above two principles for better organization of the computational biology projects. Starting from file and directory organization, and then follow up discussions on scripting and version control, author have outlined few rules of thumb. In summary, a highly recommended paper for the the computational biology peers.


Reference:

Noble, W. (2009). A Quick Guide to Organizing Computational Biology Projects PLoS Computational Biology, 5 (7) DOI: 10.1371/journal.pcbi.1000424

Share and Enjoy:
  • HackerNews
  • Twitter
  • Facebook
  • Google Buzz
  • LinkedIn
  • Posterous
  • Tumblr
  • Digg
  • Reddit
  • del.icio.us
  • DZone
  • FriendFeed
  • Suggest to Techmeme via Twitter
  • Print
  • RSS
  • Slashdot

2 Responses to “Project management in Computational Biology”
  1. 08.30.2009

    Project management in Computational Biology- by Fisheye Perspective http://bit.ly/11n7Q6

  2. 08.30.2009

    Project management in Computational Biology: In a recent PLoS Computational Biology article William Stafford Nob.. http://bit.ly/8kSLx