This is a home for various data science projects, where I try to analyse some interesting collaborate datasets, build pretty and informative graphics. Example: Document your workflow In this example, the coder doesn’t need to present every line of code, but rather needs to present the overall process of loading, crunching, and reporting the data, so another scientist can understand the whole process, and if necessary, replicate it. References, links, and provenance of data files are more important here, so the reader can understand where the data sets are coming from.
I picked up the R programming language during my MSc at University of California San Diego, and use it constantly in my day-job, along with some Python. For fun I sometimes apply these tools to interesting-looking datasets that are lying around the web, and try to tell their stories through well-designed data visualisations. Some blog posts are mirrored on R-bloggers, a blogging community for the R language. Useful bash one-liners useful for bioinformatics (and some, more generally useful).