High-Performance Computing at the NIH

RSS Feed

FastQC on Helix

FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines. It provides a modular set of analyses which you can use to give a quick impression of whether your data has any problems of which you should be aware before doing any further analysis.

The main functions of FastQC are

FastQC is developed by Simon Andrews, Babraham Bioinformatics.

 

Make sure X-windows is running while connecting to helix.

The environment variable(s) need to be set properly first. The easiest way to do this is by using the modules commands as in the example below.

[user@helix]$ module avail fastqc
-------------------- /usr/local/Modules/3.2.9/modulefiles --------------------------------------
fastqc/0.10.0 fastqc/0.10.1(default) fastqc/0.9 [user@helix]$ module load fastqc [user@helix]$ module list Currently Loaded Modulefiles:
1) fastqc/0.10.1 [user@helix]$ module unload fastqc [user@helix]$ module load fastqc/0.9 [user@helix]$ module list Currently Loaded Modulefiles: 1) fastqc/0.9 [user@helix]$ module show fastqc ------------------------------------------------------------------- /usr/local/Modules/3.2.9/modulefiles/fastqc/0.10.1: module-whatis Sets up fastqc 0.10.1 prepend-path PATH /usr/local/apps/fastqc/0.10.1 -------------------------------------------------------------------

How To Use

$ module load fastqc
$ fastqc -o output_dir [-f fastq|bam|sam] -c contaminant_file seqfile1 .. seqfileN

 

Documentation

http://www.bioinformatics.babraham.ac.uk/projects/fastqc/