Opened 11 years ago

Last modified 10 years ago

#533 closed task

Add secondary analysis section to Reggie — at Version 4

Reported by: Nicklas Nordborg Owned by: Nicklas Nordborg
Priority: critical Milestone: Reggie v2.16
Component: net.sf.basedb.reggie Keywords:
Cc:

Description (last modified by Nicklas Nordborg)

This is the master ticket for adding secondary analysis registration functionality to Reggie. The secondary analysis is the steps done from sequencing down to expression values have been generated, including demux and alignment against a reference genome.

Note! Primary analysis is the base calling performed by the Illumina software during the sequencing.

The pipeline will be something like this. See the other tickets (to be created) for more information about each step:

  1. (#545) Register sequencing as ended. Part of the "Library preparation wizards" section and done by someone in the lab.
  1. (#546) Confirm sequencing as completed. First wizard in the "Secondary analysis wizards" section. Used to decide if the sequenced data is ok or not. If ok, continue with demuxing, otherwise flag pools for re-sequencing.
  1. Start demux and merge. This wizard is starts the demux and merge operations.
  1. Register demux and merge as ended. At the end we have one "MergedSequences" item for each "Library" from the flow cells that was sequenced. A count of the number of reads for each library must be recorded and is used to determine if the library needs to be re-sequenced or not. FASTQ files for each library are stored on the server.
  1. Start filtering and alignment. Bowtie and TopHat is used to first filter and then align against a pre-defined set of transcripts.
  1. Register filtering and alignment as ended. At the end we have one "AlignedSequences" item for each "Library" from the flow cells that was sequenced. BAM files for each library are stored on the server.
  1. Start feature extraction. Cufflinks is used to calculate expression values.
  1. Register feature extraction. At the end we have one "RawBioAssay" item for each "Library" from the flow cells that was sequenced. FPKM files are uploaded to BASE and imported into the database.

Change History (4)

comment:1 by Nicklas Nordborg, 11 years ago

Milestone: Reggie v2.xReggie v2.15

comment:2 by Nicklas Nordborg, 11 years ago

Description: modified (diff)
Summary: Add primary analysis section to ReggieAdd secondary analysis section to Reggie

comment:3 by Nicklas Nordborg, 11 years ago

Description: modified (diff)

comment:4 by Nicklas Nordborg, 11 years ago

Description: modified (diff)
Note: See TracTickets for help on using tickets.