Opened 10 years ago

Closed 10 years ago

#684 closed enhancement (fixed)

Run picard AddOrReplaceReadGroups after alignment

Reported by: Nicklas Nordborg Owned by: Nicklas Nordborg
Priority: major Milestone: Reggie v2.17
Component: net.sf.basedb.reggie Keywords:
Cc:

Description

Downstream analysis tools need this information. Typical command is:

./picard AddOrReplaceReadGroups
  INPUT=accepted_hits.bam
  OUTPUT=accepted_hits.bam.tmp
  RGLB=${library}
  RGPL=Illumina
  RGPU=NotApplicable
  RGSM=${sample}
  RGCN=SCAN-B
  CREATE_INDEX=true

where ${library} is the name of the 'Library' item in BASE and ${sample} is the name of the 'Specimen' item in BASE.

To simplify code we don't follow items all the way up to specimen (which would break for external rna). We simply use the same rule as when generating the file structure for FASTQ files (eg sample name = everything before the first dot not followed by a number).

With the CREATE_INDEX option we can also skip the call to samtool index.

The INPUT/OUTPUT file names may also have to be swapped since we run this after MarkDuplicates which has generated a temporary file already which should be used as input to the AddOrReplaceReadGroups step.

Change History (3)

comment:1 by Nicklas Nordborg, 10 years ago

(In [2882]) References #684: Run picard AddOrReplaceReadGroups after alignment

Merge configuration changes to trunk since this will not be automatic due to file name change.

comment:2 by Nicklas Nordborg, 10 years ago

(In [2881]) Fixes #684: Run picard AddOrReplaceReadGroups after alignment

comment:3 by Nicklas Nordborg, 10 years ago

Resolution: fixed
Status: newclosed
Note: See TracTickets for help on using tickets.