Opened 4 years ago
Closed 4 years ago
#1253 closed defect (fixed)
Change the name in VCF file created by variant calling
Reported by: | Nicklas Nordborg | Owned by: | Nicklas Nordborg |
---|---|---|---|
Priority: | major | Milestone: | Reggie v4.27.2 |
Component: | net.sf.basedb.reggie | Keywords: | |
Cc: |
Description (last modified by )
The current variant calling pipeline gives the parent Library item as the name (-N parameter to VarDict) to use in the VCF file.
This can be a problem for downstream tools since the name is not unique among all VCF files. There are Library items that have been part of more than one pool.
Instead, we should use the name of the Alignment item.
Existing VCF files should be manually fixed (with help of some clever grep/sed/awk/... commands). Basically we can do sed 's/S0123456.l.r.m.c.lib/S0123456.l.r.m.c.lib.g.k2.a/'
to replace all library names (S0123456.l.r.m.c.lib
) with alignment names (S0123456.l.r.m.c.lib.g.k2.a
). The variants-filtered.vcf
file is not compressed and this can be done in-place with the -i flag to sed. But the raw and annotated files are compressed with bgzip so we need to decompress and re-compress before and after and probably need to store the result in a temporary file before overwriting the original file.
Change History (3)
comment:1 by , 4 years ago
comment:2 by , 4 years ago
Description: | modified (diff) |
---|
comment:3 by , 4 years ago
Resolution: | → fixed |
---|---|
Status: | new → closed |
In 5964: