Opened 4 years ago

Closed 4 years ago

#1229 closed enhancement (fixed)

Import some variants as annotations

Reported by: Nicklas Nordborg Owned by: Nicklas Nordborg
Priority: major Milestone: Reggie v4.26
Component: net.sf.basedb.reggie Keywords:
Cc:

Description (last modified by Nicklas Nordborg)

If we find one or more variants in a sample that belongs to a predefined list of "important" variants this information should be imported after the variant calling as an annotation in BASE.

The predefined list of variants should probably be a VCF file with specified chr:pos:ref>alt and some extra information that we can use. We should at least have the GENE name and match that to an annotation type in BASE. For example, if we have a variant:

#CHROM	POS		ID	REF	ALT	QUAL	FILTER	INFO	FORMAT
chr3	179234297	.	A	G	.	.	GENE=PIK3CA;AA=H1047R

we should import chr3:179234297:A>G to the PIK3CA_Variants annotation (and possibly also the AA tag).

NOTE! The idea is to run this scan/import at the end of the regular variant calling pipeline. To import the same information for existing variant call items, a special script that creates a tab-separated text file (or similar) is needed. The text file can then be imported with the regular annotation importer plug-in.

The list of interesting genes:

  • PIK3CA
  • AKT1
  • SF3B1
  • GATA3
  • ERBB2
  • TP53
  • FOXA1
  • CDH1
  • MAP3K1
  • PTEN
  • KMT2C
  • TBX3
  • CBFB
  • ARID1A
  • RNF213
  • PLEC
  • RUNX1
  • TRPS1
  • ARHGAP35
  • NFIC

The ESR1 gene is also interesting but here we should only import variants that are known drug resistance mutations.

  • chr6:152011697:G>C (E380Q)
  • chr6:152011733:G>A (V392I)
  • chr6:152061020:ATGG>A (V422del)
  • chr6:152094402:T>C (S463P)
  • chr6:152098775:G>A (V533M)
  • chr6:152098779:T>A (V534E)
  • chr6:152098785:TC>AG (L536Q)
  • chr6:152098785:T>C (L536P)
  • chr6:152098785:T>G (L536R)
  • chr6:152098787:T>G (Y537D)
  • chr6:152098787:T>A (Y537N)
  • chr6:152098788:A>C (Y537S)
  • chr6:152098791:A>G (D538G)

The original list had some more variants, but they turned out to have unknown genomic positions.

Change History (10)

comment:1 by Nicklas Nordborg, 4 years ago

Description: modified (diff)

comment:2 by Nicklas Nordborg, 4 years ago

Description: modified (diff)

comment:3 by Nicklas Nordborg, 4 years ago

Description: modified (diff)

comment:4 by Nicklas Nordborg, 4 years ago

Status: newaccepted

comment:5 by Nicklas Nordborg, 4 years ago

In 5872:

References #1229: Import some variants as annotations

All variants that are found in the specified list of genes should now be imported as annotations. The import is made as part of the completion handler for variant calling jobs.

comment:6 by Nicklas Nordborg, 4 years ago

In 5876:

References #1229: Import some variants as annotations

Added a temporary item list that is filled with all existing variant call raw bioassays. All items in this list need to check the variants-filtered.vcf file if there are any variants that should be imported as annotations.

It should be relatively simple to implement a wizard for this with the existing importer.

comment:7 by Nicklas Nordborg, 4 years ago

In 5877:

References #1229: Import some variants as annotations

Added a wizard that parses all filtered VCF files and import variants as annotations.

comment:8 by Nicklas Nordborg, 4 years ago

In 5878:

References #1229: Import some variants as annotations

Use the AutoProcessing=Disable annotation to disabled the import wizard once the list has been processed.

comment:9 by Nicklas Nordborg, 4 years ago

In 5879:

References #1229: Import some variants as annotations

Only commit for every 500 raw bioassays.

comment:10 by Nicklas Nordborg, 4 years ago

Resolution: fixed
Status: acceptedclosed
Note: See TracTickets for help on using tickets.