Opened 17 years ago
Last modified 17 years ago
#91 closed task
Import Illumina scandata files as raw data — at Initial Version
Reported by: | Nicklas Nordborg | Owned by: | Johan Enell |
---|---|---|---|
Priority: | major | Milestone: | Illumina package v1.0 |
Component: | net.sf.basedb.illumina | Keywords: | |
Cc: |
Description
The rawest form of raw data generated by the Illumina scanner are the scandata files. Each sub-array can have one or two files (related to the number of "stripes"). The scandata files are simple and have only four columns:
- Illumicode: The ID of the feature. Same as Array_Address_Id in the bgx files, but without 0-padding.
- N: The number of beads
- Mean GRN: Mean intensity
- Dev GRN: Standard deviation of the intensities
In each file there is only one entry for each feature, but the same feature may have an entry in both files. When importing the values for a feature must be merged. The intensity value is simply the weighted (by the number of beads) mean of the values in the two files. The number of beads are simply added. We need to find a formula for how to merge the standard deviation values.