Changes between Version 34 and Version 37 of Ticket #887
- Timestamp:
- May 19, 2016, 8:00:48 AM (9 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Ticket #887 – Description
v34 v37 4 4 - `tidmatrix.features.txt`: Array design features with some annotations. The first line is a header line: 5 5 * `id`, `geneSymbol`, `refSeq`, `protAcc`, `description`, `chr`, `entrez`. 6 * Rows are sorted by external ID (or internal ID??see [comment:33 comment below] for more information)6 * Rows are sorted by internal ID (see [comment:33 comment below] for more information) 7 7 * All raw bioassays in the input list must use the same array design. 8 8 - `tidmatrix_data.txt`: FPKM values for all raw bioassays. Each row represents a feature and each column a raw bioassay. … … 15 15 - `genematrix_data.txt`: Sum of FPKM values per gene symbol. 16 16 * The first line is a header line with raw bioassay names. 17 * The first column is the gene symbol (in ~~no particular~~ '''alphabetical'''order).17 * The first column is the gene symbol (in no particular order). 18 18 - `is.NM.gene.txt`: TRUE/FALSE flag for each gene indicating if the refSeq ID starts with `NM_` or not. 19 19 * No header line. 20 * Rows must be in the same order as in `genematrix_data.txt`. 20 21 * First column is the line number (in this file, add +1 for getting the line number in `genematrix_data.txt`). 21 22 * Second column is `TRUE` or `FALSE`. 23 * Third column is the gene symbol. 22 24 23 25 * Cohort data (in folder `cohortTables`): A set of tab-separated files with data for each raw bioassay and the parent items it is derived from. Each file starts with a header line. Each row contains data for one raw bioassay. The first column (`rba`) is always the name of the raw bioassay. Columns ending with `.A.` are annotation columns. Date values are formatted as `YYYY-MM-DD` unless otherwise noted.