Changes between Version 10 and Version 11 of Ticket #525, comment 6


Ignore:
Timestamp:
Feb 22, 2016, 8:43:23 AM (8 years ago)
Author:
olle

Legend:

Unmodified
Added
Removed
Modified
  • Ticket #525, comment 6

    v10 v11  
    2222The first version of INCA data import should only import data for patients, for which data had been requested in the csv file sent to INCA, i.e., those with personal numbers in the INCA export files.
    2323
    24 No database schema for the INCA data was available. Inspection of data in the two example export files gave the following result:
     24A description of the variables in the INCA database from 2014-01-01 was available. This together with inspection of data in the two example export files gave the following result:
    2525
    26 || '''INCA example file''' || '''# Column headers''' || '''# Date columns''' || '''# String columns''' || '''# Integer columns''' || '''# Float columns''' || '''# Unknown type columns''' ||
     26|| '''INCA example file''' || '''# Column headers''' || '''# Date columns''' || '''# String columns''' || '''# Integer columns''' || '''# Float columns''' ||
    2727|| INCA_file_a || 145 || 5 || 59 || 81 || 0 || 0 ||
    28 || INCA_file_b || 146 || 23 || 37 || 77 || 4 || 5 ||
    29 || Both files (all columns) || 291 || 28 || 96 || 158 || 4 || 5 ||
    30 || Both files (unique columns) || 286 || 28 || 94 || 155 || 4 || 5 ||
     28|| INCA_file_b || 146 || 23 || 37 || 82 || 4 ||
     29|| Both files (all columns) || 291 || 28 || 96 || 163 || 4 ||
     30|| Both files (unique columns) || 286 || 28 || 94 || 160 || 4 ||
    3131
    3232Types of the columns represented in both example files:
     
    3939|| KON_VALUE || Integer ||
    4040
    41 Of the 5 columns with unknown type in the seconds example type (all data lines had blank cells for these columns), 3 could be identified as integer columns with high probability, based on other columns with the same header name suffix being of integer type. The remaining column could also be identified as integer with reasonable probability, based on other columns with the same header name prefix being of integer type. In conclusion, probably all the 5 columns with unknown type are integer columns.
     41Inspection of the example files indicated that Boolean variables related to check boxes (checked = true), were coded as 1 for checked = true, and `null` (blank) for unchecked = false.