Opened 7 years ago
Closed 7 years ago
#1008 closed enhancement (fixed)
The release exporter should be able to export expression data from the cufflinks file
Reported by: | Nicklas Nordborg | Owned by: | Nicklas Nordborg |
---|---|---|---|
Priority: | major | Milestone: | Reggie v4.14 |
Component: | net.sf.basedb.reggie | Keywords: | |
Cc: |
Description
Since #984 data from the cufflinks file are no longer imported to the database. Most things that need the data has already been fixed (for example #985) to read it from the file instead. The exception is the release importer plug-in which still only has support for exporting data that is in the database.
It should not be that difficult to parse and extract information from the cufflinks file. We are already doing it in #985. We probably need some modification to pass on the data to the other files. We also need to make sure that data is sorted in the same order (it may be dangerous to assume that every cufflinks file is sorted in the same order). We can probably re-use the existing code for this since we already do similar things for data from the database.
A "side-effect" is that the matrix files will get the transcripts on columns and samples on rows instead of the (traditional) other way around.
If required, I think R or some other tool may be able to transpose the matrix files.
Change History (5)
comment:1 by , 7 years ago
Milestone: | Reggie v4.x → Print package v1.4 |
---|
comment:2 by , 7 years ago
Milestone: | Print package v1.4 → Reggie v4.14 |
---|
comment:3 by , 7 years ago
comment:4 by , 7 years ago
Status: | new → assigned |
---|
(In [4641]) References #1008: The release exporter should be able to export expression data from the cufflinks file
Changed the exporter so that it uses the cufflinks data files instead. The data files should be equivalent to those produced before. Sort order of transcripts/genes should now be based on transcript_id/gene symbol instead of the internal id.
Performance-wise it seems to be ok. About 40 raw bioassays per minute which is a bit slower than before but still good enough.