Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Section

 

The Profile (.PRO) format is designed to describe the simulated characteristics of each transcript from the reference annotation, one per line. After each step of a simulation run, tab-separated are added to the file.

Column NumberNrNameValueDescription
1LOCUS_IDLocuschrom:start-end[W|C]identifier for of the intrinsic splicing transcriptional locus, given by the chromosome (chrom), start and respectively end position, and the strand (Watson or Crick).
2TRANSCRIPTTranscript_IDStringtranscript identifier from the reference annotation.
3CODINGCoding[CDS|NC]specifies whether the transcript has an annotated coding sequence (CDS) or not (NC)
4LENGTHLengthIntegerthe spliced mature length of the transcript molecule after splicing out introns, disregarding the poly-A tail, as annotated in the reference annotation
5RFREQ_EXPExpressed FractionFloatrelative frequency fraction of RNA copies of this transcript after simulated expressionmolecules that represent transcripts that are qualitatively equal to this RNA form
6AFREQ_EXPExpressed NumberIntegerabsolute number of expressed RNA molecules
7RFREQ_LIBLibrary FractionFloatrelative frequency fraction of cDNA molecules derived in the final library that have been produced from this transcript after library construction
 Library Number8AFREQ_LIBIntegerabsolute number of cDNA molecules fragments generated from this transcript
9RFREQ_SEQSequenced FractionFloatrelative frequency fraction of total reads that have been sequenced from this transcript
10AFREQ_SEQSequenced NumberIntegerabsolute number of reads sequenced from this transcript

...