Expression | |||
NB_MOLECULES | 5,000,000 | Number of RNA molecules initially in the experiment | |
TSS_MEAN | 25 | Average deviation from the annotated transcription start site (TSS) | |
POLYA_SCALE | 300 | Scale of the Weibull distribution, shifts the average length of poly-A tail sizes | |
POLYA_SHAPE | 2 | Shape of the Weibull distribution describing poly-A tail sizes | |
Fragmentation | |||
FRAG_SUBSTRATE | RNA | Specifies RNA as the substrate of fragmentation | |
FRAG_METHOD | UR | Uniform random fragmentation | |
FRAG_UR_ETA | 170 | Average expected framgent size after fragmentations, i.e., number of breaks per unit length (exhautiveness of fragmentation) | |
FRAG_UR_D0 | 1 | Minimum length of fragments produced by UR fragmentation | |
FRAG_UR_DELTA | NaN | Geometry of molecules in the UR process depends logarithmically on molecule length | |
Reverse Transcription | |||
RTRANSCRIPTION | YES | Switch on the reverse transcription | |
RT_PRIMER | RH | Use random hexamer primers used for first strand synthesis | |
RT_MOTIF | default | A default PWM of the current Illumina protocol is used | |
RT_LOSSLESS | YES | Flag to force every molecule to be reversely transcribed | |
RT_MIN | 500 | Minimum length observed after reverse transcription of full-length transcripts | |
RT_MAX | 5,500 | Maximum length observed after reverse transcription of full-length transcripts | |
Amplification and Size Segregation | |||
PCR_DISTRIBUTION | default | Default PCR distribution with 15 rounds and 20 bins | |
GC_MEAN | 0.5 | Mean value of a gaussian distribution that reflects GC bias amplification probability | |
GC_SD | 0.1 | Standard deviation of a gaussian distribution that reflects GC bias amplification probability | |
FILTERING | YES | Enables size filtering of fragments | |
SIZE_DISTRIBUTION | null | Employ an empirical Illumina fragment size distribution | |
SIZE_SAMPLING | MH | The Metropolis-Hastings algorithm is used for filtering | |
Sequencing | |||
READ_NUMBER | 15,000,000 | Produce 15 million reads | |
READ_LENGTH | 75 | Each read sequence is 75nt long | |
PAIRED_END | YES | NO | Single reads are simulated (one per fragment) |
Code Block |
---|
[INFO] I am collecting information on the run.
initializing profiler **********
|
Code Block |
[INFO] Checking GTF file ********** OK (00:00:0403) [PROFILING] I am assigning the expression profile ********** OK (00:00:05) Reading reference annotation ********** OK (00:00:0706) found 28045 transcripts [PROFILING] Parameters NB_MOLECULES 5000000 EXPRESSION_K -0.6 EXPRESSION_X0 5.0E7 EXPRESSION_X1 9500.0 PRO_FILE_NAME /Users/micha/Desktop/mm9_hydrolysis.pro profiling ********** OK (00:00:00) Updating .pro file ********** OK (00:00:00) molecules 4999457 4999480 [LIBRARY] creating the cDNA libary Initializing Fragmentation File ********** OK (00:00:0506) 49994574999480 mol initialized [LIBRARY] Fragmentation UR [LIBRARY] Configuration D0: 1.0 Delta: Not specified, depends on sequence length Eta: 170.0 Processing Fragments ********** OK (00:0203:3420) 9336323499433550 mol: in 49994574999480, new 8836377794434070, out 9336323499433550 avg Len 154.0819113617, maxLen 513499 preparing transcript sequences ********** OK (00:0102:21) [04) [INFO] Initializing PWM cache [INFO] Done [LIBRARY] Reverse Transcription [LIBRARY] Configuration Mode: RH PWM: Nomotif_1mer_0-5.pwm RT MIN: 500 RT MAX: 5500 Processing Fragments ********** OK (00:0518:3253) 9336323499436361 mol: in 9336323499433550, new 02811, out 9334720099436361 avg Len 145226.2810849129, maxLen 506718 initializing Selected Size distribution [LIBRARY] Segregating cDNA (MCMC FilterAcceptance) Processing Fragments ********** OK (00:0302:1432) 9334720099436361 mol: in 9334720099436361, new 0, out 497129783935454 avg Len 153183.102318074, maxLen 299 start amplification [INFO] Loading default PCR distribution [INFO] Initializing PWM cache [INFO] Done [LIBRARY] Amplification [LIBRARY] Configuration Rounds: 15 Mean: 0.5 Standard Deviation: 0.1 Processing Fragments ********** OK (00:0300:1319) Amplification done. In: 497129783935454 Out: 1340698400 106111525 497129783935454 mol: in 497129783935454, new 0, out 1340698400106111525 avg Len 153183.1066616734, maxLen 299 Copied results to /Users/micha/Desktop/mm9_hydrolysis.lib Updating .pro file ********** OK (00:00:00) [SEQUENCING] getting the reads Initializing Fragment Index Indexing ********** OK (00:00:3003) 266818802112053 lines indexed (1340698400106111525 fragments, 1875016421 entries) sequencing ********** OK (00:3504:0511) 1340698400106111525 fragments found (266818802112053 without PCR duplicates) 2999375015000653 reads sequenced 23146442333333 reads fall in poly-A tail 2538184511978 truncated reads Moving temporary BED file Updating .pro file ********** OK (00:00:00) Updating .pro file ********** OK (00:00:00) Updating .pro file ********** OK (00:00:00) Updating .pro file ********** OK (00:00:00) [END] I finished, took me 32522291 sec. Process finished with exit code 0 |