A.2.1 - Parameter File Format .PAR

The PAR format in the Flux Simulator is used to administrate all parameters of a run. It is a simple format containing key value pairs (one per line) with the following parameter names (i.e., keys):

File Locations

Key	Type	Default Value	Description
REF_FILE_NAME	String		Path to the GTF reference annotation, either absolute or relative to the location of the parameter file
PRO_FILE_NAME	String	{REF_FILE_NAME}.PRO	Path to the profile of the run, either absolute or relative to the location of the parameter file; the default profile uses the name of the parameter file with the extension .pro.
LIB_FILE_NAME	String	{REF_FILE_NAME}.LIB	Path to the library file of the run, either absolute or relative to the location of the parameter file; the default profile uses the name of the parameter file with the extension .lib.
SEQ_FILE_NAME	String	{REF_FILE_NAME}.BED	Path to the sequencing file of the run, either absolute or relative to the location of the parameter file; the default profile uses the name of the parameter file with the extension .bed.
GEN_DIR	String		Path to the directory with the genomic sequences, i.e., one fasta file per chromosome/scaffold/contig with a file name corresponding to the identifiers of the first column in the GTF annotation.

Expression

Key	Type	Default Value	Description
LOAD_CODING	Boolean	YES	Coding messengers, i.e., transcripts that have an annotated CDS, are extracted from the cell.
LOAD_NONCODING	Boolean	YES	Non-coding RNAs, i.e., transcripts without an annotated ORF are extracted from the cell.
NB_MOLECULES	Long	5,000,000	Number of RNA molecules initially in the experiment.
EXPRESSION_K	Double	(-0.6)	Exponent of power-law underlying the expression profile [-1;0]
EXPRESSION_X0	Double	9,500	Linear parameter of the exponential decay.
EXPRESSION_X1	Double	90,250,000	Quadratic parameter of the exponential decay.

Library prepeparation

Fragmentation

Key	Type	Default Value	Description
FRAGMENTATION	Boolean	YES	Turn fragmentation on/off.
FRAG_SUBSTRATE	{DNA,RNA}	DNA	Substrate of fragmentation, determines the order of fragmentation and reverse transcription (RT): for substrate DNA, fragmentation is carried out after RT, substrate RNA triggers fragmentation before RT.
FRAG_METHOD	{EZ,NB,UR}	UR	Fragmentation method employed: * [EZ] Fragmentation by enzymatic digestion * [NB] Fragmentation by nebulization * [UR] Uniformal random fragmentation
Enzymatic Digestion
FRAG_EZ_MOTIF	String		Sequence motif caused by selective restriction with an enzyme, choose pre-defined NlaIII, DpnII, or a file with a custom position weight matrix.
Nebulization
FRAG_NB_LAMBDA	Double	900.0	Threshold on molecule length that cannot be broken by the shearfield of nebulization.
FRAG_NB_THOLD	Double	0.1	Threshold on the fraction of the molecule population; if less molecules break per time unit, convergence to steady state is assumed.
FRAG_NB_M	Double	1.0	Strength of the nebulization shearfield (i.e., rotor speed).
Uniformal Random (UR) Fragmentation
FRAG_UR_ETA	Double	NaN	Average expected framgent size after fragmentations, i.e., number of breaks per unit length (exhautiveness of fragmentation); NaN optimizes the fragmentation process w.r.t. the size filtering
FRAG_UR_DELTA	Double	NaN	Geometry of molecules in the UR process: * NaN= depends logarithmically on molecule length, * 1= always linear, * 2= always surface-diameter, * 3= volume-diameter, ...
FRAG_UR_D0	Double	1.0	Minimum length of fragments produced by UR fragmentation.

Key	Type	Default Value	Description
RT_PRIMER	[RANDOM\|POLY-DT]		Flag to switch between random priming and poly-dT priming for the first strand synthesis of the reverse transcription
RT_MIN	Integer		Minimum length (in [nt]) of the expected reversely transcribed cDNA molecules
RT_MAX	Integer		Maximum length (in [nt]) of the expected reverse transcription products
FRAGMENTATION	[YES\|NO]		Optional: flag that determines whether a fragmentation step is carried out
FRAG_LAMBDA	Integer		Upper boundary of fragment lengths (in [nt]) that are not expected to be fragmented by the applied technique
FILTERING	[YES\|NO]		Flag to indicate whether a length filtering step is carried out on the cDNA library.
FILT_MIN	Integer		Minimum length that is retained during filtering.
FILT_MAX	Integer		Maximum length that is retained during filtering.
READ_NUMBER	Integer		Number of reads that are intented to produce. Note: this number is an upper boundary and gets adapted to the actual size of the intermediary generated library.
READ_LENGTH	Integer		Length of the generated reads, depends on filtering settings.
PAIRED_END	[YES\|NO]		Flag to indicate whether read pairs are produced.
FASTQ	[YES\|NO]		Flag that indicates whether additionally the read sequences and qualities are output. Depends on GENOME_DIR and ERR_FNAME.
QTHOLD	Integer		Quality value below which base-calls are considered problematic.
TMP_DIR	String		Path to folder for temporary files, if different from system standard (commonly /tmp on Unix clones).

Space shortcuts

Child pages

File Locations

Expression

Library prepeparation

Fragmentation