We have been using FluxSimulator successfully to simulate paired-end reads of up to 100nt, using uniformly random (UR) fragmentation.  However, the 200nt fragment size limitation makes it difficult to try longer read lengths (such as 120nt or 150nt) as we anticipate continued improvements in sequencing technology.

Would it be possible for us to "fool" the simulator into using longer fragments with UR fragmentation to generate longer paired-end reads?  Or would it be better to abandon UR fragmentation in favor of another kind?  We would like our simulations to be directly comparable so that read length is the only variable that changes.  At the same time, we would like our simulations to reflect the kind of experiments people really are (or will be) doing.

  • No labels

1 Comment

  1. Hi Mark,

    Im not sure whether I quite understood the question; both random fragmentation protocols we implement (UR and Nebulization) provide a parameter that allows you to control for the expected length of a fragment:

    Nebulization
    FRAG_NB_LAMBDADouble900.0Threshold on molecule length that cannot be broken by the shearfield of nebulization.
    Uniformal Random (UR) Fragmentation
    FRAG_UR_ETADoubleNaN

    Average expected framgent size after fragmentations, i.e., number of breaks per unit length (exhautiveness of fragmentation);

    NaN optimizes the fragmentation process w.r.t. the size filtering

    Please let us know whether you found any problems with these parameters. 

    Cheers