i still got an error while preparing transcript sequence during PCR distribution:
I just want to get a simple human experiment with 2 groups 3 replicates (fastq files) each. Some of them should be even expressed between groups and some should be highly different expressed. How can i acomplish this?
My genome: Homo_sapiens.GRCh37.73.dna.primary_assembly.fa
My gtf:gencode.v18
the key error, i think:
and i dont want to alter my genome file, because the position wont match anymore with the gtf-file:
[INFO] Loading default PCR distribution
preparing transcript sequences ERROR
[ERROR] Error while preparing sequences: Problems reading sequence chr1: pos -1, len 124,
check whether chromosomal sequence exists / has the correct size
[INFO] Flux-Simulator v1.2.2-20140614022955 (Flux Library: 1.30-20140614022955)
[INFO] No mode selected, executing the full pipeline (-x -l -s)
[INFO] I am collecting information on the run.
initializing profiler **********
[CAUTION] I overwrite the expression values in file parameters.pro, please confirm:
(Yes,No,Don't know) Yes
[INFO] Reading error model 76 bases model
[WARN] The error model supports a read length of 76 but
you are trying to create reads of length 36. We are scaling.
[INFO] Checking GTF file
Checking GTF *[WARN] Unsorted in line 5 - cannot perform gene clustering: chr1 + ENST00000591440.1 @ 719750 after ENST00000590848.1 @ 728262
Checking GTF ********** OK (00:00:00)
[GTF FILE] The GTF reference file given is not sorted, but we found a sorted version.
[GTF FILE] The Simulator will use /mountpoints/wntshare/project/aczerny/Untitled_Folder/filtered_gtf_v18_sorted.gtf
[GTF FILE] You might want to update your parameters file
[PROFILING] I am assigning the expression profile
Checking GTF ********** OK (00:00:00)
Reading reference annotation OK (00:00:00)
found 144 transcripts
[PROFILING] Parameters
NB_MOLECULES 100000
EXPRESSION_K -0.6
EXPRESSION_X0 9500.0
EXPRESSION_X1 9.025E7
PRO_FILE_NAME /mountpoints/wntshare/project/aczerny/Untitled_Folder/parameters.pro
profiling ********** OK (00:00:00)
Updating .pro file ********** OK (00:00:00)
molecules 99996
[LIBRARY] creating the cDNA libary
Initializing Fragmentation File ********** OK (00:00:00)
99996 mol initialized
[LIBRARY] Fragmentation UR
[LIBRARY] Configuration
D0: 1.0
Delta: Not specified, depends on sequence length
Eta: 223.09448802182916
Processing Fragments ********** OK (00:00:01)
721945 mol: in 99996, new 621949, out 721945
avg Len 201.0221, maxLen 572
[LIBRARY] Reverse Transcription
[LIBRARY] Configuration
Mode: RH
PWM: No
RT MIN: 500
RT MAX: 5500
Processing Fragments ********** OK (00:00:02)
721945 mol: in 721945, new 0, out 721805
avg Len 175.60474, maxLen 562
start amplification
[INFO] Loading default PCR distribution
preparing transcript sequences ERROR
[ERROR] Error while preparing sequences: Problems reading sequence chr1: pos -1, len 124,
check whether chromosomal sequence exists / has the correct size
java.lang.RuntimeException: Problems reading sequence chr1: pos -1, len 124,
check whether chromosomal sequence exists / has the correct size
at barna.model.Graph.readSequence(Graph.java:721)
at barna.model.Graph.readSequence(Graph.java:608)
at barna.model.Graph.readSequence(Graph.java:430)
at barna.model.Transcript.getSplicedSequence(Transcript.java:451)
at barna.flux.simulator.fragmentation.Fragmenter.getMapTxSeq(Fragmenter.java:381)
at barna.flux.simulator.fragmentation.Fragmenter.process(Fragmenter.java:545)
at barna.flux.simulator.fragmentation.Fragmenter.call(Fragmenter.java:264)
at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:440)
at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:54)
at barna.commons.launcher.Flux.main(Flux.java:197)
Caused by: java.lang.NullPointerException
at barna.model.Graph.getRAF(Graph.java:1198)
at barna.model.Graph.readSequence(Graph.java:651)
... 9 more
FAILED
[ERROR] Error while fragmenting : Problems reading sequence chr1: pos -1, len 124,
check whether chromosomal sequence exists / has the correct size
java.lang.RuntimeException: Problems reading sequence chr1: pos -1, len 124,
check whether chromosomal sequence exists / has the correct size
at barna.model.Graph.readSequence(Graph.java:721)
at barna.model.Graph.readSequence(Graph.java:608)
at barna.model.Graph.readSequence(Graph.java:430)
at barna.model.Transcript.getSplicedSequence(Transcript.java:451)
at barna.flux.simulator.fragmentation.Fragmenter.getMapTxSeq(Fragmenter.java:381)
at barna.flux.simulator.fragmentation.Fragmenter.process(Fragmenter.java:545)
at barna.flux.simulator.fragmentation.Fragmenter.call(Fragmenter.java:264)
at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:440)
at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:54)
at barna.commons.launcher.Flux.main(Flux.java:197)
Caused by: java.lang.NullPointerException
at barna.model.Graph.getRAF(Graph.java:1198)
at barna.model.Graph.readSequence(Graph.java:651)
... 9 more
[ERROR] Fragmentation file /mountpoints/wntshare/project/aczerny/Untitled_Folder/parameters.lib not found. Make sure fragmentation was done !
java.lang.RuntimeException: Fragmentation file /mountpoints/wntshare/project/aczerny/Untitled_Folder/parameters.lib not found. Make sure fragmentation was done !
at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:450)
at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:54)
at barna.commons.launcher.Flux.main(Flux.java:197)
1 Comment
Vitor Lima Coelho
Post your command line and try this:
divide your genome file into chromosomes such that each chromosome must be a fasta file according to the seqnames present on the first column of your gtf file (chr1.fa, chr2.fa, etc)