[mira_talk] MITObim/Mira aborting process with Genbank data

  • From: Hernan Vazquez Miranda <miran050@xxxxxxx>
  • To: mira_talk@xxxxxxxxxxxxx
  • Date: Mon, 6 Apr 2015 12:08:47 -0400

Hello everybody,

I've been fighting with the following problem trying to extract the
mitogenome of the recently deposited woodpecker genome in Genbank. I
downloaded one full lane, merge all pair-end reads, and concatenated them
in a single file. I had to subsample the full file to 10% because it
wouldn't run (Killed:9 error). Now it doesn't die but Mira won't work. I
even tried the dos2unix trick and the reduced fastq file still doesn't
work. I've tried it with my experimental reads and it works fine.

Am I doing something wrong? Most likely, but I cannot find what it is. Any
help will be greatly appreciated. This is my run.log:

MITObim - mitochondrial baiting and iterative mapping

version 1.7

author: Christoph Hahn, (c) 2012-2014


quick option selected! -maf option will be ignored (if given)


Full command run:

/Users/Odontodactylus/Programs/MITObim/MITObim_1.7_beta.pl
<http://mitobim_1.7_beta.pl/> -sample downy -ref DQ780882_Pteroglossus
-readpool downyS10.fastq --quick DQ780882_Pteroglossus.fasta -end 100
--denovo --clean


All paramters seem to make sense:

startiteration: 0

enditeration: 100

sample: downy

refname: DQ780882_Pteroglossus

readpool: /Users/Odontodactylus/Sanger_data/melanerpes/downyS10.fastq

maf: 0

quick:
/Users/Odontodactylus/Sanger_data/melanerpes/DQ780882_Pteroglossus.fasta

paired: 0 (off=0, on=1)

assembly mode: 1 (mapping=0, denovo=1)

verbose: 0 (off=0, on=1)

clean: 1 (off=0, on=1)

trim reads: 0 (off=0, on=1)

trim overhang: 0 (no=0, yes=1)

platform: SOLEXA

kmer baiting: 31


Starting MITObim


==============

ITERATION 0

==============

Apr 3 21:53:47



quick option baits reads from provided reference in iteration 0


fishing readpool using mirabait (k = 31)


/Users/Odontodactylus/Sanger_data/melanerpes/downyS10.fastq 1
/Users/Odontodactylus/Sanger_data/melanerpes 2 downyS10 3 fastq

Loading baits ...Localtime: Fri Apr 3 21:53:47 2015

Loading data from FASTA file:

[0%] ....|.... [10%] ....|.... [20%] ....|.... [30%] ....|.... [40%]
....|.... [50%] ....|.... [60%] ....|.... [70%] ....|.... [80%] ....|....
[90%] ....|.... [100%]

Localtime: Fri Apr 3 21:53:47 2015

rnm size: 0

No FASTA quality file given, using default qualities for all reads just
loaded.

Localtime: Fri Apr 3 21:53:47 2015



Done.

Loaded 1 reads with 0 reads having quality accounted for.

baitrp.size(): 1

Localtime: Fri Apr 3 21:53:47 2015

Writing temporary hstat files:

freemem: 27016318976

TNH: 18706

XME 1: 0.00074331

XME 2: 0.1

NEPB 1: 104857

NEPB 2: 104857

[0%] ....|.... [10%] ....|.... [20%] ....|.... [30%] ....|.... [40%]
....|.... [50%] ....|.... [60%] ....|.... [70%] ....|.... [80%] ....|....
[90%] ....|.... [100%] done

Flushing buffers to disk:

[0%] ....|.... [10%] ....|.... [20%] ....|.... [30%] ....|.... [40%]
....|.... [50%] ....|.... [60%] ....|.... [70%] ....|.... [80%] ....|....
[90%] ....|.... [100%] done

Localtime: Fri Apr 3 21:53:47 2015



Analysing hstat files:

[0%] ....|.... [10%] ....|.... [20%] ....|.... [30%] ....|.... [40%]
....|.... [50%] ....|.... [60%] ....|.... [70%] ....|.... [80%] ....|....
[90%] ....|.... [100%]

Localtime: Fri Apr 3 21:53:47 2015

clean up temporary stat files...Localtime: Fri Apr 3 21:53:47 2015

Raw MHI: 17003

Raw avg. freq. : 1

HSS 33824 HSST: 30442

Localtime: Fri Apr 3 21:53:47 2015

Will save to: fastq

Loading data from fastq ...Localtime: Fri Apr 3 21:53:47 2015

Loading data from FASTQ file:
/Users/Odontodactylus/Sanger_data/melanerpes/downyS10.fastq

(sorry, no progress indicator for that, possible only with zlib >=1.34)

Read SRR949787.4.1: invalid quality 170

Read SRR949787.20.1: invalid quality 171

Read SRR949787.20.1: invalid quality 167

... 4GB od these error lines


Read SRR949787.119142778.1: invalid quality 171

Read SRR949787.119142778.1: invalid quality 167

Read SRR949787.119142778.1: invalid quality 165





In func: size_t ReadPool::loadDataFromFASTQ_rgid(const string & filename,
int8 fastqoffset, const ReadGroupLib::ReadGroupID rgid, bool countonly,
void (*callback)(ReadPool &))

Throw message: Unrecoverable error while loading data from FASTQ (see
output above) Fix your input please.



Fatal error (may be due to problems of the input data or parameters):




********************************************************************************

* Unrecoverable error while loading data from FASTQ (see output above) Fix
*

* your input please.
*


********************************************************************************

->Thrown: size_t ReadPool::loadDataFromFASTQ_rgid(const string & filename,
int8 fastqoffset, const ReadGroupLib::ReadGroupID rgid, bool countonly,
void (*callback)(ReadPool &))

->Caught: main



Aborting process, probably due to error in the input data or
parametrisation.

Please check the output log for more information.

For help, please write a mail to the mira talk mailing list.

Subscribing / unsubscribing to mira talk, see:
//www.freelists.org/list/mira_talk



CWD: /Users/Odontodactylus/Sanger_data/melanerpes/iteration0

Thank you for noticing that this is *NOT* a crash, but a

controlled program stop.



mirabait seems to have failed - see detailed output above

Thank you for your help!

Hernán

--


*----*

__ __
/ <` '> \
( / @ @ \ )
\(_ _\_/_ _)/
(\ `-/ \-' /)
"===\ /==="
.==')___(`==. hjw
' .=' `=.

*Hernán Vázquez Miranda, PhD*



*Postdoctoral Research AssociateFlorida International
UniversityBracken-Grissom Lab*

http://www.brackengrissomlab.com

Other related posts: