[mira_talk] Re: 500Mb assembly
- From: Bastien Chevreux <bach@xxxxxxxxxxxx>
- To: mira_talk@xxxxxxxxxxxxx
- Date: Thu, 10 Dec 2009 18:32:20 +0100
On Donnerstag 10 Dezember 2009 Filip Van Nieuwerburgh wrote:
> I have been asked to de novo assemble an insect genome.
> Until now, I only de novo assembled bacterial genomes (max 5 Mb). Mira
> proved to be the best assembler (in my hands) to tackle these
> assemblies.
>
> What would be the best strategy (also taking into account that I cannot
> wait more than a month for MIRA to finish) to de novo assemble the 500Mb
> insect genome?
>
> I have:
> - 100E6 reads of 2x100 bp Illumina data
> - 1 full Roche titanium run: paired-end insert size= 3000 bp.
> - A 16 core 64-bit linux with 128Gb of RAM
Hi Filip,
take another assembler. Though I never tried it, I'd put my money on the
Celera assembler. 100 bp might be just long enough for it to get working
halfway right.
This is out of MIRA's league, I wouldn't want to use more than 10 to 20
million reads at the moment. You might also think of hiring half a dozen
bioinformaticians for one or two years to get that thing assembled / finished.
And if you are the only one "who has been tasked" (mind the 't' :-) to
assemble that thing in a month: don't walk, run away! Sheeeesh. Doing things
right takes time (and manpower).
Regards,
Bastien
--
You have received this mail because you are subscribed to the mira_talk mailing
list. For information on how to subscribe or unsubscribe, please visit
http://www.chevreux.org/mira_mailinglists.html
Other related posts: