[mira_talk] High GC genomes and mira

From: Phillip San Miguel <pmiguel@xxxxxxxxxx>
To: mira_talk@xxxxxxxxxxxxx
Date: Tue, 17 May 2011 11:15:59 -0400

On 5/16/2011 1:17 PM, Bastien Chevreux wrote:

On May 16, 2011, at 15:16 , Phillip San Miguel wrote:
I tried MIRA V3.2.1.15 on a 70% GC bacterial genome (Deinococcus) ataround 100x coverage with solexa PE 101 base reads. My N50 contigsize was 4630 bases. That seems short to me, but it might be a resultof the 70% GC. So I decided to de novo assemble a 50% GC data setfrom the same run.
That's bad, really bad. You are the second report I get thatapparently, MIRA has problems with high GC Solexa data sets. The firstbeing a supersecret bug of a big company, I cannot get the data to seewhat's causing havoc. Would it be possible for me to have a look atthat thing? No promises, but it might help.
B.

    Probably, just let me check with the owner of the sequences.

However, the short contig lengths may derive from somethingtrivial: read distribution bias. An Eland/Gerald mapping of our IlluminaSalmonella reads produces a reasonably even coverage depth across thethe genome. A similar mapping of our Illumina Deinococcus reads showsmostly 50-150x coverage, but also frequent regions with very lowcoverage (a few X coverage -- or zero).


--
Phillip

--
You have received this mail because you are subscribed to the mira_talk mailing 
list. For information on how to subscribe or unsubscribe, please visit 
http://www.chevreux.org/mira_mailinglists.html

Follow-Ups:
- [mira_talk] Re: High GC genomes and mira
  - From: Shaun Tyler

References:
- [mira_talk] Re: Call for testing: MIRA 3.2.1.17 and Ion Torrent
  - From: Bastien Chevreux
- [mira_talk] Re: Call for testing: MIRA 3.2.1.17 and Ion Torrent
  - From: Phillip San Miguel
- [mira_talk] Re: Call for testing: MIRA 3.2.1.17 and Ion Torrent
  - From: Bastien Chevreux

[mira_talk] High GC genomes and mira

Other related posts: