[mira_talk] assembly options for non-redundant contigs

From: Richard Gregory <R.Gregory@xxxxxxxxxxxxxxx>
To: "mira_talk@xxxxxxxxxxxxx" <mira_talk@xxxxxxxxxxxxx>
Date: Tue, 02 Jun 2009 17:30:23 +0100

Hi All,

We been using Mira for a while now, handles cDNA much better thanNewbler and gives us more confidence in the result.

The latest batch of data is proving to be a problem, the current projectrequires contigs that contain all similar reads and not be split intomultiple contigs of minor (or maybe large) differences. We are havingtrouble finding the options to achieve this. Does anybody know if/howthis can be done?

The sequence data is 1.5 plates of 454 Titanium cDNA and half a plate ofpre-Titanium cDNA. Have tried many options, genome or est,--noclippings, -SK:mmhr=1, -DP:ure=no, -AS:ard=no, -AL:egp=no,-AS:sd=no, -CO:mr=no, -AL:mrs=55, -SK:mnr=yes, -SK:hss=1:pr=70, but theresult is basically the same. The better option set was -fasta-job=denovo,est,draft,454 --noclippings -SK:mmhr=1 -DP:ure=no-AS:ard=no -AS:sd=no -GE:not=4 -SK:hss=1:pr=70 , but this wasmarginally better and doesn't achieve the desired result.

The only clue comes previous assemblies with earlier versions of Mira,which produced much less redundancy, ie, was ~8000 contigs, now V2.9.43produces ~18000. Mapping this onto a reference showed ~1500 contigscould be the same gene. Assembling the ~1500 contigs with cap3produced ~3 contigs, one containing hundreds of contigs.



Thanks,

Richard

--
You have received this mail because you are subscribed to the mira_talk mailing 
list. For information on how to subscribe or unsubscribe, please visit 
http://www.chevreux.org/mira_mailinglists.html

Follow-Ups:
- [mira_talk] Re: assembly options for non-redundant contigs
  - From: Bastien Chevreux
- [mira_talk] Re: assembly options for non-redundant contigs
  - From: Laurent MANCHON

[mira_talk] assembly options for non-redundant contigs

Other related posts: