Ok. We have a hundred 454 reads which belong to a given previously identified protein from the same organism. They align to a known reference sequence in different places. According to this, we thought that their assembly in several contigs could be a raw estimation of the number of genes since non-overlap contigs could be considered 2 different genes (or maybe exons?) and those reads in a single contig could belong to the same gene. What do you think about that? Maybe we are trying to do a clustering process and MIRA isn't the best tool to do that. Regards, 2011/2/8 Bastien Chevreux <bach@xxxxxxxxxxxx> > On Tuesday 08 February 2011 18:27:59 Jordi Durban wrote: > > But what about using some clustering scripts?? > > Uhm ... it's not entirely clear to me what your question is? Care to > elaborate? > > B. > > -- > You have received this mail because you are subscribed to the mira_talk > mailing list. For information on how to subscribe or unsubscribe, please > visit http://www.chevreux.org/mira_mailinglists.html > -- Jordi