[mira_talk] Re: Alignment errors in Solexa mapping
- From: Björn Nystedt <bjorn.nystedt@xxxxxxxxx>
- To: mira_talk@xxxxxxxxxxxxx
- Date: Mon, 14 Dec 2009 08:36:18 +0100
> As long as what you show me are two reads and not the reference and a read,
> I'm fine. I don't like it, but I've learned that whatever clever trick one
> applies, there will always be a combination of mapping sequence and
> sequencing
> errors that will lead to situations like this.
You are right, the situtation occurs at homopolymer frameshifts in the
reference, so exactly as you describe below. Sorry for being slow :)
> I have an idea how this kind of effect could be toned down a bit and the
> pieces for it are almost in place in MIRA, but I need to test it (and hope it
> really works).
That sounds great!
B
On Fri, 11 Dec 2009 19:49:48 +0100
Bastien Chevreux <bach@xxxxxxxxxxxx> wrote:
> On Freitag 11 Dezember 2009 Björn Nystedt wrote:
> > I am still somewhat confused about exactly how the second error can occur.
> > The situation is that this is what you would expect
> > ...CATTTTTTC...
> > ATTTTTTC...
> > but this is what you get
> > ...CATTTTTTC...
> > A*TTTTTTC...
> > I can see the point of relaxing gap penalties at ends of reads, but to
> > prefer a mismatch over a match seems strange to me. Note also that there
> > was no IUPAC in the reference in this case!
>
> As long as what you show me are two reads and not the reference and a read,
> I'm fine. I don't like it, but I've learned that whatever clever trick one
> applies, there will always be a combination of mapping sequence and
> sequencing
> errors that will lead to situations like this.
> Welcome to the wonderful world of mapping.
>
> Imagine you have a backbone:
>
> B ...GCTTTTTTCG...
>
> You now map the first read which has one sequencing error right at the
> beginning/end (an A instead of a C):
>
> B ...GCTTTTTTCG...
> r1 ATTTTTTCG...
>
> Now you map a second read, where the sequecning error is an insertion (an
> additional A) somewhere in the middle of the read. Remember, you map against
> the reference:
>
> B ...GC*TTTTTTCG...
> r1 A*TTTTTTCG...
> r2 ...GCATTTTTTCG...
>
> Wham! You have the situation you described. From there on, things can only
> deteriorate with additional reads :-)
>
> I have an idea how this kind of effect could be toned down a bit and the
> pieces for it are almost in place in MIRA, but I need to test it (and hope it
> really works).
>
> Regards,
> Bastien
>
> --
> You have received this mail because you are subscribed to the mira_talk
> mailing list. For information on how to subscribe or unsubscribe, please
> visit http://www.chevreux.org/mira_mailinglists.html
--
====================================
Björn Nystedt, PhD
Molecular Evolution
EBC, Uppsala University
Norbyv. 18C, 752 36 Uppsala
Sweden
phone: +46 (0)18-471 45 88
email: Bjorn.Nystedt@xxxxxxxxx
====================================
--
You have received this mail because you are subscribed to the mira_talk mailing
list. For information on how to subscribe or unsubscribe, please visit
http://www.chevreux.org/mira_mailinglists.html
Other related posts: