[mira_talk] New Solexa seqence identifier format - Paired end reads

Hi,

The manual gives instructions on how MIRA will treat paired end reads in Solexa 
files.

However, the identifier format has changed.

Old format: @HWUSI-EAS100R:6:73:941:1973#0/1
New format: @EAS139:136:FC706VJ:2:2104:15343:197393 1:Y:18:ATCACG

In the old format, the number 1 at the end of the line indicates which member 
of the paired end read this is.
In the new format, the number 1 is the first number after the space (before the 
Y in this example).

Will MIRA understand the new format?
Do I have to convert the new format to the old format for MIRA to distinguish 
the paired end reads?
If so, what is the sed command to do this?

See Wikipedia for full details of new format for identifier in Solexa FASTQ 
files.
The format of the '@' line has changed since Casava 1.8, according to Wikipedia.
http://en.wikipedia.org/wiki/FASTQ_format

Thank you for your help,

Alexis Blanchet-Cohen

--
You have received this mail because you are subscribed to the mira_talk mailing 
list. For information on how to subscribe or unsubscribe, please visit 
http://www.chevreux.org/mira_mailinglists.html

Other related posts: