[mira_talk] Re: miraconvert maf to fastq contains asterix

  • From: Bastien Chevreux <bach@xxxxxxxxxxxx>
  • To: mira_talk@xxxxxxxxxxxxx
  • Date: Wed, 15 Oct 2014 17:50:46 +0200 (CEST)

> On October 15, 2014 at 5:14 PM Bert Brutzel <bertbrutzel@xxxxxxxxxxxxxx>
> wrote:
>  the -d Option helped, this avoids my awk, sed tr, solution I build... I
>nevertheless still have a problem using sort -u since the
> reads extracted from a mapping and from the raw data are not identical
> anymore, as there are some changes as indicated below. These
> duplicates I can luckily still remove using uniq -w 50 -d , but I still wonder
> where they come from....

You certainly want to track the source of the duplicates. There's a problem in
your upstream pipeline if duplicates end up in your data.

B.

--
You have received this mail because you are subscribed to the mira_talk mailing 
list. For information on how to subscribe or unsubscribe, please visit 
http://www.chevreux.org/mira_mailinglists.html

Other related posts: