[mira_talk] Re: large hybrid assembly w/ minimal ram

  • From: Sven Klages <sir.svencelot@xxxxxxxxxxxxxx>
  • To: mira_talk@xxxxxxxxxxxxx
  • Date: Mon, 15 Nov 2010 12:18:35 +0100

Hi Michael,

2010/11/15 Wachholtz, Michael <mwachholtz@xxxxxxxxxxx>

> [...]

it is safe to use such strict criteria. After that, for each lane, we
> used the fastq program to collapse/remove any identical reads. This
>
[...]

just a short question. You have successfuly used the FASTX-Toolkit to
quality-clip your data;
this tool collection also contains a program to remove duplicates from NGS
data:

FASTQ/A Collapser
Collapsing identical sequences in a FASTQ/A file into a single sequence
(while maintaining reads counts)

Have you tried this for your data?

cheers,
Sven

Other related posts: