Hey Bastien. I know you've got a lot on your plate, but I'm still trying to find out why mira won't do hybrid assembly for me on OSX. If you'll remember, when I run a bare bones (i.e. basic command line --project=EO -job=denovo,genome,accurate,454,solexa -GE:not=8) hybrid 454 / solexa assembly on my Mac, I wind up with 76814 contigs, none larger than 1745 bases. Running the exact same command line on my university's server using the linux binaries of the same version (3.2.1.17) I run out of disk space after the first pass due to my being on a test account, but at that point I have 18498 total contigs, 88 of which are larger than 500 bp. The university server is busy and the queues are long, so even if I can get a larger disk space allocation, that's not really a preferable option since I have a perfectly good computer with plenty of RAM and disk space and no queue sitting right on my desk that I think should be capable of doing this. Here's what else I've tried so far: - Installed Ubuntu on a virtualbox, but the memory limit of virtualbox is 16G of RAM and it keeps killing mira runs while trying to load the blocks after skim, so that's no help. - Built 3.2.1.17 from source on OSX instead of using the pre-built binaries, just in case. Output from the hybrid run was nearly identical to the pre-built binaries (big surprise, I know) If it's any help, I was scanning through the log files from my last OSX run and the partial server run. What I found was they start off exactly the same. The first several pool statistics, deduced thresholds, and repeat ratio histograms are identical. There are a few minor differences after the skim: OSX: ... truncating EO_assembly/EO_d_tmp/EO_int_posmatchf_pass.1.lst from 5685491832 to 2526204024 truncating EO_assembly/EO_d_tmp/EO_int_posmatchc_pass.1.lst from 5601074328 to 2506863336 Skim summary: accepted: 478708281 possible: 608170028 permbans: 0 Hits chosen: 470273590 ... Linux: ... truncating EO_assembly/EO_d_tmp/EO_int_posmatchf_pass.1.lst from 5685493824 to 2526205752 truncating EO_assembly/EO_d_tmp/EO_int_posmatchc_pass.1.lst from 5601074688 to 2506864824 Skim summary: accepted: 478708281 possible: 608170028 permbans: 0 Hits chosen: 470273688 ... There's further divergence in the output after this: OSX output: ... Total megahubs: 0 tcmalloc: large alloc 4294967296 bytes == 0x473800000 @ Can load up to 134217702 skim edges at once. Partitioning into 4 blocks. We have 419422280 skims in file. ... Linux output: ... Total megahubs: 0 System memory: 50620112896 Mem2keepfree: 7593016934 Used by MIRA: 14746963968 Mem avail: 28280131994 rsh increased memtouse to: 28280131994 tcmalloc: large alloc 13421522944 bytes == 0x38c9cc000 @ Can load up to 419422548 skim edges at once. Partitioning into 1 blocks. We have 419422548 skims in file. ... There are some more differences in the block loading outputs after that, a lot of which I'm assuming is due to mira not being able to use memory management on OSX. After that both log files go through "Filtering forward skims" and "Filtering complement skims." After the next step, "Aligning possible forward matches," there are really divergent "Alignment stats": OSX: ... Potential: 2971963 Calculated: 40864 Evaded (PB): 0 Rejected (checkfun): 0 Trans 100 saved: 2931099 Banned overlap pairs: 2753 in 1673 sets. ... Linux: ... Potential: 18008746 Calculated: 2326435 Evaded (PB): 3 Rejected (checkfun): 0 Trans 100 saved: 15682308 Banned overlap pairs: 23229 in 15217 sets. ... Similarly the output from "Aligning possible complement matches" is widely divergent between the two. OSX soon goes on to kill an enormous number of clusters (50,198 lines of "Killing cluster") versus Linux version producing 985 "Killing cluster" lines, ultimately culminating with OSX version moving 209187 reads to debris, Linux version moving 2188 reads to debris. The Linux version then goes on to build a few long contigs, OSX a huge number of teeny-tiny ones. Does any of this help you to figure out why OSX is going off the rails? Any thoughts would be appreciated. Thanks. - E On May 15, 2011, at 3:03 PM, Bastien Chevreux wrote: > On Friday 06 May 2011 15:27:29 you wrote: > > ?????????????? > > Didn't work again! > > Got my DSL back, haven't forgotten you. > > Would you still like to run a couple of tests to see whether we can pinpoint > the cause of the problem on OSX? > > Best, > Bastien > > >