On Sun, Jun 14, 2009 at 2:16 PM, Christian Packmann<Christian.Packmann@xxxxxx> wrote: > I could need a few volunteers now to run the benchmark on various systems > and post/mail the results. This would help me in deciding which routines > should be aggressively optimized. > I'd be especially interested in the following systems (but other systems > would be welcome as well): > * Intel Atom > * Intel Core2 65nm (can be recognized by lack of SSE4.1 support) > * Intel Pentium 4 > * Intel Core/Pentium M > * AMD K10 - Phenom/Shanghai > * AMD K8 - Athlon64/Sempron > * AMD K7 - Athlon(XP)/Duron Here's another Core 2 example running on gcc4 Haiku r30955: ~> runme_haiku Benchmark: Haiku app_server bilinear copy Compile date: Jun 14 2009 14:38:02 GCC version: 2.95.3-haiku-081024 CPU vendor ID: GenuineIntel CPU: Intel(R) Core(TM)2 Duo CPU E6750 @ 2.66GHz SIMD instructions: MMX SSE SSE-Integer SSE2 SSE3 SSSE3 Can't lock process to CPU on this platform. Estimated CPUID/RDTSC overhead: 248 clock cycles. 10 runs per benchmark. -- Results -- Minimum Average Maximum # 1: 358488 371818 484328 - 'C, original' # 2: 331824 332024 332552 - 'C, precise' # 3: 350560 351513 355840 - 'C, precise DIV' # 4: 189456 190421 192176 - 'MMX/SSE' # 5: 177768 177912 178232 - 'MMX/SSE optim-test' # 6: 249120 250858 259200 - 'SSE2' # 7: 230360 230464 230712 - 'SSSE3'