Christian Packmann wrote:
I could need a few volunteers now to run the benchmark on various systems and post/mail the results. This would help me in deciding which routines should be aggressively optimized. I'd be especially interested in the following systems (but other systems would be welcome as well):* Intel Atom * Intel Core2 65nm (can be recognized by lack of SSE4.1 support) * Intel Pentium 4 * Intel Core/Pentium M * AMD K10 - Phenom/Shanghai * AMD K8 - Athlon64/Sempron * AMD K7 - Athlon(XP)/Duron
Benchmark: Haiku app_server bilinear copy Compile date: Jun 14 2009 14:38:02 GCC version: 2.95.3-haiku-081024 CPU vendor ID: AuthenticAMD CPU: AMD Athlon(tm) 64 Processor 3000+ SIMD instructions: MMX SSE SSE-Integer SSE2 Can't lock process to CPU on this platform. Estimated CPUID/RDTSC overhead: 59 clock cycles. 10 runs per benchmark. -- Results -- Minimum Average Maximum # 1: 437597 465076 581038 - 'C, original' # 2: 449468 450649 457440 - 'C, precise' # 3: 457459 458318 462643 - 'C, precise DIV' # 4: 242621 243435 250598 - 'MMX/SSE' # 5: 242013 244420 247426 - 'MMX/SSE optim-test' # 6: 304589 305326 311670 - 'SSE2' Skipped 'SSSE3', insufficient SIMD support ~/projects>