[haiku-development] Re: Optimizing Painter::_DrawBitmapBilinearCopy32

  • From: Rob Judd <haiqu@xxxxxxxxxx>
  • To: haiku-development@xxxxxxxxxxxxx
  • Date: Mon, 15 Jun 2009 09:46:34 +1000

Christian Packmann wrote:
I could need a few volunteers now to run the benchmark on various systems and post/mail the results. This would help me in deciding which routines should be aggressively optimized. I'd be especially interested in the following systems (but other systems would be welcome as well):
* Intel Atom
* Intel Core2 65nm (can be recognized by lack of SSE4.1 support)
* Intel Pentium 4
* Intel Core/Pentium M
* AMD K10 - Phenom/Shanghai
* AMD K8 - Athlon64/Sempron
* AMD K7 - Athlon(XP)/Duron

Benchmark: Haiku app_server bilinear copy
Compile date: Jun 14 2009 14:38:02
GCC version: 2.95.3-haiku-081024

CPU vendor ID: AuthenticAMD
CPU: AMD Athlon(tm) 64 Processor 3000+
  SIMD instructions: MMX SSE SSE-Integer SSE2

Can't lock process to CPU on this platform.
Estimated CPUID/RDTSC overhead: 59 clock cycles.
10 runs per benchmark.

                    --  Results  --

       Minimum    Average    Maximum
# 1:    437597     465076     581038  - 'C, original'
# 2:    449468     450649     457440  - 'C, precise'
# 3:    457459     458318     462643  - 'C, precise DIV'
# 4:    242621     243435     250598  - 'MMX/SSE'
# 5:    242013     244420     247426  - 'MMX/SSE optim-test'
# 6:    304589     305326     311670  - 'SSE2'
Skipped 'SSSE3', insufficient SIMD support
~/projects>

Other related posts: