[haiku-development] Re: Optimizing Painter::_DrawBitmapBilinearCopy32

  • From: Urias McCullough <umccullough@xxxxxxxxx>
  • To: haiku-development@xxxxxxxxxxxxx
  • Date: Sun, 14 Jun 2009 18:37:20 +0000

On Sun, Jun 14, 2009 at 2:16 PM, Christian
Packmann<Christian.Packmann@xxxxxx> wrote:
> I could need a few volunteers now to run the benchmark on various systems
> and post/mail the results. This would help me in deciding which routines
> should be aggressively optimized.
> I'd be especially interested in the following systems (but other systems
> would be welcome as well):
> * Intel Atom
> * Intel Core2 65nm (can be recognized by lack of SSE4.1 support)
> * Intel Pentium 4
> * Intel Core/Pentium M
> * AMD K10 - Phenom/Shanghai
> * AMD K8 - Athlon64/Sempron
> * AMD K7 - Athlon(XP)/Duron

Here's another Core 2 example running on gcc4 Haiku r30955:

~> runme_haiku
Benchmark: Haiku app_server bilinear copy
Compile date: Jun 14 2009 14:38:02
GCC version: 2.95.3-haiku-081024

CPU vendor ID: GenuineIntel
CPU: Intel(R) Core(TM)2 Duo CPU     E6750  @ 2.66GHz
  SIMD instructions: MMX SSE SSE-Integer SSE2 SSE3 SSSE3

Can't lock process to CPU on this platform.
Estimated CPUID/RDTSC overhead: 248 clock cycles.
10 runs per benchmark.

                    --  Results  --

       Minimum    Average    Maximum
# 1:    358488     371818     484328  - 'C, original'
# 2:    331824     332024     332552  - 'C, precise'
# 3:    350560     351513     355840  - 'C, precise DIV'
# 4:    189456     190421     192176  - 'MMX/SSE'
# 5:    177768     177912     178232  - 'MMX/SSE optim-test'
# 6:    249120     250858     259200  - 'SSE2'
# 7:    230360     230464     230712  - 'SSSE3'

Other related posts: