[haiku-development] Re: Compositing window management

From: Stephan Aßmus <superstippi@xxxxxx>
To: haiku-development@xxxxxxxxxxxxx
Date: Wed, 15 Jun 2011 11:19:50 +0200

Hi,

On 15.06.2011 06:37, Mark Watts wrote:

Hi, I'm not exactly new to this list (lots of lurking), but I haven't
done any hacking on the Haiku sources.
I want to implement compositing in Haiku (along with other interface
things). I've gotten Haiku compiled and running in a VM, so from here
I guess I should start with the interface kit? Can anyone who's done
related work suggest a starting point in the code or potential
problems?

Great to hear, I hope you are indeed serious about getting your feetwet! :-) I am going to give you an overview, but it is intended to helpyou along when reading the code and piecing things together. It willjust give you a direction that is guaranteed to work OK and with theminimal amount of changes, but you need to figure out the actualimplementation yourself. Here it goes:

Compositing can initially be implemented without doing lots of changesto the drawing code or how updates work. In the Interface Kit we haveBWindow and BView. The BWindow has a messaging port to the app_server.Inside app_server, a ServerWindow (once instance per BWindow) receivesany drawing commands which BViews send via their owning BWindow. BViewsalso have a server counterpart, which is called View. There is anotherclass called Window. Window is representing the on-screen object (holdsclipping, decorator, ...) while ServerWindow is rather managing thecommunication and owns one Window object. View objects are owned byWindow and mirror the client side BView hierarchy, as long as thoseBViews are attached. Window also owns a DrawingEngine, which abstractsthe entire drawing interface. The default implementation for all thesecalls is in the Painter class. Painter itself is using AGG for generalpurpose drawing algorithm implementation, as well has a whole bunch ofoptimized implementations when we can take a shortcut. Each Windowinstance owns a DrawingEngine and Painter instance.

Now comes the interesting part: Painter has the notion of being attachedto a memory address which represents the frame buffer of the screen. Allthe Painter objects owned by each Window are attached to the same memoryaddress at the moment. They each expect to be setup with a correctclipping region before any drawing calls are invoked on them (thishappens in ServerWindow right before invoking drawing calls). Theclipping is updated for all on-screen windows in an "atomic" operation,by the Desktop object. The Desktop object runs in its own thread andwhenever it calls into any ServerWindow/Window/View/... code it isexpected to hold the global window lock in write-mode. This means allServerWindow threads are then blocking. Whenever ServerWindow threadsrun, they hold the global lock in read-only mode, which means all otherServerWindow threads are allowed to run concurrently, and only theDesktop thread is blocking (since it always wants a write-lock). Thismeans when the global clipping is supposed to be changed, for example bymoving a window on screen, this operation blocks until all otherServerWindow threads have released the read-lock. Note that you can alsoinvoke Desktop functions from a ServerWindow thread. In that case youhave to first give up the read-lock, acquire the write-lock, call intothe Desktop function, release the write-lock, and re-acquire theread-lock. Just so you understand these bits of code when you comeacross them. That's basically the locking inside app_server, but thereare additional utility classes which perform their own (inner) locking.It is relatively easy, unfortunately, to mess things up and cause adead-lock. Just so you keep this in mind. To recap: A ServerWindowthread only runs after it successfully acquired the read-lock (or thewrite-lock, but the write-lock then blocks all other threads includingthe Desktop thread). The Desktop thread only runs when it successfullyacquired the write-lock, it never read-locks.

Anyway, the important bit is that all Painters are attached to the framebuffer, pointing to the same top-left pixel address, and they arealready allowed to paint concurrently. All threads are blocked when theclipping is updated, then they can continue drawing again. The framebuffer which these Painters are attached to is called the "back-buffer".It resides in system RAM and is always 32 bit per pixel.

Since all Painter objects point to the top-left pixel of thefront-buffer, all the coordinates of client drawing commands areconverted to screen coordinate space! (Important since this needs tochange later on)

Access to the frame buffer is managed by instances of a class calledHWInterface. There is only a single instance of this class per Screen.In a regular desktop situation, this instance would be of the typeAccelerantHWInterface. It is connected to the graphics card via theaccelerant interface by which the graphics card provides a frame bufferof its own. It may have a different color space. HWInterface providesutility methods for copying parts of the back-buffer (RAM) to thefront-buffer (graphics card memory).

Inside the DrawingEngine, you will come across code which performs thisback-to-front copying. Here is another important bit of information: Toavoid flickering, a Window object has the notion of an "update session".Basically it means it collects all "dirty regions". Eventually it willhave a chance to tell the client BWindow object (via a back channelwhich is just regular BMessages being sent to the client BWindow) thatit needs to paint these dirty regions. This is all asynchronous ofcourse. Eventually sometime later, the BWindow has become aware that theWindow companion in the server wants it to paint. It sends a comman"begin update session". The first thing that happens is that Windowlocks the session's dirty region and tells the BWindow which views areaffected. Updates which arrive while a session is ongoing are batchedinto the next (future) update session. Also, an update session puts theWindow drawing code into a special mode: It does not performback-to-front copying of painted regions. Once the BWindow has finishedcalling all the Draw() hooks of the affected BViews, it sends a command"end update session". This in turn will then finally trigger the back tofront copy. This means there is no flickering when BViews draw in Haiku.However, BViews can draw at any time. An application may just invokesomeView->Draw(), or you can even invoke any drawing methods directly.When this happens, the server Window is not inside an update session andthe back-to-front copy happens immediately. With flickering and theobvious performance hit. That's why the proper way to redraw a BView isby calling Invalidate() instead of Draw().

So how do you implement compositing now? Basically, you would want togive each Window object currently on screen it's own private framebuffer. The Painter objects are then each attached to a frame buffer oftheir own. Nothing would /need/ to change except that the coordinateconversion is done for the window coordinate system rather than thescreen system. The buffer allocation management can be simple at first.

As your second step, you need something which actually composes allwindow buffers into the back-buffer at the right moment and transfersthe result into the front-buffer inside the graphics memory. Notice thatyou absolutely *cannot* get rid of the back-buffer in system RAM. Youwant to access the graphics memory only in one direction -- writing --you never want to read from it. Reading the graphics memory is insanelyslow. For compositing, that is what you need to do, however.

The third step (rather simple, but here for completeness) is to changethat BDirectWindow now points to the window buffer instead of the framebuffer.

When you trigger the compositing at the right time, you should now havea system which works exactly as before, only with a lot of memoryoverhead and no other benefits.

Obviously your next steps would then be to realize some benefits fromyour changes:

* You would change the way how dirty regions are triggered when partsof windows are exposed (all this code is in the Desktop class).Obviously exposing parts of a window does not actually need anything tobe redrawn anymore, since the exposed part is already fully valid in theprivate Window frame buffer. So expose events do not need to invalidateclient BWindows anymore, but they simply need to trigger an update atthe compositing step. This change will be one of the biggest benefits,since it greatly reduces the CPU consumption when windows are moved onscreen. You can consider it performance optimization by "caching".


 * The only events that trigger actual redraw would be:
   - When a Window is resized. Note you can copy the valid
     region of the old buffer and limit the dirty region, but
     Views with B_FULL_UPDATE_ON_RESIZE, or views which follow
     the right/bottom edge of their parents still invalidate parts
     of the new buffer that you could copy from the old... the code
     already does this, so nothing needs to change.
   - When the client requests a redraw via BView::Invalidate().
   - When the View hierarchy changes.
   - When other properties change, like the decorator look.
   (All this already happens, no need to change anything except to
    avoid triggering the wrong kind of update when windows move.)

 * Your next chance of introducing some benefit is by using an alpha
   channel for Windows. Giving them a drop shadow would be nice.
   Compositing these in software may be fast enough, you just freed
   up some time by caching window contents.

Once you have this system in place, you can start thinking about doingthe compositing via the graphics card hardware. For this to work youwould setup the hardware to pull textures from main memory and composein the graphics buffer. This allows you to get rid of the back-buffer insystem RAM, but only for the sitation when hardware acceleration isavailable. Obviously there is no driver for Haiku yet which has thesecapabilities. There isn't even an official accelerant API for this newfunctionality and requirements of the compositing.

There are some nice properties which can be coded into the compositor:It can be locked to the screen refresh rate, and you may cause windowpainting to happen in separate temporary buffers. The compositor wouldthen compose the dirty parts of the back-buffer at a fixed rate, in itsown thread. It requests buffers from each Window to do so, but thebuffers would always be clean. When a Window draws, which is undisturbedby the compositor asking for the buffer, it draws into a temporarybuffer. When it is done, it switches the new clean buffer for the oldclean one by holding the compositor lock for a short time (also tellingthe compositor to recompose at the same time). This way you can neversee dirty parts of any window anymore as a user, and since thecompositor is locked to the screen refresh, you see no tearing either.


Hope this helps,
-Stephan

Follow-Ups:
- [haiku-development] Re: Compositing window management
  - From: Stephan Aßmus
- [haiku-development] Re: Compositing window management
  - From: Stephan Aßmus

References:
- [haiku-development] Compositing window management
  - From: Mark Watts

[haiku-development] Re: Compositing window management

Other related posts: