[opendtv] Re: Precision

From: Craig Birkmaier <craig@xxxxxxxxx>
To: opendtv@xxxxxxxxxxxxx
Date: Thu, 31 May 2007 09:29:54 -0400

At 4:14 PM -0400 5/28/07, Tom Barry wrote:

I'd prefer to think of practical (lossy) compression as removing:

1) Redundancy, as you stated
2) Information we don't care about enough to encode, say very highfrequency information, and

Careful here. You are treading on thin ice and some of the 1080Pzealots may become upset with your logic here.

Very high frequency information may be important as the screen sizeincreases. On smaller screens these details are too small to see,even if they make it through the emission channel. As we increase thescreen size, however, we may need these details to deliver whatappears as a sharp picture.

Admittedly, these details are the first thing that gets quantizedaway when we compress for emission, but this does not make them lessimportant to "some" viewers. A viewer with a 27" screen probablydoes not even care about HDTV - they just want a clean sharp SDTVpicture.

For the vast majority of HDTV owners, 720P is sufficient to deliversharp pictures, especially when the high frequency information isDELIVERED, not quantized away.

But those who have gone to the expense of building a really BIGSCREEN home theater system with 1080P resolution want the extradetail, and they DON"T want no stinking compression artifacts.

And don't bother trying to tell them that MOST OF THE TIME this extradetail will be left on the "compression room" floor. They bought a1080P display and they know that compression technology is improving,so don't waste your effort trying to explain how stuff "really" works.

3) Related, but not the same as 2), information we don't know aboutor don't trust. This is information that was captured, but notreliably due to sampling error, noise, whatever. There is a pointof diminishing returns on how many bits we can afford to spendencoding unreliable samples or extra bit depth once these thingsbecome lost in the noise.


Yup. Entropy is a bitch.

As Mark pointed out, however, it is an integral part of the samplingprocess. Most of the extra information that exists in 1080P versus720P is in that borderline area where noise starts to have an impacton the sample integrity ( not to mention the fact that most of thesedetails are captured with limited contrast due to MTF considerations- that is, they are already inaccurate, but hopefully justattenuated, not completely wrong.

Cameras are already designed to deal with some of this. The designersknow the frequencies at which useful details can be captured, andthose above which noise makes the samples highly unreliable. So theydesign the cameras, and downstream processing gear to smoothly rolloff the response in the area where some useful details are captured,and to roll off everything above a certain frequency. Without thebenefits of oversampling the frequency response of a 1920 x 1080camera extends only to about 22-24 MHz before the noise overwhelmshigh frequency details. Most cameras start to roll of the responsejust above 20 MHz.

The best way to minimize the impact of noise and sampling dither isto OVERSAMPLE. When we resample to a smaller raster we filter outmuch of the entropy, improving the precision of all of the higherfrequency details that are left. We also improve the contrast, whichcontributes to the perception of a sharper picture.

And then we need to acknowledge real world encoding practices, andthe techniques that are used when the peak bit rate requirementsexceed the channel bandwidth. We can:

Let the encoder do the best it can, replacing real image detail withquantization noise and in the extreme blocking artifacts.

OR

Pre-filter the source to reduce the amount of high frequency detailthat is presented to the encoder. This is more insidious thatresampling to a lower resolution for emission, as we reduce theinformation but keep all of the encoding overhead for the higherresolution format. I have heard experts talk about the fact that forthe 1080 line formats, up to half of the available bits can beconsumed just for the transmission of motion vectors when the encoderis stressed.

Once again, however, it is important to recognize that a smallpercentage of viewers have 1080P displays that need this extradetail. So we need to at least pretend that this stuff is making itthrough the emission channel to keep them satisfied. Who cares if theprograms breaks up into blocks on occasion and the deliveredresolution modulates based on the encoding stress...


At least we are delivering the very best HDTV possible...

;-(

For 2) and 3) however it may be best to not filter or discard thembut instead allow the encoder to opportunistically choose whichevervalues happen to encode nicest. That's one of the reasons I thinkcapturing at much higher bit depths and allowing encoders toquantize them away seems, in some tests, to work more efficientlythan some might predict.

I've never seen such a test. Unfortunately, encoders are no smartenough to encode what looks nicest - even with h.264. They runalgorithms that are VERY limited in terms of the decision that can bemade. H.264 introduced two features that help a little - one improvesthe decisions, the other masks the mistakes. The frequency domaintransform has the ability to select from a range of choices withrespect to the content of a transform block. It can be weighted forincreased H or V detail and to deal with several types of gradients.The deblocking filter helps to mask the artifacts when a block isover-quantized.

The reality of how compression works is that there is a compressionrange where only the highest frequencies are quantized. The actualsamples are replaced with correlated noise. As long as we operate inthis range the pictures look very good. But there are several issuesthat cause severe problems.

One is high frequency edge information, which we recently establishedas being critical to human visual perception. Unfortunately, when anencoding block contains this edge information we typically see blockswith a bunch of coefficients that represent the edge. When wequantize these coefficient we introduce distortions in the edge whichare typically seen as ringing or noise around the edge. This isparticularly bothersome for text overlays. We simply cannot quantizetoo much, or the distortions become a major problem - like blackpixels adjacent to white pixels - which violates sampling theory,which in turn makes the artifacts easier to detect.

Adding bit depth allows us to limit the range of sampling dither.Adding resolution allows us to localize the impact of the frequencytransform. But both of these things increase the overhead that mustbe dedicated to delivering the compressed pictures - i.e. moreencoding blocks, more motion vectors, more bits per sample, etc.

The BEST way to deal with this is to resample to a lower resolutionraster. We get more accurate samples - less entropy - and there isless encoding overhead, which translates into higher quality samplesdelivered to the decoder.


Regards
Craig


----------------------------------------------------------------------
You can UNSUBSCRIBE from the OpenDTV list in two ways:

- Using the UNSUBSCRIBE command in your user configuration settings at FreeLists.org

- By sending a message to: opendtv-request@xxxxxxxxxxxxx with the word 
unsubscribe in the subject line.

References:
- [opendtv] Precision
  - From: dan . grimes
- [opendtv] Re: Precision
  - From: Mark Schubin
- [opendtv] Re: Precision
  - From: Craig Birkmaier
- [opendtv] Re: Precision
  - From: Tom Barry
- [opendtv] Re: Precision
  - From: Craig Birkmaier
- [opendtv] Re: Precision
  - From: Tom Barry

[opendtv] Re: Precision

Other related posts: