[opendtv] Re: Time to give up on 1080i for football

From: Craig Birkmaier <craig@xxxxxxxxx>
To: opendtv@xxxxxxxxxxxxx
Date: Wed, 9 Dec 2009 07:41:34 -0500

At 3:19 PM -0500 12/8/09, John Shutt wrote:

True, the bit rates are capped so that the maximum bitrate of anyone channel is total ATSC video payload (for us about 16.7 Mbps)minus the minimum bitrates of the other three streams. That resultsin bitrate constraints of 7Mbps to 14.4Mbps for the HD service, and800 Kbps to 7.7 Mbps for each of the three SD services.
Minimum bitrates are determined by the vintage of our Tandbergencoders. Bits are of course further divvied up between encoders byweighting in the stat mux.


Following up on my previous encoding post.

The major reason that VBR and stat muxing work so well is that theinformation content of any video source is constantly changing. Rapidmotion can cause spikes in the information content, but there aremany other aspects of imagery that can make the bit rates spike.

There is a misguided assumption that as the resolution of the capturedevice (camera) increases the information content of the capturedscenes also increases. This is only true IF the scene that the camerais pointed at contains high frequency details that are filtered outwhen captured with a lower resolution camera. And yes, thanks tosampling theory, every camera employs filters to limit the highestfrequencies that reach the image sensor.

The major reason that SD-DVD works so well for movies is thatcinematographers go out of their way to limit the resolution whencapturing the images. Depth of field is one of the major tools usedto limit resolution, causing significant areas of the typical cinemaframe to be blurred or limited in detail. Likewise, because of thelow taking frame rate, motion blur is essential to prevent motiondiscontinuity. Cinematographers are highly skilled in controlling themotion of the camera to limit/prevent motion artifacts. Bottom line,24P is NOT about high def detail - it is used to create a look andfeel that Hollywood strives for, which conveniently limitsresolution, minimizing the potential additional detail that can bedelivered via Blu-Ray.

There is another aspect of image capture that can make the encodingof High Def material more difficult.


NOISE and sampling errors.

Noise is the enemy of entropy coding as it is random and cannot bepredicted - by the way, the same is true for those Hollywood typeswho think it is important to capture every detail in the grain infilm.

IF the optics are held constant, as camera resolution increases, thenumber of photons hitting each sensor site decreases - this directlyimpacts the noise floor for the capture device. So while on one handthe accuracy of each sample may improve as camera resolutionincreases, the potential for noise and sampling errors alsoincreases. This is why oversampling is so important, as it allows usto literally filter out entropy before the source gets to the encoder.

Dan noted that entropy encoding algorithms are very complex, and thatprocessing power impacts the complexity of the algorithms that can beapplied. The most difficult aspect of entropy coding is motioncompensated prediction. As the accuracy of the P and B framepredictions improves, the differences that must be encoded decrease.When you see significant coding noise and/or blocking artifacts, themain problem is that the predictions are poor, which cause thedifference information to overwhelm the available channel capacity.

While increased processing power can improve compression algorithms,it is important to note that high quality motion compensatedprediction is VERY DIFFICULT. It is not just a matter of trackingmotion, but also dealing with information that may not exist in ANYframe within the encoding GOP. Here are a few of the worstpathological cases:

1. The revealing of information that does not exist in any availableprediction frame. For example a football player who is twisting andturning, revealing portions of his body/uniform that are not seen inother frames.

2. Reflections and plastic deformations - bright surfaces reflectlight, creating images that must be encoded. These reflections may bedeformed by the shape of the surface. To accurately predict thisinformation you must know what the original scene that is beingreflected looks like as well as the geometry of the surface that iscausing the reflection. Now add motion to the reflecting surface andyou get the idea that predicting this kind of imagery takes massivecomputing resources.

3. Sudden changes in lighting - any kind of strobing lights, cameraflashes, etc. can cause short transients that the encoder must dealwith.

The bottom line is that there is still a huge amount of territory toexploit in terms of improving compression efficiency. Most encodingalgorithms still use crude block matching techniques rather than truemotion compensated prediction to create the prediction frames. Themajor improvements in h.264 have more to do with improved blockmatching techniques including sub pixel positioning of the blocksthat are being matched. The basic image transform is also moreefficient.

So for the moment, and probably well into the future, we can do moreto improve encoding efficiency by improving the quality of thesamples presented to the encoder than improving the encoder. Thismeans improved camera designs that minimize noise and reduce samplingerrors. It means oversampling to reduce the impact of entropy on thesource images. And it means that we should place more emphasis onsample quality than the number of samples.

Bert tried to use some simple logic - i.e. A is to B is as B is to C- to suggest that we can get more information through the channel.Specifically that it might be possible to encode HD as efficiently asSD.


I have a similar bit of logic relative to encoding.

In a bit rate constrained channel there is a maximum amount ofinformation that can be carried. If we stress the channel by tryingto carry more information than it can hold, we start to reduce thequality of the information that reaches the decoder.

Thus it is not only possible, but highly likely, that in a bit rateconstrained channel, a high quality 480P encoding may deliver higherquality to an HD screen than an over compressed HD version of thesame source.

This is today's reality, and the main reason why services like iTunesare focused on sample quality rather than sample quantity.


Regards
Craig


----------------------------------------------------------------------
You can UNSUBSCRIBE from the OpenDTV list in two ways:

- Using the UNSUBSCRIBE command in your user configuration settings at FreeLists.org

- By sending a message to: opendtv-request@xxxxxxxxxxxxx with the word 
unsubscribe in the subject line.

References:
- [opendtv] Re: Time to give up on 1080i for football
  - From: Mike Tsinberg
- [opendtv] Re: Time to give up on 1080i for football
  - From: John Shutt
- [opendtv] Re: Time to give up on 1080i for football
  - From: Tom Barry
- [opendtv] Re: Time to give up on 1080i for football
  - From: John Shutt

[opendtv] Re: Time to give up on 1080i for football

Other related posts: