T

Maxime Ripard e7c09a336f h264: Implement local cache of the latest decoded pictures

The libva only provides the reference images needed to decode the current
picture, but not the full DPB. However, some codecs need that whole DPB in
order to decode a picture.

For example, the Allwinner hardware codec has an internal SRAM, with each
picture getting a slot in that SRAM, and during each decoding process, some
metadata will then be generated from that SRAM content to a separate
buffer. Therefore, each frames must be located at the same SRAM position
each time so that the metadata are then re-used properly.

However, since libva will only pass a few reference images, we can end up
in a situation where multiple, subsequent, frames will have the same
reference images set, but might all be used as reference later on and
cannot therefore be located at the same position.

And from a more theorical point of view, Linux expects a full blown DPB in
its H264 control.

In order to work around this, we can create a shadow of the DPB by simply
maintaining a list of 16 decoded images, each associated with their
VAPictureH264 and an age. This age is the last time we used that frame as
reference. When a new picture is decoded, either we assign it to a free
slot, or we reuse the slot from the frame that hasn't been used as a
reference for the longest time.

This is a much simpler approach than the one documented in the H264 spec,
but this shouldn't really be a problem since we don't handle the reference
frames ourselves, but just re-use the one from the libva, and taken from
the bitstream before. As such, frames that are not supposed to be used for
reference will not be anymore, their age will not increase, and therefore
after a while we will garbage-collect their slot to store a much newer
frame.

Signed-off-by: Maxime Ripard <maxime.ripard@bootlin.com>

2018-07-17 15:30:33 +02:00

src

h264: Implement local cache of the latest decoded pictures

2018-07-17 15:30:33 +02:00

.clang-format

tree: Run clang-format to conform to the kernel coding style

2018-07-17 10:12:15 +02:00

.gitignore

Rename va_config to config for consistency

2018-04-23 17:09:19 +02:00

AUTHORS

Adds a sunxi-cedrus-drv-video libVA backend

2016-08-25 16:19:34 +02:00

autogen.sh

Adds a sunxi-cedrus-drv-video libVA backend

2016-08-25 16:19:34 +02:00

configure.ac

Remove X11 support with vaPutSurface

2018-06-21 09:30:06 +02:00

COPYING

COPYING: Reformulate and make more concise

2018-04-23 15:52:03 +02:00

COPYING.LGPL

Clarify licenses text

2016-08-26 15:43:09 +02:00

COPYING.MIT

Clarify licenses text

2016-08-26 15:43:09 +02:00

Makefile.am

Cleanup automake files

2018-04-23 17:31:57 +02:00

README.md

Rework comments by splitting them into README and removing redundant ones

2018-04-23 16:40:00 +02:00

README.md

Sunxi-Cedrus libVA Backend

About

This libVA backend is designed to work with the Sunxi-Cedrus VPU kernel driver, that supports the Video Engine found in most Allwinner SoCs.

Status

The Sunxi-Cedrus libVA backend currently only supports MPEG2 video decoding.

Instructions

In order to use the Sunxi-Cedrus libVA backend, the sunxi_cedrus driver has to be specified through the LIBVA_DRIVER_NAME environment variable, as such:

export LIBVA_DRIVER_NAME=sunxi_cedrus

A media player that supports VAAPI (such as VLC) can then be used to decode a video in a supported format:

vlc path/to/video.mpg

Sample media files can be obtained from:

http://samplemedia.linaro.org/MPEG2/
http://samplemedia.linaro.org/MPEG4/SVT/

Technical Notes

Surface

A Surface is an internal data structure never handled by the VA's user containing the output of a rendering. Usualy, a bunch of surfaces are created at the begining of decoding and they are then used alternatively. When created, a surface is assigned a corresponding v4l capture buffer and it is kept until the end of decoding. Syncing a surface waits for the v4l buffer to be available and then dequeue it.

Note: since a Surface is kept private from the VA's user, it can ask to directly render a Surface on screen in an X Drawable. Some kind of implementation is available in PutSurface but this is only for development purpose.

Context

A Context is a global data structure used for rendering a video of a certain format. When a context is created, input buffers are created and v4l's output (which is the compressed data input queue, since capture is the real output) format is set.

Picture

A Picture is an encoded input frame made of several buffers. A single input can contain slice data, headers and IQ matrix. Each Picture is assigned a request ID when created and each corresponding buffer might be turned into a v4l buffers or extended control when rendered. Finally they are submitted to kernel space when reaching EndPicture.

The real rendering is done in EndPicture instead of RenderPicture because the v4l2 driver expects to have the full corresponding extended control when a buffer is queued and we don't know in which order the different RenderPicture will be called.

Image

An Image is a standard data structure containing rendered frames in a usable pixel format. Here we only use NV12 buffers which are converted from sunxi's proprietary tiled pixel format with tiled_yuv when deriving an Image from a Surface.

Languages

C 96.3%

Shell 1.9%

Meson 0.8%

Assembly 0.4%

Makefile 0.4%

Other 0.2%