You are on page 1of 6

NOVEL METHODS FOR VIDEO COMPRESSION

TECHNIQUES SURVEY PAPER


Nicole Verdugo Paul Andrade
Escuela Superior Politécnica del Litoral

1 Abstract
When talking about compression at the same time removing the non-
techniques it means basically reducing visible portion of the video.
the image data, a digitalized analog
sequence can be comprised up to 165 2 Introduction
Mbps. Therefore many of the techniques For the first method which is variable
for compression known till now are block sizes in motion estimation process
commonly employed to achieve it is well known that video and image
desirable reductions in image data, play an important role in conveying the
color resolution and many other visual information, among all the
aspects in order to achieve such information managed by
compression. For video compression telecommunication networks, the visual
technique using variable block sizes in information is the most appealing to the
motion estimation processes human hence, because it carries
consequent images, where the first three significant amount of message and also
are image-based compression plays an important role in the real
techniques where only one of the frames telecommunication service such as
is compressed and evaluated at the time video conferencing, video telephony ,
and the last one is a video compression etc. The advantage of digital systems is
technique where different frames are that they give better performance to the
compared in order to reduce the image conventional analog systems that’s why
of the data. On the other hand it is also the world of image and video
known that the high dynamic range communication is becoming digital,
imaging/video has become more which has opened a path for computers
common in the electronic field as it to treat video and images, such
allows people to view a picture or a video digitalization has led to an easier
at a minimum eye adaptation, this manipulation and modification of images
dynamic has de capability to capture a and videos and so to extract necessary
full range light from a frame unlike other information to then encode it. With the
methods that can only capture 8-f-stops, increasing number of services, more
even though the high dynamic range information is required to be transmitted
video compression techniques are still in over channels with limited capacity and
continuous development this paper is here is where the challenge of reducing
going to focus on the shortcomings in the storage and transmission bandwidth
the present method of high dynamic requirements has led to the
range video coding and also the novel development of different video
method of two layered backward compression techniques. When talking
compatible video encoding methods about dynamic range it refers to the
accompanied by the Chroma existing ratio between the duskiest and
subsampling and the luminance the brightest part of a scenario, human
masking to improve color accuracy and
eye can only see approximately 20-f reduction is the removal of the high data
stops, within the traditional dynamic redundancies.
range of digital cameras only an 8-f
stops of light is reached. In a way to 3.2 Video Formats
enhance the quality of the content an Videos are three-dimensional array of
HDR was developed to capture HDR color pixels where two dimensions serve
images. The proposed method utilizes a as spatial directions of the moving
two layered backward compatible video picture and the dimension remaining
encoding method accompanied by the represent the time domain, also a frame
chroma sampling and the luminance is a picture consisting of macro blocks
masking for the HDR video content to that are usually composed of two more
improve the quality, preserve the blocks of pixels. A pixel value is
luminance and avoid the nonvisible represented using parameters of
portions in content. luminance(Y) and chrominance (U,V),
luminance is the brightest of the pixel
and chrominance is the color of the pixel.
3 Variable block size in The human visual system is less
motion estimation sensitive to colors than luminance and
process there are three sampling patterns for
them that are used in video formats.
4:4:4 sampling means that the three
3.1 Media and Compression components have the same resolution.
The mathematical foundation of
compression was set years ago but with 4:2:2 sampling , the chrominance
technological advances and the components have the same vertical
necessity to store larger amounts of data resolution as the luma but half the
became more relevant. The first horizontal resolution and is also used for
compression schemes were lossless high quality color reproduction.
which implied that the compressed data
4:2:0 sampling format Cb and Cr each
is fully recovered after decompression
have half the horizontal and vertical
and with textual data there can be no
resolution of Y, in this format Cb and Cr
tolerance of error, those lossless
each have half the horizontal and
compression schemes generally provide
vertical resolution of Y.
a ratio of 2:1 to 4:1. However the new
media formats that are coming up are
different and more effective, this new
techniques are not all lossless, for 3.3 Video Encoding Technique
instance motion videos can be Video compression employs both intra
compressed to a 30:1, color images at and inters frame coding techniques. The
15:1 and stereo sound at 6:1. The main challenge in video compression is
compression ratio for motion videos is efficient reduction of the temporal
not that critical because human eye redundancy. Some of the compression
exhibits a certain tolerance for methods applicable to video are entropy
differences within and in between coding, quantization, sub-sampling,
images in a frame, our eye is not highly interpolation, predictive coding, and
affected by subtle differences that may motion estimation/compensation.
occur as a result of data loss after
compression a video. With this
advances the problem is in making
technique compression more efficient
and practical, the principle of data
3.4 Model for Video and an increase in the number of motion
Compression vectors transmitted.

The various algorithms and techniques 3.7 Variable Block Sizes In


discussed in the previous section are
Motion Estimation Process
combined to form a practical video
compression system. In most video
codecs DCT is used to remove intra- The latest standards in digital video
frame redundancy while motion processing support variable block sizes
estimation/compensation takes care of in motion estimation process. The
the inter-frame redundancy. smaller block sizes give better
estimation of the motion prediction but
they also lead to larger number of motion
3.5 Video Quality Measurement vectors. H.264 standard which is the
most popular video coding standard
supports 16x16, 16x8, 8x16 and 4x4
The most widely used measure is Peak block sizes.
Signal to Noise Ratio (PSNR), which is
measured on a logarithmic scale and HEVC (also now as H.265) is the latest
depends on the mean squared error upcoming standard in digital video
(MSE) of an image or video frame, compression after H.264. It aims to
relative to (2n −1)2 (the square of the further improve coding efficiency in
highest-possible signal value in an video compression by a factor of 2. It
image) also a higher PSNR imply better involves many innovative algorithms and
quality images and lower PSNR images techniques like variable Transform sizes
mean they are less crisp, noise-free and entropy coding schemes. Analysis
and/or artefact free. for video compression performed for
different Coding Unit (CU) block sizes for
4 different values of quantization (which
decide the quality of the output video).
3.6 Inter Frame Prediction
The all three sets of CU size are as
Compression Technique follows. First set is used for fixed CU
size, second set for variable CU size and
Motion estimation of a macro block third set for constant CU size.
involves finding a 16 × 16-sample region 1) Set A: Fixed CU size viz. 16x16 only.
in a reference frame that closely
matches the current macro block. The 2) Set B: 2 variable CU sizes are
output of the motion estimation process allowed viz. 16x16 and 32x32.
is the difference between the
3) Set C: 3 variable CU sizes are
coordinates of the current macro block
allowed viz. 16x16, 32x32, and 64x64.
and the best match in the reference
frame.
Moving objects in a video rarely follow The experiments were performed
'neat' 16x16-pixel boundaries so it may keeping the quality constant and hence
be more efficient to use a variable block the effect of the variable block sizes is
size for motion estimation and visible on the size of the output bit
compensation. A smaller block size streams. The results were generated
leads to increased complexity (more with the standard video test streams viz.
search operations must be carried out) cactus, basketball pass and BQ_mall
4 HDR video encoding, takes up. A simple way to reduce the
compression and quality color information in your video is to
compress it or discard some of it. This is
evaluation
called chroma subsampling. Under
sampling is accomplished by converting
4.1 Double Layer Backward- the red, blue, and green spectra to
Compatible Encoding 𝑌𝐶𝐵 𝐶𝑅 . Y' luminance is sent
uncompressed, 𝐶𝐵 𝐶𝑅 is compressed to
save space.
A backward-compatible double-layer
method of video coding with high .
dynamic range selection uses instances. 4.2.2 Luminance masking:
It uses dual cords in a 2-track scalable This improves compression efficiency in
way to ensure the necessary
HDR. This is actually done to reduce the
compatibility between high dynamics. distortion seen on the dark and light
Ranging and low dynamic ranging for sides of the content. According to the
distributions. The base layer is called the Weber-Fechner law, the higher the
first layer that provides Low dynamic luminance of the background, the
range to maintain compatibility with brighter the masking phenomenon and
decoders and his LDR displays. high the greater the perceptible visual
dynamic range Video creates his LDR stimulus difference. Spatial frequency
content by tone mapping or color
masking and luminance masking are
mapping.
related. HDR is integrated into the JND
The next layer does the coding. The profile and masks frequencies taken
process encodes the differences that from the quantization matrix.
exist between the generated LDR and
HDR representations of her. Of Encoded
data is sent in quantized remainder form.
The reconstruction function uses the 5 Conclusions
correlation between LDR and HDR to
predict HDR using LDR. A two-layer
 This paper introduces an interesting
backward compatible encoding has
many advantages over single-layer method of two-layer backward
backwards compatibility that uses compatible video coding method
metadata to reverse audio. It can map along with chroma subsampling and
curves to generate LDR content from luminance masking to improve
HDR representations, but these color accuracy and remove hidden
solutions rely heavily on tradeoffs parts of video content respectively.
between the amount of information  The results obtained are the mean
required and the quality of the opinion source, bit rate savings, and
reconstructed content. comparison between HDR10 and
HDR-HLG. This can be further used
4.2 Distribution of high dynamic
in applications requiring image
ranging classification and segmentation
4.2.1 Chroma subsampling: processes.
Video color information occupies
another place in the video. Color
information and the space it occupies
 The bit rate for all the streams at
are directly proportional. The more color
all the quality levels is lowest for
information you have, the more space it
Set C which has the maximum
number (3) of variable Coding Circuits and Systems for video Technology,
unit sizes (64x64, 32x32, 16x16). vol. 18, no.7, July 2008, pp.900-909.
This fact was earlier established
for H.264 but my experiments [8] R. M. Gray, “ Vector Quantization,” IEEE
have proved that even with latest ASSP Mag. , vol. 1, pp. 4-29, Apr. 1984.
video coding standard
[9] Chun-Man Mak, Chi keung Fong, and
H.265/HEVC provides better
Wai Khen Chan, “Fast Motion Estimation
results when compared to fixed
For H.264/AVC in Walsh Hadmard Domain”,
coding unit sizes.
IEEE Transactions on Circuits and Systems
for Video Technology, vol. 18, no.6, June
2008, pp. 735-745.
6 References [10] Artusi, Alessandro, Rafał K. Mantiuk,
[1] Y. Wang, B. Li and T. Weise, “Estimation Thomas Richter, Philippe Hanhart, Pavel
of Distribution and Differential Evolution Korshunov, Massimiliano Agostinelli,
Cooperation for Large Scale Economic Load Arkady Ten, and Touradj Ebrahimi.
Dispatch Optimization of Power Systems,” "Overview and evaluation of the JPEG XT
Information Sciences, Elsevier, 2010 HDR image compression standard." Journal
[2] Swagatam Das and Sudeshna Sil, of Real-Time Image Processing 16, no. 2
“Kernel-induced fuzzy clustering of image (2019): 413- 428.
pixels with an improved differential [11] Roimela, Kimmo, Tomi Aarnio, and
evolution algorithm,” Information Sciences, Joonas Itäranta. "High dynamic range
Elsevier, vol. 180, pp. 1237–1256, 2010. [3] texture compression." ACM Transactions on
G. T. Tuttle, S. Fallahi, and A.A. Abidi, “ A Graphics (TOG) 25, no. 3 (2006): 707-712.
low-power analog CMOS vector quantizer,”
in Proc. IEEE Data Compression Conf. J. A. [12] Salvucci, Emanuele. "Multi-compatible
Storer and M. Cohn, Eds.Snowbrid, UT: IEEE low and high dynamic range and high bit-
Computer Society Press,1993, pp 410-419. depth texture and video encoding system."
U.S. Patent 8,237,865, issued August 7,
[4] M. Senthil Arumugam, M. V. C. Rao and 2012.
Alan W.C. Tan, “A new novel and effective
particle swarm optimization like algorithm [13] Zhang, Yang, Matteo Naccari, Dimitris
with extrapolation technique,” Agrafiotis, Marta Mrak, and David R. Bull.
International Journal of Applied Soft "High dynamic range video compression
Computing, Elsevier, vol. 9, pp. 308-320, exploiting luminance masking." IEEE
2009 Transactions on Circuits and Systems for
Video Technology 26, no. 5 (2015): 950-964.
[5] Dan Simon, “Biogeography-Based
Optimization,” IEEE Trans. on Evol. Comput. [14] Mantiuk, Rafał, Alexander Efremov,
vol. 12, no.6, pp. 712-713, 2008. Karol Myszkowski, and Hans-Peter Seidel.
"Backward compatible high dynamic range
[6] C. W. Rutledge, “Vector DPCM: Vector MPEG video compression." ACM
predictive coding of color image,” in Proc. Transactions on Graphics (TOG) 25, no. 3
IEEE Global Telecommunication Conf., Sept (2006): 713-723.
1986, pp. 1158-1164.
[15] Korshunov, Pavel, and Touradj
[7] Jiancong Luo, Ishfog Ahmed, Yong Fang Ebrahimi. "A JPEG backward-compatible
Liang and Vishwanathan Swaminathan, HDR image compression." In Applications of
“Motion Estimation for Content adaptive Digital Image Processing XXXV, vol. 8499, p.
Video Compression”, IEEE Transactions on
84990J. International Society for Optics and video compression Technique using variable
Photonics, 2012. block sizes in motion estimation
process. International Journal of Electronics
[16] Chalmers, Alan, and Kurt Debattista.
and Computer Science Engineering
"HDR video past, present and future: A
(IJECSE), 1(3), 1321-1327.
perspective." Signal Processing: Image
Communication 54 (2017): 49-55. [19] Kumar, T. S., & Senthil, T. (2019). A
novel method for HDR video encoding,
[17] Lee, Chul, and Chang-Su Kim. "Rate-
compression and quality evaluation. Journal
distortion optimized compression of high
of Innovative Image Processing (JIIP), 1(02),
dynamic range videos." In 2008 16th
71-80
European Signal Processing Conference, pp.
1-5. IEEE, 2008

[18] Chauhan, V., Nathaney, P. K., Pandey,


M., & Rai, K. M. (2012). A novel approach to

You might also like