Video Coding With Semantic Image

1. The document discusses a new approach for video coding called semantic coding. 2. The semantic coding approach uses texture analysis and synthesis to improve coding efficiency. Texture is categorized as subjectively relevant or irrelevant. 3. Relevant texture is coded normally, while irrelevant texture is approximated at the decoder using side information from the encoder. This allows for bitrate savings of up to 33.3% compared to standard video codecs.

Uploaded by

Ajnesh Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views6 pages

Video Coding With Semantic Image

Uploaded by

Ajnesh Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Video Coding with Semantic Image

Analysis and Synthesis

Goal: Improvement of coding efficiency in video coding through texture analysis at the encoder-side and
texture synthesis at the decoder-side and under integration of semantic hints.

A new content-based approach for improved H.264/AVC video coding is presented. The framework is
generic because based on a closed-loop texture analysis by synthesis algorithm that can automatically
identify and recover from video quality impairments through artifact detectors and appropriate
countermeasures respectively. The algorithm is flexible, for it can in principle be integrated into any
standards-compliant video codec. The fundamental assumption of our approach is that many video
scenes can be classified into subjectively relevant and irrelevant textures. The texture categorization is
thereby done by a texture analyzer (encoder side), while the corresponding texture synthesizer performs
the replacement of the subjectively irrelevant textures (decoder side), given the side information
generated by the texture analyzer. When implementing the proposed approach into an H.264/AVC codec,
bit rate savings of up to 33.3% compared to an H.264/AVC video codec without our approach are
achieved.
Structure of the Semantic Coding Approach
In this work, we have developed the closed-loop analysis-synthesis algorithm depicted in Fig. 1. The
incoming video sequence is divided into overlapping groups of pictures (GoP). The first GoP consists of
the first I picture of the sequence and the last picture of the GoP is the first P picture. Between this I an P
picture are B pictures. For example, when 3 B pictures are used, the first GoP has the structure IBBBP1
in temporal order. The second GoP consists of the last picture (the P1 picture) of the first GoP and the
next P picture. In our example, the second GoP has the structure P1BBBP2. I and P pictures are key
pictures and coded using MSE distortion and an H.264/AVC encoder. B pictures (between the key
pictures) are candidates for a possible partial texture synthesis and are also otherwise coded using MSE
distortion and H.265/AVC.

Each GoP is analyzed by the texture analyzer (TA) and synthesized by the texture synthesizer (TS), given
the (quantized) side information generated by the TA. The synthesized GoP is then submitted to the video
quality assessment unit (VQA) for detection of possible spatial or temporal impairments in the
reconstructed video

Fig. 1 Principle of the closed-loop analysis-synthesis video coding approach

In the subsequent iterations, the degrees of freedom of the system are explored by a state machine (SM)
in the quest of even better side information. Once all relevant system states have been visited for the
given input GoP, a rate-distortion decision is made and the optimized side information is transmitted to
the decoder. Detail-irrelevant textures for which no rate-distortion gains can be achieved are coded by the
reference codec, which acts as fallback coding solution. Furthermore, the GoP structure used in our
framework prevents infinite error propagation, as the key pictures are coded based on MSE.
Bit-Rate Savings with the Semantic Coding Approach
We have integrated our approach into an H.264/AVC codec. The test sequences “Concrete”, “City”,
“Preakness”, and “Coastguard” are used to demonstrate that an approximate representation of some rigid
and non-rigid textures can be done without subjectively noticeable loss of quality.

Fig. 2: Bitrate savings w.r.t. quantization accuracy

The following set-up was used for the H.264/AVC codec. Three B pictures, one reference picture for each
P picture, CABAC (entropy coding method), rate distortion optimization, 30 Hz progressive video at CIF
resolution. The quantization parameter QP was set to 16, 20, 24, 28 and 32. Fig. 2 depicts the bit rate
savings obtained for each of the test sequences. Here we have assumed and verified through visual
inspection that the mse coded and synthesized textures cannot be distinguished. It can be seen that the
highest savings are measured for the highest quantization accuracy considered. The most substantial bit
rate savings (33.3%) are measured for the “City” sequence. The bit rate savings decrease with the
quantization accuracy due to the fact that the volume of the side information remains constant over the
different QP settings. All results are derived from decoding bit-streams and the encoder is run
automatically for each sequence.

Video data contains spatial and temporal redundancy. Similarities can thus be encoded by merely
registering differences within a frame (spatial), and/or between frames (temporal). Spatial
encoding is performed by taking advantage of the fact that the human eye is unable to distinguish
small differences in color as easily as it can perceive changes in brightness, so that very similar
areas of color can be "averaged out" in a similar way to jpeg images (JPEG image compression
FAQ, part 1/2). With temporal compression only the changes from one frame to the next are
encoded as often a large number of the pixels will be the same on a series of frames.

Lossless compression
Some forms of data compression are lossless. This means that when the data is decompressed, the result is a
bit-for-bit perfect match with the original. While lossless compression of video is possible, it is rarely used, as
lossy compression results in far higher compression ratios at an acceptable level of quality.

Intraframe versus interframe compression

One of the most powerful techniques for compressing video is interframe compression. Interframe
compression uses one or more earlier or later frames in a sequence to compress the current frame, while
intraframe compression uses only the current frame, which is effectively image compression.
The most commonly used method works by comparing each frame in the video with the previous one. If the
frame contains areas where nothing has moved, the system simply issues a short command that copies that part
of the previous frame, bit-for-bit, into the next one. If sections of the frame move in a simple manner, the
compressor emits a (slightly longer) command that tells the decompresser to shift, rotate, lighten, or darken the
copy — a longer command, but still much shorter than intraframe compression. Interframe compression works
well for programs that will simply be played back by the viewer, but can cause problems if the video sequence
needs to be edited.
Since interframe compression copies data from one frame to another, if the original frame is simply cut out (or
lost in transmission), the following frames cannot be reconstructed properly. Some video formats, such as DV,
compress each frame independently using intraframe compression. Making 'cuts' in intraframe-compressed
video is almost as easy as editing uncompressed video — one finds the beginning and ending of each frame,
and simply copies bit-for-bit each frame that one wants to keep, and discards the frames one doesn't want.
Another difference between intraframe and interframe compression is that with intraframe systems, each frame
uses a similar amount of data. In most interframe systems, certain frames (such as " I frames" in MPEG-2)
aren't allowed to copy data from other frames, and so require much more data than other frames nearby.
It is possible to build a computer-based video editor that spots problems caused when I frames are edited out
while other frames need them. This has allowed newer formats like HDV to be used for editing. However, this
process demands a lot more computing power than editing intraframe compressed video with the same picture
quality

Video is basically a three-dimensional array of color pixels. Two dimensions serve as spatial
(horizontal and vertical) directions of the moving pictures, and one dimension represents the time
domain. A data frame is a set of all pixels that correspond to a single time moment. Basically, a
frame is the same as a still picture.
Video data contains spatial and temporal redundancy. Similarities can thus be encoded by merely
registering differences within a frame (spatial), and/or between frames (temporal). Spatial
encoding is performed by taking advantage of the fact that the human eye is unable to distinguish
small differences in color as easily as it can perceive changes in brightness, so that very similar
areas of color can be "averaged out" in a similar way to jpeg images (JPEG image compression
FAQ, part 1/2). With temporal compression only the changes from one frame to the next are
encoded as often a large number of the pixels will be the same on a series of frames

































Compusoft, 2 (5), 127-129 PDF
No ratings yet
Compusoft, 2 (5), 127-129 PDF
3 pages
DC 8
No ratings yet
DC 8
9 pages
Video Compression Techniques
100% (2)
Video Compression Techniques
42 pages
Unit 3 MM
No ratings yet
Unit 3 MM
15 pages
Unit-5 Video Compression
No ratings yet
Unit-5 Video Compression
45 pages
The Novel Broadcast Encryption Method For Large Dynamically Changing User Groups
No ratings yet
The Novel Broadcast Encryption Method For Large Dynamically Changing User Groups
8 pages
Video Compression
No ratings yet
Video Compression
9 pages
CH: 7 Fundamentals of Video Coding
No ratings yet
CH: 7 Fundamentals of Video Coding
13 pages
Decode To Encode
No ratings yet
Decode To Encode
232 pages
H.264 Video Encoder Standard - Review
No ratings yet
H.264 Video Encoder Standard - Review
5 pages
Lossless Compression in MPEG4 Videos: K.Rajalakshmi, K.Mahesh
No ratings yet
Lossless Compression in MPEG4 Videos: K.Rajalakshmi, K.Mahesh
4 pages
MTEK Lect-Wavelet Filt
No ratings yet
MTEK Lect-Wavelet Filt
23 pages
Reducing Interlace Flicker in Video
No ratings yet
Reducing Interlace Flicker in Video
2 pages
H.264/AVC Algorithm Study and Matlab Implementation
No ratings yet
H.264/AVC Algorithm Study and Matlab Implementation
16 pages
2K6EC 705 (F) : Data Compression Handout 1 Video Signal Representation
No ratings yet
2K6EC 705 (F) : Data Compression Handout 1 Video Signal Representation
10 pages
Video Note
No ratings yet
Video Note
4 pages
MPEG Standards Overview
No ratings yet
MPEG Standards Overview
11 pages
Video PDF
No ratings yet
Video PDF
37 pages
MPEG-2 Video Compression Overview
No ratings yet
MPEG-2 Video Compression Overview
37 pages
MPEG Video Compression
No ratings yet
MPEG Video Compression
14 pages
Video Stream Basics for Tech Students
No ratings yet
Video Stream Basics for Tech Students
8 pages
MPEG Encoding Basics
100% (2)
MPEG Encoding Basics
7 pages
Video Coding Format - Wikipedia
No ratings yet
Video Coding Format - Wikipedia
10 pages
JPEG and MPEG Compression Techniques
No ratings yet
JPEG and MPEG Compression Techniques
12 pages
Video Coding and A Mobile Augmented Reality Approach
No ratings yet
Video Coding and A Mobile Augmented Reality Approach
10 pages
Video Compression Techniques Overview
No ratings yet
Video Compression Techniques Overview
36 pages
Videoprocessing4 240501171322 058694b4
No ratings yet
Videoprocessing4 240501171322 058694b4
32 pages
ch06f Mpeg Compression
100% (1)
ch06f Mpeg Compression
71 pages
MPEG Unit 5
No ratings yet
MPEG Unit 5
9 pages
Error Detection and Data Recovery Architecture For Motion Estimation
100% (1)
Error Detection and Data Recovery Architecture For Motion Estimation
63 pages
Video Compression Techniques Explained
No ratings yet
Video Compression Techniques Explained
26 pages
Lecture 20 - Video Coding
No ratings yet
Lecture 20 - Video Coding
36 pages
JPEG and H.26x Standards
No ratings yet
JPEG and H.26x Standards
30 pages
H.264 Video Compression
No ratings yet
H.264 Video Compression
4 pages
Motion Estimtion and Motion Compensated (Video) Coding
No ratings yet
Motion Estimtion and Motion Compensated (Video) Coding
41 pages
Mpeg-2 Basics
100% (1)
Mpeg-2 Basics
17 pages
MPEG-2 - Tha Basis of How It Works
No ratings yet
MPEG-2 - Tha Basis of How It Works
17 pages
H.264/AVC Video Compression Overview
No ratings yet
H.264/AVC Video Compression Overview
21 pages
HEVC
No ratings yet
HEVC
50 pages
Video Formats and Mpeg Compression
No ratings yet
Video Formats and Mpeg Compression
52 pages
Video Fundamentals and Compression Techniques
No ratings yet
Video Fundamentals and Compression Techniques
16 pages
Video Processing Basics Explained
No ratings yet
Video Processing Basics Explained
26 pages
RDO Analysis for H.264/AVC Coding
No ratings yet
RDO Analysis for H.264/AVC Coding
5 pages
Introduction to MPEG Video Compression
No ratings yet
Introduction to MPEG Video Compression
24 pages
Unit - 6 Fundamentals of Digital Video
100% (1)
Unit - 6 Fundamentals of Digital Video
29 pages
12 Mpeg
No ratings yet
12 Mpeg
60 pages
JPEG and MPEG Compression Explained
No ratings yet
JPEG and MPEG Compression Explained
8 pages
Video Fundamentals for IT Students
No ratings yet
Video Fundamentals for IT Students
9 pages
JPEG, Basic Ideas, Standards H.261, MPEG-1, MPEG-2 AVC, HEVC, Container Formats
No ratings yet
JPEG, Basic Ideas, Standards H.261, MPEG-1, MPEG-2 AVC, HEVC, Container Formats
20 pages
Unit III Compression
No ratings yet
Unit III Compression
95 pages
Video To MPEG Coding
No ratings yet
Video To MPEG Coding
14 pages
EC English Guide for Class 9 Students
No ratings yet
EC English Guide for Class 9 Students
40 pages
Quality Circle Report
100% (3)
Quality Circle Report
45 pages
Jurisdiction in Cheque Dishonour Cases
No ratings yet
Jurisdiction in Cheque Dishonour Cases
4 pages
Test 4
No ratings yet
Test 4
4 pages
09-Bomba de Aceite PDF
No ratings yet
09-Bomba de Aceite PDF
212 pages
Free Range Farming Manual
No ratings yet
Free Range Farming Manual
55 pages
Dagmara's Journey on Schindler's List
No ratings yet
Dagmara's Journey on Schindler's List
4 pages
Predicting Heart Disease at Early Stages Using Machine Learning: A Survey
No ratings yet
Predicting Heart Disease at Early Stages Using Machine Learning: A Survey
4 pages
Module11 by Amevoice M
No ratings yet
Module11 by Amevoice M
852 pages
Lectures in International Marketing 2019
No ratings yet
Lectures in International Marketing 2019
61 pages
EPP - ICT - Creating A Multimedia Presentation Using The Advanced Features of MS PowerPoint Tool
No ratings yet
EPP - ICT - Creating A Multimedia Presentation Using The Advanced Features of MS PowerPoint Tool
27 pages
Letter A Fisa Engleza
No ratings yet
Letter A Fisa Engleza
4 pages
Beauty Influencer: Do Generation Z Women Consumers Trust Them?
No ratings yet
Beauty Influencer: Do Generation Z Women Consumers Trust Them?
74 pages
Scale Calibration Procedures in Hospitality
No ratings yet
Scale Calibration Procedures in Hospitality
3 pages
ACC262 SPECIMEN PAPER (Nov 2024) (1) - Merged
No ratings yet
ACC262 SPECIMEN PAPER (Nov 2024) (1) - Merged
22 pages
OWASPWebAppPenTestList1 1
No ratings yet
OWASPWebAppPenTestList1 1
23 pages
Unit 3
No ratings yet
Unit 3
16 pages
Pin Codes Fo Bangalore
No ratings yet
Pin Codes Fo Bangalore
3 pages
Icap Sales Tax Past Papers 2014 To 2024
No ratings yet
Icap Sales Tax Past Papers 2014 To 2024
24 pages
Fractions Year 4
100% (1)
Fractions Year 4
3 pages
Akta Satelit On Astra 4A at 4
No ratings yet
Akta Satelit On Astra 4A at 4
6 pages
Paediatric Bronchoscopy Progress in Respiratory Research Kostas N. Priftis Download
No ratings yet
Paediatric Bronchoscopy Progress in Respiratory Research Kostas N. Priftis Download
53 pages
ENTREPRENEURSHIP
No ratings yet
ENTREPRENEURSHIP
2 pages
Questionnaire Ekiti Construction SMEs
No ratings yet
Questionnaire Ekiti Construction SMEs
5 pages
Topic 2 Scratch
No ratings yet
Topic 2 Scratch
47 pages
LCCC Educ 97 Mid Term Exam
No ratings yet
LCCC Educ 97 Mid Term Exam
7 pages
Questions About Animals and Pets
No ratings yet
Questions About Animals and Pets
5 pages
NYCSampleFinalExam v2
No ratings yet
NYCSampleFinalExam v2
22 pages
Usage History: Total Balance Used Rs 0.04
No ratings yet
Usage History: Total Balance Used Rs 0.04
4 pages
Newcastle Disease Scientific - & Technico Booklet
No ratings yet
Newcastle Disease Scientific - & Technico Booklet
45 pages

Video Coding With Semantic Image

Uploaded by

Video Coding With Semantic Image

Uploaded by

Video Coding with Semantic Image

Analysis and Synthesis

Fig. 1 Principle of the closed-loop analysis-synthesis video coding approach

Fig. 2: Bitrate savings w.r.t. quantization accuracy

Intraframe versus interframe compression

You might also like