(Press Release)

3 September, 2015

Nippon Telegraph and Telephone Corporation
NTT Advanced Technology Corporation

Development of World's First HEVC Software Encoding Engine Supporting 60P/120P Simultaneous Transmission of Ultra high-Definition Video
-Launch of SDK and Transcoder Application from NTT-AT-

Nippon Telegraph and Telephone Corporation (abbreviated to NTT; Head Office: Chiyoda-Ku, Tokyo; Managing Director: Unoura Hiroo) has developed a software encoding engine which complies with the International Standard H.265/HEVC (High Efficiency Video Coding) (abbreviated to HEVC), and which enables the simultaneous transmission of video at the 60 frames-per-second that is currently used for 4K distribution as well as ultra high-definition video at 120 frames-per-second.
Encoding done by this technology will enable viewers to watch video at 60 frames-per-second on existing TVs and other devices when distribution of 120 frames per second video starts in the future, facilitating a smooth migration. This functionality is achieved by implementing temporal-direction hierarchical encoding, which enables embedding of 60 frames-per-second HEVC bitstream in a 120 frames-per-second HEVC bitstream. Original bit rate control algorithm by NTT is used to achieve high visual quality while keeping the 60 and 120 frames-per-second HEVC bitstreams at the desired bit rates. Furthermore, by introducing an original variable bit rate encoding technique, we have also achieved an approximately 40% reduction in the amount of data compared to constant bit rate, with image quality that is equivalent to that seen previously, so we can expect a reduction in storage costs, even with current distribution services.
This technique has been acquired by NTT Advanced Technology Corporation (abbreviated to NTT-AT, Head Office: Saiwai-ku, Kawasaki, Kanagawa; Managing Director : George Kimura) and upgraded versions of the software codec development kit "HEVC-1000 SDK" and the file conversion software "FileConverter FC4000" are going on sale.
Note that this technique will be exhibited at "IBC 2015" which will be held in Amsterdam, Netherlands from September 11 to 15.

1. Background and objectives

A high-definition 4K video distribution service is currently being commercialized and there are studies into the distribution of 120 frames-per-second 4K/8K video that is suitable for video footage in which subjects are actively moving, such as sports videos. To ensure that 120 frames-per-second video can be viewed even on decoders that were designed for existing 60 frames-per-second video alone, temporal-direction hierarchical encoding is specified in STD-B32, which is a standard issued by ARIB. In temporal-direction hierarchical encoding, a different bit rate is set for each frame rate, necessitating an accurate technique of estimating the amount of encoding.
The start of fully-fledged 4K video distribution means that the volume of content has also been increasing year by year. Together with expansion of resolution and frame rate, the amount of data has become huge, so compression of the video footage is required and improvements in the compression performance of H.265/HEVC encoders are expected.

2. Technical points (1) Temporal-direction hierarchical encoding function (Fig. 1)

This software encoding engine is the world's first encoder to support the temporal-direction hierarchical encoding specified by ARIB. In this temporal-direction hierarchical encoding, the encoding is such that some frames can be extracted from the 120 frames-per-second video stream. When the encoding is performed, each frame is analyzed and the amount of data is estimated. If the accuracy of the estimation of the amount of data is low in the temporal-direction hierarchical encoding, the allocation of the amount of encoding to each frame rate is not optimized and it may result in low image quality. At NTT, we have developed a technique of efficiently allocating the amount of data to improve the accuracy of estimating the amount of data, by resolving and analyzing the elements of each frame.

(2) Technique of controlling the amount of encoding according to variable bit rate (Fig. 2)

In services that store and distribute video footage, not only has constant bit rate (CBR) control, in which the distribution bit rate is fixed, been attracting attention, but also variable bit rate (VBR) control, in which allocation of the amount of data is varied temporally, which can reduce the overall amount of data. With VBR, the various conditions necessary for decoding must be considered when varying the amount of data, depending on the complexity of the video footage. We have made it possible to estimate the amount of data to a high degree of accuracy with respect to a combination of a number of conditions to allocate the amount of data efficiently, by applying a highly accurate technique of estimating the amount of data which NTT has developed. As a result, we have implemented a reduction in 40% of the amount of data, in comparison with the fixed bit rate control.

Figure 2 NTT has developed a novel algorithm for optimizing data size allocation according to the analyzed local complexity of video, while conforming target bitrate and various constraints for decoding

3. Expansion of this technique

The software encoding engine we developed last year is being sold by NTT-AT and is being used by customers, ranging from equipment manufacturers to post-production operators and distribution business operators, through our software development kit and file conversion software as a file transcoding application to H.265/HEVC. By incorporating these techniques we have developed into our products, we enable users to get an early start on preparing for verification with respect to the 4K/8K 120P video that is planned for the future, and also contribute to improvements in existing HD video and 4K video compression rates.

For further information, please contact:
Nippon Telegraph and Telephone Corporation

NTT Service Integration Laboratories,
TEL: 046-859-2032
E-mail: randd@lab.ntt.co.jp

NTT Advanced Technology Corporation

International Business Division US Office
Contacts : Mayumi Yamashita
TEL : +1-408-392-4280
E-mail: codec_sales@ntt-at.com

  • *1 Association of Radio Industries and Businesses
  • *2 ARIB standard: "Video Coding, Audio Coding and Multiplexing Specifications for Digital Broadcasting"
Attachment 1 NTT-AT HEVC Software Codec Development Kit "HEVC-1000 SDK" and File Conversion Software "FileConverter FC4000"
Product Summary

NTT-AT has previously launched the software codec product that takes advantage of video compression techniques, and has seen increasing sales performance. At the end of August, 2015, NTT-AT has begun to sell upgraded versions of the "HEVC-1000 SDK" and "FileConverter FC4000" using the HEVC software encoding engine that was developed at the NTT Media Intelligence Laboratories. It has the highest level of compression in the world and can simultaneously transmit ultra high-definition video at both 60 frames-per-second and 120 frames-per-second.

1. HEVC software codec development kit "HEVC-1000 SDK"

This is a software codec development kit consisting of a set of encoder and decoder. It can be used to develop software such as file conversion software that converts the video data compression method and the bit rate, and authoring software for editing video data.
In this upgraded version, the following features are added so that users can also take advantage of video distribution with low bit rate and high image quality in mobile distribution or the next generation of video distribution with a high frame rate.

  • Enables high-compression encoding and low bit rate distribution by the incorporation of a 2-pass VBR*1 function.
  • Supports ARIB-standard temporal-direction hierarchical encoding and field-pair encoding.
  • Speeds up encoding processing by improved efficiency of prediction-mode decision algorithm.
  • *1: 2-pass VBR: Method of encoding with a high image quality and high level of compression which uses the technique of controlling the amount of data by variable bit rate. After the first pass of encoding, the system analyzes the complexity of all the data, and in the second pass of encoding, readjusts the bit rate to optimize.
2. File conversion software "FileConverter FC4000"

With this file conversion software, users can input all major video media formats and convert various types of content from typical resolution to 4K into formats intended for distribution over IP networks to multiple devices (MPEG-DASH, HTTP live streaming, smooth streaming, etc.).
In this version, it supports the 2-pass VBR function, for transmitting a highly realistic feeling of 60 frames-per-second 4K content of a high image quality at a low bit rate of 12 to 15 Mbps.
With this upgraded version, we aim to further improve compression performance and encoding speed, to reduce time of encoding time 4K content which the volume is growing, and meet the demands of post-production and distribution business operators.

About "FileConverter FC4000",
At the end of October, 2015, It is scheduled to support audio lossless (ALS) coding, which is a lossless compression method that is anticipated as an audio format for transmitting a highly realistic feeling.
At the end of February, 2016, It is scheduled to support High Dynamic Range(HDR), 8K and 12bit, more tenderhearted color melody expression of the next generation becomes possible.

Information is current as of the date of issue of the individual press release.
Please be advised that information may be outdated after that point.

distributed by