fanbird2008

yuv

http://www.fourcc.org/yuv.php

http://msdn.microsoft.com/en-us/library/aa904813(VS.80).aspx

Video Rendering with 8-Bit YUV Formats

14 out of 14 rated this helpful - Rate this topic

Gary Sullivan and Stephen Estrop
Microsoft Digital Media Division

April 2002

Updated August 2003

Applies To:
Microsoft® Windows®, Microsoft DirectShow®

Summary: This article describes the 8-bit YUV formats that are recommended for video rendering in the Microsoft Windows operating system. This article presents techniques for converting between YUV and RGB formats, and also provides techniques for upsampling YUV formats. This article is intended for anyone working with YUV video decoding or rendering in Windows. (13 printed pages)

Introduction

Numerous YUV formats are defined throughout the video industry. This article identifies the 8-bit YUV formats that are recommended for video rendering in the Microsoft® Windows® operating system. Decoder vendors and display vendors are encouraged to support the formats described in this article. This article does not address other uses of YUV color, such as still photography.

The formats described in this article all use 8 bits per pixel location to encode the Y channel (also called the luma channel) and use 8 bits per sample to encode each U or V chroma sample. However, most YUV formats use fewer than 24 bits per pixel on average, because they contain fewer samples of U and V than of Y. This article does not cover YUV formats with 10-bit and 12-bit Y channels.

Note For the purposes of this article, the term U is equivalent to Cb, and the term V is equivalent to Cr.

This article covers the following topics:

Identifying YUV Formats in DirectShow — Explains how to describe Microsoft DirectShow® YUV format types.
YUV Sampling — Describes the most common YUV sampling techniques.
Surface Definitions — Describes the recommended YUV formats.
Color Space and Chroma Sampling Rate Conversions — Provides guidelines for converting between YUV and RGB formats, and for converting between different YUV formats.
Additional Information Provides additional information.

Identifying YUV Formats in DirectShow

Each of the YUV formats described in this article has an assigned FOURCC code. A FOURCC code is a 32-bit unsigned integer that is created by concatenating four ASCII characters.

There are various C/C++ macros that make it easier to declare FOURCC values in source code. For example, the MAKEFOURCC macro is declared in Mmsystem.h, and the FCC macro is declared in Aviriff.h. Use them as follows:

      Copy 
    
     DWORD fccYUY2 = MAKEFOURCC('Y','U','Y','2');
DWORD fccYUY2 = FCC('YUY2');

You can also declare a FOURCC code directly as a character literal simply by reversing the order of the characters. For example:

      Copy 
    
     DWORD fccYUY2 = '2YUY';  // Declares the FOURCC 'YUY2'

Reversing the order is necessary because the Windows operating system uses a little-endian architecture. 'Y' = 0x59, 'U' = 0x55, and '2' = 0x32, so '2YUY' is 0x32595559.

In DirectShow, formats are identified by a major-type globally unique identifier (GUID) and a subtype GUID. The major type for computer video formats is always MEDIATYPE_Video. The subtype can be constructed by mapping the FOURCC code to a GUID, as follows:

      Copy 
    
     XXXXXXXX-0000-0010-8000-00AA00389B71

where XXXXXXXX is the FOURCC code. Thus, the subtype GUID for YUY2 is:

      Copy 
    
     32595559-0000-0010-8000-00AA00389B71

Many of these GUIDs are defined already in the header file Uuids.h. For example, the YUY2 subtype is defined as MEDIASUBTYPE_YUY2. The DirectShow base class library also provides a helper class, FOURCCMap, which can be used to convert FOURCC codes into GUID values. The FOURCCMap constructor takes a FOURCC code as an input parameter. You can then cast the FOURCCMap object to the corresponding GUID:

      Copy 
    
     FOURCCMap fccMap(FCC('YUY2'));
GUID g1 = (GUID)fccMap;

// Equivalent:
GUID g2 = (GUID)FOURCCMap(FCC('YUY2'));

YUV Sampling

One of the advantages of YUV is that the chroma channels can have a lower sampling rate than the Y channel without a dramatic degradation of the perceptual quality. A notation called the A:B:C notation is used to describe how often U and V are sampled relative to Y:

4:4:4 means no downsampling of the chroma channels.
4:2:2 means 2:1 horizontal downsampling, with no vertical downsampling. Every scan line contains four Y samples for every two U or V samples.
4:2:0 means 2:1 horizontal downsampling, with 2:1 vertical downsampling.
4:1:1 means 4:1 horizontal downsampling, with no vertical downsampling. Every scan line contains four Y samples for every U or V sample. 4:1:1 sampling is less common than other formats, and is not discussed in detail in this article.

Figure 1 shows the sampling grid used in 4:4:4 pictures. Luma samples are represented by a cross, and chroma samples are represented by a circle.

Figure 1. YUV 4:4:4 sample positions

The dominant form of 4:2:2 sampling is defined in ITU-R Recommendation BT.601. Figure 2 shows the sampling grid defined by this standard.

Figure 2. YUV 4:2:2 sample positions

There are two common variants of 4:2:0 sampling. One of these is used in MPEG-2 video, and the other is used in MPEG-1 and in ITU-T recommendations H.261 and H.263. Figure 3 shows the sampling grid used in the MPEG-1 scheme, and Figure 4 shows the sampling grid used in the MPEG-2 scheme.

Figure 3. YUV 4:2:0 sample positions (MPEG-1 scheme)

Figure 4. YUV 4:2:0 sample positions (MPEG-2 scheme)

Compared with the MPEG-1 scheme, it is simpler to convert between the MPEG-2 scheme and the sampling grids defined for 4:2:2 and 4:4:4 formats. For this reason, the MPEG-2 scheme is preferred in Windows, and should be considered the default interpretation of 4:2:0 formats.

Surface Definitions

This section describes the 8-bit YUV formats that are recommended for video rendering. These fall into several categories:

4:4:4 Formats, 32 Bits per Pixel
4:2:2 Formats, 16 Bits per Pixel
4:2:0 Formats, 16 Bits per Pixel
4:2:0 Formats, 12 Bits per Pixel

First, you should be aware of the following concepts in order to understand what follows:

Surface origin. For the YUV formats described in this article, the origin (0,0) is always the upper-left corner of the surface.
Stride. The stride of a surface, sometimes called the pitch, is the width of the surface in bytes. Given a surface origin at the upper-left corner, the stride is always positive.
Alignment. The alignment of a surface is at the discretion of the graphics display driver. The surface must always be DWORD aligned, that is, individual lines within the surface are guaranteed to originate on a 32-bit (DWORD) boundary. The alignment can be larger than 32 bits, however, depending on the needs of the hardware.
Packed format versus planar format. YUV formats are divided into packed formats and planar formats. In a packed format, the Y, U, and V components are stored in a single array. Pixels are organized into groups of macropixels, whose layout depends on the format. In a planar format, the Y, U, and V components are stored as three separate planes.

4:4:4 Formats, 32 Bits per Pixel

A single 4:4:4 format is recommended, with the FOURCC code AYUV. This is a packed format, where each pixel is encoded as four consecutive bytes, arranged in the following sequence.

Figure 5. AYUV memory layout

The bytes marked A contain values for alpha.

4:2:2 Formats, 16 Bits per Pixel

Two 4:2:2 formats are supported, with the following FOURCC codes:

YUY2
UYVY

Both are packed formats, where each macropixel is two pixels encoded as four consecutive bytes. This results in horizontal downsampling of the chroma by a factor of two.

YUY2

In YUY2 format, the data can be treated as an array of unsigned char values, where the first byte contains the first Y sample, the second byte contains the first U (Cb) sample, the third byte contains the second Y sample, and the fourth byte contains the first V (Cr) sample, as shown in Figure 6.

Figure 6. YUY2 memory layout

If the image is addressed as an array of two little-endian WORD values, the first WORD contains Y0 in the least significant bits (LSBs) and U in the most significant bits (MSBs). The second WORD contains Y1 in the LSBs and V in the MSBs.

YUY2 is the preferred 4:2:2 pixel format for Microsoft DirectX® Video Acceleration (DirectX VA). It is expected to be an intermediate-term requirement for DirectX VA accelerators supporting 4:2:2 video.

UYVY

This format is the same as YUY2, except the byte order is reversed — that is, the chroma and luma bytes are flipped (Figure 7). If the image is addressed as an array of two little-endian WORD values, the first WORD contains U in the LSBs and Y0 in the MSBs, and the second WORD contains V in the LSBs and Y1 in the MSBs.

Figure 7. UYVY memory layout

4:2:0 Formats, 16 Bits per Pixel

Two 4:2:0 16-bits per pixel formats are recommended, with the following FOURCC codes:

IMC1
IMC3

Both FOURCC codes are planar formats. The chroma channels are subsampled by a factor of two in both the horizontal and vertical dimensions.

IMC1

All of the Y samples appear first in memory as an array of unsigned char values. This is followed by all of the V (Cr) samples, and then all of the U (Cb) samples. The V and U planes have the same stride as the Y plane, resulting in unused areas of memory, as shown in Figure 8.

Figure 8. IMC1 memory layout

IMC3

This format is identical to IMC1, except the U and V planes are swapped:

Figure 9. IMC3 memory layout

4:2:0 Formats, 12 Bits per Pixel

Four 4:2:0 12-bpp formats are recommended, with the following FOURCC codes:

IMC2
IMC4
YV12
NV12

In all of these formats, the chroma channels are subsampled by a factor of two in both the horizontal and vertical dimensions.

IMC2

This format is the same as IMC1 except that the V (Cr) and U (Cb) lines are interleaved at half-stride boundaries. In other words, each full-stride line in the chroma area starts with a line of V samples, followed by a line of U samples that begins at the next half-stride boundary (Figure 10). This layout makes more efficient use of address space than IMC1. It cuts the chroma address space in half, and thus the total address space by 25 percent. Among 4:2:0 formats, IMC2 is the second-most preferred format, after NV12.

Figure 10. IMC2 memory layout

IMC4

This format is identical to IMC2, except the U (Cb) and V (Cr) lines are swapped:

Figure 11. IMC4 memory layout

YV12

All of the Y samples appear first in memory as an array of unsigned char values. This array is followed immediately by all of the V (Cr) samples. The stride of the V plane is half the stride of the Y plane, and the V plane contains half as many lines as the Y plane. The V plane is followed immediately by all of the U (Cb) samples, with the same stride and number of lines as the V plane (Figure 12).

Figure 12. YV12 memory layout

NV12

All of the Y samples are found first in memory as an array of unsigned char values with an even number of lines. The Y plane is followed immediately by an array of unsigned char values that contains packed U (Cb) and V (Cr) samples, as shown in Figure 13. When the combined U-V array is addressed as an array of little-endian WORD values, the LSBs contain the U values, and the MSBs contain the V values. NV12 is the preferred 4:2:0 pixel format for DirectX VA. It is expected to be an intermediate-term requirement for DirectX VA accelerators supporting 4:2:0 video.

Figure 13. NV12 memory layout

Color Space and Chroma Sampling Rate Conversions

This section provides guidelines for converting between YUV and RGB, and for converting between some different YUV formats. We consider two RGB encoding schemes in this section: 8-bit computer RGB, also known as sRGB or "full-scale" RGB, and studio video RGB, or "RGB with head-room and toe-room." These are defined as follows:

Computer RGB uses 8 bits for each sample of red, green, and blue. Black is represented by R = G = B = 0, and white is represented by R = G = B = 255.
Studio video RGB uses some number of bits N for each sample of red, green, and blue, where N is 8 or more. Studio video RGB uses a different scaling factor than computer RGB, and it has an offset. Black is represented by R = G = B = 16*2^N-8, and white is represented by R = G = B = 235*2^N-8. However, actual values may fall outside this range.

Studio video RGB is the preferred RGB definition for video in Windows, while computer RGB is the preferred RGB definition for non-video applications. In either form of RGB, the chromaticity coordinates are as specified in ITU-R BT.709 for the definition of the RGB color primaries. The (x,y) coordinates of R, G, and B are (0.64, 0.33), (0.30, 0.60), and (0.15, 0.06), respectively. Reference white is D65 with coordinates (0.3127, 0.3290). Nominal gamma is 1/0.45 (approximately 2.2), with precise gamma defined in detail in ITU-R BT.709.

Conversion between RGB and 4:4:4 YUV

We first describe conversion between RGB and 4:4:4 YUV. To convert 4:2:0 or 4:2:2 YUV to RGB, we recommend converting the YUV data to 4:4:4 YUV, and then converting from 4:4:4 YUV to RGB. The AYUV format, which is a 4:4:4 format, uses 8 bits each for the Y, U, and V samples. YUV can also be defined using more than 8 bits per sample for some applications.

Two dominant YUV conversions from RGB have been defined for digital video. Both are based on the specification known as ITU-R Recommendation BT.709. The first conversion is the older YUV form defined for 50-Hz use in BT.709. It is the same as the relation specified in ITU-R Recommendation BT.601, also known by its older name, CCIR 601. It should be considered the preferred YUV format for standard-definition TV resolution (720 x 576) and lower-resolution video. It is characterized by the values of two constants Kr and Kb:

      Copy 
    
     Kr = 0.299
Kb = 0.114

The second conversion is the newer YUV form defined for 60-Hz use in BT.709, and should be considered the preferred format for video resolutions above SDTV. It is characterized by different values for these two constants:

      Copy 
    
     Kr = 0.2126
Kb = 0.0722

Conversion from RGB to YUV is defined by starting with the following:

      Copy 
    
     L = Kr * R + Kb * B + (1 – Kr – Kb) * G

The YUV values are then obtained as follows:

      Copy 
    
     Y =                 floor(2^(M-8) * (219*(L–Z)/S + 16) + 0.5)
U = clip3(0, 2^M-1, floor(2^(M-8) * (112*(B-L) / ((1-Kb)*S) + 128) + 0.5))
V = clip3(0, 2^M-1, floor(2^(M-8) * (112*(R-L) / ((1-Kr)*S) + 128) + 0.5))

where

M is the number of bits per YUV sample (M >= 8).
Z is the black-level variable. For computer RGB, Z equals 0. For studio video RGB, Z equals 16*2^N-8, where N is the number of bits per RGB sample (N >= 8).
S is the scaling variable. For computer RGB, S equals 255. For studio video RGB, S equals 219*2^N-8.

The function floor(x) returns the largest integer greater than or equal to x. The function clip3(x, y, z) is defined as follows:

      Copy 
    
     clip3(x, y, z) = ((z < x) ? x : ((z > y) ? y : z))

The Y sample represents brightness, and the U and V samples represent the color deviations toward blue and red, respectively. The nominal range for Y is 16*2^M-8 to 235*2^M-8. Black is represented as 16*2^M-8, and white is represented as 235*2^M-8. The nominal range for U and V are 16*2^M-8 to 240*2^M-8, with the value 128*2^M-8 representing neutral chroma. However, actual values may fall outside these ranges.

For input data in the form of studio video RGB, the clip operation is necessary to keep the U and V values within the range 0 to 2^M-1. If the input is computer RGB, the clip operation is not required, because the conversion formula cannot produce values outside of this range.

These are the exact formulas without approximation. Everything that follows in this document is derived from these formulas.

Example: Converting RGB888 to YUV 4:4:4
Example: Converting 8-bit YUV to RGB888
Converting 4:2:0 YUV to 4:2:2 YUV
Converting 4:2:2 YUV to 4:4:4 YUV
Converting 4:2:0 YUV to 4:4:4 YUV

Example: Converting RGB888 to YUV 4:4:4

In the case of computer RGB input and 8-bit BT.601 YUV output, we believe that the formulas given in the previous section can be reasonably approximated by the following:

      Copy 
    
     Y = ( (  66 * R + 129 * G +  25 * B + 128) >> 8) +  16
U = ( ( -38 * R -  74 * G + 112 * B + 128) >> 8) + 128
V = ( ( 112 * R -  94 * G -  18 * B + 128) >> 8) + 128

These formulas produce 8-bit results using coefficients that require no more than 8 bits of (unsigned) precision. Intermediate results will require up to 16 bits of precision.

Example: Converting 8-bit YUV to RGB888

From the original RGB-to-YUV formulas, one can derive the following relationships for the 8-bit BT.601 definition of YUV:

      Copy 
    
     Y = round( 0.256788 * R + 0.504129 * G + 0.097906 * B) +  16 
U = round(-0.148223 * R - 0.290993 * G + 0.439216 * B) + 128
V = round( 0.439216 * R - 0.367788 * G - 0.071427 * B) + 128

Therefore, given:

      Copy 
    
     C = Y - 16
D = U - 128
E = V - 128

the formulas to convert YUV to computer RGB can be derived as follows:

      Copy 
    
     R = clip( round( 1.164383 * C                   + 1.596027 * E  ) )
G = clip( round( 1.164383 * C - (0.391762 * D) - (0.812968 * E) ) )
B = clip( round( 1.164383 * C +  2.017232 * D                   ) )

where clip() denotes clipping to a range of [0..255]. These formulas can be reasonably approximated by the following:

      Copy 
    
     R = clip(( 298 * C           + 409 * E + 128) >> 8)
G = clip(( 298 * C - 100 * D - 208 * E + 128) >> 8)
B = clip(( 298 * C + 516 * D           + 128) >> 8)

These formulas use some coefficients that require more than 8 bits of precision to produce each 8-bit result, and intermediate results will require more than 16 bits of precision.

Converting 4:2:0 YUV to 4:2:2 YUV

Converting 4:2:0 YUV to 4:2:2 YUV requires vertical upconversion by a factor of two. This section describes an example method for performing the upconversion. The method assumes that the video pictures are progressive scan.

Note The 4:2:0 to 4:2:2 interlaced scan conversion process presents atypical problems and is difficult to implement. This article does not address the issue of converting interlaced scan from 4:2:0 to 4:2:2.

Let each vertical line of input chroma samples be an array Cin[] that ranges from 0 to N - 1. The corresponding vertical line on the output image will be an array Cout[] that ranges from 0 to 2N - 1. To convert each vertical line, perform the following process:

      Copy 
    
 
     Cout[0]     = Cin[0];
Cout[1]     = clip((9 * (Cin[0] + Cin[1]) – (Cin[0] + Cin[2]) + 8) >> 4);
Cout[2]     = Cin[1];
Cout[3]     = clip((9 * (Cin[1] + Cin[2]) - (Cin[0] + Cin[3]) + 8) >> 4);
Cout[4]     = Cin[2]
Cout[5]     = clip((9 * (Cin[2] + Cin[3]) - (Cin[1] + Cin[4]) + 8) >> 4);
...
Cout[2*i]   = Cin[i]
Cout[2*i+1] = clip((9 * (Cin[i] + Cin[i+1]) - (Cin[i-1] + Cin[i+2]) + 8) >> 4);
...
Cout[2*N-3] = clip((9 * (Cin[N-2] + Cin[N-1]) - (Cin[N-3] + Cin[N-1]) + 8) >> 4);
Cout[2*N-2] = Cin[N-1];
Cout[2*N-1] = clip((9 * (Cin[N-1] + Cin[N-1]) - (Cin[N-2] + Cin[N-1]) + 8) >> 4);

 
    

where clip() denotes clipping to a range of [0..255].

Note The equations for handling the edges can be mathematically simplified. They are shown in this form to illustrate the clamping effect at the edges of the picture.

In effect, this method calculates each missing value by interpolating the curve over the four adjacent pixels, weighted toward the values of the two nearest pixels (Figure 14). The specific interpolation method used in this example generates missing samples at half-integer positions using a well-known method called Catmull-Rom interpolation, also known as cubic convolution interpolation.

Figure 14. 4:2:0 to 4:2:2 upsampling

In signal processing terms, the vertical upconversion should ideally include a phase shift compensation to account for the half-pixel vertical offset (relative to the output 4:2:2 sampling grid) between the locations of the 4:2:0 sample lines and the location of every other 4:2:2 sample line. However, introducing this offset would increase the amount of processing required to generate the samples, and make it impossible to reconstruct the original 4:2:0 samples from the upsampled 4:2:2 image. It would also make it impossible to decode video directly into 4:2:2 surfaces and then use those surfaces as reference pictures for decoding subsequent pictures in the stream. Therefore, the method provided here does not take into account the precise vertical alignment of the samples. Doing so is probably not visually harmful at reasonably high picture resolutions.

If you start with 4:2:0 video that uses the sampling grid defined in H.261, H.263, or MPEG-1 video, the phase of the output 4:2:2 chroma samples will also be shifted by a half-pixel horizontal offset relative to the spacing on the luma sampling grid (a quarter-pixel offset relative to the spacing of the 4:2:2 chroma sampling grid). However, the MPEG-2 form of 4:2:0 video is probably more commonly used on PCs and does not suffer from this problem. Moreover, the distinction is probably not visually harmful at reasonably high picture resolutions. Trying to correct for this problem would create the same sort of problems discussed for the vertical phase offset.

Converting 4:2:2 YUV to 4:4:4 YUV

Converting 4:2:2 YUV to 4:4:4 YUV requires horizontal upconversion by a factor of two. The method described previously for vertical upconversion can also be applied to horizontal upconversion. For MPEG-2 and ITU-R BT.601 video, this method will produce samples with the correct phase alignment.

Converting 4:2:0 YUV to 4:4:4 YUV

To convert 4:2:0 YUV to 4:4:4 YUV, you can simply follow the two methods described previously. Convert the 4:2:0 image to 4:2:2, and then convert the 4:2:2 image to 4:4:4. You can also switch the order of the two upconversion processes, as the order of operation does not really matter to the visual quality of the result.

Additional Information

To learn more about Microsoft DirectShow, see the DirectShow SDK Documentation.

你可能感兴趣的:(yuv)

libyuv之linux编译 jaronho Linux linux 运维服务器
文章目录一、下载源码二、编译源码三、注意事项1、银河麒麟系统（aarch64）（1）解决armv8-a+dotprod+i8mm指令集支持问题（2）解决armv9-a+sve2指令集支持问题一、下载源码到GitHub网站下载https://github.com/lemenkov/libyuv源码，或者用直接用git克隆到本地，如：gitclonehttps://github.com/lemenko
ffmpeg批量将tif文件转成jpeg格式 winfredzhang 图像工具 ffmpeg tif jpeg 转换
1、cmd2、切换到安装ffmpeg的路径。3、输入命令：ffmpeg-start_number001-i"D:\ocr\%03d.tif"-start_number001-pix_fmtyuv420p-qscale:v1"D:\ocr\%03d.jpg"结果。
使用FPGA接收MIPI CSI RX信号并进行去抖动、RGB转YUV处理：FX3014 USB3.0 UVC传输与帧率控制源代码，FPGA实现MIPI CSI RX接收，去Debayer， RGB转 kVfINoSzdrt fpga开发程序人生
fpgamipicsirx接收去debayer,rgb转yuv,fx3014usb3.0uvc传输与帧率控制源代码，具体架构看图，除dphy物理层外，mipi均为源码sensorimx219mipi源码mipi4lanecsirxraw10fpgamachXO3lf-690usb3.0fx301432bityuvdatawithframesync测试模式3280*246415fps1920*108
ffmpeg 命令转vp9 980205 ffmpeg
mp4转vp9./ffmpeg-itest.mp4-pix_fmtyuv420p10le-c:vlibvpx-vp9-b:v0-crf31-speed1-qualitygood-static-thresh4 -lag-in-frames25 -fwebmout.webmyuv转vp9，需要指定yuv的高宽//转vp9./ffmpeg-pix_fmtyuv420p-s704*576 -i out.y
Linux 编译 qtav,QtAV 1.3.3 发布，跨平台音视频播放库丶本心灬 Linux 编译 qtav
QtAV1.3.3发布-支持调用NVIDIA的cuvid库进行CUDA硬解。支持平台：windows,linux。(是linux上第一个支持cuda硬解的么？)。有些视频播放会抖动，目前原因还不清楚。4k硬解画面貌似有点花，效果不如lavfilters好。-OpenGL和OpenGLES2支持16-bit的YUV渲染，包括9,10,12,14,16bit的little/bigendian的yuv。
ffmpeg 格式转换 AI算法网奇视频编解码 ffmpeg
目录python版ffmpeg图片转mp41.提取每一帧并保存为图片2.每秒保存一帧python版pipinstallffmpeg-pythonffmpeg图片转mp4如果你的图片命名为frame_0001.jpg、frame_0002.jpg，并且存储在同一目录下，可以运行以下命令：ffmpeg-framerate30-iframe_%04d.jpg-c:vlibx264-pix_fmtyuv4
ffmpeg：单张图片 + 音频生成视频 KAMILLE ffmpeg
ffmpeg-r1-fimage2-loop1-i图片地址-i音频地址-s1920x1080-pix_fmtyuvj420p-t时长(秒)-vcodeclibx264视频地址帧率为1，转换速度更快。如果想根据音频的时长：ffmpeg-y-loop1-r1-i图片地址-i2.音频地址-vcodeclibx264-acodecaac-shortest视频地址ffmpeg-y-loop1-r1-i图片地
海康相机白平衡鱼险胜计算机视觉人工智能
海康相机白平衡-手动和SDK实现首先确定相机图像格式，因为不同的格式有不同的白平衡方法，黑白相机无白平衡功能。本文主要是针对YUV、RGB和BGR格式进行白平衡设置。白平衡相关参数自动白平衡参数：三种模式“关闭”、“一次”、“连续”关闭：关闭模式下可以通过平衡比选择器和平衡比设置红、绿、蓝各分量值。一次：一次模式下相机根据当前图像进行一次白平衡调整，会存在调整不到位的情况。连续：连续模式下相机根据
stm32 DCMI的知识点 MCU_wb stm32 嵌入式硬件单片机
1.DCMI的简介DCMI全称Digitalcamerainterface（数字摄像头接口），是一种可以采集摄像头数据的一种接口。此接口适用于黑白摄像头、X24和X5摄像头，并可以假定所有预处理（如调整大小）都可以在该摄像头模块中执行。支持原始的按行、帧格式来组织的图像数据，如YUV、RGB，也支持接收JPEG格式压缩的数据流。接收数据时，主要使用HSYNC及VSYNC信号来同步。STM32的DC
视频基础知识 littlezls 多媒体 video 音视频 video
文章目录一、视频信号1.1模拟信号1.2数字信号二、视频扫描格式三、视频图像基础四、图像颜色空间1、颜色空间分类2、YUV分类3、YUV存储方式4、YUV类型和存储类型关系5、ColorRange6、RBG与YUV互转规范7、RBG与YUV转换公式五、视频信号显示格式1、标清SD2、高清HD3、全高清FHD4、QHD5、UHD参考资料一、视频信号1.1模拟信号连续信号，它在一定的时间范围内可以有无
Android使用OpenGL渲染ffmpeg解码的YUV视频数据 FlyerGo
在《安卓使用SurfaceView绘制ffmpeg解码的视频数据》虽然我们成功地实现了视频的渲染，但是在YUV转换成RGB的时候，我们调用了ffmpeg内部的转换函数，这里面包含大量的计算转换，所以是很耗费CPU性能的。今天我们来学习一下如何将YUV转换RGB的功能转换到GPU中去执行，减少CPU的计算工作量，达到性能优化的目的。解决方案是使用OpenGL渲染，将YUV转换RGB的功能交由着色器去
C++音视频01：视频基础、音频基础、封装格式学而知不足~ 音视频开发音视频计算机视觉图像处理
视频基础音视频录制原理音视频播放原理图像表示-RGB格式图像表示-YUV格式1图像表示-YUV格式2图像表示-YUV格式3图像表示相较于RGB，我们可以计算一帧为1280×720的视频帧，用YUV420P的格式来表示，其数据量的大小如下：42->1+0.5=1.51280*720*1＋1280*720*0.5=1.318MB如果fps（1秒的视频帧数目）是25，按照一般电影的长度90分钟来计算，那
C++ 音视频原理 4399.9855 音视频&QT 音视频 c++
本篇文章我们来描述一下音视频原理音视频录制原理:下面是对这张思维导图的介绍摄像头部分:麦克风采集声音摄像头采集画面摄像头采集回来的数据可以用RGB也可以用YUV来表示图像帧帧率一秒能处理多少张图像图像处理：调亮度图像帧队列:意思是将数据取出来储存在图像帧队列里面等着编码器将数据取出来进行编码处理进行压缩视频编码将视频体积大变成小的精简的视频包队列:压缩好的（编程好的）音频作为音频包队列然后按照一定
Android硬编、硬解h264 璃云曦
项目工程demo地址https://github.com/liluojun/PlayVideodemo包含硬编解h264、libyuv裁剪图像、opengles渲染yuv数据、ffmpeg解码裸h264数据等功能，故仅供参考测试。硬编码首先设置编码器MediaFormatmediaFormat=MediaFormat.createVideoFormat("video/avc",width,heig
autojs调用zxing实现扫码功能 qq_570333273 android ui
Andorid的demo需要引用xml资源，autojs无法引用，此次主要实现相机界面部份，实现扫一扫功能。加载dex文件runtime.loadDex("core.dex");导入相关的类：importClass(com.google.zxing.PlanarYUVLuminanceSource);importClass(com.google.zxing.common.HybridBinariz
评价尤瓦尔·赫拉利 Jeep_9b89
尤瓦尔·赫拉利（YuvalNoahHarari），1976年生于以色列，牛津大学历史学博士，青年怪才、全球瞩目的新锐历史学家。现任耶路撒冷希伯来大学的历史系教授。它擅长世界历史和宏观历史进程研究，在学术领域和大众出版领域都有很大的兴趣。他的成就尤瓦尔·赫拉利的作品《人类简史》成为以色列超级畅销书，截至2016年，该书已授20多个国家版权，在历史学之外，人类学、生态学、基因学等领域的知识信手拈来，根
最简单的基于 FFmpeg 的视频编码器（YUV 编码为 H.264） UestcXiye FFmpeg ffmpeg 音视频 h.264 C++视频编解码
最简单的基于FFmpeg的视频编码器（YUV编码为H.264）最简单的基于FFmpeg的视频编码器（YUV编码为H.264）正文结果工程文件下载最简单的基于FFmpeg的视频编码器（YUV编码为H.264）参考雷霄骅博士的文章，链接：最简单的基于FFMPEG的视频编码器（YUV编码为H.264）正文本文介绍一个最简单的基于FFmpeg的视频编码器。该编码器实现了YUV420P的像素数据编码为H.2
ffmpeg例子为技术疯狂
ffmpeg-r15-fimage2-loop1-i输入图片.png-i输入音频.mp3-s1920x1080-pix_fmtyuvj420p-t278-vcodeclibx264输出.mp4参数说明：-pix_fmt：指定图片输入格式(有yuv420,yuv444等各种格式)-loop1：因为只有一张图片所以必须加入这个参数-t：这个是图片转换成视频后持续的时间长度，必须指定，单位为秒，不然会无
TDA4VM EVM学习笔记（2）:基于v4l2用USB摄像头采集图像并显示耀眼宝玉 TDA4VM 学习笔记 linux
本文要实现的功能是在A72上用USB摄像头采集图像数据（格式YUYV4:2:2），然后在DSP上将YUV422格式的图像转化为RGB格式的图像，之后在Display上显示该图像数据。完整代码1.Kernel：https://download.csdn.net/download/walker_bk/877479672.Usecase：https://download.csdn.net/downloa
YUV格式学习：YUV422P、YV16、NV16、NV61格式转换成RGB24 yanyan_happy0506 picture
YUV格式学习：YUV422P、YV16、NV16、NV61格式转换成RGB24https://blog.csdn.net/subfate/article/details/473049452015年08月05日22:23:02李迟阅读数6380更多所属专栏：YUV视频格式学习笔记对于YUV422的格式，网上有一大堆资料，这里就不说了。直奔主题，给出如何转换的函数，一如既往，只用代码说事。YUV42
调用jni库的java.lang.UnsatisfiedLinkError 技术笔记
调用jni库的java.lang.UnsatisfiedLinkErrorjava.lang.UnsatisfiedLinkError:Noimplementationfoundforvoidcom.abc.video.ColorConverterJNI.convertYuv420spToYvu420sp(int,int,byte[],byte[],int,int,int,int,boolean)
音视频色彩：RGB/YUV 孙八瓶 Linux通用知识 PC端软件音视频
目录1.RGB1.1介绍1.2分类1.2.1RGB161)RGB5652)RGB5551.2.2RGB241.2.3RGB2222.YUV2.1介绍2.2分类2.2.1YUV4442.2.2YUV4222.2.3YUV4202.3存储格式2.3.1YUYV2.3.2UYVY2.3.3YUV422P2.3.4YUV420P/YUV420SP2.3.5YU12和YU212.3.6NV12和NV213.
h264 码流格式简述 baoyu45585 c++音视频 c语言 ffmpeg h.264
h264码流格式简述(Annex-B格式)1nalunitstream(NetworkAbstractionLayerUnitStream)h.264编码器把原始的yuv图像文件编码成码流文件，生成的码流文件称为NAL单元流(NALunitStream)，NALUstream由一个个NALU(nal单元)组成(https://www.cnblogs.com/TaigaCon/p/5215448.h
jpeg压缩基本步骤 superdont 图像加密计算机视觉
基本过程总体描述：根据JPEG压缩标准，图像首先被转换成YUV色彩空间。经过8×8的离散余弦变换(DCT)和量化之后，每个块总共有64个系数，其中第一个系数是直流(DC)系数，剩下的63个系数是交流(AC)系数。DC系数通过差分脉冲编码调制(DPCM)进行编码，同一块中的剩余63个AC系数通过之字形扫描转变成一个序列。AC系数用游程编码(RLE)编码，这些系数被转换成(,)对。JPEG压缩的基本过
opencv 叠加文字_EmguCV(OpenCV)实现高效显示视频（YUV）叠加包括汉字 weixin_39630106 opencv 叠加文字
视频处理中，往往需要在上面增加文字包括汉字英文字母数字标点等，Emgu.CV/opencv绘图线面文字包括中文这篇里也有相关介绍，但是这篇里根据逐像素修改rgb值的方法效率太低查了很多资料，基本上opencv叠加汉字的方法都起源于这里http://wenku.baidu.com/link?url=g1dCXwRbSpy7XUhsStRLANQRmvAXKSAa1ohrphx1R3XSZozi68W
OMX标准接口OMX_FillThisBuffer机制详解丽萨的托马斯 Android媒体音视频 mediacodec omx
一、引言：OMX组件的标准接口（OMX_Core.h）中，OMX_FillThisBuffer和OMX_EmptyThisBuffer共同完成了OMX的buffer运转。OMX_FillThisBuffer是操作解码完后数据（PCM/YUV）的，OMX_EmptyThisBuffer是操作解码前（es）数据的。本博客将分析OMX_FillThisBuffer，下篇博客再分析OMX_EmptyThi
ffmpeg将yuv文件编码为mp4 3c1b8ae8346c
上一遍文件是将mp4的视频流数据解码，并且写入yuv的数据文件中，这篇文章是一个逆向操作，既将yuv数据文件编码为一个mp4文件1.主要函数的调用流程avformat_alloc_output_context2根据文件名创建视频封装上下文对象avio_open打开视频文件avcodec_find_encoder查找编码器，我们使用的是h264,参数就是AV_CODEC_ID_H264avcodec
ffmpeg入门教程之ffmpeg命令行手册------视频滤镜(翻译) 安娇德 ffmpeg ffmpeg命令行 ffmpeg滤镜 scale overlay filter_complex
文章目录本文将持续更新，敬请关注滤镜filtering简单滤镜Simplefiltergraphs复杂滤镜Complexfiltergraphs-filter_complexfiltergraph(global)覆盖overlayxyformatyuv420yuv422yuv444rgbgbrpautomain_w,Wmain_h,Hoverlay_w,woverlay_h,hoverlay跑马灯
2019-06-16 色彩空间崔冬明
灰度图像的每一个像素都是由一个数字进行量化的，而彩色图像的每一个像素都是由三个数字量化的。由于人类视觉系统的特点，人们在三色系统方面投入大量的资源进行数字成像，特别是电视摄像机、数字化仪、显示器、打印机等，使得三色模型具有特殊的重要意义，比较常用的三色色彩空间包括RGB、HSV、HLS、Lab、YUV等。RGB色彩空间RGB色彩空间源于使用阴极射线管（CRT）的彩色电视。RGB模型使用加性色彩混合
【FFmpeg】ffplay 命令行参数 ① ( 设置播放分辨率 | 禁用音频 / 视频 / 字幕选项 ) 韩曙亮 FFmpeg ffmpeg 音视频 ffplay 禁用音频禁用视频禁用字幕设置分辨率
文章目录一、ffplay命令行参数-设置播放分辨率1、强制设置通用播放分辨率-x-y参数2、命令行示例-正常播放视频3、命令行示例-强制设置播放分辨率4、设置YUV播放分辨率-video_size和像素设置-pixel_format5、全屏播放-fs参数二、ffplay命令行参数-禁用音频/视频/字幕选项1、禁用音频/视频/字幕选项2、命令行示例-禁用音频选项3、命令行示例-禁用视频选项一、ffp
springmvc 下 freemarker页面枚举的遍历输出杨白白 enum freemarker
spring mvc freemarker 中遍历枚举 1枚举类型有一个本地方法叫values（），这个方法可以直接返回枚举数组。所以可以利用这个遍历。 enum public enum BooleanEnum { TRUE(Boolean.TRUE, "是"), FALSE(Boolean.FALSE, "否");
实习简要总结 byalias 工作
来白虹不知不觉中已经一个多月了，因为项目还在需求分析及项目架构阶段，自己在这段时间都是在学习相关技术知识，现在对这段时间的工作及学习情况做一个总结：（1）工作技能方面大体分为两个阶段，Java Web 基础阶段和Java EE阶段 1）Java Web阶段在这个阶段，自己主要着重学习了 JSP, Servlet, JDBC, MySQL，这些知识的核心点都过了一遍，也
Quartz——DateIntervalTrigger触发器 eksliang quartz
转载请出自出处：http://eksliang.iteye.com/blog/2208559 一.概述 simpleTrigger 内部实现机制是通过计算间隔时间来计算下次的执行时间，这就导致他有不适合调度的定时任务。例如我们想每天的 1：00AM 执行任务，如果使用 SimpleTrigger，间隔时间就是一天。注意这里就会有一个问题，即当有 misfired 的任务并且恢复执行时，该执行时间
Unix快捷键 18289753290 unix Unix；快捷键;
复制，删除，粘贴： dd:删除光标所在的行 &nbs
获取Android设备屏幕的相关参数酷的飞上天空 android
包含屏幕的分辨率以及屏幕宽度的最大dp 高度最大dp TextView text = (TextView)findViewById(R.id.text); DisplayMetrics dm = new DisplayMetrics(); text.append("getResources().ge
要做物联网？先保护好你的数据蓝儿唯美数据
根据Beecham Research的说法，那些在行业中希望利用物联网的关键领域需要提供更好的安全性。在Beecham的物联网安全威胁图谱上，展示了那些可能产生内外部攻击并且需要通过快速发展的物联网行业加以解决的关键领域。 Beecham Research的技术主管Jon Howes说：“之所以我们目前还没有看到与物联网相关的严重安全事件，是因为目前还没有在大型客户和企业应用中进行部署，也就
Java取模（求余）运算随便小屋 java
整数之间的取模求余运算很好求，但几乎没有遇到过对负数进行取模求余，直接看下面代码： /** * * @author Logic * */ public class Test { public static void main(String[] args) { // TODO A
SQL注入介绍 aijuans sql注入
二、SQL注入范例这里我们根据用户登录页面 <form action="" > 用户名：<input type="text" name="username"><br/> 密码：<input type="password" name="passwor
优雅代码风格 aoyouzi 代码
总结了几点关于优雅代码风格的描述：代码简单：不隐藏设计者的意图，抽象干净利落，控制语句直截了当。接口清晰：类型接口表现力直白，字面表达含义，API 相互呼应以增强可测试性。依赖项少：依赖关系越少越好，依赖少证明内聚程度高，低耦合利于自动测试，便于重构。没有重复：重复代码意味着某些概念或想法没有在代码中良好的体现，及时重构消除重复。战术分层：代码分层清晰，隔离明确，
布尔数组百合不是茶 java 布尔数组
androi中提到了布尔数组; 布尔数组默认的是false, 并且只会打印false或者是true 布尔数组的例子; 根据字符数组创建布尔数组 char[] c = {'p','u','b','l','i','c'}; //根据字符数组的长度创建布尔数组的个数 boolean[] b = new bool
web.xml之welcome-file-list、error-page bijian1013 java web.xml servlet error-page
welcome-file-list 1.定义： <welcome-file-list> <welcome-file>login.jsp</welcome> </welcome-file-list> 2.作用：用来指定WEB应用首页名称。 error-page1.定义： <error-page&g
richfaces 4 fileUpload组件删除上传的文件 sunjing clear Richfaces 4 fileupload
页面代码 <h:form id="fileForm"> <rich:
技术文章备忘 bit1129 技术文章
Zookeeper http://wenku.baidu.com/view/bab171ffaef8941ea76e05b8.html http://wenku.baidu.com/link?url=8thAIwFTnPh2KL2b0p1V7XSgmF9ZEFgw4V_MkIpA9j8BX2rDQMPgK5l3wcs9oBTxeekOnm5P3BK8c6K2DWynq9nfUCkRlTt9uV
org.hibernate.hql.ast.QuerySyntaxException: unexpected token: on near line 1解决方案白糖_ Hibernate
文章摘自：http://blog.csdn.net/yangwawa19870921/article/details/7553181 在编写HQL时，可能会出现这种代码： select a.name,b.age from TableA a left join TableB b on a.id=b.id 如果这是HQL，那么这段代码就是错误的，因为HQL不支持
sqlserver按照字段内容进行排序 bozch 按照内容排序
在做项目的时候，遇到了这样的一个需求：从数据库中取出的数据集，首先要将某个数据或者多个数据按照地段内容放到前面显示，例如:从学生表中取出姓李的放到数据集的前面； select * fro
编程珠玑-第一章-位图排序 bylijinnan java 编程珠玑
import java.io.BufferedWriter; import java.io.File; import java.io.FileWriter; import java.io.IOException; import java.io.Writer; import java.util.Random; public class BitMapSearch {
Java关于==和equals chenbowen00 java
关于==和equals概念其实很简单，一个是比较内存地址是否相同，一个比较的是值内容是否相同。虽然理解上不难，但是有时存在一些理解误区，如下情况： 1、 String a = "aaa"; a=="aaa"; ==> true 2、 new String("aaa")==new String("aaa
[IT与资本]软件行业需对外界投资热情保持警惕 comsci it
我还是那个看法,软件行业需要增强内生动力,尽量依靠自有资金和营业收入来进行经营,避免在资本市场上经受各种不同类型的风险,为企业自主研发核心技术和产品提供稳定,温和的外部环境... 如果我们在自己尚未掌握核心技术之前,企图依靠上市来筹集资金,然后使劲往某个领域砸钱,然
oracle 数据块结构 daizj oracle 块数据块块结构行目录
oracle 数据块是数据库存储的最小单位，一般为操作系统块的N倍。其结构为：块头－－〉空行－－〉数据，其实际为纵行结构。块的标准大小由初始化参数DB_BLOCK_SIZE指定。具有标准大小的块称为标准块（Standard Block）。块的大小和标准块的大小不同的块叫非标准块（Nonstandard Block）。同一数据库中，Oracle9i及以上版本支持同一数据库中同时使用标
github上一些觉得对自己工作有用的项目收集 dengkane github
github上一些觉得对自己工作有用的项目收集技能类 markdown语法中文说明回到顶部全文检索 elasticsearch bigdesk elasticsearch管理插件回到顶部 nosql mapdb 支持亿级别map, list, 支持事务. 可考虑做为缓存使用 C
初二上学期难记单词二 dcj3sjt126com english word
dangerous 危险的 panda 熊猫 lion 狮子 elephant 象 monkey 猴子 tiger 老虎 deer 鹿 snake 蛇 rabbit 兔子 duck 鸭 horse 马 forest 森林 fall 跌倒；落下 climb 爬；攀登 finish 完成；结束 cinema 电影院；电影 seafood 海鲜；海产食品 bank 银行
8、mysql外键(FOREIGN KEY)的简单使用 dcj3sjt126com mysql
一、基本概念 1、MySQL中“键”和“索引”的定义相同，所以外键和主键一样也是索引的一种。不同的是MySQL会自动为所有表的主键进行索引，但是外键字段必须由用户进行明确的索引。用于外键关系的字段必须在所有的参照表中进行明确地索引，InnoDB不能自动地创建索引。 2、外键可以是一对一的，一个表的记录只能与另一个表的一条记录连接，或者是一对多的，一个表的记录与另一个表的多条记录连接。 3、如
java循环标签 Foreach shuizhaosi888 标签 java循环 foreach
1. 简单的for循环 public static void main(String[] args) { for (int i = 1, y = i + 10; i < 5 && y < 12; i++, y = i * 2) { System.err.println("i=" + i + " y="
Spring Security（05）——异常信息本地化 234390216 exception Spring Security 异常信息本地化
异常信息本地化 Spring Security支持将展现给终端用户看的异常信息本地化，这些信息包括认证失败、访问被拒绝等。而对于展现给开发者看的异常信息和日志信息（如配置错误）则是不能够进行本地化的，它们是以英文硬编码在Spring Security的代码中的。在Spring-Security-core-x
DUBBO架构服务端告警Failed to send message Response javamingtingzhao 架构 DUBBO
废话不多说，警告日志如下，不知道有哪位遇到过，此异常在服务端抛出(服务器启动第一次运行会有这个警告)，后续运行没问题，找了好久真心不知道哪里错了。 WARN 2015-07-18 22:31:15,272 com.alibaba.dubbo.remoting.transport.dispatcher.ChannelEventRunnable.run(84)
JS中Date对象中几个用法 leeqq JavaScript Date 最后一天
近来工作中遇到这样的两个需求 1. 给个Date对象，找出该时间所在月的第一天和最后一天 2. 给个Date对象，找出该时间所在周的第一天和最后一天需求1中的找月第一天很简单，我记得api中有setDate方法可以使用使用setDate方法前，先看看getDate var date = new Date(); console.log(date); // Sat J
MFC中使用ado技术操作数据库你不认识的休道人 sql mfc
1.在stdafx.h中导入ado动态链接库 #import"C:\Program Files\Common Files\System\ado\msado15.dll" no_namespace rename("EOF","end")2.在CTestApp文件的InitInstance()函数中domodal之前写::CoIniti
Android Studio加速 rensanning android studio
Android Studio慢、吃内存！启动时后会立即通过Gradle来sync & build工程。（1）设置Android Studio a) 禁用插件 File -> Settings... Plugins 去掉一些没有用的插件。比如：Git Integration、GitHub、Google Cloud Testing、Google Cloud
各数据库的批量Update操作 tomcat_oracle java oracle sql mysql sqlite
MyBatis的update元素的用法与insert元素基本相同，因此本篇不打算重复了。本篇仅记录批量update操作的 sql语句，懂得SQL语句，那么MyBatis部分的操作就简单了。　　注意：下列批量更新语句都是作为一个事务整体执行，要不全部成功，要不全部回滚。 MSSQL的SQL语句　WITH R AS（　　SELECT 'John' as name, 18 as
html禁止清除input文本输入缓存 xp9802 input
多数浏览器默认会缓存input的值，只有使用ctl+F5强制刷新的才可以清除缓存记录。如果不想让浏览器缓存input的值，有2种方法：方法一：在不想使用缓存的input中添加 autocomplete="off"; eg: <input type="text" autocomplete="off" name