mci2004

Multimedia Standards Introduction——专业术语

[Quto from H.261 specification]

The standard supports two video frame sizes: CIF(352x288 luma with 176x144 chroma) and QCIF (176x144 with 88x72 chroma) using a 4:2:0

352x288 luma with 176x144 chroma<----- 这个怎么理解？

1， 4：2：0的采样方法表示在水平方向上采用 2：1(明度比色度)的水平采样，垂直方向上采用 2：1(明度比色度)的采样。

Multimedia Standards Introduction——专业术语_第1张图片

2，所以，luma的resolution(352*288)可以理解成图像的大小(尺寸)，而chroma的resolution(176*144 in this)指的是在luma resoltuion一定的情况下，色度信息的size。

YUV specification

由上面的知识点我们引出了 YUV的概念，下面我们将单独讲解一下YUV的相关知识。

YUV 是一种颜色编码方式，它较RGB的优势是，YUV在颜色编码的过程中，将人的视觉感知纳入考虑。

YUV is a color space typically used as part of a color image pipeline. It encodes a color image or video taking human perception into account, allowing reduced bandwidth for chrominance components, thereby typically enabling transmission errors or compression artifacts to be more efficiently masked by the human perception than using a "direct" RGB-representation. Other color spaces have similar properties, and the main reason to implement or investigate properties of Y'UV would be for interfacing with analog or digital television or photographic equipment that conforms to certain Y'UV standards.
（其中 Y‘ 表示经过 Gamma 矫正后的 Y数据）

Y'UV was invented when engineers wanted color television in a black-and-white infrastructure.

下面的图展示了，U-V 在 y=0.5情况下的调色板

Multimedia Standards Introduction——专业术语_第2张图片

一副图片中，对应的 Y‘ ，U， V数据示例：

YUV数据的抽样，色度抽样，Sampling systems and ratios：

The subsampling scheme is commonly expressed as a three part ratio J:a:b (e.g. 4:2:2), although sometimes expressed as four parts (e.g. 4:2:2:4), that describe the number of luminance and chrominance samples in a conceptual region that is J pixels wide, and 2 pixels high. The parts are (in their respective order):
• J: horizontal sampling reference (width of the conceptual region). Usually, 4.
• a: number of chrominance samples (Cr, Cb) in the first row of J pixels.
• b: number of (additional) chrominance samples (Cr, Cb) in the second row of J pixels.
• Alpha: horizontal factor (relative to first digit). May be omitted if alpha component is not present, and is equal to J when present.
An explanatory image of different chroma subsampling schemes can be seen at the following link: http://lea.hamradio.si/~s51kq/subsample.gif (source: "Basics of Video":http://lea.hamradio.si/~s51kq/V-BAS.HTM) or in details in Chrominance Subsampling in Digital Images, by Douglas Kerr.

Multimedia Standards Introduction——专业术语_第3张图片

The mapping examples given are only theoretical and for illustration. Also note that the diagram does not indicate any chroma filtering, which should be applied to avoid aliasing.
To calculate required bandwidth factor relative to 4:4:4 (or 4:4:4:4), one needs to sum all the factors and divide the result by 12 (or 16, if alpha is present).

YUV 4:2:0

4:2:0又稱I420。I420是YUV格式的一種，屬於 planar format。4:2:0并不意味着只有Y,Cb而没有Cr分量。它指的是对每行扫描线来说，只有一种色度分量以2:1的抽样率存储。相邻的扫描行存储不同的色度分量，也就是说，如果一行是4:2:0的话，下一行就是4:0:2，再下一行是4:2:0...以此类推。对每个色度分量来说，水平方向和竖直方向的抽样率都是2:1，所以可以说色度的抽样率是4:1。 PAL制式和 SECAM制式的色彩系统特别适合于用这种方式来存储。绝大多数视频编解码器都采用这种格式作为标准的输入格式。

映射:

码流
Yo0 Uo0 Yo1 Yo2 Uo2 Yo3
Ye0 Ve0 Ye1 Ye2 Ve2 Ye3

将被映射为下面的两行各四个像素:
[Yo0 Uo0 Ve0] [Yo1 Uo0 Ve0] [Yo2 Uo2 Ve2] [Yo3 Uo2 Ve2]
[Ye0 Uo0 Ve0] [Ye1 Uo0 Ve0] [Ye2 Uo2 Ve2] [Ye3 Uo2 Ve2]

YUV 4:2:0的采样周期和存放方式：

YUV数据的存放方式

YUV formats fall into two distinct groups, the packed formats where Y, U (Cb) and V (Cr) samples are packed together into macropixels which are stored in a single array, and the planar formats where each component is stored as a separate array, the final image being a fusing of the three separate planes.

Packed YUV Formats

click me	click me	click me	click me
Label	FOURCC in Hex	Bits per pixel	Description
AYUV	0x56555941	32	Combined YUV and alpha
CLJR	0x524A4C43	8	Cirrus Logic format with 4 pixels packed into a u_int32. A form of YUV 4:1:1 wiht less than 8 bits per Y, U and V sample.
cyuv	0x76757963	16	Essentially a copy of UYVY except that the sense of the height is reversed - the image is upside down with respect to the UYVY version.
GREY	0x59455247	8	Apparently a duplicate of Y800 (and also, presumably, "Y8 ")
IRAW	0x57615349	?	Intel uncompressed YUV. I have no information on this format - can you help?
IUYV	0x56595549	16	Interlaced version of UYVY (line order 0, 2, 4,....,1, 3, 5....) registered by Silviu Brinzei of LEAD Technologies.
IY41	0x31345949	12	Interlaced version of Y41P (line order 0, 2, 4,....,1, 3, 5....) registered by Silviu Brinzei of LEAD Technologies.
IYU1	0x31555949	12	12 bit format used in mode 2 of the IEEE 1394 Digital Camera 1.04 spec. This is equivalent to Y411
IYU2	0x32555949	24	24 bit format used in mode 0 of the IEEE 1394 Digital Camera 1.04 spec
HDYC	0x43594448	16	YUV 4:2:2 (Y sample at every pixel, U and V sampled at every second pixel horizontally on each line). A macropixel contains 2 pixels in 1 u_int32. This is a suplicate of UYVY except that the color components use the BT709 color space (as used in HD video).
UYNV	0x564E5955	16	A direct copy of UYVY registered by NVidia to work around problems in some old codecs which did not like hardware which offered more than 2 UYVY surfaces.
UYVP	0x50565955	24?	YCbCr 4:2:2 extended precision 10-bits per component in U0Y0V0Y1 order. Registered by Rich Ehlers of Evans & Sutherland. (Awaiting confirmation of component packing structure)
UYVY	0x59565955	16	YUV 4:2:2 (Y sample at every pixel, U and V sampled at every second pixel horizontally on each line). A macropixel contains 2 pixels in 1 u_int32.
V210	0x30313256	32	10-bit 4:2:2 YCrCb equivalent to the Quicktime format of the same name.
V422	0x32323456	16	I am told that this is an upside down version of UYVY.
V655	0x35353656	16?	16 bit YUV 4:2:2 format registered by Vitec Multimedia. I have no information on the component ordering or packing.
VYUY	0x59555956	?	ATI Packed YUV Data (format unknown but you can get hold of a codec supporting it here)
Y422	0x32323459	16	Direct copy of UYVY as used by ADS Technologies Pyro WebCam firewire camera.
YUY2	0x32595559	16	YUV 4:2:2 as for UYVY but with different component ordering within the u_int32 macropixel.
YUYV	0x56595559	16	Duplicate of YUY2
YUNV	0x564E5559	16	A direct copy of YUY2 registered by NVidia to work around problems in some old codecs which did not like hardware which offered more than 2 YUY2 surfaces.
YVYU	0x55595659	16	YUV 4:2:2 as for UYVY but with different component ordering within the u_int32 macropixel.
Y41P	0x50313459	12	YUV 4:1:1 (Y sample at every pixel, U and V sampled at every fourth pixel horizontally on each line). A macropixel contains 8 pixels in 3 u_int32s.
Y411	0x31313459	12	YUV 4:1:1 with a packed, 6 byte/4 pixel macroblock structure.
Y211	0x31313259	8	Packed YUV format with Y sampled at every second pixel across each line and U and V sampled at every fourth pixel.
Y41T	0x54313459	12	Format as for Y41P but the lsb of each Y component is used to signal pixel transparency .
Y42T	0x54323459	16	Format as for UYVY but the lsb of each Y component is used to signal pixel transparency .
YUVP	0x50565559	24?	YCbCr 4:2:2 extended precision 10-bits per component in Y0U0Y1V0 order. Registered by Rich Ehlers of Evans & Sutherland.
Y800	0x30303859	8	Simple, single Y plane for monochrome images.
Y8	0x20203859	8	Duplicate of Y800 as far as I can see.
Y16	0x20363159	16	16-bit uncompressed greyscale image.

YUY2 (and YUNV and V422 and YUYV)YUY2 is another in the family of YUV 4:2:2 formats and appears to be used by all the same codecs as UYVY.

click me	click me	click me
	Horizontal	Vertical
Y Sample Period	1	1
V Sample Period	2	1
U Sample Period	2	1

Effective bits per pixel : 16
Positive biHeight implies top-down image (top line first)

Planar YUV Formats

click me	click me	click me	click me
Label	FOURCC in Hex	Bits per pixel	Description
YVU9	0x39555659	9	8 bit Y plane followed by 8 bit 4x4 subsampled V and U planes. Registered by Intel.
YUV9	0x39565559	9?	Registered by Intel., this is the format used internally by Indeo video code
IF09	0x39304649	9.5	As YVU9 but an additional 4x4 subsampled plane is appended containing delta information relative to the last frame. (Bpp is reported as 9)
YV16	0x36315659	16	8 bit Y plane followed by 8 bit 2x1 subsampled V and U planes.
YV12	0x32315659	12	8 bit Y plane followed by 8 bit 2x2 subsampled V and U planes.
I420	0x30323449	12	8 bit Y plane followed by 8 bit 2x2 subsampled U and V planes.
IYUV	0x56555949	12	Duplicate FOURCC, identical to I420.
NV12	0x3231564E	12	8-bit Y plane followed by an interleaved U/V plane with 2x2 subsampling
NV21	0x3132564E	12	As NV12 with U and V reversed in the interleaved plane
IMC1	0x31434D49	12	As YV12 except the U and V planes each have the same stride as the Y plane
IMC2	0x32434D49	12	Similar to IMC1 except that the U and V lines are interleaved at half stride boundaries
IMC3	0x33434D49	12	As IMC1 except that U and V are swapped
IMC4	0x34434D49	12	As IMC2 except that U and V are swapped
CLPL	0x4C504C43	12	Format similar to YV12 but including a level of indirection.
Y41B	0x42313459	12?	Weitek format listed as "YUV 4:1:1 planar". I have no other information on this format.
Y42B	0x42323459	16?	Weitek format listed as "YUV 4:2:2 planar". I have no other information on this format.
Y800	0x30303859	8	Simple, single Y plane for monochrome images.
Y8	0x20203859	8	Duplicate of Y800 as far as I can see.
CXY1	0x31595843	12	Awaiting clarification of format.
CXY2	0x32595842	16	Awaiting clarification of format.

关于YUV 420 平面存储方式，上面有提到过，这里不赘述。思考，彩色电视机-->黑白电视机的转变？

答：由于 YUV 420被应用于电视信号的传播，而又由于起平面存储方式（planar mode），所以彩色的信号传递给黑白电视机，黑白电视机只需要处理Y分量上的数据就可以了，而忽略掉其后的色度信息。

什么是MP3

MP3并不是指 MPEG-3，事实上MPEG标准里，MPEG-3的方案制定已经终止，废弃了。因为，Moving Picture Experts Group在准备指定MPEG-3的时候，发现MPEG-2的标准已经足够强大，所以MPEG-3被废弃了。

现在的MP3通常指，MPEG-1 or MPEG-2 Audio Layer III，对应标准号分别为，ISO/IEC 11172-3, ISO/IEC 13818-3。

Multimedia Standards Introduction——专业术语_第4张图片

MPEG-1/Audio/Layer3：

MPEG-1 Layer 3 (or MP3) is a 1- or 2-channel perceptual audio coder that provides excellent compression of music signals. Compared to Layer 1 and Layer 2 it provides a higher compression efficiency. It can typically compress high quality audio CD data by a factor of 12 while maintaining a high audio quality. In general MP3 is appropriate for applications involving storage or transmission of mono or stereo music or other audio signals. Since it is implemented on virtually all digital audio devices playback is always ensured.

MPEG-2/Audio/Layer3：

与MPEG-1该部分完全向后兼容.

MP3的常见bit rate：

MP3的常见采样频率：

Multimedia Standards Introduction——专业术语_第5张图片

MP3各个Layer的采样个数:

Multimedia Standards Introduction——专业术语_第6张图片

思考如何计算一帧MP3数据的时长？

一帧数据的时间长度 = 帧的采样个数（1152） * 采样频率（1/44100Hz）

思考如何计算mp3文件的duration，在CBR/VBR（固定bit率/可变bit率）的情况下？

✨【CosyVoice2-0.5B 实战】Segmentation fault (core dumped) 终极解决方案（保姆级教程）杨靳言先语音识别语音生成 python 人工智能
【CosyVoice2-0.5B实战】Segmentationfault(coredumped)终极解决方案|torchaudio.save崩溃全流程排查与替代方案（保姆级教程）“运行没报错就是胜利，结果没崩溃就是奇迹。”——每一位搞TTS的开发者内心独白本文聚焦使用CosyVoice2-0.5B进行TTS推理过程中，常见的torchaudio.save()崩溃问题——Segmentationfa
Pydub音频处理库核心API详解滕娴殉
Pydub音频处理库核心API详解pydubManipulateaudiowithasimpleandeasyhighlevelinterface项目地址:https://gitcode.com/gh_mirrors/py/pydub概述Pydub是一个功能强大的Python音频处理库，它提供了简洁直观的API来处理各种音频操作。本文将深入解析Pydub的核心功能，帮助开发者快速掌握音频处理的关键
强化学习 16G实践以下是基于CQL（Conservative Q-Learning）与QLoRA（Quantized Low-Rank Adaptation）结合的方案相关开源项目及资源，【ai技】行云流水AI笔记开源人工智能
根据你提供的CUDA版本（11.5）和NVIDIA驱动错误信息，以下是PyTorch、TensorFlow的兼容版本建议及环境修复方案：1.版本兼容性表框架兼容CUDA版本推荐安装命令（CUDA11.5）PyTorch11.3/11.6pipinstalltorchtorchvisiontorchaudio--extra-index-urlhttps://download.pytorch.org/
高通 audio pal 配置文件盼雨落，等风起 audio 音视频
一、PAL配置文件解析1.mixer_paths.xml-硬件控制中枢核心作用：物理通路定义：建立Codec寄存器到音频端点的信号链路动态控制：运行时通过ALSAControlAPI（如amixerset"SpkrLeftPAVolume"25）实时调整参数平台适配：文件命名规则mixer_paths__.xml（如mixer_paths_sm8550-demo.xml）调试技巧：使用tinymi
九、buildroot系统 usb配置
3.3、usb配置源码中kernel默认已经打开了相关的usb配置，只需要在buildroot中打开相关配置。1、基本功能类别简称功能描述ADB(AndroidDebugBridge)ADB是一种功能多样的命令行调试工具，可以实现文件传输，UnixShell登录等功能。UAC（USBAudioClass）UAC通过USB虚拟标准PCM接口给Host设备，实现Device和Host之间音频互传功能。
Android实时获取声音音量大小泓博 android
使用AudioRecord实时获取音量创建一个AudioRecord实例并持续读取音频数据，计算音量大小。AudioRecord适用于需要原始音频数据的场景。privatevoidstartRecording(){intminBufferSize=AudioRecord.getMinBufferSize(SAMPLE_RATE,AudioFormat.CHANNEL_IN_MONO,AudioFo
Android15音频进阶之MIC设备通路之间对应关系(一百二十四) Android系统攻城狮 Android Audio工程师进阶系列 Android15 AudioReach 音频高通
简介：CSDN博客专家、《Android系统多媒体进阶实战》一书作者新书发布：《Android系统多媒体进阶实战》优质专栏：Audio工程师进阶系列【原创干货持续更新中……】优质专栏：多媒体系统工程师系列【原创干货持续更新中……】优质视频课程：AAOS车载系统+AOSP14系统攻城狮入门视频实战课
torch-gpu版本 anaconda配置教程 GXYGGYXG python
教程Pytorch的GPU版本安装，在安装anaconda的前提下安装pytorch_pytorch-gpu-CSDN博客版本对应PyTorch中torch、torchvision、torchaudio、torchtext版本对应关系_torch2.0.1对应的torchvision-CSDN博客cuda下载地址CUDAToolkitArchive|NVIDIADevelopercudacudnn
Android端直播SDK实现方案
概述直播系统的架构总体上分为采集模块、预览模块、处理模块、编码模块、推流模块。把这五个模块串联起来就构成了整个直播系统的数据流。如下图所示：音频采集：采集原始的PCM数据。音频处理：对音频进行混音消除、降噪、自动增益等处理。音频编码：把PCM格式的数据编码为AAC格式。视频采集：相机/屏幕流的采集；YUV格式或者纹理格式。视频处理：对视频进行美颜/滤镜等处理。预览：把视频处理后的视频流在屏幕上进行
微软ASR与开源模型分析老兵发新帖 microsoft 开源
一、微软ASR核心能力1.支持场景场景功能实时语音转文本低延迟流式识别（会议字幕/直播转录）音频文件转文本支持多种格式（WAV/MP3等），批量处理长音频定制化模型针对特定行业术语（医疗/金融）训练专属模型多语言混合识别中英文混合、方言识别（如中文普通话+粤语）说话人分离区分不同发言人（声纹识别）2.关键性能指标识别准确率：中文普通话>95%（安静环境）英文>96%（MicrosoftResear
ffmpeg（六）：图片与视频互转命令却道天凉_好个秋 #ffmpeg命令 ffmpeg 音视频
图像序列转视频（多张图片➜视频）ffmpeg-framerate25-iimage_%03d.jpg-c:vlibx264-pix_fmtyuv420poutput.mp4参数说明：image_%03d.jpg：文件名格式（如image_001.jpg、image_002.jpg）。-framerate25：输入帧率（25fps）。-c:vlibx264：使用H.264编码。-pix_fmtyuv
H5新增特性大全小夏啥也不会 html中的新特性 video audio css html5 前端
一、HTML概述1.1什么是HTMLHTML5是HTML最新的修订版本（超文本标记语言的第五次重大修改），2014年10月由万维网联盟（W3C）完成标准制定。HTML5的设计目的是为了在移动设备上支持多媒体。HTML5简单易学，HTML5是下一代HTML标准。1.2HTML中的新特性用于绘画的canvas元素用于媒介回放的video和audio元素对本地离线存储的更好的支持新的特殊内容元素，比如a
Android10 音频系统之HAL分析 @OuYang 音视频
一、AudioHAL架构分析Android音频架构定义了如何实现音频功能，并指出实现过程中涉及的相关源码Applicationframeworkapplicationframework包括应用程序代码，该代码使用android.media包中的API接口去与音频硬件交互。在内部，这些代码通过jni去访问与硬件交互的native层的代码。JNI与android.media相关的jni代码会调用nat
【libyuv】windows cmake 构建 for webrtc 等风来不如迎风去 WebRTC入门与实战 windows git bash libyuv
使用vs直接构建webrtc的部分源码，发现libyuv是webrtc源码的依赖库，会有链接错误官方说明https://github.com/frankpapenmeier/libyuv/blob/master/docs/getting_started.md看起来官方灭有推荐windows用cmake构建实测，用cmake也是可以的。deptoolsYou’llneedtohavedepottoo
深度学习Day-38：Pytorch文本分类入门 Point__Nemo 深度学习自然语言处理人工智能
本文为：[365天深度学习训练营]中的学习记录博客原作者：[K同学啊|接辅导、项目定制]任务：了解文本分类的基本流程学习常用数据清洗方法学习如何使用jieba实现英文分词学习如何构建文本向量1.前期准备1.1环境安装pipinstalltorchvision==0.15.0pipinstalltorchaudio==2.0.1pipinstalltorch==2.0.01.2加载数据importt
Ubuntu24.04 ProteinMPNN安装 lamovrevx pytorch 人工智能深度学习
安装建立环境，python=3.9condacreate--nameproteinmpnnpython=3.9condaactivateproteinmpnncondainstallpytorch=1.12.0torchvision=0.13.0torchaudiocudatoolkit=11.3-cpytorch#不指定的话cudapytorch和GPU又不能好好配合#验证pytorchimpo
Android GlSurfaceView渲染YUV图形菠萝加点糖 android OpenGL
OpenGLES2.0的代码，用来显示YUV格式的视频数据。这个示例将包括初始化OpenGL环境、加载Shader程序、绘制纹理等步骤importandroid.content.Context;importandroid.opengl.GLES20;importjava.nio.ByteBuffer;importjava.nio.ByteOrder;importjava.nio.FloatBuff
[特殊字符] 一键搭建AI语音助理：基于DashScope+GRadio的智能聊天机器人技术全解来自于狂人人工智能机器人
一、项目核心技术架构（图1）交互层核心模块pyaudio实时采集流式响应PCM编码GRadio界面状态控制实时对话展示语音输出历史记录管理ASR回调类ASR语音识别聊天处理引擎GPT大模型处理语音合成回调TTS语音合成语音输入DashScopeAPI二、四大核心技术实现1.智能语音识别引擎（附关键源码注释）classASRCallback(TranslationRecognizerCallback
Qt音频采集：QAudioInput详解与示例
1.简介QAudioInput是QtMultimedia模块中用于音频采集的核心类，能够从麦克风等输入设备实时获取原始音频数据（PCM格式）。本文将通过原理讲解和代码示例，帮助开发者快速掌握音频采集的核心技术。2.核心功能支持多种音频格式（采样率/声道/位深）提供实时音频流访问自动管理音频设备资源支持多平台（Windows/Linux/macOS/移动端）3.开发准备3.1环境要求#.pro文件添
【音视频】PJSIP库——pjsua命令使用详解郭老二视频音视频
1、源码编译1）安装依赖库sudoaptinstalllibsrtp2-devsudoaptinstalllibopus-devalsa-toolslibalsaplayer-devffmpeglibalsa*pulseaudio-module-jacksudoaptinstalljackdlibjack-jackd2-devlibjack-devlibsdl2-devlibv4l-devliba
Unreal 文件夹命名----理解引擎坤坤子的世界 unreal unreal
一个项目一般包括两个文件夹：Assert（资源）和Maps（管卡文件夹）这两大部分。在资源文件夹里一般包括：声音（Audio）、蓝图（BlueprintBP）、特效（effect）、材质（Materials）、网格（Mesh）、纹理贴图（Textures）等文件，其中一般材质很多时，材质可按布料、玻璃、地面、金属、木制等进行进一步细分。
【学习笔记】码率&带宽&RGB&YUV HaiQinyanAN 工作中的学习笔记学习笔记
【学习笔记】码率&带宽&RGB&YUV一、码率码率（BitRate）指单位时间内传输或存储的比特（bit）数量，单位为bps（bitpersecond，比特/秒），常用单位还有kbps（千比特/秒）、Mbps（兆比特/秒）。码率：相当于“每秒需要流出的水量”——水流需求（码率）不能超过水管直径的最大承载能力（带宽），否则水流会中断（视频卡顿）。1080p电影若码率为20Mbps，画面细节（如人物发
Win10/11: Windows Audio无法启动错误 0x80070005:拒绝访问积跬步至千里PRO Windows windows
解决办法进入目录C:\Windows\System32，找到cmd.exe，右键->以管理员身份运行在cmd窗口中输入：netlocalgroupAdministrators/addnetworkservice，回车在cmd窗口中输入：netlocalgroupAdministrators/addlocalservice，回车右击我的电脑-管理-服务和运用程序-服务，找到WindowsAudio-
Mac电脑-媒体文件格式转换-Permute 2401_88856700 媒体 mac macos 格式转换
Permute是一款功能强大的媒体文件格式转换工具。支持多种音视频和图像格式，包括但不限于MP4、AVI、MOV、MKV、MP3、WAV、FLAC、JPEG、PNG等。操作界面简洁明了，只需拖拽文件或点击添加按钮来选择需要转换的文件。转换设置区域，可自由选择输出格式、输出路径、输出参数等，实现个性化转换。原文地址：Permute媒体文件格式转换工具
鸿蒙AI语音翻译便签应用设计与实现鸿蒙大白 ui ArKUI-X wpf 物联网 HarmonyOS5 仓颉
鸿蒙AI语音翻译便签应用设计与实现一、系统架构设计基于HarmonyOS的AI能力和分布式技术，我们设计了一个语音翻译便签应用，能够实时将语音输入转换为文字并进行翻译，最终生成多语言便签，支持跨设备同步。https://example.com/ai-voice-translator-arch.png系统包含三个核心模块：语音识别模块-使用@ohos.multimedia.audio和AI语音识别服
Unreal Engine：声音设计与音频集成技术教程_2024-07-13_00-24-34.Tex chenjj4003 游戏开发虚幻音视频 javascript unity ar 游戏引擎网络
UnrealEngine：声音设计与音频集成技术教程声音设计基础音频格式与质量在声音设计中，理解音频格式和质量至关重要。不同的格式适用于不同的场景，而音频质量则直接影响游戏体验的沉浸感。音频格式WAV(WaveformAudioFileFormat)WAV是一种无损音频格式，保留了原始音频的所有数据，适用于编辑和处理阶段，但文件大小较大，不适合游戏中的实时加载。MP3(MPEG-1AudioLay
MP34DT05TR-A MEMS音频传感器全向数字麦克风：122.5dB AOP抗爆破音设计在工业警报系统中的应用验证 Hailey深力科 MP34DT05TR-A MEMS麦克风 MEMS音频传感器全向数字麦克风
一、产品架构与核心性能MP34DT05TR-A采用硅微加工电容传感单元+CMOSASIC双芯片集成架构，通过PDM接口输出数字音频流。其突破性在于：122.5dB声学过载点(AOP)：超越消费级麦克风常规100dB极限，耐受强声压冲击64dBSNR：1kHz频点底噪低至29dBA，保留语音高频细节（>6kHz）-26dBFS±3dB灵敏度一致性：产线匹配公差缩小50%，降低阵列设计校准成本二、关键
RGB与YUV格式的转换五月的鱼
一、实验原理：1.图像数据存储方式图像中RGB以像素为单位，存储顺序为B、G、RYUV以整幅图为单位，先存Y，亮度分量，再存U、V，色差分量分别提取rgb图片和yuv图片的RGB与YUV数值，通过转换公式，即可得到另一种图像格式所需数值，再写入新图像，即可转换图像格式。2.RGB与YUV转换关系由电视原理可知，亮度和色差信号的构成如下：Y=0.2990R+0.5870G+0.1140BR-Y＝0.
python rgb转yuv_python – rgb转换为yuv并访问Y,U和V通道 weixin_39564368 python rgb转yuv
我一直在寻找这种转换.有什么方法可以在Linux上使用Python将RGB图像转换为YUV图像并访问Y,U和V通道？(使用opencv,skimage等等…)更新：我用过opencvimg_yuv=cv2.cvtColor(image,cv2.COLOR_BGR2YUV)y,u,v=cv2.split(img_yuv)cv2.imshow('y',y)cv2.imshow('u',u)cv2.im
HTML5 更新的功能 TE-茶叶蛋面试复习系列 html知识 html5 前端 html
文章目录前言**一、语义化标签（SemanticElements）****二、多媒体支持（Audio&Video）****三、图形与绘图（Canvas&SVG）****1.``****2.SVG内联支持****四、表单增强（FormFeatures）****1.新输入类型****2.新属性****五、本地存储（WebStorage）****六、地理定位（Geolocation）****七、拖放AP
JAVA基础灵静志远位运算加载 Date 字符串池覆盖
一、类的初始化顺序 1 （静态变量，静态代码块）-->（变量，初始化块）--> 构造器同一括号里的，根据它们在程序中的顺序来决定。上面所述是同一类中。如果是继承的情况，那就在父类到子类交替初始化。二、String 1 String a = "abc"; JAVA虚拟机首先在字符串池中查找是否已经存在了值为"abc"的对象，根
keepalived实现redis主从高可用 bylijinnan redis
方案说明两台机器（称为A和B），以统一的VIP对外提供服务 1.正常情况下，A和B都启动，B会把A的数据同步过来（B is slave of A） 2.当A挂了后，VIP漂移到B；B的keepalived 通知redis 执行：slaveof no one，由B提供服务 3.当A起来后，VIP不切换，仍在B上面；而A的keepalived 通知redis 执行slaveof B，开始
java文件操作大全 0624chenhong java
最近在博客园看到一篇比较全面的文件操作文章，转过来留着。 http://www.cnblogs.com/zhuocheng/archive/2011/12/12/2285290.html 转自http://blog.sina.com.cn/s/blog_4a9f789a0100ik3p.html 一.获得控制台用户输入的信息 &nbs
android学习任务不懂事的小屁孩工作
任务完成情况搞清楚带箭头的pupupwindows和不带的使用已完成熟练使用pupupwindows和alertdialog，并搞清楚两者的区别已完成熟练使用android的线程handler,并敲示例代码进行中了解游戏2048的流程，并完成其代码工作进行中-差几个actionbar 研究一下android的动画效果，写一个实例已完成复习fragem
zoom.js 换个号韩国红果果 oom
它的基于bootstrap 的 https://raw.github.com/twbs/bootstrap/master/js/transition.js transition.js模块引用顺序 <link rel="stylesheet" href="style/zoom.css"> <script src=&q
详解Oracle云操作系统Solaris 11.2 蓝儿唯美 Solaris
当Oracle发布Solaris 11时，它将自己的操作系统称为第一个面向云的操作系统。Oracle在发布Solaris 11.2时继续它以云为中心的基调。但是，这些说法没有告诉我们为什么Solaris是配得上云的。幸好，我们不需要等太久。Solaris11.2有4个重要的技术可以在一个有效的云实现中发挥重要作用：OpenStack、内核域、统一存档（UA）和弹性虚拟交换（EVS）。
spring学习——springmvc（一） a-john springMVC
Spring MVC基于模型-视图-控制器（Model-View-Controller，MVC）实现，能够帮助我们构建像Spring框架那样灵活和松耦合的Web应用程序。 1，跟踪Spring MVC的请求请求的第一站是Spring的DispatcherServlet。与大多数基于Java的Web框架一样，Spring MVC所有的请求都会通过一个前端控制器Servlet。前
hdu4342 History repeat itself-------多校联合五 aijuans 数论
水题就不多说什么了。 #include<iostream>#include<cstdlib>#include<stdio.h>#define ll __int64using namespace std;int main(){ int t; ll n; scanf("%d",&t); while(t--)
EJB和javabean的区别 asia007 bean ejb
EJB不是一般的JavaBean,EJB是企业级JavaBean,EJB一共分为3种,实体Bean,消息Bean,会话Bean,书写EJB是需要遵循一定的规范的,具体规范你可以参考相关的资料.另外,要运行EJB,你需要相应的EJB容器,比如Weblogic,Jboss等,而JavaBean不需要,只需要安装Tomcat就可以了 1.EJB用于服务端应用开发, 而JavaBeans
Struts的action和Result总结百合不是茶 struts Action配置 Result配置
一:Action的配置详解: 下面是一个Struts中一个空的Struts.xml的配置文件 <?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE struts PUBLIC &quo
如何带好自已的团队 bijian1013 项目管理团队管理团队
在网上看到博客" 怎么才能让团队成员好好干活"的评论，觉得写的比较好。原文如下：我做团队管理有几年了吧，我和你分享一下我认为带好团队的几点： 1.诚信对团队内成员，无论是技术研究、交流、问题探讨，要尽可能的保持一种诚信的态度，用心去做好，你的团队会感觉得到。 2.努力提
Java代码混淆工具 sunjing ProGuard
Open Source Obfuscators ProGuard http://java-source.net/open-source/obfuscators/proguardProGuard is a free Java class file shrinker and obfuscator. It can detect and remove unused classes, fields, m
【Redis三】基于Redis sentinel的自动failover主从复制 bit1129 redis
在第二篇中使用2.8.17搭建了主从复制，但是它存在Master单点问题，为了解决这个问题，Redis从2.6开始引入sentinel，用于监控和管理Redis的主从复制环境，进行自动failover，即Master挂了后，sentinel自动从从服务器选出一个Master使主从复制集群仍然可以工作，如果Master醒来再次加入集群，只能以从服务器的形式工作。什么是Sentine
使用代理实现Hibernate Dao层自动事务白糖_ DAO spring AOP 框架 Hibernate
都说spring利用AOP实现自动事务处理机制非常好，但在只有hibernate这个框架情况下，我们开启session、管理事务就往往很麻烦。 public void save(Object obj){ Session session = this.getSession(); Transaction tran = session.beginTransaction(); try
maven3实战读书笔记 braveCS maven3
Maven简介是什么？ Is a software project management and comprehension tool.项目管理工具是基于POM概念(工程对象模型) [设计重复、编码重复、文档重复、构建重复，maven最大化消除了构建的重复] [与XP：简单、交流与反馈；测试驱动开发、十分钟构建、持续集成、富有信息的工作区] 功能：
编程之美-子数组的最大乘积 bylijinnan 编程之美
public class MaxProduct { /** * 编程之美子数组的最大乘积 * 题目: 给定一个长度为N的整数数组，只允许使用乘法，不能用除法，计算任意N-1个数的组合中乘积中最大的一组，并写出算法的时间复杂度。 * 以下程序对应书上两种方法，求得“乘积中最大的一组”的乘积——都是有溢出的可能的。 * 但按题目的意思，是要求得这个子数组，而不
读书笔记-2 chengxuyuancsdn 读书笔记
1、反射 2、oracle年-月-日时-分-秒 3、oracle创建有参、无参函数 4、oracle行转列 5、Struts2拦截器 6、Filter过滤器(web.xml) 1、反射 (1)检查类的结构在java.lang.reflect包里有3个类Field,Method,Constructor分别用于描述类的域、方法和构造器。 2、oracle年月日时分秒 s
[求学与房地产]慎重选择IT培训学校 comsci it
关于培训学校的教学和教师的问题,我们就不讨论了,我主要关心的是这个问题培训学校的教学楼和宿舍的环境和稳定性问题我们大家都知道，房子是一个比较昂贵的东西，特别是那种能够当教室的房子... &nb
RMAN配置中通道(CHANNEL)相关参数 PARALLELISM 、FILESPERSET的关系 daizj oracle rman filesperset PARALLELISM
RMAN配置中通道(CHANNEL)相关参数 PARALLELISM 、FILESPERSET的关系转 PARALLELISM --- 我们还可以通过parallelism参数来指定同时"自动"创建多少个通道： RMAN > configure device type disk parallelism 3 ; 表示启动三个通道，可以加快备份恢复的速度。
简单排序:冒泡排序 dieslrae 冒泡排序
public void bubbleSort(int[] array){ for(int i=1;i<array.length;i++){ for(int k=0;k<array.length-i;k++){ if(array[k] > array[k+1]){
初二上学期难记单词三 dcj3sjt126com sciet
concert 音乐会 tonight 今晚 famous 有名的；著名的 song 歌曲 thousand 千 accident 事故；灾难 careless 粗心的，大意的 break 折断；断裂；破碎 heart 心（脏） happen 偶尔发生，碰巧 tourist 旅游者；观光者 science （自然）科学 marry 结婚 subject 题目；
I.安装Memcahce 1. 安装依赖包libevent Memcache需要安装libevent,所以安装前可能需要执行 Shell代码收藏代码 dcj3sjt126com redis
wget http://download.redis.io/redis-stable.tar.gz tar xvzf redis-stable.tar.gz cd redis-stable make 前面3步应该没有问题，主要的问题是执行make的时候，出现了异常。异常一： make[2]: cc: Command not found 异常原因：没有安装g
并发容器 shuizhaosi888 并发容器
通过并发容器来改善同步容器的性能，同步容器将所有对容器状态的访问都串行化，来实现线程安全，这种方式严重降低并发性，当多个线程访问时，吞吐量严重降低。并发容器ConcurrentHashMap 替代同步基于散列的Map，通过Lock控制。 &nb
Spring Security（12）——Remember-Me功能 234390216 Spring Security Remember Me 记住我
Remember-Me功能目录 1.1 概述 1.2 基于简单加密token的方法 1.3 基于持久化token的方法 1.4 Remember-Me相关接口和实现
位运算焦志广位运算
一、位运算符Ｃ语言提供了六种位运算符： & 按位与 | 按位或 ^ 按位异或 ~ 取反 << 左移 >> 右移 1. 按位与运算按位与运算符"&"是双目运算符。其功能是参与运算的两数各对应的二进位相与。只有对应的两个二进位均为1时，结果位才为1 ，否则为0。参与运算的数以补码方式出现。例如：9&am
nodejs 数据库连接 mongodb mysql liguangsong mongodb mysql node 数据库连接
1.mysql 连接 package.json中dependencies加入 "mysql":"~2.7.0" 执行 npm install 在config 下创建文件 database.js
java动态编译 olive6615 java HotSpot jvm 动态编译
在HotSpot虚拟机中，有两个技术是至关重要的，即动态编译(Dynamic compilation)和Profiling。 HotSpot是如何动态编译Javad的bytecode呢？Java bytecode是以解释方式被load到虚拟机的。HotSpot里有一个运行监视器，即Profile Monitor,专门监视
Storm0.9.5的集群部署配置优化 roadrunners 优化 storm.yaml
nimbus结点配置（storm.yaml）信息： # Licensed to the Apache Software Foundation (ASF) under one # or more contributor license agreements. See the NOTICE file # distributed with this work for additional inf
101个MySQL 的调节和优化的提示 tomcat_oracle mysql
　1. 拥有足够的物理内存来把整个InnoDB文件加载到内存中——在内存中访问文件时的速度要比在硬盘中访问时快的多。　　2. 不惜一切代价避免使用Swap交换分区 – 交换时是从硬盘读取的，它的速度很慢。　　3. 使用电池供电的RAM（注：RAM即随机存储器）。　　4. 使用高级的RAID（注：Redundant Arrays of Inexpensive Disks，即磁盘阵列
zoj 3829 Known Notation(贪心) 阿尔萨斯 ZOJ
题目链接：zoj 3829 Known Notation 题目大意：给定一个不完整的后缀表达式，要求有2种不同操作，用尽量少的操作使得表达式完整。解题思路：贪心，数字的个数要要保证比∗的个数多1，不够的话优先补在开头是最优的。然后遍历一遍字符串，碰到数字+1，碰到∗-1,保证数字的个数大于等1，如果不够减的话，可以和最后面的一个数字交换位置（用栈维护十分方便），因为添加和交换代价都是1