u013250861

音频-特征提取：①幅度谱（短时傅里叶变换谱/STFT）、②梅尔频谱（mel-spectrogram）、③梅尔倒谱（MFCC）【在梅尔频谱上取对数，做DCT（离散余弦变换）变换，得梅尔倒谱】

原始信号

从音频文件中读取出来的原始语音信号通常称为raw waveform，是一个一维数组，长度是由音频长度和采样率决定，比如采样率Fs为16KHz，表示一秒钟内采样16000个点，这个时候如果音频长度是10秒，那么raw waveform中就有160000个值，值的大小通常表示的是振幅。

一、幅度谱（spectrogram）/ STFT

声音信号是一维信号，直观上只能看到时域信息，不能看到频域信息。

通过傅里叶变换(FT)可以变换到频域，但是丢失了时域信息，无法看到时频关系。为了解决这个问题，产生了很多方法，短时傅里叶变换，小波等都是很常用的时频分析方法。

短时傅里叶变换(STFT)，就是对短时的信号做傅里叶变换。原理如下：对一段长语音信号，分帧、加窗，再对每一帧做傅里叶变换，之后把每一帧的结果沿时间维度堆叠，得到一张图（类似于二维信号），这张图就是声谱图。

librosa.stft(y, *, n_fft=2048, hop_length=None, 
				   win_length=None, window='hann', center=True, dtype=None, pad_mode='constant')

def stft(
    y, # 音频时间序列
    *,
    n_fft=2048,
    hop_length=None,
    win_length=None,
    window="hann",
    center=True,
    dtype=None,
    pad_mode="constant",
)

参数：
y：音频时间序列
n_fft：FFT窗口大小，n_fft=hop_length+overlapping
hop_length：帧移。
spectrum = np.abs(librosa.stft(frame, n_fft=self.nfft))，未指定hop_length时，则默认win_length / 4
spectrum = np.abs(librosa.stft(frame, n_fft=self.nfft, hop_length=len(frame)))时，如果帧移长度小于傅里叶变换点数，librosa.stft输出为hop_length+1
spectrum = np.abs(librosa.stft(frame, n_fft=self.nfft, hop_length=self.nfft))时，无论win_length设置为帧长还是nfft，librosa.stft输出都只有一帧。
最后得出结论librosa.stft的输出帧数为speech_length // hop_length + 1
win_length：每一帧音频都由window()加窗。窗长win_length，然后用零填充以匹配n_fft。
默认win_length=n_fft。
window：字符串，元组，数字，函数 shape =（n_fft, )
窗口（字符串，元组或数字）
窗函数，例如scipy.signal.hanning
长度为n_fft的向量或数组
center：bool
如果为True，则填充信号y，以使帧 D [:, t]以y [t * hop_length]为中心
如果为False，则D [:, t]从y [t * hop_length]开始
dtype：D的复数值类型。默认值为64-bit complex复数
pad_mode：如果center = True，则在信号的边缘使用填充模式。默认情况下，STFT使用reflection padding
返回：一个复数矩阵使得D(f,t) STFT矩阵 shape =（1 + nfft/2，t）其中：
n_fft/2是因为实数FFT信号具有对称性，我们只需要去一般的数据分析即可，全部返回有数据冗余。
n_frames: n_frames = (speech_len) // hop_len + 1。具体可以画图，信号处理之前首先需要padding, padding之后分帧，画图可以看到，真正与帧数有关系的，是hop_len。

参数详解：

y：时间序列，通常是音频信号，表示为浮点值的一维 numpy.ndarray。 y[t]对应样本在t处的波形幅度；幅度通常是作为最初拾取音频的麦克风或接收器设备周围压力变化的函数来测量的。y.shape = (64000,) 表示在整个音频中仅在一个通道（单声道）上录制了 64000(total_number_of_samples) 样本；
n_fft：FFT窗口大小， $n_fft = hop_length + overlapping \text{n\_fft = hop\_length + overlapping}$ ；
win_length：每一帧音频都由window()加窗。窗长为win_length，然后用0进行padding来匹配n_fft；默认 $win_length = n_fft \text{win\_length = n\_fft}$ ；
hop_length：帧移，如果不指定，则默认 $hop_length = win_length / 4 \text{hop\_length = win\_length / 4}$ ；
window：使用的窗函数名称；
center：布尔值，默认为True；
- 如果为True，则填充信号y，以使帧D $\text{[:,t]}$ 以 $hop_length] \text{y[t * hop\_length]}$ 为中心；
- 如果为False，帧D $\text{[:,t]}$ 以 $hop_length] \text{y[t * hop\_length]}$ 为开始；
dtype：D的复数值类型，默认值为64-bit的complex复数；
pad_mode：如果 center = True，则在信号的边缘使用填充模型；默认情况下，STFT使用 constant padding‘’

1、STFT案例01

import librosa

y, sr = librosa.load("air_conditioner01.wav")  # y.shape = (88200,)  sr = 22050
print("y.shape = {0}; y = {1}".format(y.shape, y))
print("sr = {0}".format(sr))

n_fft = 2048
win_length = n_fft  # 默认
hop_length = win_length // 4  # 默认

D = librosa.stft(y=y, n_fft=n_fft, win_length=win_length, hop_length=hop_length, window='hann', center=True, pad_mode='constant')
print("\nSTFT结果：D.shape = {0}; \nD = {1}".format(D.shape, D))

计算：

频率维度的频率组数： $\cfrac{n\_fft}{2} + 1=\cfrac{2048}{2}+1=1025$ ； $1025$ 是由FFT窗口大小决定的: $2048/2 + 1$ , 这个 $1025$ 是声谱图的纵坐标，也就是他的频率被分成了1025份。
时间维度的帧数： $hop_length + 1 = 88200 512 + 1 = 173 D.shape[1] = \cfrac{\text{样本总采样点数}}{\text{hop\_length}}+1=\cfrac{88200}{512}+1=173$ 【默认： $win_length = n_fft \text{win\_length = n\_fft}$ 、 $hop_length = win_length / 4 \text{hop\_length = win\_length / 4}$ 】； $173$ 是由总的信号长度和窗口大小和帧移决定的，也就是所谓的每一帧，即关于时间的讯息。

打印结果：

y.shape = (88200,); y = [-0.00429636 -0.01181004 -0.01559684 ... -0.02535474 -0.02254227 -0.01510671]

sr = 22050

STFT结果：D.shape = (1025, 173); 
D = [[ 4.0991772e-02+0.0000000e+00j  7.6347418e-02+0.0000000e+00j
   1.0839665e-01+0.0000000e+00j ... -5.6880221e-02+0.0000000e+00j
  -4.0515706e-01+0.0000000e+00j -1.2604758e+00+0.0000000e+00j]
 [ 4.9349996e-03+2.2843832e-02j -6.5081336e-02-1.5483504e-03j
  -6.4895572e-03+8.5287131e-02j ...  8.6759105e-02+4.4598781e-02j
  -1.3626395e-01-3.5130250e-01j  1.2085854e+00-4.9212924e-01j]
 [-3.1803373e-02+6.2021937e-02j  1.8289314e-01-2.9990083e-02j
  -2.6830113e-01-2.0388773e-02j ...  2.5796932e-01-3.4968770e-01j
   8.2743064e-02-4.2531621e-01j -9.6776468e-01+1.2715183e+00j]
 ...
 [ 6.0332339e-04-2.8250517e-07j -3.0159805e-04+3.6464758e-07j
  -1.6187376e-07-1.8996661e-07j ... -1.5264601e-07-2.4162804e-07j
  -7.4535434e-04-8.3691225e-04j  2.4131173e-03+2.7090199e-03j]
 [-6.0317205e-04+8.0741266e-08j -4.6025047e-08+3.0144685e-04j
  -8.3143476e-08+1.5921461e-07j ... -1.0696644e-07+1.1993595e-07j
   4.5853498e-04-1.0226711e-03j -3.3102881e-03-1.4843964e-03j]
 [ 6.0308439e-04+0.0000000e+00j  3.0151531e-04+0.0000000e+00j
  -1.1964966e-07+0.0000000e+00j ...  2.1211932e-07+0.0000000e+00j
   1.1208834e-03+0.0000000e+00j  3.6279499e-03+0.0000000e+00j]]

Process finished with exit code 0

2、STFT案例02

import numpy as np
import librosa.display
import matplotlib.pyplot as plt

y, sr = librosa.load("air_conditioner01.wav")  # y.shape = (88200,)  sr = 22050
print("y.shape = {0}; \ny = {1}".format(y.shape, y))
print("\nsr = {0}".format(sr))

n_fft = 2048
win_length = n_fft  # 默认
hop_length = win_length // 4  # 默认

D = librosa.stft(y=y, n_fft=n_fft, win_length=win_length, hop_length=hop_length, window='hann', center=True, pad_mode='constant')
print("\nD.shape = {0}; \nD = {1}".format(D.shape, D))

D_abs = np.abs(D)  # 4.9349996e-03+2.2843832e-02j ----> 2.3370814e-02【a + bj ----> sqrt(a^2 + b^2)】
print("\nD_abs.shape = {0}; \nD_abs = {1}".format(D_abs.shape, D_abs))

D_pow = D_abs ** 2  # 4.9349996e-03+2.2843832e-02j ----> 5.4619490e-04【a + bj ----> a^2 + b^2】
print("\nD_pow.shape = {0}; \nD_pow = {1}".format(D_pow.shape, D_pow))

# 作图01
fig, ax = plt.subplots()
img = librosa.display.specshow(data=D_pow, x_axis='time', y_axis='mel', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.2f')
ax.set(title='STFT spectrogram')
fig.show()

D_dB = librosa.power_to_db(D_pow)  # 能量转换为分贝
print("\nD_dB.shape = {0}; \nD_dB = {1}".format(D_dB.shape, D_dB))

# 作图02
fig, ax = plt.subplots()
img = librosa.display.specshow(data=D_dB, x_axis='time', y_axis='mel', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.0f dB')
ax.set(title='STFT(dB) spectrogram')
fig.show()

# 作图03【y_axis='linear'】
fig, ax = plt.subplots()
img = librosa.display.specshow(data=D_dB, x_axis='time', y_axis='linear', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.0f dB')
ax.set(title='STFT(dB) spectrogram')
fig.show()

打印结果：

y.shape = (88200,); 
y = [-0.00429636 -0.01181004 -0.01559684 ... -0.02535474 -0.02254227 -0.01510671]

sr = 22050

D.shape = (1025, 173); 
D = [[ 4.0991772e-02+0.0000000e+00j  7.6347418e-02+0.0000000e+00j
   1.0839665e-01+0.0000000e+00j ... -5.6880221e-02+0.0000000e+00j
  -4.0515706e-01+0.0000000e+00j -1.2604758e+00+0.0000000e+00j]
 [ 4.9349996e-03+2.2843832e-02j -6.5081336e-02-1.5483504e-03j
  -6.4895572e-03+8.5287131e-02j ...  8.6759105e-02+4.4598781e-02j
  -1.3626395e-01-3.5130250e-01j  1.2085854e+00-4.9212924e-01j]
 [-3.1803373e-02+6.2021937e-02j  1.8289314e-01-2.9990083e-02j
  -2.6830113e-01-2.0388773e-02j ...  2.5796932e-01-3.4968770e-01j
   8.2743064e-02-4.2531621e-01j -9.6776468e-01+1.2715183e+00j]
 ...
 [ 6.0332339e-04-2.8250517e-07j -3.0159805e-04+3.6464758e-07j
  -1.6187376e-07-1.8996661e-07j ... -1.5264601e-07-2.4162804e-07j
  -7.4535434e-04-8.3691225e-04j  2.4131173e-03+2.7090199e-03j]
 [-6.0317205e-04+8.0741266e-08j -4.6025047e-08+3.0144685e-04j
  -8.3143476e-08+1.5921461e-07j ... -1.0696644e-07+1.1993595e-07j
   4.5853498e-04-1.0226711e-03j -3.3102881e-03-1.4843964e-03j]
 [ 6.0308439e-04+0.0000000e+00j  3.0151531e-04+0.0000000e+00j
  -1.1964966e-07+0.0000000e+00j ...  2.1211932e-07+0.0000000e+00j
   1.1208834e-03+0.0000000e+00j  3.6279499e-03+0.0000000e+00j]]

D_abs.shape = (1025, 173); 
D_abs = [[4.0991772e-02 7.6347418e-02 1.0839665e-01 ... 5.6880221e-02
  4.0515706e-01 1.2604758e+00]
 [2.3370814e-02 6.5099753e-02 8.5533671e-02 ... 9.7550981e-02
  3.7680408e-01 1.3049406e+00]
 [6.9700614e-02 1.8533567e-01 2.6907471e-01 ... 4.3454534e-01
  4.3329009e-01 1.5979135e+00]
 ...
 [6.0332345e-04 3.0159828e-04 2.4958049e-07 ... 2.8580573e-07
  1.1207029e-03 3.6279366e-03]
 [6.0317205e-04 3.0144685e-04 1.7961662e-07 ... 1.6070609e-07
  1.1207634e-03 3.6278698e-03]
 [6.0308439e-04 3.0151531e-04 1.1964966e-07 ... 2.1211932e-07
  1.1208834e-03 3.6279499e-03]]

D_pow.shape = (1025, 173); 
D_pow = [[1.6803254e-03 5.8289282e-03 1.1749834e-02 ... 3.2353594e-03
  1.6415225e-01 1.5887991e+00]
 [5.4619490e-04 4.2379778e-03 7.3160087e-03 ... 9.5161935e-03
  1.4198132e-01 1.7028699e+00]
 [4.8581758e-03 3.4349307e-02 7.2401196e-02 ... 1.8882965e-01
  1.8774031e-01 2.5533276e+00]
 ...
 [3.6399919e-07 9.0961521e-08 6.2290423e-14 ... 8.1684910e-14
  1.2559751e-06 1.3161924e-05]
 [3.6381653e-07 9.0870202e-08 3.2262130e-14 ... 2.5826448e-14
  1.2561105e-06 1.3161439e-05]
 [3.6371080e-07 9.0911477e-08 1.4316039e-14 ... 4.4994604e-14
  1.2563796e-06 1.3162020e-05]]

D_dB.shape = (1025, 173); 
D_dB = [[-27.746067  -22.344112  -19.299683  ... -24.900776   -7.8475313
    2.01069  ]
 [-32.626526  -23.728413  -21.357258  ... -20.215368   -8.477688
    2.3118148]
 [-23.13527   -14.64082   -11.402542  ...  -7.2392983  -7.2644243
    4.071065 ]
 ...
 [-57.84667   -57.84667   -57.84667   ... -57.84667   -57.84667
  -48.806805 ]
 [-57.84667   -57.84667   -57.84667   ... -57.84667   -57.84667
  -48.80697  ]
 [-57.84667   -57.84667   -57.84667   ... -57.84667   -57.84667
  -48.80677  ]]

Process finished with exit code 0

3、STFT参数详解

"""Short-time Fourier transform (STFT).

The STFT represents a signal in the time-frequency domain by
computing discrete Fourier transforms (DFT) over short overlapping
windows.

This function returns a complex-valued matrix D such that

- ``np.abs(D[..., f, t])`` is the magnitude of frequency bin ``f``
  at frame ``t``, and

- ``np.angle(D[..., f, t])`` is the phase of frequency bin ``f``
  at frame ``t``.

The integers ``t`` and ``f`` can be converted to physical units by means
of the utility functions `frames_to_sample` and `fft_frequencies`.

Parameters
----------
y : np.ndarray [shape=(..., n)], real-valued
    input signal. Multi-channel is supported.

n_fft : int > 0 [scalar]
    length of the windowed signal after padding with zeros.
    The number of rows in the STFT matrix ``D`` is ``(1 + n_fft/2)``.
    The default value, ``n_fft=2048`` samples, corresponds to a physical
    duration of 93 milliseconds at a sample rate of 22050 Hz, i.e. the
    default sample rate in librosa. This value is well adapted for music
    signals. However, in speech processing, the recommended value is 512,
    corresponding to 23 milliseconds at a sample rate of 22050 Hz.
    In any case, we recommend setting ``n_fft`` to a power of two for
    optimizing the speed of the fast Fourier transform (FFT) algorithm.

hop_length : int > 0 [scalar]
    number of audio samples between adjacent STFT columns.

    Smaller values increase the number of columns in ``D`` without
    affecting the frequency resolution of the STFT.

    If unspecified, defaults to ``win_length // 4`` (see below).

win_length : int <= n_fft [scalar]
    Each frame of audio is windowed by ``window`` of length ``win_length``
    and then padded with zeros to match ``n_fft``.

    Smaller values improve the temporal resolution of the STFT (i.e. the
    ability to discriminate impulses that are closely spaced in time)
    at the expense of frequency resolution (i.e. the ability to discriminate
    pure tones that are closely spaced in frequency). This effect is known
    as the time-frequency localization trade-off and needs to be adjusted
    according to the properties of the input signal ``y``.

    If unspecified, defaults to ``win_length = n_fft``.

window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]
    Either:

    - a window specification (string, tuple, or number);
      see `scipy.signal.get_window`
    - a window function, such as `scipy.signal.windows.hann`
    - a vector or array of length ``n_fft``

    Defaults to a raised cosine window (`'hann'`), which is adequate for
    most applications in audio signal processing.

    .. see also:: `filters.get_window`

center : boolean
    If ``True``, the signal ``y`` is padded so that frame
    ``D[:, t]`` is centered at ``y[t * hop_length]``.

    If ``False``, then ``D[:, t]`` begins at ``y[t * hop_length]``.

    Defaults to ``True``,  which simplifies the alignment of ``D`` onto a
    time grid by means of `librosa.frames_to_samples`.
    Note, however, that ``center`` must be set to `False` when analyzing
    signals with `librosa.stream`.

    .. see also:: `librosa.stream`

dtype : np.dtype, optional
    Complex numeric type for ``D``.  Default is inferred to match the
    precision of the input signal.

pad_mode : string or function
    If ``center=True``, this argument is passed to `np.pad` for padding
    the edges of the signal ``y``. By default (``pad_mode="constant"``),
    ``y`` is padded on both sides with zeros.
    If ``center=False``,  this argument is ignored.

    .. see also:: `numpy.pad`

Returns
-------
D : np.ndarray [shape=(..., 1 + n_fft/2, n_frames), dtype=dtype]
    Complex-valued matrix of short-term Fourier transform
    coefficients.

See Also
--------
istft : Inverse STFT
reassigned_spectrogram : Time-frequency reassigned spectrogram

Notes
-----
This function caches at level 20.

Examples
--------
>>> y, sr = librosa.load(librosa.ex('trumpet'))
>>> S = np.abs(librosa.stft(y))
>>> S
array([[5.395e-03, 3.332e-03, ..., 9.862e-07, 1.201e-05],
       [3.244e-03, 2.690e-03, ..., 9.536e-07, 1.201e-05],
       ...,
       [7.523e-05, 3.722e-05, ..., 1.188e-04, 1.031e-03],
       [7.640e-05, 3.944e-05, ..., 5.180e-04, 1.346e-03]],
      dtype=float32)

Use left-aligned frames, instead of centered frames

>>> S_left = librosa.stft(y, center=False)

Use a shorter hop length

>>> D_short = librosa.stft(y, hop_length=64)

Display a spectrogram

>>> import matplotlib.pyplot as plt
>>> fig, ax = plt.subplots()
>>> img = librosa.display.specshow(librosa.amplitude_to_db(S,
...                                                        ref=np.max),
...                                y_axis='log', x_axis='time', ax=ax)
>>> ax.set_title('Power spectrogram')
>>> fig.colorbar(img, ax=ax, format="%+2.0f dB")
"""

二、梅尔频谱（melspectrogram）

人耳能听到的频率范围是20-20000HZ，但是人耳对HZ单位不是线性敏感，而是对低HZ敏感，对高HZ不敏感，将HZ频率转化为梅尔频率，则人耳对频率的感知度就变为线性。

例如如果我们适应了1000Hz的音调，如果把音调频率提高到2000Hz，我们的耳朵只能觉察到频率提高了一点点，根本察觉不到频率提高了一倍。

将普通频率转化到Mel频率的公式是：

下图是HZ到Mel的映射关系图，由于二者为log关系，在频率较低时，Mel随HZ变化较快；当频率较高时，曲线斜率小，变化缓慢。

在Mel频域内，人对音调的感知度为线性关系。

举例来说，，则人耳听起来两者的音调也如果两段语音的Mel频率相差两倍相差两倍。

使用python的librosa音频处理库可以轻松实现：

librosa.feature.melspectrogram(*, y=None, sr=22050, S=None, n_fft=2048, 
								  hop_length=512, win_length=None, window='hann', 
								  center=True, pad_mode='constant', power=2.0, **kwargs)

def melspectrogram(
    y=None,
    sr=22050,
    S=None,
    n_fft=2048,
    hop_length=512,
    win_length=None,
    window="hann",
    center=True,
    pad_mode="reflect",
    power=2.0,
    **kwargs,
):

y：输入时域下的音频信号。shape= （n，）；
sr：采样频率；
n_fft：FFT窗口个数，默认2048；
hop_length：连续帧之间的采样数，默认512；
window：使用加窗的类型，默认为汉宁窗；
n_mels：返回结果的Mel bands数量（number of Mel bands to generate），默认128；
return：梅尔频谱【shape=(…, n_mels, t)】；

1、梅尔频谱（melspectrogram）-案例01【直接从y计算】

import librosa.display
import matplotlib.pyplot as plt

y, sr = librosa.load("air_conditioner01.wav")  # y.shape = (88200,)  sr = 22050
print("y.shape = {0}; y = {1}".format(y.shape, y))
print("\nsr = {0}".format(sr))

n_fft = 2048
win_length = n_fft  # 默认
hop_length = win_length // 4  # 默认
n_mels = 80  # 默认 128

S = librosa.feature.melspectrogram(y=y, sr=sr, n_fft=n_fft, win_length=win_length, hop_length=hop_length, n_mels=n_mels)
print("\nMel频谱--结果：S.shape = {0}; \nS = {1}".format(S.shape, S))

# 作图01
fig, ax = plt.subplots()
img = librosa.display.specshow(S, x_axis='time', y_axis='mel', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.2f')
ax.set(title='Mel-frequency spectrogram')
fig.show()

S_dB = librosa.power_to_db(S=S)
print("\nMel频谱-dB能量谱---结果：S_dB.shape = {0}; \nS_dB = {1}".format(S_dB.shape, S_dB))

# 作图02
fig, ax = plt.subplots()
img = librosa.display.specshow(S_dB, x_axis='time', y_axis='mel', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.0f dB')
ax.set(title='Mel-frequency(dB) spectrogram')
fig.show()

# 作图03
fig, ax = plt.subplots()
img = librosa.display.specshow(S_dB, x_axis='time', y_axis='linear', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.0f dB')
ax.set(title='Mel-frequency(dB) spectrogram')
fig.show()

打印结果：

y.shape = (88200,); y = [-0.00429636 -0.01181004 -0.01559684 ... -0.02535474 -0.02254227 -0.01510671]

sr = 22050

结果：S.shape = (80, 173); 
S = [[2.1666234e-02 2.1649336e-02 1.9549519e-01 ... 7.8533143e-01  1.4178720e+00 7.7648085e-01]
 [1.6606098e-01 2.8194109e-01 7.3984677e-01 ... 5.2415431e-01  6.8600404e-01 4.3208513e-01]
 [1.0179499e-01 1.3825276e-01 2.5464761e-01 ... 6.1250693e-01  2.6014277e-01 1.3389401e-01]
 ...
 [6.9146539e-05 3.5622125e-04 1.7929733e-04 ... 4.7781770e-05  4.5372890e-05 3.5324701e-05]
 [3.0970994e-05 8.2059370e-05 3.9507086e-05 ... 2.9160121e-05  2.3678922e-05 1.5615517e-05]
 [2.1788853e-06 5.2237228e-06 3.3358933e-06 ... 1.7063500e-06  2.1911462e-06 2.4011656e-06]]

Process finished with exit code 0

2、梅尔频谱（melspectrogram）-案例02【从STFT的结果进一步计算】

import numpy as np
import librosa.display
import matplotlib.pyplot as plt

y, sr = librosa.load("air_conditioner01.wav")  # y.shape = (88200,)  sr = 22050
print("y.shape = {0}; y = {1}".format(y.shape, y))
print("\nsr = {0}".format(sr))

n_fft = 2048
win_length = n_fft  # 默认
hop_length = win_length // 4  # 默认
n_mels = 80  # 默认 128

D = librosa.stft(y=y, n_fft=n_fft, win_length=win_length, hop_length=hop_length, window='hann', center=True, pad_mode='constant')
print("\nSTFT结果：D.shape = {0}; \nD = {1}".format(D.shape, D))

D_abs = np.abs(D)  # 4.9349996e-03+2.2843832e-02j ----> 2.3370814e-02【a + bj ----> sqrt(a^2 + b^2)】
print("\nD_abs.shape = {0}; \nD_abs = {1}".format(D_abs.shape, D_abs))

D_pow = D_abs ** 2  # 4.9349996e-03+2.2843832e-02j ----> 5.4619490e-04【a + bj ----> a^2 + b^2】
print("\nD_pow.shape = {0}; \nD_pow = {1}".format(D_pow.shape, D_pow))

S = librosa.feature.melspectrogram(S=D_pow, sr=sr, n_mels=n_mels)
print("\nMel频谱--结果：S.shape = {0}; \nS = {1}".format(S.shape, S))

# 作图01
fig, ax = plt.subplots()
img = librosa.display.specshow(S, x_axis='time', y_axis='mel', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.2f')
ax.set(title='Mel-frequency spectrogram')
fig.show()

S_dB = librosa.power_to_db(S=S)
print("\nMel频谱-能量谱---结果：S_dB.shape = {0}; \nS_dB = {1}".format(S_dB.shape, S_dB))

# 作图02
fig, ax = plt.subplots()
img = librosa.display.specshow(S_dB, x_axis='time', y_axis='mel', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.0f dB')
ax.set(title='Mel-frequency(dB) spectrogram')
fig.show()

# 作图03
fig, ax = plt.subplots()
img = librosa.display.specshow(S_dB, x_axis='time', y_axis='linear', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.0f dB')
ax.set(title='Mel-frequency(dB) spectrogram')
fig.show()

打印结果：

y.shape = (88200,); y = [-0.00429636 -0.01181004 -0.01559684 ... -0.02535474 -0.02254227 -0.01510671]

sr = 22050

STFT结果：D.shape = (1025, 173); 
D = [[ 4.0991772e-02+0.0000000e+00j  7.6347418e-02+0.0000000e+00j
   1.0839665e-01+0.0000000e+00j ... -5.6880221e-02+0.0000000e+00j
  -4.0515706e-01+0.0000000e+00j -1.2604758e+00+0.0000000e+00j]
 [ 4.9349996e-03+2.2843832e-02j -6.5081336e-02-1.5483504e-03j
  -6.4895572e-03+8.5287131e-02j ...  8.6759105e-02+4.4598781e-02j
  -1.3626395e-01-3.5130250e-01j  1.2085854e+00-4.9212924e-01j]
 [-3.1803373e-02+6.2021937e-02j  1.8289314e-01-2.9990083e-02j
  -2.6830113e-01-2.0388773e-02j ...  2.5796932e-01-3.4968770e-01j
   8.2743064e-02-4.2531621e-01j -9.6776468e-01+1.2715183e+00j]
 ...
 [ 6.0332339e-04-2.8250517e-07j -3.0159805e-04+3.6464758e-07j
  -1.6187376e-07-1.8996661e-07j ... -1.5264601e-07-2.4162804e-07j
  -7.4535434e-04-8.3691225e-04j  2.4131173e-03+2.7090199e-03j]
 [-6.0317205e-04+8.0741266e-08j -4.6025047e-08+3.0144685e-04j
  -8.3143476e-08+1.5921461e-07j ... -1.0696644e-07+1.1993595e-07j
   4.5853498e-04-1.0226711e-03j -3.3102881e-03-1.4843964e-03j]
 [ 6.0308439e-04+0.0000000e+00j  3.0151531e-04+0.0000000e+00j
  -1.1964966e-07+0.0000000e+00j ...  2.1211932e-07+0.0000000e+00j
   1.1208834e-03+0.0000000e+00j  3.6279499e-03+0.0000000e+00j]]

D_abs.shape = (1025, 173); 
D_abs = [[4.0991772e-02 7.6347418e-02 1.0839665e-01 ... 5.6880221e-02
  4.0515706e-01 1.2604758e+00]
 [2.3370814e-02 6.5099753e-02 8.5533671e-02 ... 9.7550981e-02
  3.7680408e-01 1.3049406e+00]
 [6.9700614e-02 1.8533567e-01 2.6907471e-01 ... 4.3454534e-01
  4.3329009e-01 1.5979135e+00]
 ...
 [6.0332345e-04 3.0159828e-04 2.4958049e-07 ... 2.8580573e-07
  1.1207029e-03 3.6279366e-03]
 [6.0317205e-04 3.0144685e-04 1.7961662e-07 ... 1.6070609e-07
  1.1207634e-03 3.6278698e-03]
 [6.0308439e-04 3.0151531e-04 1.1964966e-07 ... 2.1211932e-07
  1.1208834e-03 3.6279499e-03]]

D_pow.shape = (1025, 173); 
D_pow = [[1.6803254e-03 5.8289282e-03 1.1749834e-02 ... 3.2353594e-03
  1.6415225e-01 1.5887991e+00]
 [5.4619490e-04 4.2379778e-03 7.3160087e-03 ... 9.5161935e-03
  1.4198132e-01 1.7028699e+00]
 [4.8581758e-03 3.4349307e-02 7.2401196e-02 ... 1.8882965e-01
  1.8774031e-01 2.5533276e+00]
 ...
 [3.6399919e-07 9.0961521e-08 6.2290423e-14 ... 8.1684910e-14
  1.2559751e-06 1.3161924e-05]
 [3.6381653e-07 9.0870202e-08 3.2262130e-14 ... 2.5826448e-14
  1.2561105e-06 1.3161439e-05]
 [3.6371080e-07 9.0911477e-08 1.4316039e-14 ... 4.4994604e-14
  1.2563796e-06 1.3162020e-05]]

Mel频谱--结果：S.shape = (80, 173); 
S = [[2.1666234e-02 2.1649336e-02 1.9549519e-01 ... 7.8533143e-01
  1.4178720e+00 7.7648085e-01]
 [1.6606098e-01 2.8194109e-01 7.3984677e-01 ... 5.2415431e-01
  6.8600404e-01 4.3208513e-01]
 [1.0179499e-01 1.3825276e-01 2.5464761e-01 ... 6.1250693e-01
  2.6014277e-01 1.3389401e-01]
 ...
 [6.9146539e-05 3.5622125e-04 1.7929733e-04 ... 4.7781770e-05
  4.5372890e-05 3.5324701e-05]
 [3.0970994e-05 8.2059370e-05 3.9507086e-05 ... 2.9160121e-05
  2.3678922e-05 1.5615517e-05]
 [2.1788853e-06 5.2237228e-06 3.3358933e-06 ... 1.7063500e-06
  2.1911462e-06 2.4011656e-06]]

Mel频谱-能量谱---结果：S_dB.shape = (80, 173); 
S_dB = [[-16.642166  -16.645554   -7.0886393 ...  -1.0494702   1.5163702
   -1.0986925]
 [ -7.797324   -5.4984164  -1.3085821 ...  -2.8054085  -1.6367333
   -3.644307 ]
 [ -9.922736   -8.593262   -5.9406037 ...  -2.12889    -5.8478827
   -8.7323885]
 ...
 [-41.602295  -34.4828    -37.46426   ... -43.20738   -43.432037
  -44.519215 ]
 [-45.090446  -40.85872   -44.03325   ... -45.352104  -46.25638
  -48.064438 ]
 [-56.617657  -52.8202    -54.76788   ... -57.67932   -56.593285
  -56.195778 ]]

Process finished with exit code 0

参数详解


"""Compute a mel-scaled spectrogram.

If a spectrogram input ``S`` is provided, then it is mapped directly onto
the mel basis by ``mel_f.dot(S)``.

If a time-series input ``y, sr`` is provided, then its magnitude spectrogram
``S`` is first computed, and then mapped onto the mel scale by
``mel_f.dot(S**power)``.

By default, ``power=2`` operates on a power spectrum.

Parameters
----------
y : np.ndarray [shape=(..., n)] or None
    audio time-series. Multi-channel is supported.

sr : number > 0 [scalar]
    sampling rate of ``y``

S : np.ndarray [shape=(..., d, t)]
    spectrogram

n_fft : int > 0 [scalar]
    length of the FFT window

hop_length : int > 0 [scalar]
    number of samples between successive frames.
    See `librosa.stft`

win_length : int <= n_fft [scalar]
    Each frame of audio is windowed by `window()`.
    The window will be of length `win_length` and then padded
    with zeros to match ``n_fft``.

    If unspecified, defaults to ``win_length = n_fft``.

window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]
    - a window specification (string, tuple, or number);
      see `scipy.signal.get_window`
    - a window function, such as `scipy.signal.windows.hann`
    - a vector or array of length ``n_fft``

    .. see also:: `librosa.filters.get_window`

center : boolean
    - If `True`, the signal ``y`` is padded so that frame
      ``t`` is centered at ``y[t * hop_length]``.
    - If `False`, then frame ``t`` begins at ``y[t * hop_length]``

pad_mode : string
    If ``center=True``, the padding mode to use at the edges of the signal.

    By default, STFT uses zero padding.

power : float > 0 [scalar]
    Exponent for the magnitude melspectrogram.
    e.g., 1 for energy, 2 for power, etc.

**kwargs : additional keyword arguments
    Mel filter bank parameters.

    See `librosa.filters.mel` for details.

Returns
-------
S : np.ndarray [shape=(..., n_mels, t)]
    Mel spectrogram

See Also
--------
librosa.filters.mel : Mel filter bank construction
librosa.stft : Short-time Fourier Transform

Examples
--------
>>> y, sr = librosa.load(librosa.ex('trumpet'))
>>> librosa.feature.melspectrogram(y=y, sr=sr)
array([[3.837e-06, 1.451e-06, ..., 8.352e-14, 1.296e-11],
       [2.213e-05, 7.866e-06, ..., 8.532e-14, 1.329e-11],
       ...,
       [1.115e-05, 5.192e-06, ..., 3.675e-08, 2.470e-08],
       [6.473e-07, 4.402e-07, ..., 1.794e-08, 2.908e-08]],
      dtype=float32)

Using a pre-computed power spectrogram would give the same result:

>>> D = np.abs(librosa.stft(y))**2
>>> S = librosa.feature.melspectrogram(S=D, sr=sr)

Display of mel-frequency spectrogram coefficients, with custom
arguments for mel filterbank construction (default is fmax=sr/2):

>>> # Passing through arguments to the Mel filters
>>> S = librosa.feature.melspectrogram(y=y, sr=sr, n_mels=128,
...                                     fmax=8000)

>>> import matplotlib.pyplot as plt
>>> fig, ax = plt.subplots()
>>> S_dB = librosa.power_to_db(S, ref=np.max)
>>> img = librosa.display.specshow(S_dB, x_axis='time',
...                          y_axis='mel', sr=sr,
...                          fmax=8000, ax=ax)
>>> fig.colorbar(img, ax=ax, format='%+2.0f dB')
>>> ax.set(title='Mel-frequency spectrogram')
"""

三、梅尔倒谱（MFCC）

MFCC的全部组成其实是由：

N维MFCC参数:
- $\cfrac{N}{3}$ 维MFCC系数/coefﬁcients
- $\cfrac{N}{3}$ 维一阶差分参数/first-order derivatives
- $\cfrac{N}{3}$ 维二阶差分参数/second-order derivatives
帧能量（此项可根据需求替换）

librosa.feature.mfcc(*, y=None, sr=22050, S=None, n_mfcc=20, dct_type=2, norm='ortho', lifter=0, **kwargs)

def mfcc(
    y=None, 
    sr=22050, 
    S=None, 
    n_mfcc=20, 
    dct_type=2, 
    norm="ortho", 
    lifter=0, 
    **kwargs
):

y：输入时域下的音频信号
sr：采样频率
n_mfcc：返回mfcc特征的数量
dct_type：DCT（离散余弦变换）的类型，默认为2
return：返回mfcc特征序列

这里主要设置sr和n_mfcc（你要提取特征的个数）

1、MFCC案例01【直接从y计算】

import librosa.display
import matplotlib.pyplot as plt

y, sr = librosa.load("air_conditioner01.wav")  # y.shape = (88200,)  sr = 22050
print("y.shape = {0}; y = {1}".format(y.shape, y))
print("\nsr = {0}".format(sr))

sr = 22050  # 默认 22050
n_mfcc = 20  # 默认 20
dct_type = 2  # 默认 2
norm = "ortho"  # 默认 "ortho"
lifter = 0  # 默认 0

mfccs = librosa.feature.mfcc(y=y, sr=sr, n_mfcc=n_mfcc, dct_type=dct_type, norm=norm, lifter=lifter)
print("\nMFCC--结果：mfccs.shape = {0}; \nmfccs = {1}".format(mfccs.shape, mfccs))

# 作图
fig, ax = plt.subplots()
img = librosa.display.specshow(mfccs, x_axis='time', y_axis='mel', sr=sr, fmax=8000, ax=ax)
fig.colorbar(img, ax=ax, format='%+2.2f')
ax.set(title='MFCC')
fig.show()

打印结果：

y.shape = (88200,); y = [-0.00429636 -0.01181004 -0.01559684 ... -0.02535474 -0.02254227 -0.01510671]

sr = 22050

MFCC--结果：mfccs.shape = (20, 173); 
mfccs = [[-287.29108   -247.5219    -249.65224   ... -253.60095   -254.32365  -275.92908  ]
 [ 129.54532    118.740906   129.125     ...  154.39165    151.97177   145.18109  ]
 [ -30.826519   -26.77267    -32.930305  ...  -33.37476    -32.383087   -30.955482 ]
 ...
 [   9.547321    12.869612    13.215841  ...    7.2535534    7.9752393     6.778652 ]
 [   4.4144416    4.8549047    7.463866  ...    3.3326685    3.2872963     5.157634 ]
 [   2.9182916    4.505571     5.7218165 ...    4.5183744    1.9608486    -1.0470729]]

Process finished with exit code 0

2、MFCC案例02【从melspectrogram的结果进一步计算】

import numpy as np
import librosa.display
import matplotlib.pyplot as plt

y, sr = librosa.load("air_conditioner01.wav")  # y.shape = (88200,)  sr = 22050
print("y.shape = {0}; y = {1}".format(y.shape, y))
print("\nsr = {0}".format(sr))

n_fft = 2048
win_length = n_fft  # 默认
hop_length = win_length // 4  # 默认
n_mels = 80  # 默认 128

S = librosa.feature.melspectrogram(y=y, sr=sr, n_fft=n_fft, win_length=win_length, hop_length=hop_length, n_mels=n_mels)
print("\nMel频谱--结果：S.shape = {0}; \nS = {1}".format(S.shape, S))

S_dB = librosa.power_to_db(S=S, ref=np.max)
print("\nMel频谱-dB能量谱---结果：S_dB.shape = {0}; \nS_dB = {1}".format(S_dB.shape, S_dB))

mfccs = librosa.feature.mfcc(S=S_dB)
print("\nMFCC---结果：mfccs.shape = {0}; \nmfccs = {1}".format(mfccs.shape, mfccs))

# 作图
fig, ax = plt.subplots(nrows=2, sharex=True)

img = librosa.display.specshow(S_dB, x_axis='time', y_axis='mel', fmax=8000, ax=ax[0])
fig.colorbar(img, ax=[ax[0]], format='%+2.0f dB')
ax[0].set(title='Mel spectrogram')
ax[0].label_outer()

img = librosa.display.specshow(mfccs, x_axis='time', y_axis='mel', fmax=8000, ax=ax[1])
fig.colorbar(img, ax=[ax[1]], format='%+2.0f dB')
ax[1].set(title='MFCC')
ax[1].label_outer()

fig.show()

3、参数详解

"""Mel-frequency cepstral coefficients (MFCCs)

.. warning:: If multi-channel audio input ``y`` is provided, the MFCC
    calculation will depend on the peak loudness (in decibels) across
    all channels.  The result may differ from independent MFCC calculation
    of each channel.

Parameters
----------
y : np.ndarray [shape=(..., n,)] or None
    audio time series. Multi-channel is supported..

sr : number > 0 [scalar]
    sampling rate of ``y``

S : np.ndarray [shape=(..., d, t)] or None
    log-power Mel spectrogram

n_mfcc : int > 0 [scalar]
    number of MFCCs to return

dct_type : {1, 2, 3}
    Discrete cosine transform (DCT) type.
    By default, DCT type-2 is used.

norm : None or 'ortho'
    If ``dct_type`` is `2 or 3`, setting ``norm='ortho'`` uses an ortho-normal
    DCT basis.

    Normalization is not supported for ``dct_type=1``.

lifter : number >= 0
    If ``lifter>0``, apply *liftering* (cepstral filtering) to the MFCCs::

        M[n, :] <- M[n, :] * (1 + sin(pi * (n + 1) / lifter) * lifter / 2)

    Setting ``lifter >= 2 * n_mfcc`` emphasizes the higher-order coefficients.
    As ``lifter`` increases, the coefficient weighting becomes approximately linear.

**kwargs : additional keyword arguments
    Arguments to `melspectrogram`, if operating
    on time series input

Returns
-------
M : np.ndarray [shape=(..., n_mfcc, t)]
    MFCC sequence

See Also
--------
melspectrogram
scipy.fftpack.dct

Examples
--------
Generate mfccs from a time series

>>> y, sr = librosa.load(librosa.ex('libri1'))
>>> librosa.feature.mfcc(y=y, sr=sr)
array([[-565.919, -564.288, ..., -426.484, -434.668],
       [  10.305,   12.509, ...,   88.43 ,   90.12 ],
       ...,
       [   2.807,    2.068, ...,   -6.725,   -5.159],
       [   2.822,    2.244, ...,   -6.198,   -6.177]], dtype=float32)

Using a different hop length and HTK-style Mel frequencies

>>> librosa.feature.mfcc(y=y, sr=sr, hop_length=1024, htk=True)
array([[-5.471e+02, -5.464e+02, ..., -4.446e+02, -4.200e+02],
       [ 1.361e+01,  1.402e+01, ...,  9.764e+01,  9.869e+01],
       ...,
       [ 4.097e-01, -2.029e+00, ..., -1.051e+01, -1.130e+01],
       [-1.119e-01, -1.688e+00, ..., -3.442e+00, -4.687e+00]],
      dtype=float32)

Use a pre-computed log-power Mel spectrogram

>>> S = librosa.feature.melspectrogram(y=y, sr=sr, n_mels=128,
...                                    fmax=8000)
>>> librosa.feature.mfcc(S=librosa.power_to_db(S))
array([[-559.974, -558.449, ..., -411.96 , -420.458],
       [  11.018,   13.046, ...,   76.972,   80.888],
       ...,
       [   2.713,    2.379, ...,    1.464,   -2.835],
       [   2.712,    2.619, ...,    2.209,    0.648]], dtype=float32)

Get more components

>>> mfccs = librosa.feature.mfcc(y=y, sr=sr, n_mfcc=40)

Visualize the MFCC series

>>> import matplotlib.pyplot as plt
>>> fig, ax = plt.subplots(nrows=2, sharex=True)
>>> img = librosa.display.specshow(librosa.power_to_db(S, ref=np.max),
...                                x_axis='time', y_axis='mel', fmax=8000,
...                                ax=ax[0])
>>> fig.colorbar(img, ax=[ax[0]])
>>> ax[0].set(title='Mel spectrogram')
>>> ax[0].label_outer()
>>> img = librosa.display.specshow(mfccs, x_axis='time', ax=ax[1])
>>> fig.colorbar(img, ax=[ax[1]])
>>> ax[1].set(title='MFCC')

Compare different DCT bases

>>> m_slaney = librosa.feature.mfcc(y=y, sr=sr, dct_type=2)
>>> m_htk = librosa.feature.mfcc(y=y, sr=sr, dct_type=3)
>>> fig, ax = plt.subplots(nrows=2, sharex=True, sharey=True)
>>> img1 = librosa.display.specshow(m_slaney, x_axis='time', ax=ax[0])
>>> ax[0].set(title='RASTAMAT / Auditory toolbox (dct_type=2)')
>>> fig.colorbar(img, ax=[ax[0]])
>>> img2 = librosa.display.specshow(m_htk, x_axis='time', ax=ax[1])
>>> ax[1].set(title='HTK-style (dct_type=3)')
>>> fig.colorbar(img2, ax=[ax[1]])
"""

四、梅尔频谱（melspectrogram） v.s. 梅尔倒谱（MFCC）

To get MFCC, compute the DCT on the mel-spectrogram. The mel-spectrogram is often log-scaled before.

MFCC is a very compressible representation, often using just 20 or 13 coefficients instead of 32-64 bands in Mel spectrogram.

The MFCC is a bit more decorrelarated, which can be beneficial with linear models like Gaussian Mixture Models.

With lots of data and strong classifiers like Convolutional Neural Networks, mel-spectrogram can often perform better.

梅尔频谱（melspectrogram）与梅尔倒谱（MFCC）的区别：

（melspectrogram）梅尔频谱的提取过程：输入语音信号->预加重->分针->加窗->FFT（傅里叶变换）->Mel滤波器-> 直接输出的一个结果
（MFCC）梅尔倒谱的提取过程：输入语音信号->预加重->分针->加窗->FFT（傅里叶变换）->Mel滤波器->对数运算->DCT(离散预先变换)->MFCC
从MFCC的API可以看出来，MFCC是在梅尔频谱（melspectrogram）基础上得来的，即：在梅尔频谱上取对数，做DCT变换，就得到了梅尔倒谱。

# -- Mel spectrogram and MFCCs -- #
def mfcc(y=None, sr=22050, S=None, n_mfcc=20, **kwargs):
    if S is None:
        S = logamplitude(melspectrogram(y=y, sr=sr, **kwargs))

    return np.dot(filters.dct(n_mfcc, S.shape[0]), S)

给定原始的音频信号，通过melspectrogram（）函数提取梅尔频谱，然后通过DCT离散余弦变换得到梅尔倒谱系数。（总之一句话，在梅尔频谱上取对数，做DCT变换，就得到了梅尔倒谱）

五、Mel滤波器

Mel滤波器对应了频率提高之后人耳会迟钝的客观规律：

Mel滤波器在人声的信号处理上有着广泛的使用；
但是如果应用到非人声上，就会丢失很多高频信息；所以如果处理非人声，则一般不用梅尔频谱（melspectrogram）或梅尔倒谱（MFCC），而用STFT；

参考资料：
声谱图，梅尔语谱，倒谱，梅尔倒谱系数
【Day6】窗涵式,n_fft ,hop_length 到底什麽意思啊？
零基础入门语音识别: 一文详解MFCC特征（附python代码）
论文笔记：语音情感识别（四）语音特征之声谱图，log梅尔谱，MFCC，deltas
声谱图，梅尔谱图
librosa 语音库（二）STFT 的实现
梅尔频谱和梅尔倒谱的初次理解和使用
音频特征提取——常用音频特征
深度学习之语音识别-音频基础知识、声谱图（Spectrogram）
声谱图
STFT和声谱图，梅尔频谱（Mel Bank Features）与梅尔倒谱（MFCCs）
语谱图（四） Mel spectrogram 梅尔语谱图
librosa 语音库（三） librosa.feature. 中的 spectrogram 与 melspectrogram
librosa.feature.melspectrogram 的形状(Shape of librosa.feature.melspectrogram)
librosa.feature.melspectrogram()
【librosa】音频特征提取
信号处理基础——傅里叶变换与短时傅里叶变换
语音信号加窗分帧是起什么作用
声学特征（二） MFCC特征原理
MFCC
librosa–学习笔记（2）(频谱特性 Spectral representations)

你可能感兴趣的:(Audio,音视频,深度学习,人工智能)

OpenAI 函数调用功能入门 AI火箭 chatgpt openai
Javascript版Langchain入门作者：AI小火箭的HB我是AI小火箭的HB，我探索和写作人工智能和语言交叉点的所有事物，范围从LLM，聊天机器人，语音机器人，开发框架，以数据为中心的潜在空间等。介绍LangChain是一个开源Python库，用于构建由大型语言模型（LLM）支持的应用程序。它提供了一个框架，将LLM与其他数据源（如互联网或个人文件）连接起来，允许开发人员将多个命令链接在
基于Python增加抖音视频播放量的代码 sh_moranliunian 蜘蛛侠网络爬虫后端 python 爬虫
一、思路通过发送HTTP请求来实现这一功能。代码主要功能的简要介绍：1.`get_ttwid`：这个函数用于获取`ttwid`，它是通过向字节跳动的接口发送POST请求，并从响应的cookie中提取`ttwid`值。2.`get_web_id`：这个函数用于获取`web_id`，它是通过向某个API发送POST请求，并从响应中提取`web_id`。3.`get_ms_token`：这个函数生成一个
Deepseek 对种猪市场会带来哪些影响？百态老人笔记大数据人工智能
DeepSeek对种猪市场的影响可以从以下几个方面进行分析：1.提高生产效率与降低成本根据，DeepSeek已经被用于养猪场中分析饲料配比，从而将猪的育肥周期从6个月缩短至5个月，并降低了15%的成本。这表明DeepSeek在优化养殖流程和提高生产效率方面具有显著作用，能够帮助养猪场降低运营成本，提升经济效益。2.推动智能化养殖技术的应用和提到，深度学习技术（如YOLOv5模型）已经被应用于生猪的
Python语言的安全开发慕璃嫣包罗万象 golang 开发语言后端
Python语言的安全开发引言在信息技术迅速发展的今天，网络安全问题愈发凸显。随着Python语言的广泛应用，尤其是在数据分析、人工智能、Web开发等领域，其安全问题越来越受到重视。Python作为一门高效且易于学习的编程语言，虽然在开发过程中为我们提供了很多便利，但如果忽视了安全性，将可能导致严重的安全漏洞和数据泄露等问题。因此，本文将围绕Python语言的安全开发展开讨论，重点分析常见的安全问
获取PPT中的MSO格式图片报错 ♢.＊ ppt python
亲爱的小伙伴们，在求知的漫漫旅途中，若你对深度学习的奥秘、Java与Python的奇妙世界，亦或是读研论文的撰写攻略有所探寻，那不妨给我一个小小的关注吧。我会精心筹备，在未来的日子里不定期地为大家呈上这些领域的知识宝藏与实用经验分享。每一个点赞，都如同春日里的一缕阳光，给予我满满的动力与温暖，让我们在学习成长的道路上相伴而行，共同进步✨。期待你的关注与点赞哟！image.ext的报错ValueEr
知识图谱技术剖析 ♢.＊人工智能知识图谱大数据
亲爱的小伙伴们，在求知的漫漫旅途中，若你对深度学习的奥秘、Java与Python的奇妙世界，亦或是读研论文的撰写攻略有所探寻，那不妨给我一个小小的关注吧。我会精心筹备，在未来的日子里不定期地为大家呈上这些领域的知识宝藏与实用经验分享。每一个点赞，都如同春日里的一缕阳光，给予我满满的动力与温暖，让我们在学习成长的道路上相伴而行，共同进步✨。期待你的关注与点赞哟！一、引言在当今数字化信息爆炸的时代，如
Deepseek技术浅析（一）爱研究的小牛 AIGC—概述大模型 AIGC 人工智能深度学习自然语言处理
DeepSeek是北京深度求索人工智能基础技术研究有限公司推出的人工智能技术品牌，专注于大语言模型（LLM）的研发与应用。其技术涵盖了从模型架构、训练方法到应用部署的多个层面，展现出强大的创新能力和应用潜力。以下将详细介绍DeepSeek的核心技术、工作原理以及具体实现方式。一、核心技术1.大语言模型（LLM）DeepSeek的核心产品是自研的大语言模型，其主要特点包括：(1)基于Transfor
启元世界（Inspir.ai）技术浅析（一）爱研究的小牛 AIGC—游戏制作人工智能机器学习 AIGC 深度学习
启元世界（Inspir.ai）作为全球领先的通用人工智能平台公司，自2017年成立以来，一直致力于通过人工智能技术提升产业效能和生活体验。公司汇聚了来自全球顶尖公司和高等学府的技术专家，专注于深度强化学习、推荐算法以及机器学习系统平台等前沿领域，并成功将人工智能技术应用于数字娱乐、智能决策和机器人等多个领域。一、核心技术启元世界在人工智能领域取得了多项突破性进展，其核心技术涵盖了以下几个方面：1.
Lumen5——AI视频制作，提取关键信息生成带有视觉效果的视频爱研究的小牛 AIGC—视频人工智能 AIGC 深度学习
一、Lumen5介绍Lumen5是一款基于人工智能的自动化视频制作平台，专为非专业用户设计，帮助其将博客、文章、新闻等文字内容快速转换为视频。Lumen5的目标是简化视频制作流程，让内容创作者、市场营销人员、社交媒体团队等无需视频制作经验即可轻松制作吸引观众的高质量视频。二、Lumen5的主要功能文字转视频Lumen5最具特色的功能是通过AI自动将文本转化为视频。用户可以输入一段文字或直接粘贴文章
python神经网络框架有哪些,python调用神经网络模型小明技术分享 python 神经网络深度学习
人工智能Python深度学习库有哪些由于Python的易用性和可扩展性，众多深度学习框架提供了Python接口，其中较为流行的深度学习库如下：第一：CaffeCaffe是一个以表达式、速度和模块化为核心的深度学习框架，具备清晰、可读性高和快速的特性，在视频、图像处理方面应用较多。Caffe中的网络结构与优化都以配置文件形式定义，容易上手，无须通过代码构建网络;网络训练速度快，能够训练大型数据集与S
人工智能的前景与未来就业市场：机遇、挑战与社会影响苹果酱0567 面试题汇总与解析 java 开发语言中间件 spring boot 后端
随着科技的飞速发展，人工智能（AI）已经逐渐渗透到我们生活的方方面面，它不仅引领着技术革新的浪潮，更在无声中重塑着我们的就业市场和社会结构。站在这个时代的交汇点上，我们不禁要问：人工智能将如何影响我们的未来就业市场？它带来的究竟是机遇还是挑战？回望过去，每一次科技革命都伴随着就业市场的剧烈震荡。而今，人工智能作为第四次工业革命的核心驱动力，正以前所未有的速度改变着劳动力市场的格局。从自动化生产线上
Python实现复原毫米波雷达呼吸波形的示例 go5463158465 python 算法机器学习 python 开发语言
以下是一个使用Python实现复原毫米波雷达呼吸波形的示例，该示例将涉及模型算法在重建损失和KL（Kullback-Leibler）损失之间的平衡问题。我们将使用深度学习中的变分自编码器（VAE）作为模型来进行呼吸波形的复原，因为VAE可以很好地处理重建和潜在空间分布的问题。步骤概述数据准备：生成或加载毫米波雷达的呼吸波形数据。定义VAE模型：包括编码器和解码器。定义损失函数：结合重建损失和KL损
对话系统(Chatbots) 原理与代码实例讲解 AI天才研究院 AI大模型企业级应用开发实战大数据AI人工智能计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
1.背景介绍1.1对话系统的发展历程对话系统，又称聊天机器人(Chatbots)，是模拟人类对话的计算机程序。从早期的基于规则的系统到如今基于深度学习的智能体，对话系统经历了漫长的发展历程。第一阶段：基于规则的系统(1960s-1990s)早期的对话系统主要基于预先定义的规则和模板。例如，ELIZA(1966)是一个模拟心理治疗师的程序，通过模式匹配和关键词识别来生成回复。这些系统只能处理有限的对
如何使用深度学习中的 Transformer 算法进行视频目标检测 go5463158465 python 算法深度学习 python 开发语言
以下将介绍如何使用深度学习中的Transformer算法进行视频目标检测，并给出一个复现相关论文思路及示例代码。这里以DETR（End-to-EndObjectDetectionwithTransformers）为基础进行说明，它是将Transformer引入目标检测领域的经典论文。步骤概述环境准备：安装必要的库，如PyTorch、torchvision等。数据准备：使用公开的视频目标检测数据集，
探索SakuraLLM：轻小说与Galgame翻译的新纪元蒋素萍Marilyn
探索SakuraLLM：轻小说与Galgame翻译的新纪元SakuraLLM适配轻小说/Galgame的日中翻译大模型项目地址:https://gitcode.com/gh_mirrors/sa/SakuraLLM在人工智能的浪潮中，SakuraLLM以其独特的魅力和强大的功能，成为了日中翻译领域的一颗璀璨明星。本文将深入介绍SakuraLLM项目，分析其技术特点，探讨其应用场景，并揭示其与众不同
大模型问答机器人的智能化程度 AI大模型应用之禅 AI大模型与大数据 java python javascript kotlin golang 架构人工智能
大模型、问答机器人、智能化程度、自然语言处理、深度学习、Transformer模型、知识图谱、推理能力、对话系统1.背景介绍近年来，人工智能技术取得了飞速发展，特别是深度学习的兴起，为自然语言处理（NLP）领域带来了革命性的变革。其中，大模型问答机器人作为一种新型的智能交互系统，凭借其强大的语言理解和生成能力，在客服、教育、娱乐等领域展现出广阔的应用前景。问答机器人是指能够理解用户自然语言问题并给
SpringBoot中运行Yolov5程序 eqa11 spring boot YOLO 后端
文章目录SpringBoot中运行Yolov5程序一、引言二、环境搭建1、SpringBoot项目创建2、YOLOv5环境配置三、SpringBoot与YOLOv5集成1、创建Python服务2、SpringBoot调用Python服务四、使用示例1、创建控制器五、总结SpringBoot中运行Yolov5程序一、引言在人工智能领域，目标检测是一个热门且实用的技术。YOLOv5作为目标检测算法中的
大语言模型原理与工程实践：残差连接与层归一化 AI大模型应用之禅 AI大模型与大数据计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
1.背景介绍随着自然语言处理（NLP）的发展，深度学习在过去几年中取得了令人瞩目的成果。其中，循环神经网络（RNN）和卷积神经网络（CNN）在图像和文本分类、语义角色标注、机器翻译等领域表现出色。然而，这些网络在训练过程中经常遭遇梯度消失和梯度爆炸的问题。为了解决这些问题，我们引入了残差连接（ResidualConnections）和层归一化（BatchNormalization）来改善模型性能。
阿里巴巴Qwen团队发布AI模型，可操控PC和手机新加坡内哥谈技术人工智能深度学习语言模型学习
每周跟踪AI热点新闻动向和震撼发展想要探索生成式人工智能的前沿进展吗？订阅我们的简报，深入解析最新的技术突破、实际应用案例和未来的趋势。与全球数同行一同，从行业内部的深度分析和实用指南中受益。不要错过这个机会，成为AI领域的领跑者。点击订阅，与未来同行！订阅：https://rengongzhineng.io/这周，科技界的目光几乎都被DeepSeek的R1模型吸引，但阿里巴巴并没有袖手旁观。1月
对比DeepSeek、ChatGPT和Kimi的学术写作摘要能力 AIWritePaper官方账号 DeepSeek AIWritePaper ChatGPT 人工智能 chatgpt llama 数据分析论文阅读
摘要摘要是文章的精华，通常在200-250词左右。要包括研究的目的、方法、结果和结论。让AI工具作为某领域内资深的研究专家，编写摘要需要言简意赅，直接概括论文的核心，为读者提供快速了解的窗口。下面我们使用DeepSeek、ChatGPT4以及Kimi辅助编写摘要。提示词：你现在是一名[计算机理论专家]，研究方向集中在[人工智能、大模型、数据挖掘等计算机相关方向]。我现在需要撰写一篇围绕[人工智能在
Transformer架构的GPU并行和之前的NLP算法并行有什么不同？ AI大模型学习不迷路 transformer 自然语言处理大模型深度学习 NLP LLM 大语言模型
1.什么是GPU并行计算？GPU并行计算是一种利用图形处理单元（GPU）进行大规模并行数据处理的技术。与传统的中央处理单元（CPU）相比，GPU拥有更多的核心，能够同时处理数千个线程，这使得GPU在处理高度并行的任务时表现出色。在深度学习中，GPU并行计算被广泛应用于训练神经网络，加速模型训练过程。在2017年之前，自然语言处理（NLP）领域的研究者们通常会从头开始训练模型，那时能够利用GPU进行
计算机视觉：解锁未来智能的钥匙及其代码实践我的运维人生计算机视觉人工智能运维开发技术共享
计算机视觉：解锁未来智能的钥匙及其代码实践在当今这个数据爆炸的时代，计算机视觉作为人工智能的一个重要分支，正以前所未有的速度推动着科技的边界。它不仅让机器“看懂”世界，更在自动驾驶、医疗影像分析、智能制造、安防监控等众多领域展现出巨大的应用潜力。本文将深入探讨计算机视觉的核心技术、最新进展，并通过一个具体的代码案例，展示如何在实践中应用这些技术，旨在为读者提供一个理论与实践相结合的全面视角。一、计
ImportError: DLL load failed while importing _rust: 找不到指定的程序的解决方案爱编程的喵喵 Python基础课程 python ImportError DLL load failed _rust 解决方案
大家好，我是爱编程的喵喵。双985硕士毕业，现担任全栈工程师一职，热衷于将数据思维应用到工作与生活中。从事机器学习以及相关的前后端开发工作。曾在阿里云、科大讯飞、CCF等比赛获得多次Top名次。现为CSDN博客专家、人工智能领域优质创作者。喜欢通过博客创作的方式对所学的知识进行总结与归纳，不仅形成深入且独到的理解，而且能够帮助新手快速入门。本文主要介绍了ImportError:DLLloa
《向量数据库指南》——MoE应用：解锁深度学习新境界的钥匙大禹智库《实战AI智能体》《向量数据库指南》深度学习人工智能向量数据库大禹智库低代码 MoE模型
在深度学习的广阔天地里，混合专家（MoE）模型如同一把锐利的钥匙，正逐步解锁着各种复杂应用场景的新境界。作为大禹智库的向量数据库高级研究员，同时也是《向量数据库指南》的作者，我深感MoE模型在推动AI技术向前发展中所扮演的重要角色。今天，我将带大家深入探讨MoE模型在自然语言处理、计算机视觉以及多模态学习等领域的应用，并巧妙引导大家通过《向量数据库指南》获取更多干货和深度实战经验。一、自然语言处理
小南每日 AI 资讯 | 国产AI之光DeepSeek暴击硅谷？？？ | 25/01/29 小南AI学院人工智能
1.中国AI模型震惊硅谷：DeepSeek为何一夜火出圈？国产AI大模型DeepSeek迅速崛起，引发硅谷关注。2.中国银行支持AI产业：1万亿元金融扶持助推智能化升级中国银行宣布提供1万亿元资金支持人工智能产业链发展，助力智能化升级。3.国产AI大模型DeepSeek惊艳全球：游戏科学冯骥称其为“国运级别科技成果”DeepSeek的AI模型引起全球关注，游戏科学的冯骥高度评价其意义。4.AI产业
【我的阅读】【nature |ai4science】Scientific discovery in the age of artificial intelligence【人工智能时代的科学发现】算法研究员【AI 4 Science】人工智能
相关资料：https://www.nature.com/articles/s41586-023-06221-2#Sec15文章目录Abstract摘要Conclusion结论Abstract摘要Artificialintelligence(AI)isbeingincreasinglyintegratedintoscientificdiscoverytoaugmentandaccelerateres
Hugging Face挑战DeepSeek，AI开源竞赛升级！新加坡内哥谈技术人工智能深度学习语言模型学习
每周跟踪AI热点新闻动向和震撼发展想要探索生成式人工智能的前沿进展吗？订阅我们的简报，深入解析最新的技术突破、实际应用案例和未来的趋势。与全球数同行一同，从行业内部的深度分析和实用指南中受益。不要错过这个机会，成为AI领域的领跑者。点击订阅，与未来同行！订阅：https://rengongzhineng.io/DeepSeek的R1推理模型刚刚引发全球轰动，开源AI界的“顶流”HuggingFac
LLM based Single Agent System AGI大模型与大数据研究院大数据AI人工智能计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
LLM-BasedSingleAgentSystem:ANewEraofIntelligentAutomation关键词：大语言模型，单智能体系统，强化学习，自然语言处理，智能自动化1.背景介绍近年来，随着深度学习技术的快速发展，大语言模型(LLM)在自然语言处理(NLP)领域取得了突破性进展。LLM凭借其强大的语言理解和生成能力，正在改变着人们与信息交互的方式。同时，人工智能领域的另一个重要研究
DeepSeek：硅谷AI格局的拐点？新加坡内哥谈技术人工智能深度学习语言模型学习
每周跟踪AI热点新闻动向和震撼发展想要探索生成式人工智能的前沿进展吗？订阅我们的简报，深入解析最新的技术突破、实际应用案例和未来的趋势。与全球数同行一同，从行业内部的深度分析和实用指南中受益。不要错过这个机会，成为AI领域的领跑者。点击订阅，与未来同行！订阅：https://rengongzhineng.io/本周，硅谷迎来了一个令人大跌眼镜的现实：打造先进人工智能模型，可能远没有想象中那么高深莫
AI常见的算法纠结哥_Shrek 人工智能算法
人工智能（AI）中常见的算法分为多个领域，如机器学习、深度学习、强化学习、自然语言处理和计算机视觉等。以下是一些常见的算法及其用途：1.机器学习(MachineLearning)监督学习(SupervisedLearning)线性回归(LinearRegression)：用于预测连续值，如房价预测。逻辑回归(LogisticRegression)：用于分类问题，如垃圾邮件检测。支持向量机(SVM)
对股票分析时要注意哪些主要因素？会飞的奇葩猪股票分析云掌股吧
　　众所周知，对散户投资者来说，股票技术分析是应战股市的核心武器，想学好股票的技术分析一定要知道哪些是重点学习的，其实非常简单，我们只要记住三个要素：成交量、价格趋势、振荡指标。一、成交量　　大盘的成交量状态。成交量大说明市场的获利机会较多，成交量小说明市场的获利机会较少。当沪市的成交量超过150亿时是强市市场状态，运用技术找综合买点较准；
【Scala十八】视图界定与上下文界定 bit1129 scala
Context Bound，上下文界定，是Scala为隐式参数引入的一种语法糖，使得隐式转换的编码更加简洁。隐式参数首先引入一个泛型函数max，用于取a和b的最大值 def max[T](a: T, b: T) = { if (a > b) a else b } 因为T是未知类型，只有运行时才会代入真正的类型，因此调用a >
C语言的分支——Object-C程序设计阅读有感 darkblue086 apple c 框架 cocoa
自从1972年贝尔实验室Dennis Ritchie开发了C语言，C语言已经有了很多版本和实现，从Borland到microsoft还是GNU、Apple都提供了不同时代的多种选择，我们知道C语言是基于Thompson开发的B语言的，Object-C是以SmallTalk-80为基础的。和C++不同的是，Object C并不是C的超集，因为有很多特性与C是不同的。 Object-C程序设计这本书
去除浏览器对表单值的记忆周凡杨 html 记忆 autocomplete form 浏览
&n
java的树形通讯录 g21121 java
最近用到企业通讯录，虽然以前也开发过，但是用的是jsf，拼成的树形，及其笨重和难维护。后来就想到直接生成json格式字符串，页面上也好展现。 // 首先取出每个部门的联系人 for (int i = 0; i < depList.size(); i++) { List<Contacts> list = getContactList(depList.get(i
Nginx安装部署 510888780 nginx linux
Nginx ("engine x") 是一个高性能的 HTTP 和反向代理服务器，也是一个 IMAP/POP3/SMTP 代理服务器。 Nginx 是由 Igor Sysoev 为俄罗斯访问量第二的 Rambler.ru 站点开发的，第一个公开版本0.1.0发布于2004年10月4日。其将源代码以类BSD许可证的形式发布，因它的稳定性、丰富的功能集、示例配置文件和低系统资源
java servelet异步处理请求墙头上一根草ｊａｖａ异步返回ｓｅｒｖｌｅｔ
servlet3.0以后支持异步处理请求，具体是使用AsyncContext ，包装httpservletRequest以及httpservletResponse具有异步的功能， final AsyncContext ac = request.startAsync(request, response); ac.s
我的spring学习笔记8-Spring中Bean的实例化 aijuans Spring 3
在Spring中要实例化一个Bean有几种方法： 1、最常用的（普通方法） <bean id="myBean" class="www.6e6.org.MyBean" /> 使用这样方法，按Spring就会使用Bean的默认构造方法，也就是把没有参数的构造方法来建立Bean实例。（有构造方法的下个文细说） 2、还
为Mysql创建最优的索引 annan211 mysql 索引
索引对于良好的性能非常关键，尤其是当数据规模越来越大的时候，索引的对性能的影响越发重要。索引经常会被误解甚至忽略，而且经常被糟糕的设计。索引优化应该是对查询性能优化最有效的手段了，索引能够轻易将查询性能提高几个数量级，最优的索引会比较好的索引性能要好2个数量级。 1 索引的类型 (1) B-Tree 不出意外，这里提到的索引都是指 B-
日期函数百合不是茶 oracle sql 日期函数查询
ORACLE日期时间函数大全 TO_DATE格式(以时间:2007-11-02 13:45:25为例) Year: yy two digits 两位年显示值:07 yyy three digits 三位年显示值:007
线程优先级 bijian1013 java thread 多线程 java多线程
多线程运行时需要定义线程运行的先后顺序。线程优先级是用数字表示，数字越大线程优先级越高，取值在1到10，默认优先级为5。实例： package com.bijian.study; /** * 因为在代码段当中把线程B的优先级设置高于线程A,所以运行结果先执行线程B的run()方法后再执行线程A的run()方法 * 但在实际中，JAVA的优先级不准，强烈不建议用此方法来控制执
适配器模式和代理模式的区别 bijian1013 java 设计模式
一.简介适配器模式：适配器模式（英语：adapter pattern）有时候也称包装样式或者包装。将一个类的接口转接成用户所期待的。一个适配使得因接口不兼容而不能在一起工作的类工作在一起，做法是将类别自己的接口包裹在一个已存在的类中。 &nbs
【持久化框架MyBatis3三】MyBatis3 SQL映射配置文件 bit1129 Mybatis3
SQL映射配置文件一方面类似于Hibernate的映射配置文件，通过定义实体与关系表的列之间的对应关系。另一方面使用<select>,<insert>,<delete>，<update>元素定义增删改查的SQL语句，这些元素包含三方面内容 1. 要执行的SQL语句 2. SQL语句的入参，比如查询条件 3. SQL语句的返回结果
oracle大数据表复制备份个人经验 bitcarter oracle 大表备份大表数据复制
前提：数据库仓库A（就拿oracle11g为例）中有两个用户user1和user2,现在有user1中有表ldm_table1,且表ldm_table1有数据5千万以上，ldm_table1中的数据是从其他库B（数据源）中抽取过来的，前期业务理解不够或者需求有变，数据有变动需要重新从B中抽取数据到A库表ldm_table1中。
HTTP加速器varnish安装小记 ronin47 http varnish 加速
上午共享的那个varnish安装手册，个人看了下，有点不知所云，好吧~看来还是先安装玩玩！苦逼公司服务器没法连外网，不能用什么wget或yum命令直接下载安装，每每看到别人博客贴出的在线安装代码时，总有一股羡慕嫉妒“恨”冒了出来。。。好吧，既然没法上外网，那只能麻烦点通过下载源码来编译安装了！ Varnish 3.0.4下载地址： http://repo.varnish-cache.org/
java-73-输入一个字符串，输出该字符串中对称的子字符串的最大长度 bylijinnan java
public class LongestSymmtricalLength { /* * Q75题目：输入一个字符串，输出该字符串中对称的子字符串的最大长度。 * 比如输入字符串“google”，由于该字符串里最长的对称子字符串是“goog”，因此输出4。 */ public static void main(String[] args) { Str
学习编程的一点感想 Cb123456 编程感想 Gis
写点感想，总结一些，也顺便激励一些自己.现在就是复习阶段，也做做项目. 本专业是GIS专业，当初觉得本专业太水，靠这个会活不下去的，所以就报了培训班。学习的时候，进入状态很慢，而且当初进去的时候，已经上到Java高级阶段了，所以.....，呵呵，之后有点感觉了，不过，还是不好好写代码，还眼高手低的，有
[能源与安全]美国与中国 comsci 能源
现在有一个局面：地球上的石油只剩下N桶，这些油只够让中国和美国这两个国家中的一个顺利过渡到宇宙时代，但是如果这两个国家为争夺这些石油而发生战争，其结果是两个国家都无法平稳过渡到宇宙时代。。。。而且在战争中，剩下的石油也会被快速消耗在战争中，结果是两败俱伤。。。在这个大
SEMI-JOIN执行计划突然变成HASH JOIN了的原因分析 cwqcwqmax9 oracle
甲说： A B两个表总数据量都很大，在百万以上。 idx1 idx2字段表示是索引字段 A B 两表上都有 col1字段表示普通字段 select xxx from A where A.idx1 between mmm and nnn and exists (select 1 from B where B.idx2 =
SpringMVC-ajax返回值乱码解决方案 dashuaifu Ajax springMVC response 中文乱码
SpringMVC-ajax返回值乱码解决方案一：（自己总结，测试过可行） ajax返回如果含有中文汉字，则使用：（如下例：） @RequestMapping(value="/xxx.do") public @ResponseBody void getPunishReasonB
Linux系统中查看日志的常用命令 dcj3sjt126com OS
因为在日常的工作中，出问题的时候查看日志是每个管理员的习惯，作为初学者，为了以后的需要，我今天将下面这些查看命令共享给各位 cat tail -f 日志文件说明 /var/log/message 系统启动后的信息和错误日志，是Red Hat Linux中最常用的日志之一 /var/log/secure 与安全相关的日志信息 /var/log/maillog 与邮件相关的日志信
[应用结构]应用 dcj3sjt126com PHP yii2
应用主体应用主体是管理 Yii 应用系统整体结构和生命周期的对象。每个Yii应用系统只能包含一个应用主体，应用主体在入口脚本中创建并能通过表达式 \Yii::$app 全局范围内访问。补充: 当我们说"一个应用"，它可能是一个应用主体对象，也可能是一个应用系统，是根据上下文来决定[译：中文为避免歧义，Application翻译为应
assertThat用法 eksliang JUnit assertThat
junit4.0 assertThat用法一般匹配符1、assertThat( testedNumber, allOf( greaterThan(8), lessThan(16) ) ); 注释： allOf匹配符表明如果接下来的所有条件必须都成立测试才通过，相当于“与”（&&） 2、assertThat( testedNumber, anyOf( g
android点滴2 gundumw100 应用服务器 android 网络应用 OS HTC
如何让Drawable绕着中心旋转？ Animation a = new RotateAnimation(0.0f, 360.0f, Animation.RELATIVE_TO_SELF, 0.5f, Animation.RELATIVE_TO_SELF,0.5f); a.setRepeatCount(-1); a.setDuration(1000); 如何控制Andro
超简洁的CSS下拉菜单 ini html Web 工作 html5 css
效果体验：http://hovertree.com/texiao/css/3.htmHTML文件： <!DOCTYPE html> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <title>简洁的HTML+CSS下拉菜单-HoverTree</title>
kafka consumer防止数据丢失 kane_xie kafka offset commit
kafka最初是被LinkedIn设计用来处理log的分布式消息系统，因此它的着眼点不在数据的安全性（log偶尔丢几条无所谓），换句话说kafka并不能完全保证数据不丢失。尽管kafka官网声称能够保证at-least-once，但如果consumer进程数小于partition_num，这个结论不一定成立。考虑这样一个case，partiton_num=2
@Repository、@Service、@Controller 和 @Component mhtbbx DAO spring bean prototype
@Repository、@Service、@Controller 和 @Component 将类标识为Bean Spring 自 2.0 版本开始，陆续引入了一些注解用于简化 Spring 的开发。@Repository注解便属于最先引入的一批，它用于将数据访问层 (DAO 层 ) 的类标识为 Spring Bean。具体只需将该注解标注在 DAO类上即可。同时，为了让 Spring 能够扫描类
java 多线程高并发读写控制误区 qifeifei java thread
先看一下下面的错误代码，对写加了synchronized控制，保证了写的安全，但是问题在哪里呢？ public class testTh7 { private String data; public String read(){ System.out.println(Thread.currentThread().getName() + "read data "
mongodb replica set(副本集)设置步骤 tcrct java mongodb
网上已经有一大堆的设置步骤的了，根据我遇到的问题，整理一下，如下：首先先去下载一个mongodb最新版，目前最新版应该是2.6 cd /usr/local/bin wget http://fastdl.mongodb.org/linux/mongodb-linux-x86_64-2.6.0.tgz tar -zxvf mongodb-linux-x86_64-2.6.0.t
rust学习笔记 wudixiaotie 学习笔记
1.rust里绑定变量是let，默认绑定了的变量是不可更改的，所以如果想让变量可变就要加上mut。 let x = 1; let mut y = 2; 2.match 相当于erlang中的case，但是case的每一项后都是分号，但是rust的match却是逗号。 3.match 的每一项最后都要加逗号，但是最后一项不加也不会报错，所有结尾加逗号的用法都是类似。 4.每个语句结尾都要加分