算法学习者

语音识别综述 awesome-speech-recognition-speech-synthesis-papers

https://github.com/zzw922cn/awesome-speech-recognition-speech-synthesis-papers

awesome-speech-recognition-speech-synthesis-papers

automatic speech recognition/speech synthesis paper roadmap, including HMM, DNN, RNN, CNN, Seq2Seq, Attention

Introduction

Automatic Speech Recognition has been investigated for several decades, and speech recognition models are from HMM-GMM to deep neural networks today. It's very necessary to see the history of speech recognition by this awesome paper roadmap. I will cover papers from traditional models to nowadays popular models, not only acoustic models or ASR systems, but also many interesting language models.

Paper List

Automatic Speech Recognition

An Introduction to the Application of the Theory of Probabilistic Functions of a Markov Process to Automatic Speech Recognition(1982), S. E. LEVINSON et al. [pdf]
A Maximum Likelihood Approach to Continuous Speech Recognition(1983), LALIT R. BAHL et al. [pdf]
Heterogeneous Acoustic Measurements and Multiple Classifiers for Speech Recognition(1986), Andrew K. Halberstadt. [pdf]
Maximum Mutual Information Estimation of Hidden Markov Model Parameters for Speech Recognition(1986), Lalit R. Bahi et al. [pdf]
A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition(1989), Lawrence R Rabiner. [pdf]
Phoneme recognition using time-delay neural networks(1989), Alexander H. Waibel et al. [pdf]
Speaker-independent phone recognition using hidden Markov models(1989), Kai-Fu Lee et al. [pdf]
Hidden Markov Models for Speech Recognition(1991), B. H. Juang et al. [pdf]
Connectionist Speech Recognition: A Hybrid Approach(1994), Herve Bourlard et al. [pdf]
A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER)(1997), J.G. Fiscus. [pdf]
Review of Tdnn (time Delay Neural Network) Architectures for Speech Recognition(2014), Masahide Sugiyamat et al. [pdf]
Framewise phoneme classification with bidirectional LSTM and other neural network architectures(2005), Alex Graves et al. [pdf]
Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks(2006), Alex Graves et al. [pdf]
The kaldi speech recognition toolkit(2011), Daniel Povey et al. [pdf]
Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition(2012), Ossama Abdel-Hamid et al. [pdf]
Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition(2012), George E. Dahl et al. [pdf]
Deep Neural Networks for Acoustic Modeling in Speech Recognition(2012), Geoffrey Hinton et al. [pdf]
Sequence Transduction with Recurrent Neural Networks(2012), Alex Graves et al. [pdf]
Deep convolutional neural networks for LVCSR(2013), Tara N. Sainath et al. [pdf]
Improving deep neural networks for LVCSR using rectified linear units and dropout(2013), George E. Dahl et al.[pdf]
Improving low-resource CD-DNN-HMM using dropout and multilingual DNN training(2013), Yajie Miao et al. [pdf]
Improvements to deep convolutional neural networks for LVCSR(2013), Tara N. Sainath et al. [pdf]
Machine Learning Paradigms for Speech Recognition: An Overview(2013), Li Deng et al. [pdf]
Recent advances in deep learning for speech research at Microsoft(2013), Li Deng et al. [pdf]
Speech recognition with deep recurrent neural networks(2013), Alex Graves et al. [pdf]
Convolutional deep maxout networks for phone recognition(2014), László Tóth et al. [pdf]
Convolutional Neural Networks for Speech Recognition(2014), Ossama Abdel-Hamid et al. [pdf]
Combining time- and frequency-domain convolution in convolutional neural network-based phone recognition(2014), László Tóth. [pdf]
Deep Speech: Scaling up end-to-end speech recognition(2014), Awni Y. Hannun et al. [pdf]
End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results(2014), Jan Chorowski et al. [pdf]
First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs(2014), Andrew L. Maas et al. [pdf]
Long short-term memory recurrent neural network architectures for large scale acoustic modeling(2014), Hasim Sak et al. [pdf]
Robust CNN-based speech recognition with Gabor filter kernels(2014), Shuo-Yiin Chang et al. [pdf]
Stochastic pooling maxout networks for low-resource speech recognition(2014), Meng Cai et al. [pdf]
Towards End-to-End Speech Recognition with Recurrent Neural Networks(2014), Alex Graves et al. [pdf]
Attention-Based Models for Speech Recognition(2015), Jan Chorowski et al. [pdf]
Analysis of CNN-based speech recognition system using raw speech as input(2015), Dimitri Palaz et al. [pdf]
Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks(2015), Tara N. Sainath et al. [pdf]
Deep convolutional neural networks for acoustic modeling in low resource languages(2015), William Chan et al.[pdf]
Deep Neural Networks for Single-Channel Multi-Talker Speech Recognition(2015), Chao Weng et al. [pdf]
Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition(2015), Hasim Sak et al.[pdf]
Lexicon-Free Conversational Speech Recognition with Neural Networks(2015), Andrew L. Maas et al. [pdf]
Online Sequence Training of Recurrent Neural Networks with Connectionist Temporal Classification(2015), Kyuyeon Hwang et al. [pdf]
Advances in All-Neural Speech Recognition(2016), Geoffrey Zweig et al. [pdf]
Advances in Very Deep Convolutional Neural Networks for LVCSR(2016), Tom Sercu et al. [pdf]
End-to-end attention-based large vocabulary speech recognition(2016), Dzmitry Bahdanau et al. [pdf]
Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention(2016), Dong Yu et al. [pdf]
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin(2016), Dario Amodei et al. [pdf]
End-to-end attention-based distant speech recognition with Highway LSTM(2016), Hassan Taherian. [pdf]
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning(2016), Suyoun Kim et al.[pdf]
Listen, attend and spell: A neural network for large vocabulary conversational speech recognition(2016), William Chan et al. [pdf]
Latent Sequence Decompositions(2016), William Chan et al. [pdf]
Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks(2016), Tara N. Sainath et al. [pdf]
Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition(2016), Suyoun Kim et al. [pdf]
Segmental Recurrent Neural Networks for End-to-End Speech Recognition(2016), Liang Lu et al. [pdf]
Towards better decoding and language model integration in sequence to sequence models(2016), Jan Chorowski et al. [pdf]
Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition(2016), Yanmin Qian et al. [pdf]
Very Deep Convolutional Networks for End-to-End Speech Recognition(2016), Yu Zhang et al. [pdf]
Very deep multilingual convolutional neural networks for LVCSR(2016), Tom Sercu et al. [pdf]
Wav2Letter: an End-to-End ConvNet-based Speech Recognition System(2016), Ronan Collobert et al. [pdf]
WaveNet: A Generative Model for Raw Audio(2016), Aäron van den Oord et al. [pdf]
Attentive Convolutional Neural Network based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech(2017), Michael Neumann et al. [pdf]
An enhanced automatic speech recognition system for Arabic(2017), Mohamed Amine Menacer et al. [pdf]
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM(2017), Takaaki Hori et al. [pdf]
A network of deep neural networks for distant speech recognition(2017), Mirco Ravanelli et al. [pdf]
An online sequence-to-sequence model for noisy speech recognition(2017), Chung-Cheng Chiu et al. [pdf]
An Unsupervised Speaker Clustering Technique based on SOM and I-vectors for Speech Recognition Systems(2017), Hany Ahmed et al. [pdf]
Building DNN acoustic models for large vocabulary speech recognition(2017), Andrew L. Maas et al. [pdf]
Direct Acoustics-to-Word Models for English Conversational Speech Recognition(2017), Kartik Audhkhasi et al.[pdf]
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments(2017), Zixing Zhang et al. [pdf]
English Conversational Telephone Speech Recognition by Humans and Machines(2017), George Saon et al. [pdf]
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA(2017), Song Han et al. [pdf]
Deep LSTM for Large Vocabulary Continuous Speech Recognition(2017), Xu Tian et al. [pdf]
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling(2017), Hairong Liu et al.[pdf]
Multichannel End-to-end Speech Recognition(2017), Tsubasa Ochiai et al. [pdf]
Multi-task Learning with CTC and Segmental CRF for Speech Recognition(2017), Liang Lu et al. [pdf]
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition(2017), Tara N. Sainath et al. [pdf]
Optimizing expected word error rate via sampling for speech recognition(2017), Matt Shannon. [pdf]
Residual Convolutional CTC Networks for Automatic Speech Recognition(2017), Yisen Wang et al. [pdf]
Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition(2017), Jaeyoung Kim et al. [pdf]
Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition(2017), Suyoun Kim et al. [pdf]
Reducing Bias in Production Speech Models(2017), Eric Battenberg et al. [pdf]

Speech Synthesis

Signal estimation from modified short-time Fourier transform(1993), Daniel W. Griffin et al. [pdf]
A fast Griffin-Lim algorithm(2013), Nathanael Perraudin et al. [pdf]
First Step Towards End-to-End Parametric TTS Synthesis: Generating Spectral Parameters with Neural Attention(2016), Wenfu Wang et al. [pdf]
Recent Advances in Google Real-Time HMM-Driven Unit Selection Synthesizer(2016), Xavi Gonzalvo et al. [pdf]
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model(2016), Soroush Mehri et al. [pdf]
WaveNet: A Generative Model for Raw Audio(2016), Aäron van den Oord et al. [pdf]
Deep Voice: Real-time Neural Text-to-Speech(2017), Sercan O. Arik et al. [pdf]
Deep Voice 2: Multi-Speaker Neural Text-to-Speech(2017), Sercan Arik et al. [pdf]
Tacotron: Towards End-to-End Speech Synthesis(2017), Yuxuan Wang et al. [pdf]

Language Modelling

Class-Based n-gram Models of Natural Language(1992), Peter F. Brown et al. [pdf]
A Neural Probabilistic Language Model(2000), Yoshua Bengio et al. [pdf]
Discriminative n-gram language modeling(2007), Brian Roark et al. [pdf]
Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition(2016), Xie Chen et al. [pdf]

Contact Me

For any questions, welcome to send email to :[email protected]. Thanks!

第八周的点滴记录 aefb1b6871b5
1.由于各种主客观原因，这周开了初一年级的第一次家长会，我们一班的家长来得不多，但是方便了一对一的沟通。在家长会开始前播放了皮克斯《包包包》给家长们看，引起部分家长的共鸣，差点泪目。2.在六班开始了dailyspeech，按照学号轮流每天课前三分钟内的英语演讲.我给孩子们提供了获取演讲灵感的途径，例如TED,Chinadaily，留学征途等。让我印象深刻的是汤义讲了最近高校流行的payforpra
VITS 源码解析2-模型概述迪三 #NN_Audio 音频人工智能
VITs是文本到语音(Text-to-Speech,TTS)任务中最流行的技术之一，其实现思路是将文本语音信息融合到了HiFiGAN潜空间内,通过文本控制HiFiGAN的生成器，输出含文本语义的声音。VITs主要以GAN的方式训练,其生成器G是SynthesizerTrn，判别器D是MPD。VITS的判别器几乎和HiFiGAN一样，生成器则融合了文本、时序、声音三大类模型1.文件概述模型部分包含三
QT DAY4 _shenash qt 开发语言
作业：要求做一个闹钟clock.proQT+=coreguitexttospeechmain.cpp#include"widget.h"intmain(intargc,char*argv[]){QApplicationa(argc,argv);Widgetw;w.show();returna.exec();}widget.cpp#include"widget.h"Widget::Widget(QW
FunASR 语音识别系统概述瑞雪兆我心语音识别人工智能
FunASR（AFundamentalEnd-to-EndSpeechRecognitionToolkit）是一个基础的语音识别工具包，提供多种功能，包括语音识别（ASR）、语音端点检测（VAD）、标点恢复（PR）、语言模型（LM）、说话人分离等。项目源地址1语音识别（ASR）参考语音交互：聊聊语音识别-ASR（万字长文）语音识别技术（AutomaticSpeechRecognition,ASR）
使用PyTorch实现的DeepSpeech模型: 强大的语音识别利器毕艾琳
使用PyTorch实现的DeepSpeech模型:强大的语音识别利器deepspeech.pytorchSpeechRecognitionusingDeepSpeech2.项目地址:https://gitcode.com/gh_mirrors/de/deepspeech.pytorch在今天的数字化世界中，语音识别技术已成为人机交互的关键组成部分。deepspeech.pytorch是一个由Sea
Peter Pan 33 Mr_Oldman
Therecametohimapresentimentofhisearlydissolution(death).ItwasasifPeter'sterribleoathhadboardedtheship.Hookfeltagloomydesiretomakehisdyingspeech,lestpresentlythereshouldbenotimeforit.{1}"BetterforHook,
MB-iSTFT-VITS 项目教程邱敬镇
MB-iSTFT-VITS项目教程MB-iSTFT-VITSLightweightandHigh-FidelityEnd-to-EndText-to-SpeechwithMulti-BandGenerationandInverseShort-TimeFourierTransform项目地址:https://gitcode.com/gh_mirrors/mb/MB-iSTFT-VITS项目介绍MB-
推荐项目：VITS2 Chinese - 轻松转化你的中文语音至文本傅尉艺Maggie
推荐项目：VITS2Chinese-轻松转化你的中文语音至文本VITS2-ChineseVITS2forChinesespeech|最新VITS2中文语音合成项目地址:https://gitcode.com/gh_mirrors/vi/VITS2-Chinese项目介绍VITS2Chinese是一个针对中文语音的自动转文字工具，它简化了传统语音识别的复杂流程，让用户只需上传音频文件，就能一键完成语
探索MB-iSTFT-VITS：一款高效的语音合成工具张姿桃Erwin
探索MB-iSTFT-VITS：一款高效的语音合成工具MB-iSTFT-VITSLightweightandHigh-FidelityEnd-to-EndText-to-SpeechwithMulti-BandGenerationandInverseShort-TimeFourierTransform项目地址:https://gitcode.com/gh_mirrors/mb/MB-iSTFT-V
探索未来之声：GLaDOS 文本转语音引擎卓桢琳Blackbird
探索未来之声：GLaDOS文本转语音引擎项目地址:https://gitcode.com/R2D2FISH/glados-tts在科技的无垠领域中，有一项创新正悄然改变我们与机器交流的方式——GLaDOSText-to-speech(TTS)VoiceGenerator。想象一下，拥有一个如同游戏《传送门》中的智能助手般独特且富有表现力的声音，这不再仅仅是科幻梦想。GLaDOSTTS，基于神经网络
AI语音机器人：通过 Azure Speech 实现类人类的交互全云在线allcloudonline 人工智能机器人 azure
语音对话的重要性在竞争日益激烈的客户互动领域，人工智能语音对话正成为重中之重。随着数字参与者的崛起，组织认识到语音机器人的强大力量，它是一种自然而直观的沟通方式，可以提供类似人类的体验，深度吸引用户，并让他们从竞争对手中脱颖而出。无缝客户服务、个性化协助和即时信息访问的需求推动了对高质量语音交互的需求不断增长。此外，随着公司努力保留和扩大收入，跨越语言障碍接触更多样化的客户群变得至关重要，这使得多
Python知识点：如何使用Python实现语音识别超哥同学 Python系列 python 语音识别 xcode 编程面试
要在Python中实现语音识别，你可以使用SpeechRecognition库，它是一个功能强大的库，能够识别音频中的语音并将其转换为文本。下面是一个简单的示例代码，展示如何使用这个库进行语音识别。步骤1：安装依赖库首先，你需要安装SpeechRecognition库和pyaudio库。你可以使用以下命令安装这些库：pipinstallSpeechRecognitionpipinstallpyau
开源语音转文本Speech-to-Text大模型实战之Wav2Vec篇码上飞扬 Wav2Vec 文本转语音人工智能 Speech-to-Text
前言近年来，语音转文本（Speech-to-Text,STT）技术取得了长足的进步，广泛应用于各种领域，如语音助手、自动字幕生成、智能客服等。本文将详细介绍如何利用开源语音转文本大模型进行实战，从模型选择、环境搭建、模型训练到实际应用，带您一步步实现语音转文本功能。一、模型选择目前，市面上有许多优秀的开源语音转文本模型可供选择，其中一些流行的模型包括：DeepSpeech：由Mozilla开源，基
逆水行舟，不进则退舒乔终身成长
昨天例会做了一篇speech，匆匆写稿，完全没有rehearsal。结果嘛，当然不尽人意。首先，noticeablenerviness。紧张偶尔会有，但都controllable，毕竟按我的马龄，也算是个老司机，可被人看出，也是头一遭。原因嘛，也不言而喻。最近半年，参会频率和发言频率都太低，练习太少，至今还未完全适应ZOOMmeeting。此外，昨天一天其实没有什么安排，可却拖延到下午四点来钟才开
AdventureCreator学习笔记13：脸部表情 AlpacasKing
BlendShape设置在模型上添加Shapeable脚本，可以添加表情组，然后在表情组里可以添加表情。BlendShape设置表情设置在NPC脚本上，还需要添加表情，名字与表情组相同。表情设置使用表情可以在ActionList中添加新的Action，也可以直接写在对话文本中。使用表情嘴型设置对话时根据内容嘴型可以相应变化。首先在ACGameEditor的Speech选项卡开启Lipsyncing
ITTS, VALL-E,soundstorm 0010000100 PyTorch 人工智能
ITTS,VALL-E,andSoundStormarealladvancedtechnologiesandmodelsrelatedtospeechandaudioprocessing.ITTS(InteractiveText-to-Speech):ITTSusuallyreferstoaText-to-Speech(TTS)systemthatallowsinteractivecontrolo
探索创新语音识别：IMS Toucan - 你的智能语音解决方案班歆韦Divine
探索创新语音识别：IMSToucan-你的智能语音解决方案IMS-ToucanText-to-SpeechToolkitoftheSpeechandLanguageTechnologiesGroupattheUniversityofStuttgart.Objectivesofthedevelopmentaresimplicity,modularity,controllabilityandmulti
WebKit的语音交互新篇章：Web Speech API深度解析 2401_85742452 前端 webkit 交互
WebKit的语音交互新篇章：WebSpeechAPI深度解析随着技术的进步，人机交互的方式正在不断演变。WebSpeechAPI作为现代Web技术的一部分，为浏览器提供了语音识别和语音合成的能力。这项API在WebKit中的支持为开发者带来了创建具有语音交互功能的Web应用的可能性。本文将详细介绍WebKit对WebSpeechAPI的支持，并提供实际的代码示例。一、WebSpeechAPI简介
【机器学习】Whisper：开源语音转文本（speech-to-text）大模型实战 LDG_AGI AI智能体研发之路-模型篇机器学习 whisper 人工智能语音识别实时音视频 python transformer
目录一、引言二、Whisper模型原理2.1模型架构2.2语音处理2.3文本处理三、Whisper模型实战3.1环境安装3.2模型下载3.3模型推理3.4完整代码3.5模型部署四、总结一、引言上一篇对ChatTTS文本转语音模型原理和实战进行了讲解，第6次拿到了热榜第一。今天，分享其对称功能（语音转文本）模型：Whisper。Whisper由OpenAI研发并开源，参数量最小39M，最大1550M
使用pyttsx3实现文字转语音静候光阴语音识别语音识别人工智能
专栏总目录该方法不需要生成音频文件，可以直接输出声音。但是，声音比较生硬，不自然。只能说是一种比较方便实现的文字转语音简单方案一、安装pyttsx3安装命令：pipinstallpyttsx3二、代码执行后，即可听到转换后的声音importpyttsx3#创建文字转语音函数deftext_to_speech(text):#初始化语音引擎engine=pyttsx3.init()#设置语音速度eng
iOS Unity导出的工程基础百度语音合成SDK问题 Fintecher
没有基础百度语音合成sdk前,Unity可以正常播放声音。集成后百度语音合成sdk能够播放声音，但是Unity却不能播放声音。研究了一个多小时，终于在百度语音的sdk中发现一点蛛丝马迹。配置百度sdk的时候讲AVSession管理关闭[[BDSSpeechSynthesizersharedInstance]setSynthParam:@(0)forKey:BDS_SYNTHESIZER_PARAM
WhisperFusion：具有超低延迟无缝对话功能的AI系统语音之家智能语音人工智能语音识别语言模型
WhisperFusion基于WhisperLive和WhisperSpeech的功能而构建，在实时语音到文本管道之上集成了大型语言模型Mistral(LLM)。LLM和Whisper都经过优化，可作为TensorRT引擎高效运行，从而最大限度地提高性能和实时处理能力。WhiperSpeech是通过torch.compile进行优化的。特征实时语音转文本：利用OpenAIWhisperLive将口
Python实现视频转音频、音频转文本的最佳方法 AICodeThunder 问答教程 python 音视频开发语言
文章目录Python实现视频转音频和音频转文字视频转音频步骤1：导入moviepy库步骤2：选择视频文件步骤3：创建VideoFileClip对象步骤4：提取音频步骤5：保存音频文件音频转文字步骤1：导入SpeechRecognition库步骤2：选择音频文件步骤3：创建Recognizer对象步骤4：读取音频文件步骤5：将音频转换为文字步骤6：打印转换结果完整代码Python实现视频转音频和音频
2022-08-04 CET-6 Catholi_costudy
Vocabularyingredientn.配料，成分，因素texturen.质感，口感literallyn.字面上的，确实地，简直renewv.延长，使续期，重新开始，更换Phraseakeyingredient关键配料freshingredients新鲜食材anactiveingredient有效成分anessentialingredient基本要素acceptancespeech获奖感言th
Shell Script Jenny测试开发基本能力 linux 运维服务器
foriin`kubectlgetpod--namespacespeech-cn-lapp=locust|awk'{if(NR>1)}{print$1}'`dokubectlcp$i:/home/locust/output.txt./01019/$i.txtdone
扩展语音识别系统：增强功能与多语言支持 GT开发算法工程师语音识别人工智能
一、引言在之前的博客中，我们成功构建了一个基于LibriSpeech数据集的英文语音识别系统。现在，我们将对系统进行扩展，增加一些增强功能，并尝试支持多语言识别。二、增加增强功能语音合成--除了语音识别，我们还可以增加语音合成（Text-to-Speech,TTS）功能，将文本转换为语音输出。这可以使得我们的系统不仅仅是一个转录工具，还能够进行语音交互。情感分析--通过结合情感分析模型，我们可以识
ASR 概述墨鸦_Cormorant Python ASR
前言随着企业加强了与客户的线上沟通，企业越发依赖于虚拟助手、聊天机器人以及其他的语音技术，以实现与客户的高效互动。这几类人工智能，都是依赖于自动语音识别技术，简称为ASR。ASR涉及到将语音转换为文本，促使计算机理解人类语言并与人类对话。语音识别技术，也被称为自动语音识别AutomaticSpeechRecognition（ASR），其目标是将人类的语音中的词汇内容转换为计算机可读的输入，例如按键
中文语音标注工具FunASR（语音识别）我要用代码向我喜欢的女孩表白语音识别人工智能
全称AFundamentalEnd-to-EndSpeechRecognitionToolkit（一个语音识别工具）可能大家用过whisper（openAi），它【标注英语的确很完美】，【但中文会出现标注错误】或搞了个没说的词替换上去，所以要人工核对，麻烦。FunASR作用:能【准确】识别语音，并转成【文字、标出声调】他的原理，就不讲了，俺是搞大数据的，python这东西就勉强能写个爬虫和接口，机
掌上新闻随心播控，HarmonyOS SDK助力新浪新闻打造精致易用的资讯服务新体验
原生智能是HarmonyOSNEXT的核心亮点之一，依托HarmonyOSSDK丰富全面的开放能力，开发者只需通过几行代码，即可快速实现AI功能。新浪新闻作为鸿蒙原生应用开发的先行者之一，从有声资讯入手，将基于SpeechKit朗读控件上线听新闻功能，让新浪新闻充分发挥HarmonyOS原生智能优势，为鸿蒙用户带来更加智能易用的新闻即时获取体验。朗读控件，更智能、更精致、更易用HarmonyOSS
windows 环境实现文字转语音。咕噜咕噜_87bc
目前有很多提供语音合成的SDK，比如科大讯飞，百度，腾讯云等。其实windowspowershell里自带语音合成。例如：Add-Type-AssemblyNameSystem.speech;$speak=New-ObjectSystem.Speech.Synthesis.SpeechSynthesizer;$speak.Rate=1;//朗读速度$speak.SetOutputToWaveFil
关于旗正规则引擎中的MD5加密问题何必如此 jsp MD5 规则加密
一般情况下，为了防止个人隐私的泄露，我们都会对用户登录密码进行加密，使数据库相应字段保存的是加密后的字符串，而非原始密码。在旗正规则引擎中，通过外部调用，可以实现MD5的加密，具体步骤如下： 1.在对象库中选择外部调用，选择“com.flagleader.util.MD5”，在子选项中选择“com.flagleader.util.MD5.getMD5ofStr({arg1})”； 2.在规
【Spark101】Scala Promise/Future在Spark中的应用 bit1129 Promise
Promise和Future是Scala用于异步调用并实现结果汇集的并发原语，Scala的Future同JUC里面的Future接口含义相同，Promise理解起来就有些绕。等有时间了再仔细的研究下Promise和Future的语义以及应用场景，具体参见Scala在线文档：http://docs.scala-lang.org/sips/completed/futures-promises.html
spark sql 访问hive数据的配置详解 daizj spark sql hive thriftserver
spark sql 能够通过thriftserver 访问hive数据，默认spark编译的版本是不支持访问hive，因为hive依赖比较多，因此打的包中不包含hive和thriftserver,因此需要自己下载源码进行编译，将hive，thriftserver打包进去才能够访问，详细配置步骤如下： 1、下载源码 2、下载Maven,并配置此配置简单，就略过
HTTP 协议通信周凡杨 java httpclient http 通信
一：简介 HTTPCLIENT，通过JAVA基于HTTP协议进行点与点间的通信！二：代码举例测试类： import java
java unix时间戳转换 g21121 java
把java时间戳转换成unix时间戳： Timestamp appointTime=Timestamp.valueOf(new SimpleDateFormat("yyyy-MM-dd HH:mm:ss").format(new Date())) SimpleDateFormat df = new SimpleDateFormat("yyyy-MM-dd hh:m
web报表工具FineReport常用函数的用法总结（报表函数）老A不折腾 web报表 finereport 总结
说明：本次总结中，凡是以tableName或viewName作为参数因子的。函数在调用的时候均按照先从私有数据源中查找，然后再从公有数据源中查找的顺序。 CLASS CLASS(object):返回object对象的所属的类。 CNMONEY CNMONEY(number,unit)返回人民币大写。 number:需要转换的数值型的数。 unit:单位，
java jni调用c++ 代码报错墙头上一根草 java C++jni
# # A fatal error has been detected by the Java Runtime Environment: # # EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x00000000777c3290, pid=5632, tid=6656 # # JRE version: Java(TM) SE Ru
Spring中事件处理de小技巧 aijuans spring Spring 教程 Spring 实例 Spring 入门 Spring3
Spring 中提供一些Aware相关de接口，BeanFactoryAware、 ApplicationContextAware、ResourceLoaderAware、ServletContextAware等等，其中最常用到de匙ApplicationContextAware.实现ApplicationContextAwaredeBean，在Bean被初始后，将会被注入 Applicati
linux shell ls脚本样例 annan211 linux linux ls源码 linux 源码
#! /bin/sh - #查找输入文件的路径 #在查找路径下寻找一个或多个原始文件或文件模式 # 查找路径由特定的环境变量所定义 #标准输出所产生的结果通常是查找路径下找到的每个文件的第一个实体的完整路径 # 或是filename :not found 的标准错误输出。 #如果文件没有找到则退出码为0 #否则即为找不到的文件个数 #语法 pathfind [--
List,Set,Map遍历方式 (收集的资源,值得看一下) 百合不是茶 list set Map遍历方式
List特点：元素有放入顺序，元素可重复 Map特点：元素按键值对存储，无放入顺序 Set特点：元素无放入顺序，元素不可重复（注意：元素虽然无放入顺序，但是元素在set中的位置是有该元素的HashCode决定的，其位置其实是固定的） List接口有三个实现类：LinkedList，ArrayList，Vector LinkedList：底层基于链表实现，链表内存是散乱的，每一个元素存储本身
解决SimpleDateFormat的线程不安全问题的方法 bijian1013 java thread 线程安全
在Java项目中，我们通常会自己写一个DateUtil类，处理日期和字符串的转换，如下所示： public class DateUtil01 { private SimpleDateFormat dateformat = new SimpleDateFormat("yyyy-MM-dd HH:mm:ss"); public void format(Date d
http请求测试实例（采用fastjson解析） bijian1013 http 测试
在实际开发中，我们经常会去做http请求的开发，下面则是如何请求的单元测试小实例，仅供参考。 import java.util.HashMap; import java.util.Map; import org.apache.commons.httpclient.HttpClient; import
【RPC框架Hessian三】Hessian 异常处理 bit1129 hessian
RPC异常处理概述 RPC异常处理指是，当客户端调用远端的服务，如果服务执行过程中发生异常，这个异常能否序列到客户端？如果服务在执行过程中可能发生异常，那么在服务接口的声明中，就该声明该接口可能抛出的异常。在Hessian中，服务器端发生异常，可以将异常信息从服务器端序列化到客户端，因为Exception本身是实现了Serializable的
【日志分析】日志分析工具 bit1129 日志分析
1. 网站日志实时分析工具 GoAccess http://www.vpsee.com/2014/02/a-real-time-web-log-analyzer-goaccess/ 2. 通过日志监控并收集 Java 应用程序性能数据(Perf4J) http://www.ibm.com/developerworks/cn/java/j-lo-logforperf/ 3.log.io 和
nginx优化加强战斗力及遇到的坑解决 ronin47 nginx 优化
　　　先说遇到个坑，第一个是负载问题，这个问题与架构有关，由于我设计架构多了两层，结果导致会话负载只转向一个。解决这样的问题思路有两个：一是改变负载策略，二是更改架构设计。　　　由于采用动静分离部署，而nginx又设计了静态，结果客户端去读nginx静态，访问量上来，页面加载很慢。解决：二者留其一。最好是保留apache服务器。　　　来以下优化：　　　
java-50-输入两棵二叉树A和B，判断树B是不是A的子结构 bylijinnan java
思路来自： http://zhedahht.blog.163.com/blog/static/25411174201011445550396/ import ljn.help.*; public class HasSubtree { /**Q50. * 输入两棵二叉树A和B，判断树B是不是A的子结构。例如，下图中的两棵树A和B，由于A中有一部分子树的结构和B是一
mongoDB 备份与恢复开窍的石头 mongDB备份与恢复
Mongodb导出与导入 1: 导入/导出可以操作的是本地的mongodb服务器,也可以是远程的. 所以,都有如下通用选项: -h host 主机 --port port 端口 -u username 用户名 -p passwd 密码 2: mongoexport 导出json格式的文件
[网络与通讯]椭圆轨道计算的一些问题 comsci 网络
如果按照中国古代农历的历法，现在应该是某个季节的开始，但是由于农历历法是3000年前的天文观测数据，如果按照现在的天文学记录来进行修正的话，这个季节已经过去一段时间了。。。。。也就是说，还要再等3000年。才有机会了，太阳系的行星的椭圆轨道受到外来天体的干扰，轨道次序发生了变
软件专利如何申请 cuiyadll 软件专利申请
软件技术可以申请软件著作权以保护软件源代码，也可以申请发明专利以保护软件流程中的步骤执行方式。专利保护的是软件解决问题的思想，而软件著作权保护的是软件代码（即软件思想的表达形式）。例如，离线传送文件，那发明专利保护是如何实现离线传送文件。基于相同的软件思想，但实现离线传送的程序代码有千千万万种，每种代码都可以享有各自的软件著作权。申请一个软件发明专利的代理费大概需要5000-8000申请发明专利可
Android学习笔记 darrenzhu android
1.启动一个AVD 2.命令行运行adb shell可连接到AVD,这也就是命令行客户端 3.如何启动一个程序 am start -n package name/.activityName am start -n com.example.helloworld/.MainActivity 启动Android设置工具的命令如下所示： # am start -
apache虚拟机配置，本地多域名访问本地网站 dcj3sjt126com apache
现在假定你有两个目录，一个存在于 /htdocs/a，另一个存在于 /htdocs/b 。现在你想要在本地测试的时候访问 www.freeman.com 对应的目录是 /xampp/htdocs/freeman ,访问 www.duchengjiu.com 对应的目录是 /htdocs/duchengjiu。 1、首先修改C盘WINDOWS\system32\drivers\etc目录下的
yii2 restful web服务[速率限制] dcj3sjt126com PHP yii2
速率限制为防止滥用，你应该考虑增加速率限制到您的API。例如，您可以限制每个用户的API的使用是在10分钟内最多100次的API调用。如果一个用户同一个时间段内太多的请求被接收，将返回响应状态代码 429 (这意味着过多的请求)。要启用速率限制, [[yii\web\User::identityClass|user identity class]] 应该实现 [[yii\filter
Hadoop2.5.2安装——单机模式 eksliang hadoop hadoop单机部署
转载请出自出处：http://eksliang.iteye.com/blog/2185414 一、概述 Hadoop有三种模式单机模式、伪分布模式和完全分布模式，这里先简单介绍单机模式，默认情况下，Hadoop被配置成一个非分布式模式，独立运行JAVA进程，适合开始做调试工作。二、下载地址 Hadoop 网址http:
LoadMoreListView+SwipeRefreshLayout（分页下拉）基本结构 gundumw100 android
一切为了快速迭代 import java.util.ArrayList; import org.json.JSONObject; import android.animation.ObjectAnimator; import android.os.Bundle; import android.support.v4.widget.SwipeRefreshLayo
三道简单的前端HTML/CSS题目 ini html Web 前端 css 题目
使用CSS为多个网页进行相同风格的布局和外观设置时，为了方便对这些网页进行修改，最好使用（）。http://hovertree.com/shortanswer/bjae/7bd72acca3206862.htm 在HTML中加入<table style=”color:red; font-size:10pt”>，此为（）。http://hovertree.com/s
overrided方法编译错误 kane_xie override
问题描述：在实现类中的某一或某几个Override方法发生编译错误如下： Name clash: The method put(String) of type XXXServiceImpl has the same erasure as put(String) of type XXXService but does not override it 当去掉@Over
Java中使用代理IP获取网址内容（防IP被封，做数据爬虫） mcj8089 免费代理IP 代理IP 数据爬虫 JAVA设置代理IP 爬虫封IP
推荐两个代理IP网站： 1. 全网代理IP：http://proxy.goubanjia.com/ 2. 敲代码免费IP：http://ip.qiaodm.com/ Java语言有两种方式使用代理IP访问网址并获取内容，方式一，设置System系统属性 // 设置代理IP System.getProper
Nodejs Express 报错之 listen EADDRINUSE qiaolevip 每天进步一点点学习永无止境 nodejs 纵观千象
当你启动 nodejs服务报错： >node app Express server listening on port 80 events.js:85 throw er; // Unhandled 'error' event ^ Error: listen EADDRINUSE at exports._errnoException (
C++中三种new的用法 _荆棘鸟_ C++new
转载自：http://news.ccidnet.com/art/32855/20100713/2114025_1.html 作者: mt 其一是new operator，也叫new表达式；其二是operator new，也叫new操作符。这两个英文名称起的也太绝了，很容易搞混，那就记中文名称吧。new表达式比较常见，也最常用，例如： string* ps = new string("
Ruby深入研究笔记1 wudixiaotie Ruby
module是可以定义private方法的 module MTest def aaa puts "aaa" private_method end private def private_method puts "this is private_method" end end