李开复、洪小文、黄学东关系

1、Sphinx是我的博士论文,当时CMU主流团队叫做Angel,Sphinx开始时,就是我一个人做的。后来,洪小文刚进入博士班,帮着我做,有些code是他写的。论文里的想法、实验都是我自己做的,但是因为洪小文有贡献,所以在后来出版的一些文章上,除了我和导师的名字,我也有挂他的名字。

毕业后,Sphinx被立项成为CMU主流,我负责这个项目两年,黄学东是我雇佣的博士后,另外有三、四位成员,包括洪小文。

两年后,我离开CMU,加入苹果,黄学东成为这个项目的负责人。

在这件事情上,不要想太多,大家说的都没有错误。Sphinx既是我的博士论文,也是后来组织的名称。

匿名  2011-7-17
2、SPHINX是李開復在1988發表的博士論文主題(http://portal.acm.org/citation.cfm?id=914540),而洪小文是89才開始出現在之後發表的數篇論文上的,所以看得出來原始想法是李開復的。但後來李開復寫過一本SPHINX的書 "Automatic speech recognition: the development of the SPHINX system",開頭的謝辭(http://goo.gl/tPgR8)有提到洪小文跟他從一開始就有緊密的合作。從洪的Linkedin上可以看出他是86年進CMU,也就是說他們的確可能在那時就開始合作了,只是沒出現在88年的幾篇SPHINX論文上。
(在學術界每個實驗室有不同的論文掛名方式,所以有可能洪有參與實作,但沒出什麼主意所以就不放在論文上。)

匿名  2011-7-17
3、先自己贡献一个答案吧。
从CMU Sphinx的wiki看:
Sphinx

Sphinx is a continuous-speech, speaker-independent recognition system making use of hidden Markov acoustic models (HMMs) and an n-gram statistical language model. It was developed by Kai-Fu Lee.
Sphinx featured feasibility of continuous-speech, speaker-independent
large-vocabulary recognition, the possibility of which was in dispute at
the time (1986). Sphinx is of historical interest only; it has been
superseded in performance by subsequent versions. An archival article describes the system in detail.
一代是李开复开发的。
Sphinx 2

A fast performance-oriented recognizer, originally developed by Xuedong Huang at Carnegie Mellon and released as Open source with a BSD-style license on SourceForge by Kevin Lenzo
at LinuxWorld in 2000. Sphinx 2 focuses on real-time recognition
suitable for spoken language applications. As such it incorporates
functionality such as end-pointing, partial hypothesis generation,
dynamic language model switching and so on. It is used in dialog systems
and language learning systems. It can be used in computer based PBX
systems such as Asterisk.
Sphinx 2 code has also been incorporated into a number of commercial
products. It is no longer under active development (other than for
routine maintenance). Current real-time decoder development is taking
place in the Pocket Sphinx project. An archival article describes the system.
二代是黄学东开发的。二代开始开源。
不了解洪小文的参与时间。

不过学术上面的角色或者细节,不懂,懂行的入,讲讲。

匿名  2011-7-17
4、查ACM和DBLP的记录,貌似89年开始有李开复和洪小文对sphinx的论文:

Lee, K., Hon, H., and Hwang, M. 1989. Recent progress in the SPHINX Speech Recognition system. In Proceedings of the Workshop on Speech and Natural Language (Philadelphia, Pennsylvania, February 21 - 23, 1989). Association for Computational Linguistics, Stroudsburg, PA, 125-130. DOI= http://dx.doi.org/10.3115/100964.100973

Huang, X. D., Hon, H. W., and Lee, K. F. 1989. Large-vocabulary speaker-independent continuous speech recognition with semi-continuous hidden Markov models. In Proceedings of the Workshop on Speech and Natural Language (Cape Cod, Massachusetts, October 15 - 18, 1989). Association for Computational Linguistics, Stroudsburg, PA, 276-279. DOI= http://dx.doi.org/10.3115/1075434.1075480

Hon, H., Lee, K., and Weide, R. 1989. Towards speech recognition without vocabulary-specific training. In Proceedings of the Workshop on Speech and Natural Language (Cape Cod, Massachusetts, October 15 - 18, 1989). Association for Computational Linguistics, Stroudsburg, PA, 271-275. DOI= http://dx.doi.org/10.3115/1075434.10754

李开复应该在88年开始在cmu做AP,而sphinx是李开复的phd阶段的工作,所以貌似洪小文不能算作共同创始人?

你可能感兴趣的:(工作,System,dialog,performance,出版,generation)