李琳山名譽教授的個人資料 - Profile of Lin-shan Lee

李琳山 Lin-shan Lee

國立臺灣大學電機工程學系 名譽教授
國立台灣大學資訊工程學系 教授
國立台灣大學電信工程學研究所 教授
國立台灣大學資訊與多媒體研究所 教授
Emeritus Professor, Department of Electrical Engineering, National Taiwan University
Professor, Department of Computer Science and Information Engineering, National Taiwan University
Professor, Graduate Institute of Communication Engineering, National Taiwan University
Professor, Graduate Institute of Networking and Multimedia, National Taiwan University

主要研究領域:

數位語音處理

Major Research Areas:

Computer Processing of Speech Signals

研究領域摘要:

1.語音辨識核心技術:

語音訊號之新特徴、語音辨識之新模型或新架構、雜訊及通道效應處理、聲學模型之調適及精緻化、語言模型之調適及精緻化、自發性語音處理、中英夾雜之雙語語音處理、韻律及聲調模型等。

2.網路環境下語音辨識之智慧型應用:

語音瞭解、對話模型及系統、語音資訊之語意分析、語音資訊搜尋、語音資訊摘要、語音資訊重組、語音資訊之關鍵詞擷取、語音合成、分散式語音技術等。

Research Summary:

1. Core technologies for speech recognition:

New features for speech signals, new models and new frameworks for speech recognition, handling noise and channel effect, improved acoustic modeling and adaptation, improved language modeling and adaptation, spontaneous speech processing, Mandarin-English bilingual speech processing, prosody and tone modeling, etc.

2. Intelligent applications of speech recognition vender network environment:

Speech understanding, spoken dialog modeling and systems, semantic analysis, voice-based information retrieval, speech information summarization and distillation, spoken document understanding and organization, spoken key term extraction, speech synthesis, distributed speech processing technologies, etc.

Photo of Lin-shan Lee

代表性著作 Selected Publication

  1. Cheng-Tao Chung, Lin-shan Lee, “Unsupervised Discovery of Structured Acoustic Tokens with Applications to Spoken Term Detection,” IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 26, No. 2, pp.394-405, Feb. 2018
  2. Cheng-Tao Chung, Cheng-Yu Tsai, Chia-Hsiang Liu, Lin-shan Lee, “Unsupervised Iterative Deep Learning of Speech Features and Acoustic Tokens with Applications to Spoken Term Detection,” IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 23, No. 10, pp.1914-1928, Oct. 2017
  3. Lin-shan Lee, James Glass, Hung-yi Lee, Chun-an Chan, “Spoken Content Retrieval - Beyond Cascading Speech Recognition with Text Retrieval,” IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 23, No. 9, pp. 1389-1420, Sept. 2015
  4. Ching-Feng Yeh, Lin-shan Lee, “An Improved Framework for Recognizing Highly Imbalanced Bilingual Code-Switched Lectures with Cross-Language Acoustic Modeling and Frame-Level Language Identification,” IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 23, No. 7, pp. 1144-1159, Jul. 2015
  5. Yow-Bang Wang, Lin-shan Lee, “Supervised Detection and Unsupervised Discovery of Pronunciation Error Patterns for Computer-Assisted Language Learning,” IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 23, No. 3, pp. 564-579, Mar. 2015
  6. Pei-hao Su, Chuan-hsun Wu, Lin-shan Lee, “A Recursive Dialogue Game for Personalized Computer-Aided Pronunciation Training,” IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 23, No. 1, pp. 127-141, Jan. 2015
  7. Hung-yi Lee, Sz-Rung Shiang, Ching-Feng Yeh, Yun-Nung Chen, Yu Huang, Sheng-Yi Kong, Lin-shan Lee, “Spoken Knowledge Organization by Semantic Structuring and a Prototype Course Lecture System for Personalized Learning,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 22, No. 5, pp. 883-898, May 2014
  8. Hung-yi Lee, Lin-shan Lee, “Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk over Acoustic Similarity Graphs,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 22, No. 1, pp. 80-94, Jan. 2014
  9. Chia-Yu Wan, Lin-Shan Lee, “Histogram-Based Quantization for Robust and/or Distributed Speech Recognition,” IEEE Transactions on Audio, Speech and Language Processing, Volume 16, Issue 4, pp.859–873, May 2008
  10. Ming-Yi Tsai, Fu-Chiang Chou, Lin-shan Lee, “Pronunciation Modeling with Reduced Confusion for Mandarin Chinese Using A Three-stage Framework,” IEEE Transactions on Audio, Speech and Language Processing, Vol.15, No.2, pp.661-675, Feb. 2007
  11. Gwo-Hwa Ju, Lin-shan Lee, “A Perceptually Constrained GSVD-based Approach for Enhancing Speech Corrupted by Colored Noise,” IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, No.1, pp.119-134, Jan. 2007
  12. Lin-shan Lee, Sheng-yi Kong, Yi-cheng Pan, Yi-sheng Fu, Yu-tsun Huang, “Multi-layered Summarization of Spoken Document Archives by Information Extraction and Semantic Structuring,” Interspeech Conference, International Speech Communication Association (ISCA), pp. 1539-1542, Pittsburgh, USA, Sept. 2006
  13. Jeih-weih Hung and Lin-shan Lee, “Optimization of Temporal Filters for Constructing Robust Features in Speech Recognition,” IEEE Transactions on Speech and Audio Processing, Vol.14, No.3, pp.808-832, May 2006
  14. Lin-shan Lee and Berlin Chen, “Spoken Document Understanding and Organization,” IEEE Signal Processing Magazine, Vol. 22, No.5, pp.42-60, Sept. 2005
  15. Yu Tsao, Shang-ming Lee and Lin-shan Lee, “Segmental Eigenvoice with Delicate Eigenspace for Improved Speaker Adaptation,” IEEE Transactions on Speech and Audio Processing, Vol.13, No.3, pp.399-411, May 2005