Zhehuai Chen's homepage

About Me

Zhehuai Chen 陈哲怀

My CV is available HERE.

Research Scientist at Google NYC.

A PhD. in Computer Science, Shanghai Jiao Tong University.
Supervised by Prof. Kai Yu.

Room 3-520, SEIEE Building, SJTU
800 Dongchuan Road, Shanghai 200240, China

Email: chenzhehuai@foxmail.com
Personal Page: https://chenzhehuai.github.io/
http://www.douban.com/people/chenzhehuai/

Selected Publications

Zhehuai Chen, Mahsa Yarmohammadi, Hainan Xu, Hang Lv, Lei Xie, Daniel Povey, Sanjeev Khudanpur, Incremental Lattice Determinization for WFST Decoders, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Sentosa, Singapore, 2019. [Best Paper Nomination]

Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael Seltzer, Christian Fuegen, Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR, 20th Annual Conference of International Speech Communication Association (InterSpeech), Graz, Austria, 2019. [bibtex]

Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael Seltzer, Christian Fuegen, End-to-end Contextual Spebech Recognition using Class Language Models and a Token Passing Decoder, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019. [pdf] [bibtex]

Zhehuai Chen, Wenlu Zheng, Yongbin You, Yanmin Qian, Kai Yu. Label Synchronous Decoding for Speech Recognition. Chinese Journal of Computers, 2019. [bibtex]

Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur, A GPU-based WFST Decoder with Exact Lattice Generation, 19th Annual Conference of the International Speech Communication Association (InterSpeech), 2018. [Best Paper Nomination] [pdf] [bibtex]

Zhehuai Chen, Linguistic Search Optimization for Deep Learning Based LVCSR, in Doctoral Consortium, 19th Annual Conference of International Speech Communication Association (InterSpeech), 2018. [pdf] [bibtex]

Zhehuai Chen, Jasha Droppo, Sequence Modeling in Unsupervised Single-channel Overlapped Speech Recognition, IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), Calgary, Canada, 2018. [pdf] [slide] [bibtex]

Zhehuai Chen, Qi Liu, Hao Li, Kai Yu, On Modular Training of Neural Acoustics-to-word Model for LVCSR, IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), Calgary, Canada, 2018. [pdf] [slide] [bibtex]

Zhehuai Chen, Yanmin Qian, Kai Yu, Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting. Speech Communication, vol. 102, 100-111, 2018. [pdf] [bibtex]

Zhehuai Chen, Jasha Droppo, Jinyu Li, Wayne Xiong, Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 1, pp. 184-196, Jan. 2018. doi: 10.1109/TASLP.2017.2765834. [pdf] [bibtex]

Zhehuai Chen, Yanmin Qian, and Kai Yu. A unified confidence measure framework using auxiliary normalization graph, IScIDE, 2017.

Zhehuai Chen, Yimeng Zhuang and Kai Yu, Confidence Measures for CTC-based Phone Synchronous Decoding, IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), New Orleans, USA, 2017: 4850-4854. [student travel grant] [pdf] [slide] [bibtex]

Zhehuai Chen, Yimeng Zhuang, Yanmin Qian and Kai Yu, Phone Synchronous Speech Recognition with CTC Lattices, IEEE/ACM Transactions on Audio, Speech, and Language Processing , vol. 25, no. 1, pp. 86-97, Jan. 2017. doi: 10.1109/TASLP.2016.2625459 [pdf] [bibtex]

Zhehuai Chen, Wei Deng, Tao Xu and Kai Yu. Phone Synchronous Decoding with CTC Lattice. 17th Annual Conference of the International Speech Communication Association (InterSpeech), San Francisco, America, 2016: 1923-1927.[student travel grant] [pdf] [poster] [bibtex]

Zhehuai Chen and Kai Yu, An Investigation of Implementation and Performance Analysis of DNN Based Speech Synthesis System. 12th IEEE International Conference on Signal Processing(ICSP), Hangzhou, 2014: 577-582.[pdf] [slide] [bibtex]

Selected Talks

2018 Google PhD Fellowship Summit poster [pdf]

2018 Interspeech review [pdf]

2018 JHU intern review [pdf]

2018 End-to-end speech recognition review [pdf]

2018 ICASSP review [pdf]

2017 MSR intern review [pdf]

2017 ICASSP review [pdf]

2016 Interspeech review [pdf]

2014 ICSP review [pdf]