AI語音辨識──用Kaldi實作應用全集

{{ _getLangText('m_detailInformation_goodsAuthorText') }} 陳果果,都家宇,那興宇

{{ _getLangText('m_detailInformation_goodsPublisherText') }} 深智

{{ _getLangText('m_detailInformation_goodsdatePubDateText') }} 2020年09月21日

ISBN： 9789865501525

{{ _getLangText('m_detailInformation_goodsActivityBuyButText') }}

從Hey、Siri、OK Google開始，我們早已習慣用語音來控制設備，語音輸入法取代鍵盤，Google幫你朗讀文章，你一定很好奇這些語音系統是如何建造出來的。

本書以Kaldi為主，完整介紹Librispeech等資料處理，並且完整說明了三音素架構。

語音模型方面：完整介紹語言模型、n元模型。

特徵工程方面：完整介紹包括對齊、Transition模型、GMM模型等。

構圖及解碼方面：完整介紹OpenFST、WFST等技術。

深度學習建模方面：完整介紹nnet、nnet2、nnet3。

大家最常用的語音搜尋、語音喚醒也有完整的實作介紹。類似人臉辨識的「人聲」辨識，也用PLDA、i-vector、x-vector等技術實作，最近當紅的語言辨識也沒錯過，可說是深入語音工程的最佳手冊。

{{ isMore ? _getLangText('m_detailIntroduction_goodsIntroductionHideText') : _getLangText('m_detailIntroduction_goodsIntroductionShowText') }}

{{ _getLangText('m_detailIntroduction_goodsIntroductionText') }}