圖解DeepSeek技術

{{ _getLangText('m_detailInformation_goodsAuthorText') }}傑伊．阿拉馬爾[沙特],馬爾滕．格魯滕多斯特[荷]

{{ _getLangText('m_detailInformation_goodsTranslatorText') }}李博傑,孟佳穎

{{ _getLangText('m_detailInformation_goodsPublisherText') }}人民郵電出版社

2025年06月01日

ISBN：9787115674616

本書以通俗易懂、大量圖解的方式剖析了DeepSeek的底層技術。

全書分為3章和附錄，第1章詳細分析了推理大模型的範式轉變，即從“訓練時計算”到“測試時計算”；第2章解讀了DeepSeek-R1的架構——混合專家（MoE）；第3章展示了DeepSeek-R1詳細的訓練過程及核心技術，涵蓋基於GRPO的強化學習等；附錄分享了DeepSeek開源周活動。

本書適合大模型從業人員和對大模型底層技術感興趣的讀者。書中通過豐富的圖解將複雜的技術解釋得簡單、清晰、通透，是學習大模型技術難得一見的參考書。

{{ isMore ? _getLangText("m_detailIntroduction_goodsIntroductionHideText") : _getLangText("m_detailIntroduction_goodsIntroductionShowText") }}

{{ _getLangText("m_detailIntroduction_goodsIntroductionText") }}