画像はありません

選択したカラーの画像がありません。
カラー：

このビデオを見るには、次をダウンロード Flash Player

著者をフォロー

Xuedong Huang

フォロー

Spoken Language Processing: A Guide to Theory, Algorithm and System Development ハードカバー – 2001/4/25

英語版 Hsiao-Wuen Huang, Xuedong Acero, Alex Hon (著)

5.0 9個の評価

すべての形式と版を表示

このページの読み込み中に問題が発生しました。もう一度試してください。

Remarkable progress is being made in spoken language processing, but many powerful techniques have remained hidden in conference proceedings and academic papers, inaccessible to most practitioners. In this book, the leaders of the Speech Technology Group at Microsoft Research share these advances -- presenting not just the latest theory, but practical techniques for building commercially viable products.KEY TOPICS:Spoken Language Processing draws upon the latest advances and techniques from multiple fields: acoustics, phonology, phonetics, linguistics, semantics, pragmatics, computer science, electrical engineering, mathematics, syntax, psychology, and beyond. The book begins by presenting essential background on speech production and perception, probability and information theory, and pattern recognition. The authors demonstrate how to extract useful information from the speech signal; then present a variety of contemporary speech recognition techniques, including hidden Markov models, acoustic and language modeling, and techniques for improving resistance to environmental noise. Coverage includes decoders, search algorithms, large vocabulary speech recognition techniques, text-to-speech, spoken language dialog management, user interfaces, and interaction with non-speech interface modalities. The authors also present detailed case studies based on Microsoft's advanced prototypes, including the Whisper speech recognizer, Whistler text-to-speech system, and MiPad handheld computer.MARKET:For anyone involved with planning, designing, building, or purchasing spoken language technology.

この商品に関する問題を報告する

本の長さ

1008ページ
言語

英語
出版社

Prentice Hall
発売日

2001/4/25
寸法

18.42 x 4.45 x 23.5 cm
ISBN-10

0130226165
ISBN-13

978-0130226167
すべての詳細を表示

商品の説明

著者について

XUEDONG HUANG is founder and head of the Speech Technology Group at Microsoft Research. He received his Ph.D. from the University of Edinburgh. He is an IEEE Fellow.

ALEX ACERO and HSIAO-WUEN HON are Senior Researchers at Microsoft Research and Senior Members of IEEE. Both received doctorates from Carnegie Mellon University.

Foreword by Dr. Raj Reddy, Carnegie Mellon University

登録情報

出版社 ‏ : ‎ Prentice Hall; 第1版 (2001/4/25)
発売日 ‏ : ‎ 2001/4/25
言語 ‏ : ‎ 英語
ハードカバー ‏ : ‎ 1008ページ
ISBN-10 ‏ : ‎ 0130226165
ISBN-13 ‏ : ‎ 978-0130226167
寸法 ‏ : ‎ 18.42 x 4.45 x 23.5 cm

Amazon 売れ筋ランキング: - 577,944位洋書 (洋書の売れ筋ランキングを見る)
- - 137位Computer Hardware DSPs
- - 480位Natural Language Processing Software
- - 813位Theory of Computing

カスタマーレビュー:
5.0 9個の評価

著者について

著者をフォローして、新作のアップデートや改善されたおすすめを入手してください。

Xuedong Huang

Brief content visible, double tap to read full content.

Full content visible, double tap to read brief content.

著者の本をもっと発見したり、よく似た著者を見つけたり、著者のブログを読んだりしましょう

カスタマーレビュー

5つのうち5つ

9グローバルレーティング

星5つ

100%

星4つ

0%

星3つ

0%

星2つ

0%

星1つ

0%

評価はどのように計算されますか?

この商品をレビュー

カスタマーレビューを書く

他の国からのトップレビュー

すべてのレビューを日本語に翻訳

calvinnme

A classic and comprehensive resource

2006年12月12日にアメリカ合衆国でレビュー済み

Amazonで購入

This book is a comprehensive overview of most of the major topics associated with speech processing. Divided into five main sections, the book is well structured with a clear division of concerns. The title, "Spoken Language Processing", may be misleading to some as language processing topics only accounts for one section of the book.

The first two sections cover the fundamental theories that should be understood before embarking in-depth into a study of speech processing. This may seem an obvious approach but many texts do not follow this pattern making their use as reference tomes limited. Separating background theory from its use is also useful in that it allows a rigorous approach to its description. Too often texts give a hurried imprecise overview of theories used before launching into a long and complex use of the theory; losing the reader instantly in a quagmire of formulae.

The first two sections of the book deals with background material, material that the reader should at least understand the key concepts of. The first section concentrates on speech in general (including production and perception), probability and statistics, and pattern classification. These last two topics mentioned are both important parts of the book and are dealt with in their own chapters. Both are well written with the right amount of explanation and background. Much of the remainder of the book expects at least some familiarity with the material presented here. These chapters, like all chapters in the book finish with a section entitled, "Historical Perspective and Further Reading". The inclusion of recommended further reading, in addition to the vast number of references appearing in each chapter, make the book as a whole a very good starting point for any work in speech processing.

The second section concerns itself with the DSP topics which relate to speech processing. In this section the reader will find everything from FFTs to multi-rate signal processing and speech signal representations to speech coding. Again the section is well written and the reader is not forced to refer to other texts to understand what is written. If a topic is not expanded upon here then it is an indication that is not dealt further in any great depth in the remainder of the book.

The third section of the book covers speech recognition and is probably the section which will find most use with many readers. This section is very thorough in its treatment of the subject. It starts immediately with a discussion of Hidden Markov Models which is almost exclusively the method employed in the pattern matching stage of speech recognition. Any algorithms that are mentioned are also detailed which really make the book useful. In fact algorithms are presented throughout the book making it a practical reference as much as a theoretical one. This is important because there is a big jump from understanding theory to being able to implement an algorithm to exploit that theory. Other topics covered include an excellent chapter on environmental robustness with one of the best discussions of microphones I have seen. Language modelling and search algorithms are given a thorough treatment. I would like to have seen more detailed information on front-end processing and endpoint detection, as this remains a critical stage of the recognition process. Perhaps the level of detail reflects the fact that this is currently a hot research topic with potential for significant advancement.

Section four, on text-to-speech processing, is a good overview of the field and better than any book I've seen on the subject. It shows numerous block diagrams of what you need to build such a system and gives numerous algorithms in pseudocode. It also dedicates a subsection to each block of the text-to-speech system block diagram, discussing in detail what you would need to do to implement that particular block. Since much of the individual blocks have been discussed earlier in the book, it refers you back to specific earlier sections for details.

The fifth section is a short one on entire systems and shows some case studies, concentrating on what Microsoft was doing at the time this book was published, since that is where the authors' research came from. I would highly recommend that anyone anticipating getting into speech processing have a copy of this classic nearby.

6人のお客様がこれが役に立ったと考えています

レビューを日本語に翻訳する

J. Myers

Great overview of all things speech in one tome.

2016年1月9日にアメリカ合衆国でレビュー済み

Amazonで購入

A good comprehensive overview of all things needed to know speech processing as of 2001. Once-in-a-decade book. Makes a great senior- or graduate-level textbook.

Errata: p.38 "ax" schwa sound is FIRST syllable of "ago", not second. High/(neutral)/low and front/(neutral)/back cannot be four binary systems, because it's impossible to have a [+high, +low] or a [+front, +back] vowel. They are two trinary systems. You can represent them by four binary systems if you want to pervert the notation, but you should not call them that.

レビューを日本語に翻訳する

katydid

A tour de force

2005年8月23日にアメリカ合衆国でレビュー済み

Amazonで購入

What a wonderful book. Whether you are a computer scientist or mathematician with limited exposure to the discipline of speech processing, or alternatively you are a dedicated expert in this field, you will find everything you are looking for in this book. For two weeks, I couldn't put this thing down. And that's an extraordinary testimony to a book that's 800+ pages of technical detail. If you want a high level understanding of how speech processing works, or if you want to dig in and build your own speech engine, everything you need is right here.

5人のお客様がこれが役に立ったと考えています

レビューを日本語に翻訳する

John Conde

Five Stars

2015年3月10日にアメリカ合衆国でレビュー済み

Amazonで購入

Quick shipment, flawless textbook

レビューを日本語に翻訳する

レビューをすべて見る

Amazon Advertising 商品の露出でお客様の関心と反応を引き出す	Audible（オーディブル）「聴く」読書会員なら聴き放題	アマゾンウェブサービス（AWS）クラウドコンピューティングサービス	Amazonアウトレット訳あり商品をお手頃価格で販売

Amazonビジネス（法人購買）請求書払い法人価格・数量割引	AmazonGlobal 65か国/地域以上への海外配送がより簡単に	Shopbop 世界中の厳選されたファッションアイテム

著者をフォロー

Spoken Language Processing: A Guide to Theory, Algorithm and System Development ハードカバー – 2001/4/25

商品の説明

著者について

登録情報

著者について

Xuedong Huang

カスタマーレビュー

この商品をレビュー

上位レビュー、対象国： 日本

他の国からのトップレビュー

上位レビュー、対象国：日本