Speech Recognition Books


Books-Under-Review-->Computers-->Algorithms-->Speech Recognition-->7
Related Subjects:
More Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89
Speech Recognition Books sorted by Average customer review: high to low .

Speech Recognition
Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison
Published in Paperback by Center for the Study of Language and Inf (1999-12-01)
Author:
List price: $35.00
New price: $29.50
Used price: $23.00

Average review score:

Still a useful source of information
Helpful Votes: 18 out of 18 total.
Review Date: 2001-08-07
This book, originally published in 1983, was reissued in 1999, no doubt because of the importance of genetic sequencing in recent years. What is neat about the book is it shows how algorithms from one field can be applied to solve problems in another, possibly totally disparate field, one example being computational linguistics and sequence algorithms in computational biology.

A general overview of sequence comparison is given in chapter 1 with applications to molecular biology, human speech, computer science, coding theory, gas chromotography, and bird songs discussed. The author discusses how deletion-insertion, compression-expansion, and substitution are employed in sequence comparison. Different metrics are introduced, such as the Levenshtein distance. Dynamic programming, which pretty much dominates the book, is introduced here also.

Part 1 of the book discusses sequence comparison in molecular biology. The use of dynamic programming is emphasized and its importance continues to this day. The advantages of using the dynamic programming method are outlined, and it is shown how to find the substring in a longer sequence with most optimum agreement to a shorter sequence. In addition, given an RNA molecule with a known nucleotide sequence, methods are discussed for predicting the way different parts of the molecule will bond to each other. These methods are based on dynamic programming. Mathematicians considering doing research on or about entering the field will profit from the section on the biological background. The treatment of RNA secondary structures is excellent.

In part 2, the emphasis is on speech processing and what is called "time-warping", which is a technique for comparing functions by altering the time axis. An interesting application is given to the comparison of bird songs. An algorithm is given for adjusting the time scales for two songs to arrange them in the most optimal alignment. In addition, the differences between compression and expansion and deletion and insertion are discussed in this part.

In part 3, a modified Smith-Waterman algorithm is employed to find similar portions in two sequences. Called local alignment in computational biology, it is shown in detail how to define the recurrences for the alignment and how to keep track of the pointers for backtracking. This part also generalizes the operations of substitution and Levenshtein distance. In addition, the strategy of doing sequence comparison by allowing transpositions is discussed. Such a strategy entails a generalized concept of trace, wherein trace lines can intersect each other, leading to entangling of the traces into knots or plaids. The usual dynamic programming techniques must then be extended to deal with these complications. One particular algorithm for this is discussed, called CELLAR, which involves the construction of a directed graph whose paths correspond to admissible sequences of generalizations of traces, called cuts. The computational complexity of this algorithm is discussed. In addition, an O(n^2/logn) algorithm is given for computing string-edit distances.

The last part of the book deals with studying comparisons between random sequences. Combinatorial arguments are used to derive upper bounds on the expected length of the longest common subsequences of two random sequences. Other miscellaneous results dealing with comparing common subsequences of two random sequences are given.

Speech Recognition
Topic Detection and Tracking: Event-based Information Organization (The Kluwer International Series on Information Retrieval, Volume 12) (The Information Retrieval Series)
Published in Kindle Edition by Springer (2002-02-01)
Author:
List price: $175.00
New price: $132.25

Average review score:

Good book and very TECHNICAL too
Helpful Votes: 1 out of 1 total.
Review Date: 2006-06-19
I was introduced to James Allan when i start doing my little investigation for my research. I've just finished the first 4 chapters. Absolutely a good book for those who wants to know about TDT, event-based information organization and the techniques involved to detect an event from the news streams. The book will be really beneficial though for folks who's in IR too :) It's a compilation of technical papers and maybe the TDT webpage might help in order to get the rough idea on what is TDT. Indeed the technical part will be the algorithm, approach and technique.
Can't wait to read somemore.

Speech Recognition
Voice Recognition (Artech House Telecommunications Library)
Published in Hardcover by Artech House Publishers (1997-10-01)
Authors: Richard L. Klevans and Robert D. Rodman
List price: $128.00
New price: $54.90
Used price: $47.00

Average review score:

Voice Recognition theory and application
Helpful Votes: 0 out of 0 total.
Review Date: 2001-08-31
Klevans and Rodman did an excellent job at presenting the theory and application of voice recognition. The book goes into just the right amount of detail that a subject of this nature requires. Also, various models and techniques for dealing with speech recognition were explored, with the pros and cons of each carefully stated. It was certainly a pleasure to have read this book, and the new ideas/techniques that were presented, have influenced me to further my research in speech recognition.

Speech Recognition
The VoiceXML Handbook: Understanding and Building the Phone-Enabled Web
Published in Paperback by CMP Books (2001-04)
Author: Bob Edgar
List price: $39.95
New price: $19.99
Used price: $19.89

Average review score:

An Excellent Overview
Helpful Votes: 0 out of 0 total.
Review Date: 2001-08-05
I am director of user interface design at Audiopoint in Fairfax, Virginia -- a voice portal/voice technology company. I work mostly in human factors, not programming. This book is exactly what I've been looking for, because it gives, I feel, an excellent overview of the many kinds of systems, software, and hardware that are involved in the work I do. I agree with another reviewer, who said that the author clearly states that not all the examples will work in every case, for various reasons. The great plus for me is that, even though my training is not for the most part in technology, I could still understand the book. The author takes you, usually, from the very beginning, and gives you the big, simple picture, which is crucial to have fixed firmly in your mind. This book is making it much easier for me to understand our IT people and talk with techies, and visualize various products which I'd like to see our company launch. So I give this book top rating....

An indispensable "how to" reference
Helpful Votes: 1 out of 1 total.
Review Date: 2001-10-15
In The VoiceXML Handbook: Understanding And Building The Phone-Enabled Web, computer telephony expert Bob Edgar takes the reader through a step-by-step introduction through all the features of VoiceXML (including VoiceXML 2.0). Readers will learn about Graphical Web Browsing, HTML, and HTTP; Telecommunications; Computer Telephony; Voice Recognition and Text-to-Speech; and XML. Also provided are a VoiceXML Tutorial and instructions on using Voice Browsers to crated Phone-Enabled Web Sites. The VoiceXML Handbook is an indispensable "how to" reference for anyone who needs to enhance their website with telephony-enabled technology and ability.

Not so informative.
Helpful Votes: 1 out of 1 total.
Review Date: 2001-09-20
I can find more information on the internet on this subject than reading this book. In fact I turned to the internet while reading this book for answers to the questions this book failed to answer.

VoiceXML for very beginners
Helpful Votes: 4 out of 4 total.
Review Date: 2001-12-12
The book is a general overview of telephony application and a thin introduction to VoiceXML. It covers important matters in a very rapid and unprecise way. It contains even errors in the examples.

Not very informative
Helpful Votes: 5 out of 5 total.
Review Date: 2001-07-31
As a VoiceXML developer, I looked forward to this book. I was disappointed. Too much time was spent speculating on Version 2.0 and not enough time explaining Version 1.0. If you are looking to learn VoiceXML this is not the book.

Speech Recognition
Speech Coding: A Computer Laboratory Textbook
Published in Paperback by Wiley (1995-12-18)
Authors: Thomas P. Barnwell, Kambiz Nayebi, and Craig H. Richardson
List price:
New price: $8.00
Used price: $7.99

Average review score:

Good starting point for learning about speech coding
Helpful Votes: 0 out of 0 total.
Review Date: 2007-01-09
It is surprising to see such a large amount of useful and well-presented information in a tiny volume like this. Although intended as a supplemental book emphasizing computer experiments, much theory is included, even if presented in a cursory manner. There are exercises and projects included for hands-on experimentation with the algorithms that are quite helpful. Unfortunately, it is too bad that the software does not work as promised, as this does subtract noticeably from the total learning experience. Note that this book is 12 years old, so you're not getting the latest developments in the field, but it is a good starting point for learning about speech coding. I would say if you can get it at a used book price, it is probably still worth the investment. It makes a good companion to the more modern "Speech Coding Algorithms: Foundation and Evolution of Standardized Coders" by Chu. The following is the table of contents.

DSPLAB: The DSP Laboratory Software.
Quantization: PCM and APCM.

Waveform Coding with Fixed Prediction.

Pitch-excited Linear Predictive Vocoder.

Waveform Coding with Adaptive Prediction.

Analysis-by-Synthesis LPC.

Subband Coding.

Projects.

Appendices.

Bibliography.

Index.

I gave this book only three stars. However, if the software had worked correctly, I would have probably given it four or five stars.

Shame about the software ..
Helpful Votes: 0 out of 0 total.
Review Date: 2002-01-17
This is an excellent introduction to speech coding, but like the other reviewers, I failed to get the software to play out speech files (using soundblaster boards of various vintages - it always failed).

The text is very good and I would like to employ this in my teaching, but inability to hear the results is very frustrating.

Great book. Poorly executed software.
Helpful Votes: 1 out of 1 total.
Review Date: 1998-10-09
Another reviewer has commented on difficulty playing the sound files. I experienced the same problem. I found a work around using some third party software, but I expected better from Georgia Tech.

I expected trouble free software. I didn't get it. Your milage may vary.

An excellent learning source for Speech Processing
Helpful Votes: 2 out of 2 total.
Review Date: 1998-04-27
This book is written from a practical perspective and this makes it a pleasure to learn about speech coding. Practical exercises provided in the book are can be done with the software provided (2 disks) and this makes learning fun. As a beginer to DSP and espicially to speech processing, I have found most books very technical and difficult to follow without the guidance of a technical person. This book cuts through the technical jargon and gets to the heart of the matter. There are real audio files on which Digital speech processing can be performed, however i found that with my computer, i could not play/hear the sound files. The graphical user interface is straightforward and not difficult to use. I'm glad I bought this book.

Speech Recognition
Digital Speech: Coding for Low Bit Rate Communication Systems (Wiley Series in Communication and Distributed Systems)
Published in Paperback by John Wiley & Sons Inc (1995-02)
Author: A. M. Kondoz
List price: $245.00
New price: $26.00
Used price: $23.68

Average review score:

Useful as a reference
Helpful Votes: 1 out of 1 total.
Review Date: 2000-08-30
Not very theoretically robust; many issues are touched on the surface without detail explainations. Some topics are clearly not pragmatic but yet the author make a big deal about them (like his own published method of LSF quantization). Organizations and structures are not my favorite as well. However, the book does provide references to the sources where the readers can refer to for the necessary information.

Very useful
Helpful Votes: 2 out of 2 total.
Review Date: 1999-01-03
This is the best modern times book about the subject. However part of this qualification comes from the fact it has practically no competitors. Detailed books on speech coding are very rare. Although it deals with almost every aspect of speech coding, it does so with at times awkward notation, introduction of unexplained equation terms, using of non introduced terminology and at times formulas with obvious errors. I believe a second corrected edition should be necessary. Also in such a hypotetic second edition, I hope many explanations should be made clearer, otherwise for a beginner in the field supplemental references would be needed, and the price of this one is already high.

A perfect assistant for low bit rate speech coding engineers
Helpful Votes: 4 out of 4 total.
Review Date: 1998-06-19
This book covers subjects lpc parameter quantization, coding standards, algorithms (MPLPC, RELP, VSELP briefly and CELP, LD-CELP and MBE in a very detailed manner). Error Control in speech transmission and system application aspects like DTX and echo cancellors are also reviewed. For the ones working on CELP and MBE based coders, this book does not only supply all the needed theory, but also practical issues even for real time applications on DSP processors.

Speech Recognition
Voice Application Development with VoiceXML
Published in Paperback by Sams (2001-08-20)
Authors: Rick Beasley, Kenneth Michael Farley, John O'Reilly, Leon Squire, and Kenneth Farley
List price: $49.99
New price: $7.99
Used price: $2.45

Average review score:

Top notch
Helpful Votes: 5 out of 9 total.
Review Date: 2001-08-29
This is an outstanding book with useful information and clear explanations. It doesn't just cover the VoiceXML spec but also shows you how to use it in IVR-replacement systems and other real applications. If you really need to build stuff with voicexml then you need this book. You can tell in reading the book that these authors have *actually done* voice application development and aren't just guessing at how it ought to work. I'm looking forward to the online voicexml 2.0 appendix, too.

Not what I was looking for.
Helpful Votes: 9 out of 10 total.
Review Date: 2001-10-26
This book isn't a good book to use if you're looking to develop VoiceXML applications. This book delves too far into proper software engineering and doesn't give enough information about the VoiceXML standard (doesn't touch vxml 2.0). I wish I would have purchased the wrox version that covers voiceXML.

Speech Recognition
Dragon NaturallySpeaking for Dummies
Published in Paperback by For Dummies (1999-10-21)
Authors: David C. Kay and Doug Muder
List price: $29.99
New price: $19.87
Used price: $18.99

Average review score:

Dragon Speech Recognition software
Helpful Votes: 0 out of 1 total.
Review Date: 2008-06-16
This software is highly recommended by one of the leading Rehab Centers. My daughter, who is paralyzed from the neck down, has used it,with the proper equipment, to send E-Mail messages and to make telephone calls.

Pointless keeping in print
Helpful Votes: 10 out of 11 total.
Review Date: 2007-06-24
This book refers to version 4 of the Dragon Naturally Speaking program. The current version is number 9, and is very significantly different from version 4. I don't know why this book is being offered for sale. Do not buy it.

Perfectly Useless
Helpful Votes: 34 out of 35 total.
Review Date: 2006-08-03
This book is seven years old. It was written for version 4.0. We are now up to 9.0.

Avoid it.

Outdated
Helpful Votes: 4 out of 4 total.
Review Date: 2007-10-12
Great book if you have the very old version 4. Dragon naturally Speaking is up to version 9.5 and has changed drastically since this book was printed.

This book is out of date
Helpful Votes: 8 out of 8 total.
Review Date: 2007-04-09
This book used to be useful but is now eight years out of date.

To learn how to use the latest version of Dragon, check out the Dragon NaturallySpeaking 9 "Video Guide" and "QuickStart." They are CD-ROMs that play on your computer. I am the author of both of these titles. They are available on Amazon:

Dragon Naturally Speaking 9 Preferred QuickStart
Dragon NaturallySpeaking 9 Video Guide: Complete Three-Volume Set

Speech Recognition
Voice & Data Internetworking
Published in Paperback by McGraw-Hill Companies (1999-09-17)
Author: Gilbert Held
List price: $55.00
New price: $0.01
Used price: $0.01

Average review score:

His concept of IP was a nightmare
Helpful Votes: 0 out of 0 total.
Review Date: 2002-09-12
The author's background explanation of IP was a nightmare.

His insistance that UDP is a member of the TCP/IP protocol suite is such a skewed perspective that it made me wonder if he had any competence with IP at all and made me suspicious of the remainder of the material. If he held this viewpoint for some reason (historical, or otherwise and not simply complete ignorance of the subject matter) I somehow missed that.

The editing was atrocious, which didn't help at all.

The author provides his e-mail to give comments and feedback, which I tried to do when I found that I had so many notes and errata that I was having trouble writing them all down in the book's jacket. However the e-mail addresses given for the author are no longer working. (Perhaps wisely!)

The first 3 chapters were a garbled intro to IP.

The sections on speech production and digitization in Chapter 4 were very good. Chapter 5 on telephony was a good chapter. Chapter 6 included a nice simple explanation of SIP.

Chapter 7 was about VoIP equipment which is probably OBE.

Chapter 8 was a complete joke. Explaining ping and traceroute, which should probably have been covered in the beginning sections on IP, but which most likely was boiler plate from other publications by this author.

I got this book free from Cisco and it was worth every penny.

This book is not for you unless you're a high-school student
Helpful Votes: 0 out of 8 total.
Review Date: 2000-01-09
This book trys to use simple language to explain many complicated topics, and it fails to expain well. The author is knowledgable on telephone operations, but other than that, don't expect to learn anything from this book. I want to learn more about VOIP/ATM, H.323, CELP, but I feel very very disappointed after read through the whole book. If you're a design engineer, don't expect to learn angthing from it. If you're a high school student, or a household wife, this may be a good book.

Excellent book, covered a lot of key implementation areas
Helpful Votes: 1 out of 1 total.
Review Date: 1999-11-06
I was impressed that the author covered many key implementation issues throughout the book. This was really helpfull as I'm in the process of developing an integrated solution for my company.

Great book with many application notes
Helpful Votes: 1 out of 1 total.
Review Date: 1999-10-28
I found this book to be a most practical guide to implementing voice over an IP network. In the book the author shows over 20 techniques, many of which are not obvious till you think about them that shows how to make voice a reality. I especially appreciated the authors insight concerning reconfiguring a router's access list. Its obvious the author really knows networking and shows his real life examples through numerous examples in the book.

A very practical book
Helpful Votes: 6 out of 6 total.
Review Date: 2000-01-15
I normally do not write reviews but when I noted a person said this book was for a 'housewife' and gave it one star I thought something was amiss. I found this book to be very practical, with numerous examples that show 'how to do it.' I purchased other books on this topic and found them to be filled with standards literature from the ITU. Thus, I was perplexed how someone could rate this book with one star when its significantly better that the others that spend most pages repeating ITU specifications.

Speech Recognition
Computer Speech Technology (Artech House Signal Processing Library)
Published in Hardcover by Artech House Publishers (1999-02)
Author: Robert D. Rodman
List price: $33.00
New price: $10.00
Used price: $1.07

Average review score:

Not worth the money
Helpful Votes: 0 out of 0 total.
Review Date: 2002-02-27
I tried to use this book for a survey course on computer speech technology, but I couldn't rely on it for much. My students cursed me for making them waste their money on this book. It doesn't talk about the material at a sufficiently high level or with enough detail to satisfy even non-mathematical college students seeking a general overview. One wonders who the intended audience is? The preface says it is professionals in the field of speech (other than engineers), and I can testify it certainly doesn't work for students. The chapters that do go over some of the mathematical underpinnings of speech processing do it in the most unsuccessful of non-mathematical ways, which is to say equations are presented but are not properly explained for those who don't already know what they are about. There is really very little meat here to chew, on a subject that deserves more. Stick to the engineering books if you actually want to know anything about this subject. There is still no alternative but to learn all the math.

Computer Speech Technology - A great way to start
Helpful Votes: 5 out of 5 total.
Review Date: 2000-07-26
If you are not into FFT's and HMM's, but are interested in a slightly less mathematical explanation of Automatic Speech Recognition, this is for you. Interesting, and an easy read, watch out though, the next book you get will go math on you fast. This is a great book for non-engineers. You will eveltually out-grow its simple explanations - but that does not make a poor source. The book does not talk down to the novice - in the finiest tradtion of Carl Popper! - Well Done


Books-Under-Review-->Computers-->Algorithms-->Speech Recognition-->7
Related Subjects:
More Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89