Machine Learning Books


Books-Under-Review-->Computers-->Artificial Intelligence-->Machine Learning-->12
Related Subjects: Case-Based Reasoning Companies Mailing Lists Conferences Research Groups Software Datasets Publications
More Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153
Machine Learning Books sorted by Average customer review: high to low .

Machine Learning
Pattern Classification (2nd Edition)
Published in Hardcover by Wiley-Interscience (2000-10)
Authors: Richard O. Duda, Peter E. Hart, and David G. Stork
List price: $140.00
New price: $78.00
Used price: $50.00

Average review score:

Great product & service
Helpful Votes: 0 out of 4 total.
Review Date: 2007-09-21
This was my first purchase from amazon and I was totally impressed by the quality of the product and the service! I would buy again from the same seller and recommend others to do the same.

A Very Bad Sequel
Helpful Votes: 10 out of 12 total.
Review Date: 2007-03-08
I have now used this book 3 times for a class. While the 1st edition did a nice job of covering the material in its time, the additions to in the 2nd addition are a disaster. What the book has going for it is that it at least lists the necessary material for such a course in the table of contents. However, all the additional material is poorly explained at best. The problem sets are too few and the ones that are included are generally weak.

I have tried to use this book, but after constant student complaints and my own difficulty with the text, I have finally concluded that the problem lies with the text and not with the users.

I think an indicator of problems was the large number of errors in the first printing; large here is an understatement. Even in later additions, the 4th, the size of the errata is huge. I think this is indicative of the authors' attention to detail and seriousness in preparation. I have found similar errors and ambiguities in the associate Computer Manual.

The bottom line is that this book has seen its final appearance in our curriculum. I would use any other text, even an older one.

There is simply not enough room or time to point out all the problems with this text. Do yourself a favor if considering this text for a class. Don't bother.

Terrible Problems
Helpful Votes: 2 out of 2 total.
Review Date: 2008-04-09
I am not sure how this book gets consistently high marks. I am using this text for a graduate level course. While it does a decent job covering most of the topics, it has some glaring flaws.

For one the Homework Problems it provides are not really representative of what you're learning in the text. Almost all of the problems revolve around proofs, as opposed to using the concepts in practice. You can seemingly have a good grasp on the material, yet spend hours trying to solve each of the problems they provide for that particular section. My entire class has complained, and even my professor has admitted that even he isn't sure sometimes how they expect you to solve some of the problems.

Secondly, there are very few example problems demonstrated in the text, so the reader doesn't really get to see the concepts in action so to speak.

Also, there is a typo or error on almost every other page, sometimes even on important formulas.

Overall, I'd have to think there are better books out there. If this truly is "the best there is" as some reviewers claim, God help the field of Pattern Recognition.

excellent revision of a classical text on statistical pattern recognition
Helpful Votes: 20 out of 20 total.
Review Date: 2008-01-24
The 1973 book by Duda and Hart was a classic. It surveyed the literature on pattern classification and scene analysis and provided the practitioner with wonderful insight and exposition of the subject. In the intervening 28 years the field has exploded and there has been an enormous increase in technical approaches and applications.
With this in mind the authors and their new coauthor David Stork go about the task of providing a revision. True to the goals of the original the authors undertake to describe pattern recognition under a variety of topics and with several available methods to cover each topic. Important new areas are covered and old but now deemed less significant are dropped. Advances in statistical computing and computing in general also dictate the topics. So although the authors are the same and the title is almost the same (note that scene analysis is dropped from the title) it is more like an entirely new book on the subject rthan a revision of the old. For a revision, I would expect to see mostly the same chapters with the same titles and only a few new chapters along with expansion of old chapters.

Although I view this as a new book, that is not necessarily bad. In fact it may be viewed as a strength of the book. It maintains the style and clarity of the original that we all loved but represents the state-of-the-art in pattern recognition at the beginning of the 21st Century.

The original had some very nice pictures. I liked some of them so much that I used them with permission in the section on classification error rate estimation in my bootstrap book. This edition goes much further with beautiful graphics including many nice three-dimensional color pictures like the one on the cover page.

The standard classical material is covered in the first five chapters with new material included (e.g. the EM algorithm and hidden markov models in Chapter 3). Chapter 6 covers multilayer neural networks (a totally new area). Nonmetric methods including decision trees and the CART methodology are covered in Chapter 8. Each chapter has a large number of relevant references and many homework exercises and computer exercises.

Chapter 9 is "Algorithm-Independent Machine Learning" and it includes the wonderful "No Free Lunch" theorem (Theorem 9.1), a discussion of the minimum desciption length principle, overfitting issues and Occam's razor, bias - variance tradeoffs,resampling method for estimation and classifier evaluation, and ideas about combining classifiers.

Chapter 10 is on unsurpervised learning and clustering. In addition to the traditional techniques covered in the first edition the authors include the many advances in mixture models.

I was particularly interested in that part of Chapter 9. There is good coverage of the topics and they provide a number of good references. However, I was a bit disappointed with the cursory treatment of bootstrap estimation of classification accuracy (section 9.6.3 on pages 485 - 486). I particularly disagree with the simplistic statement "In practice, the high computational complexity of bootstrap estimation of classifier accuracy is rarely worth possible improvements in that estimate (Section 9.5.1)". On the other hand, the book is one of the first to cover the newer and also promising resampling approaches called "Bagging" and "Boosting" that these authors seem to favor.

Davison and Hinkley's bootstrap text is mentioned for its practical applications and guidance for bootstrapping. The authors overlook Shao and Tu which offers more in the way of guidance. Also my book provides some guidance for error rate estimation but is overlooked.

My book also illustrate the limitations of the bootstrap. Phil Good's book provides guidance and is mentioned by the authors. But his book is very superficial and overgeneralized with respect to guiding practitioners. For these reasons I held back my enthusiasm and only gave this text four stars.

Stick with the first edition
Helpful Votes: 5 out of 5 total.
Review Date: 2007-11-19
I used the first edition of this book in a class on pattern recognition back in 1998. That old first edition did a great job of explaining the different aspects of pattern recognition as they were generally taught when the first edition came out in 1969. However, over the next 30 years the field expanded enough that a second edition was required. I purchased it, expecting an expanded version that went over the details as well as the first edition, and boy was I wrong. This second edition just glosses over the details of modern pattern classification techniques and doesn't show sufficient examples or even motivation for you to "get it". It's almost like the entire book is an introduction. I'm accustomed to the first chapter of a technical book being an overview that doesn't tell you much, but not the entire book. The only thing the second edition has to offer are slicker illustrations. My advice is find a copy of the first edition. It is very well put together. If you need additional material on subjects the first edition doesn't cover well, then go find more modern books specifically on those subjects. You may spend more money but at least you'll learn something.

Machine Learning
Principles of Data Mining (Adaptive Computation and Machine Learning)
Published in Hardcover by The MIT Press (2001-08-01)
Authors: David J. Hand, Heikki Mannila, and Padhraic Smyth
List price: $65.00
New price: $42.96
Used price: $43.94

Average review score:

It shows me many examples
Helpful Votes: 1 out of 5 total.
Review Date: 2005-04-08
Even if it is bad as all the gentlemen said, I think at least it gives me many examples which are not mentioned in other books before.

Very, Very, Very Bad Book !
Helpful Votes: 12 out of 38 total.
Review Date: 2004-12-22
I am a professional in the field of data mining (over 10 years experience). I am always taking classes and reading on the subject. This book was a required text in a grad class I was taking in the evenings. I was excited because I had heard so many good things about it.

The book is indescribably bad.

It is bad if you want theory.
It is bad if you want practical advice.
It is just plain bad.

BAD! BAD! BAD!

Do yourself a favor and go out and get the books by Gordon Linoff et. al. (Mastering Data Mining and Data Mining Techniques). I believe that Amazon will sell you both for not much more money than if you buy this book. Either one of those books is better than this! (I recommend buying both, you won't be sorry!)

SAVE YOUR MONEY, AVOID THIS BOOK !!!

finally a good statistical and computer science perspective on data mining
Helpful Votes: 20 out of 20 total.
Review Date: 2008-01-23
This book is not an introductory text. Anyone interested in a particular topic should consult the preface of the text to find out what it is about. The negative reviewers were not fair to the authors on that score. Had they read the preface they would have found out (1) how the authors define data mining, (2) that they see it as a subject with an important mix of statistical methodology and computer science and (3) that it is intended as an advanced undergraduate or first year graduate text on the topic.
They also provide a very well organized structure for the text that is well described in the preface. It consists of three parts. Chapter 1 is an essential introduction that is informative to everyone. Chapters 2 through 4 go through basic statistical ideas that statisticians would be very familiar with and others could view as a refresher. The authors have experience teaching this course to engineering and science majors and have found that many of these students unfortunately do not have the prerequisite statistical inference ideas and need this material covered in the course.

Chapters 5 through 8 cover the components of data mining algorithms and the remaining chapters deal with the details of the tasks and algorithms.

The book features a further reading section at the end of each chapter that provides a very nice guide to the useful and most significant relevant literature. The author's have done a very good job at this. One mistake I found was a reference to Miller (1980). I think this was intended to be a reference to the seocnd edition fo Rupert Miller's text "Simultaneous Statistical Inference" which was published in 1981 by Springer-Verlag but the full citation is missing from the list of references in the back of the book.

This book deserves 5 stars because it does what it intends to do. It presents the field of data mining in a clear way covering topics on classfication and kernel methods expertly. David Hand has published a great deal on these techniques including many fine books.

Mannila and Smyth bring to the text the computer science perspective. There is much useful material on optimization methods and computational complexity.

Statistical modeling and issues of the "curse of dimensionality" and the "overfitting problem" are key issues that this text emphasizes and expertly addresses.

The only thing the text misses is details on specific algorithms. But I do not grade them down for that because it was not their intention. They emphasize methodology and issues and that is the most critical thing a practitioner needs to know first before embarking on his own attack at mining data.

The text does provide most of the current important methods. Although Vapnik's work is mentioned and his two books are referenced there is very little discussion of support vector machines and the use of Vapnik-Chervonenkis classes and dimension in data mining. The new book by Hastie, Tibshirani and Friedman goes into much greater detail on specific algorithms include some only briefly discussed in this text (e.g. support vector machines). The support vector approach is also nicely treated in "Learning with Kernels" by Scholkopf and Smola.

I highly recommend this book for anyone interested in data mining. It is a great reference source and an eloquent text to remind you of the pitfalls of thoughtless mining or "data-dredging". It also has many nice practical examples and some interesting success stories on the application of data mining to specific problems.

make sure you are right audience
Helpful Votes: 4 out of 4 total.
Review Date: 2005-12-02
It's not that this is a bad book, but you have to make sure you are right audience. The book offers very high-level overviews on various techniques of data mining, but it is almost impossible to learn how to really implement them. Since there are no exercises after each chapter you probably already know who the target audience of the book are.

Good book for overall breadth of alogrithms..
Helpful Votes: 7 out of 9 total.
Review Date: 2004-08-15
Very good book for a general overview of data mining algorithms. Covers a wide variety of DM approaches.. however lacks concrete examples to clear concepts thoroughly. I especially liked Chapter-5 which gives a general framework to look at any DM algorithm. This clears confusion created by so many diverse algorithms with overlapping concepts and applications.

Machine Learning
Bioinformatics (Adaptive Computation and Machine Learning)
Published in Hardcover by The MIT Press (1998-02-13)
Authors: Pierre Baldi and Søren Brunak
List price: $52.95
New price: $23.00
Used price: $5.00

Average review score:

Could have been a great one.
Helpful Votes: 12 out of 12 total.
Review Date: 2003-12-13
This book is decidedly a mix: some very good information, combined with some very puzzling omissions and uneven editing.

First, the good. The description of stochastic context free grammars is the best I've seen. I don't know any other reference that even hint at how to use generative grammars to evaluate likelihoods. Once they caught my interest, though, the authors did not carry through with training and evaluation algorithms I could really use. I suspect that parts of the information are there, but I'll have to go back over their opaque notation again to work out just what they've given and just what's been left out.

This same pattern - an interesting introduction with missing or mysterious development - recurs throughout the book. The discussion on clustering and phylogeny goes the same way: a number of techniques are mentioned but not developed. The authors mention a tree drawing problem, not just building the tree's topology, but ordering the branches for the most informative rendering. Again, a critical topic and one that most authors miss - in the end, these authors miss it, too, by mentioning but not filling in the idea.

Their discussion of neural nets suffers badly from the authors' partial presentation. Evaluation of network output for a given input is relatively straightforward, and they present it in some detail. Training the net is the real problem, though, and is given less than a page.

Baldi and Brunak give more of the fundamentals than most authors. For example, they explain the maximum entropy principle well enough that I'll use it in lots of other areas. They give some coverage to topics of intermediate complexity, such as the forward and backward algorithms for HMM training. Finally, they fizzle out at the higher levels of complexity - the Baum-Welch algorithm could have followed from the forward and backward methods, but is left as a reference to another book.

There is some good here, especially in the fundamentals behind important techniques. The discussions I wanted - the more avanced topics, in forms I can use - are often weak, missing, or impenetrable. Just a bit more work, clearly within the authors' capability, would have made this a landmark reference.

An excellent book.
Helpful Votes: 2 out of 10 total.
Review Date: 2001-10-23
Very well written, clear, and self-contained. The authors provide a masterly treatment of machine learning methods (neural networks, hidden markov models, etc.) and their applications to fundamental problems in sequence analyis and biology. The book goes all the way from first principles to advanced research topics and should be valuable for both students and researchers. Second edition has many new topics, including DNA microarrays. Requires some concentration but mathematical details are summarized in the appendices. I strongly recommend it for anyone with an interest in bioinformatics and/or machine learning.

A very bad book. A colection of references w/o explanations
Helpful Votes: 31 out of 35 total.
Review Date: 2001-09-19
I just bought this book and am COMPLETEly disappointed with it.
Here is why. The book is badly written, hard to read and follow. Although it is said that this is a book is for " many readers", it is really for those who have already known all the algorithms. It is simply impossible to learn the algorithms from this book. The chapter on neural network is a few pages. It provieds a few equations for backpropagation. That is it! It is pretty much true for every thing else. Equations, hard to understand sentences, abbreviations with no explnantions, tons of citations everywhere. A book should strive to explain, and not to cite what other papers and go look there all the time. I suspect the few good reviews here are from the authors themselves.

I have a good programming background. I also read some papers on neural network and hidden markov models, This book is a lot worse than anything I have read in explaining the stuff. Very disappointed. Save your money and get something else.

the worst book I have ever read
Helpful Votes: 4 out of 4 total.
Review Date: 2005-11-06
Just a collection of formulae, in an unclear way. Once we tried to use it in our seminar of bioinformatics, but after a few chapters we had to give it up for its bad writing. I could not find any reason to buy it or read it.

Terrible
Helpful Votes: 5 out of 5 total.
Review Date: 2006-03-16
I'm a graduate student, reading a lot of bioinformatics materials. This is by far the worst text I've read on the subject. Poorly explained, poorly edited. Poor.

Machine Learning
Kernel Methods for Pattern Analysis
Published in Hardcover by Cambridge University Press (2004-06-28)
Authors: John Shawe-Taylor and Nello Cristianini
List price: $88.00
New price: $70.40
Used price: $66.98

Average review score:

Quite Useful
Helpful Votes: 0 out of 0 total.
Review Date: 2008-07-13
A very useful book and quite a nice read. I bought the book after reading a few chapters. Even now, an year after my grad school, I still read this. A good reference.

Nice print, no mistakes, MATLAB code. You get everything on Kernel Methods, from theory to implementation. A perfect book and helped me a lot in my research.

Nice introduction, but no more
Helpful Votes: 12 out of 17 total.
Review Date: 2006-08-11
Well, at first I was petrified to find a book that sounded like it deeply explores the subject of kernel methods. But all in all, it did not quite achieve what I hoped for. As a practical approach, when it comes to implementation, it serves nicely as a reference. The deeper mathematical roots of kernels (especially when it comes to measure theory and functional analysis) are not dealt with at all or just scratched at the very surface. The notation is sometimes awkward, mentioning for example the representation of an object in a given vector space with respect to the basis. And: Too much copied and pasted from the former book about SVMs. Basically, reading papers of Carmeli, Aronszajn and others will give you a much deeper insight into the subject.

A Useful Reference on Kernel Methods
Helpful Votes: 23 out of 27 total.
Review Date: 2005-02-21
The book is divided into 3 parts. The theory is all in part I,
the rest of the book is a cook-book with plenty of matlab code.
The website contains most of the same code + data online. Readable, complete.

coherent and accessible reference, ready-to-use algorithms
Helpful Votes: 26 out of 29 total.
Review Date: 2005-02-22
This work presents a coherent overview of an important field in machine learning. The unifying framework of kernel methods has proven state of the art results and the community has been waiting for a book like this to make both theory and practice of kernel methods accesssible for readers of all different backgrounds (researchers, students, practioners from both academia and industry, ...).

It is theoretically well-founded, the resulting algorithms are well-explained and made accessible for practioners by providing pseudo-code and online, ready-to-use matlab code.

This book nicely complements the previous, yellow book, written by the same authors. Indeed, after "getting into the field" by reading the accessible introduction to support vector machines (SVMs), it was clear to me that SVMs was only an example of a signifcantly larger framework, i.e., kernel methods. The blue book is the reference book about that larger framework I have been waiting for since then. I particularly like the way the book is set up, making clear the modular, flexible approach in kernel methods.

Sloppy
Helpful Votes: 3 out of 11 total.
Review Date: 2006-11-29
Sloppy language, sloppy definitions, sloppy proofs.
Constant repetitions do not add any clarity either.

Machine Learning
Bayesian Artificial Intelligence (Chapman & Hall/Crc Computer Science and Data Analysis)
Published in Hardcover by Chapman & Hall/CRC (2003-09-25)
Authors: Kevin B. Korb and Ann E. Nicholson
List price: $89.95
New price: $62.60
Used price: $62.60

Average review score:

Very good introduction in causal Modeling
Helpful Votes: 1 out of 4 total.
Review Date: 2006-03-09
The book by Korb and Nicholson is very readable and structured. Starting with some background information in statistics it comes directly to the major topic of the book - bayesian networks. The theory thereof is nicely evolved and applied to small examples to demonstrate its usage. Each chapter finishes with a short summary and bibliographical notes for further readings.

In my opinion this book is well written and the chosen examples are insightful. What I do not like is part three of the book which is devoted to case studies and praktical examples. If this space had been used for the first two parts by providing more details, e.g., for the discussion of path models (which is given but only short), this book could be even great on a more advanced level. In this form it is very good as an introduction in Bayesian Networks and related topics like the larger class of causal models.

Excellent Introductory Text
Helpful Votes: 16 out of 17 total.
Review Date: 2004-12-16
It is difficult to assess a review without understanding the biases of the reviewer. I fall under the category of researcher/practitioner when it comes to reasoning with graphical models. I am familiar with and make use of several books and papers on this topic in my work. Of the set of standard references (Pearl, Jensen, Neapolitan, Jordan, Cowell et al., Borgelt & Kruse) the text by Korb and Nicholson (K&N) stands out in terms of its clarity and accessibility. Does the book have everything one would ever want to know about Bayesian inference? Not by a long shot. Is it, however, a good place to start? Definitely. The basic concepts are presented relatively completely and with clarity. I consistently recommend K&N over other alternatives to colleagues new to the field. Is there a chasm separating concept and algorithm in the book? I don't think there is, especially relative to other references. With tools such as Kevin Murphy's BNT, or Netica available on the Web, it seems to me that providing a solid conceptual framework becomes paramount for a textbook such as this. I believe K&N succeed admirably in this sense. Why four stars and not five? Even for an introductory text such as K&N, it would be nice to have more development of some concepts such as causality, context specific independence, or loss of independence in dynamic nets. Although it won't be your last book on reasoning with graphical models, K&N should probably be your first.

Bayesian Networks for Undergrads and Practicioners
Helpful Votes: 19 out of 20 total.
Review Date: 2004-01-12
Despite its name "Bayesian Artificial Intelligence" covers Bayesian network (BN) techniques only. Other Bayesian techniques useful for AI are not treated.
The content is divided in three main sections: (1) The basics of probabilistic reasoning with BNs, (2) Causal discovery (finding BNs from data), and (3) "Knowledge engineering".
The first part covers the fundamental concepts and algorithms around BNs and (simple) decision networks. It is well-written and clear, but readers who are not totally new to the field might find only little new information (e.g., loopy belief propagation, continuous densities, large decision networks, etc. are not covered).
The second part is on how to deduce causal relationships from observational data. Constrained-based and Bayesian approaches are covered, but on a rather general level. I am not sure how easy it is to implement the algorithms from the descriptions provided. When it comes to details of the algorithms, proofs, or mathematical background the authors very often refer to the literature due to "lack of space". From a practical standpoint, it is unfortunate that the different methods are compared to each other only superfiscially. For instance, one method presented performs a large number of statistical tests; one would expect that this requires large amounts of data in order to avoid false positive results. Is this a problem? With questions like these the reader is often left alone.
I am not competent to talk about part three (knowledge engineering), so I end with my general impression of the book: I would have appreciated if the authors had treated some the algorithms in greater detail and had spent a few pages on advanced concepts and current research directions. On the other hand, some information provided could have easily been left out. (For instance, how to download and install certain software packages from the internet, Kevin Murphy's well-known survey on BN software packages, screenshots of user dialogs, etc. just eat pages. Providing the URLs to the corresponding sites on the internet is completely sufficient, and the information there is more likely to be up-to-date.) The saved pages could then be spent on information which is not readily available elsewhere.
To summarize: The book provides a mostly well-written general overview of the basic concepts and could serve as a first introduction to the field. However, it leaves the reader often alone when it comes to the mathematical background, potential practical pittfalls, or advanced algorithms.

Machine Learning
Efficient and Accurate Parallel Genetic Algorithms (Genetic Algorithms and Evolutionary Computation 1) (Genetic Algorithms and Evolutionary Computation)
Published in Kindle Edition by Springer (2000-12-15)
Author: Erick Cantu-Paz
List price: $189.00
New price: $151.20

Average review score:

Misleading title
Helpful Votes: 1 out of 5 total.
Review Date: 2003-05-29
One would have thought from the title of the book that it would contain "... Parallel Genetic Algorithms", however it does not. The book covers statistical analyses of algorithms. The reader is meant to find other references that contain algorithms. It is expensive for such a small book. The author talks 'around' the algorithms, without going into the necessary details to write your own parallel genetic algorithms. There are many plots of evolution, so the author must of had access to the algorithms. This book could have been so much more valuable if it contained the algorithms, rather than just talk about them.

Still the best analysis around.
Helpful Votes: 2 out of 2 total.
Review Date: 2004-06-23
I think the previous reviewer got it wrong: It is very clear from the editorial introduction and from the table of contents that this book is about the ANALYSIS of parallel genetic algorithms, not about their implementation. The problem here is that implementing these algorithms is relatively easy, but configuring them is very complicated because there are many parameters. The work of Cantu-Paz---excellently summarized in this little book---is the best analysis available for the different types of parallel genetic algorithms.

I recommend this book wholeheartedly.

Thorough review and new results
Helpful Votes: 6 out of 7 total.
Review Date: 2001-07-28
Genetic algorithms are easy to parallelize, but they are difficult to control. In a very concise manner, this book presents some theoretical results derived by the author that show how to make parallel genetic algoritms work for many problems and different architectures.

The book has a lot of new theory that is easy to follow and gives recommendations to make parallel genetic algorithms work well in many circumstances. Although the theory makes many simplyfying assumptions, the examples in the book demonstrate that the models are very accurate and the recommendations made in the book seem very reasonable.

Machine Learning
Artificial Intelligence: Opposing Viewpoints (Great Mysteries Series)
Published in Library Binding by Greenhaven Press (1990-10)
Author: Erik Belgum
List price: $18.96
Used price: $0.01

Average review score:

Too outdated
Helpful Votes: 0 out of 0 total.
Review Date: 2000-12-31
This book has an excellent overview of the history of artificial intelligence (AI), but unfortunately it is too outdated now to be of real use to students. There is a short history of AI , and the development of computers, and an explanation of the turing test that is used as a test of AI. Although this book is touted as part of a series on opposing viewpoints, there is inadequate information on the pros and cons of AI. A few charts should have been added to clarify the key points especially since it is a children's book. You will learn some things by reading this book, such as the early history of computers, but it will leave you wondering what has happened in this ever changing field in the past 10 years.

Pretty good...
Helpful Votes: 0 out of 1 total.
Review Date: 2000-04-12
This book is a well written introduction to the field of AI. Although it lacks some detail, such book would be ideal for a late grade school or middle school student doing a report on AI. Athough this book is short, it manages to cram a lot of very important information into such a small space.

Machine Learning
Views into the Chinese Room: New Essays on Searle and Artificial Intelligence
Published in Paperback by Oxford University Press, USA (2002-09-26)
Author:
List price: $43.95
New price: $40.86
Used price: $29.95

Average review score:

Trick philosophy
Helpful Votes: 1 out of 22 total.
Review Date: 2004-07-13
The human brain evolved to assist the survival of its owner while the owner navigated the dangerous jungles and forests of ancient times. Its ability to extract patterns from the information provided by the retina and optic nerve is quite phenomenal. The process by which your brain is recognizing my words and understanding my meaning is astounding.

Yet if you are asked to act like a computer by reading numbers, moving paper tape, erasing things and following instructions given on the paper tape, you will prove to be one of the slowest computers in the world. The original word `computer' referred to a man sitting in a room with paper, pencil and eraser. These human `computers' were replaced by machines a long time ago because they are too slow.

In summary, humans are fast and intelligent at being humans but slow at being computers. In the Chinese Room Argument, John Searle states that although we have a human mind which could otherwise be used to understand Chinese, this particular human mind does not in fact understand it. Given this stipulation, the human mind's ability to process language cannot be used and the only method of "understanding Chinese" is left to the "Chinese room" which consists of a computer run by the very slowest of CPUs, the human being sans abacus, sans calculator, sans silicon chips and sans hope.

The Chinese Room Argument is a trick argument that proves nothing. The computer room is so slow that it cannot ever think or understand Chinese. On the other hand, this doesn't say anything about whether a high-speed computer with the memory and processing power of the human brain might one day speak and understand Chinese quite well.

Ignore the previous comments on "trick philosophy"
Helpful Votes: 10 out of 10 total.
Review Date: 2005-04-24
The Chinese Room Argument (CRA) has nothing to do with the speed of computers or any future developments in artifical intelligence (at least as understood as following from Turing). The CRA is a purely formal argument intended to refute the claim that computers (defined as Turing machines) can think, or can understand, or are minds solely by virtue of their formal description. (This claim is the essence of "computationalism," after Turing's original formulation.) The CRA is that: 1) Syntax is not semantics. 2) The implemented synatactical or formal program of a computer is not sufficient to generate semantics. 3) Minds have semantics. 4) Therefore, computers (so defined) are not minds/cannot think/do not understand because they are not sufficient to generate semantics.

For example, the concepts we employ to think and the words we use to speak have meanings. But there is nothing in computationalism as syntax that has any meaning whatsoever. Whatever meaning an implemented formal program has results from its being programmed or interpreted by us. Syntax (e.g., a computer program) has no causal powers. Whatever causal powers computers have (e.g., to fly airplanes) results from our programming and our assigning interpretations to the electrical charge insides a chip, not from the program in itself.

The chapters in Views Into the Chinese Room attack different aspects of the CRA. But they address it as an argument that stands or falls on the truth of the premises and the validity of the inference, not on engineering questions such as the speed of computers, which are irrelevant. Searle believes that there are, in fact, thinking machines -- we human beings are biological machines that think. And he believes that there also could be artificially made machines that think. The CRA is meant to show only that an implemented computer program by itself cannot generate mental content or semantic content.

For a clear explanation of the CRA, see chapter 15 of this book, by Stevan Harnad, the editor of The Behavioral and Brain Sciences, where Searle's original paper appeared twenty years ago. Do not rely on reviewers who do not understand the argument in the first place.

Machine Learning
Bayesian Networks and Decision Graphs (Information Science and Statistics)
Published in Hardcover by Springer (2002-05-31)
Author: Finn V. Jensen
List price: $84.95
Used price: $49.00

Average review score:

Good Book
Helpful Votes: 0 out of 0 total.
Review Date: 2006-03-01
For an introduction to the subject, this book is unequivocal in my experience with the literature. Great read that has propelled me forward into combining a bayesian network with a physical model to approach a very complex sediment transport problem.

A very good introduction to Bayesian networks
Helpful Votes: 14 out of 14 total.
Review Date: 2003-06-14
I am very pleased to have found a book that gives a modern, sound, and self-contained introduction to Bayesian networks. The only prerequisite is basic knowledge of probability. This makes sense because a Bayesian network is essentially a directed graph whose vertex set is a collection of random variables, while an edge from one variable X to another variable Y represents a belief that X has a causative effect on Y. For example, X could be the pregnancy status of a cow, while Y could be a blood test administered to the cow. Vertex Y would contain a contingency table that reflects the conditional probability of Y in terms of X. The author does well in explaining this, as well as adequately treating many of the practical issues surrounding Bayesian networks, such as design issues, network learing and tuning, and some basic algorithms (e.g. bucket elimination and junction trees) that aid in the efficient updating of variable probabilities due to new evidence that may instantiate or change the distribution of one or more variables.
The author also provides a good introduction to decision graphs, a close relative of Bayesian networks.

The aspect of Bayesian networks that I find most attractive is the fact that there is a "rational" way of designing a network, based on hypothesis, informational, and mediating variables, and their "causal" relationships. Unlike neural networks in which one is almost forced to guess the appropriate structure of the network, every node in a Bayesian network correpsonds with a state or quantity that can be measured either directly or indirectly through other variables. Thus, changes in a system model should only induce local changes in a Bayesian network, where as system changes might require the design and training of an entirely new neural network.

Another aspect of Bayesian networks that I find very compelling is the way in which they seem quite amendable to learning and the presentation of new evidence. This is true since knowledge updating is done locally (through variables), while the effects of those changes are witnessed globally through appropriate belief-updating algorithms.

On the downside, it should be noted that the operation of belief-updating is in general NP-hard, thus there exists a valid concern about the computational efficiency of Bayesian networks. Contrast this with the fact that once a nueral network has been trained, it is quite easy to compute. One would hope that these concerns will subside with more research, for the above mentioned benefits of Bayesian networks leads me to believe that these networks will have quite an influence on the future directions of machine learning.

Although this book will not go down in history as the definitive reference for Bayesian networks, it serves as a good conduit for explaining this quite interesting area of learning at a time when such few complete and modern references exist.

Not worth the money
Helpful Votes: 3 out of 11 total.
Review Date: 2002-12-31
Chapter 1 is a nice introduction to probability. Chapter 2 is readable. Chapter 3 is poorly presented, and you feel sad for having wasted so much money on a book with only one intelligible chapter.

Accessible introduction to Bayesian Networks
Helpful Votes: 33 out of 33 total.
Review Date: 2003-01-21
Among currently available introduction to Bayesian networks (also known as Bayes Net, Bayesian Belief Nets), this book is probably one of the most accessible. The book is divided into part I and II. Part I is intended for BN users (practitioners) and Part II more towards BN developers and researchers, as it contains algorithmic introduction of BN.

Prerequisites of the book as stated in the preface include Graph Theory and Calculus, both at introductory level. I personally did not have exposure to Graph theory, but I was able to understand most of the material without any help. Necessary probability theory is developed, but basic probability knowledge is also a prerequisite to digest the material to a reader without prior exposure of Probability as it shapes the core of the material in the book.

The strength of this text is in Part I where the author provides several examples to illustrate use of Bayesian Networks, Influence Diagrams and other models. I find it useful Influence Diagram as an extension of Bayesian Networks.

Most answers to Exercises at the end of each chapter are provided at the author's homepage, except answers of the last chapter. Answers that require graphical modeling software are also provided in Hugin format. (Hugin Lite can be downloaded from Hugin site.)

The downsides are that writing of the text is somewhat awkward, obscuring readers from understanding, that model building chapter could have been discussed more thoroughly, that material in Learning is barely present, and that definitions are sometimes not introduced upon the first encounter but they appear later in chapters. More different and complex examples could have been discussed to illustrate the material. Note: the author provides a page for Learning at his homepage.

Although this is an introduction to Bayesian Networks and Influence Diagrams, a reader should be equipped with some level of abstract thinking in order to digest the material.

This book is suitable for self-study. It has motivations for the uninitiated. References are provided at the end of the book and I was able to find some of them online. A notable is "A tutorial on Learning with Bayesian Networks" by Heckerman, to fill in the part of Learning in this book.

Other books at this level from users' perspective are:
Edwards, Introduction to Graphical Modeling (Utilizes software MIM.)
Clemen, et al., Making Hard Decisions (Uses Palisade Decision Tools suite. The book discusses Influence Diagrams but not Bayesian Networks.)

Further studies after completion of this book include:
Cowell, et al., Probabilistic Networks and Expert Systems
Lauritzen, Graphical Models
Pearl, Probabilistic Reasoning in Intelligent Systems
Pearl, Causality

A lot about very little
Helpful Votes: 5 out of 16 total.
Review Date: 2003-05-06
The book covers many topics, but doesn't really cover them well. I would not recommend this book. I have learned litte from it.

Machine Learning
Machines (Make It Work!)
Published in School & Library Binding by Thomson Learning (1994-07)
Authors: Wendy Baker and Andrew Haslam
List price: $15.95
New price: $41.61
Used price: $1.10

Average review score:

Too frustrating!!!
Helpful Votes: 1 out of 1 total.
Review Date: 2007-09-07
I homeschool and bought this book with great enthusiasm. All three projects we attempted were a complete bust. We spent hours collecting all the parts needed, then cutting and preparing. The directions do not go into enough detail and you don't realize, until you have to start over, that something was left out or assumed that you understood. After several attempts and finally getting the projects built, they wouldn't work anyway. My son was off building his own parachute(which worked great) while I was trying to get anything I could find in my kitchen to go up the Archimedes screw!!There must be better resources out there than this completely frustrating book.

What a great book! It should be back in print!
Helpful Votes: 6 out of 7 total.
Review Date: 2001-10-24
My son has checked this book out of the school library so many times he has it memorized! Great fun at home -- inexpensive science projects that really work and are fun to do --- even for Mom! The Rocket created from a 2 liter soft drink bottle and powered by water and a bicycle pump is great. Other favorites include the Archimedes' Screw that moves popcorn and the Rubber Band Powered Boat. Bright, colorful, lots of pictures, easy to understand and easy to build --even for the science challenged!


Books-Under-Review-->Computers-->Artificial Intelligence-->Machine Learning-->12
Related Subjects: Case-Based Reasoning Companies Mailing Lists Conferences Research Groups Software Datasets Publications
More Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153