Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Statistical and machine-learning data mining : techniques for better predictive modeling and analysis of big data (3rd ed.)
Ratner B., CRC Press, Inc., Boca Raton, FL, 2017. 696 pp. Type: Book (978-1-498797-60-3)
Date Reviewed: Apr 12 2019

There are numerous books about statistics, from short recipe collections to larger theoretical works. There are texts devoted to data science and data mining, the newest expressions of data analysis, which focus on computational techniques. Ratner addresses the recent growth of data science and data mining in his new chapter 2, and ends up merging those terms back into the parent discipline of statistics. Having dealt with terminology, he goes on to present and analyze the statistical practice and problems of today, while managing to be both personable and entertaining.

The text weighs in at 44 chapters and roughly 650 pages. I’m not going to list and describe every chapter; you can find the table of contents for both the second edition and this edition using Amazon’s “Look Inside” program.

While the first edition was a bestseller, reviewers of the second edition commented that it seemed disjointed and suffered from a lack of algorithms. Ratner added 13 new chapters to produce this third edition, and states in the preface his goals: extend the core material; improve the writing and continuity; and provide his statistical subroutines, now available for download. This edition should therefore be considered both a corrected and extended version of the second edition.

In addition to chapter 2’s examination of terminology, other added chapters delve into specialized applications of data mining, like market segmentation and dealing with missing data. Also included are chapters on “Art, Science, Numbers, and Poetry” and “Opening the Dataset: A Twelve-Step Program for Dataholics” and a primer on text mining. The author even includes a chapter of his favorite statistical subroutines (written for SAS, but understandable enough to translate to other formats), as well as links to more code and examples.

The text is obviously a labor of love from a dedicated statistician. It may not be organized properly for a class textbook--the author assumes statistical knowledge, prefers to investigate difficult applications without accepted solutions, and has no interest in delivering a systematic understanding of statistics--but teachers would definitely find excerpts to cite. Given the casual writing style, the nonacademic tone, and the author’s desire to explore unsolved problems of his own choosing, readers will either love it or hate it. All practitioners should take a look; it would be a shame to miss a potential favorite.

Reviewer:  Bayard Kohlhepp Review #: CR146529 (1907-0264)
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
Data Mining (H.2.8 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Data Mining": Date
Feature selection and effective classifiers
Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article
May 1 1999
Rule induction with extension matrices
Wu X. (ed) Journal of the American Society for Information Science 49(5): 435-454, 1998. Type: Article
Jul 1 1998
Predictive data mining
Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)
Feb 1 1999
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy