Description
Probability and Statistics for Data Science: Math + R + Data covers "math stat"-distributions, expected value, estimation etc.-but takes the phrase "Data Science" in the title quite seriously:
* Real datasets are used extensively.
* All data analysis is supported by R coding.
* Includes many Data Science applications, such as PCA, mixture distributions, random graph models, Hidden Markov models, linear and logistic regression, and neural networks.
* Leads the student to think critically about the "how" and "why" of statistics, and to "see the big picture."
* Not "theorem/proof"-oriented, but concepts and models are stated in a mathematically precise manner.
Prerequisites are calculus, some matrix algebra, and some experience in programming.
Norman Matloff is a professor of computer science at the University of California, Davis, and was formerly a statistics professor there. He is on the editorial boards of the Journal of Statistical Software and The R Journal. His book Statistical Regression and Classification: From Linear Models to Machine Learning was the recipient of the Ziegel Award for the best book reviewed in Technometrics in 2017. He is a recipient of his university's Distinguished Teaching Award.
About the Author
Norman Matloff is a professor of computer science at the University of California, Davis, and was formerly a statistics professor there. He is on the editorial boards of the Journal of Statistical Software and The R Journal. His book Statistical Regression and Classification: From Linear Models to Machine Learning was the recipient of the Ziegel Award for the best book reviewed in Technometrics in 2017. He is a recipient of his university's Distinguished Teaching Award.
Reviews
"I quite like this book. I believe that the book describes itself quite well when it says: Mathematically correct yet highly intuitive...This book would be great for a class that one takes before one takes my statistical learning class. I often run into beginning graduate Data Science students whose background is not math (e.g., CS or Business) and they are not ready...The book fills an important niche, in that it provides a self-contained introduction to material that is useful for a higher-level statistical learning course. I think that it compares well with competing books, particularly in that it takes a more "Data Science" and "example driven" approach than more classical books."
~Randy Paffenroth, Worchester Polytechnic Institute
"This text by Matloff (Univ. of California, Davis) affords an excellent introduction to statistics for the data science student...Its examples are often drawn from data science applications such as hidden Markov models and remote sensing, to name a few... All the models and concepts are explained well in precise mathematical terms (not presented as formal proofs), to help students gain an intuitive understanding."
~CHOICE
Book Information
ISBN 9781138393295
Author Norman Matloff
Format Paperback
Page Count 412
Imprint CRC Press
Publisher Taylor & Francis Ltd
Weight(grams) 540g