Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python, Edition 2

· "O'Reilly Media, Inc."
4.5
8 reviews
Ebook
368
Pages
Eligible
Ratings and reviews aren’t verified  Learn More

About this ebook

Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this popular guide adds comprehensive examples in Python, provides practical guidance on applying statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what’s important and what’s not.

Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you’re familiar with the R or Python programming languages and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.

With this book, you’ll learn:

  • Why exploratory data analysis is a key preliminary step in data science
  • How random sampling can reduce bias and yield a higher-quality dataset, even with big data
  • How the principles of experimental design yield definitive answers to questions
  • How to use regression to estimate outcomes and detect anomalies
  • Key classification techniques for predicting which categories a record belongs to
  • Statistical machine learning methods that "learn" from data
  • Unsupervised learning methods for extracting meaning from unlabeled data

Ratings and reviews

4.5
8 reviews
ANALIZA ADENA ORIO (IZAY ORIO)
February 11, 2023
ANG PROTOCOL ANG PAPATAY SA GMAIL KO.. ANG SKIN CANCER DAHIL SA SATELLITE ANG PAPATAY SAKIN FROFOPOL ANG PUMATAY KAY MICHAEL JACKSON DAHIL SA PERA WALANG PAG-KAKAIBA YON SA AMING DALAWA INVOLVED MONEY SOCIETY GOV. MEDICINE
Did you find this helpful?
Nilkanth Raval
September 21, 2020
print version is available @ 1400/-. Price of books are very very very high.
3 people found this review helpful
Did you find this helpful?
Anil Das
June 14, 2021
AÀA BOSS NETWORK
2 people found this review helpful
Did you find this helpful?

About the author

Peter Bruce is the Founder and Chief Academic Officer of the Institute for Statistics Education at Statistics.com, which offers about 80 courses in statistics and analytics, roughly half of which are aimed at data scientists. He has authored or co-authored several books in statistics and analytics, and he earned his Bachelor’s degree at Princeton, and Masters degrees at Harvard and the University of Maryland.

Andrew Bruce, Principal Research Scientist at Amazon, has over 30 years of experience in statistics and data science in academia, government and business. The co-author of Applied Wavelet Analysis with S-PLUS, he earned his bachelor’s degree at Princeton, and PhD in statistics at the University of Washington

Peter Gedeck, Senior Data Scientist at Collaborative Drug Discovery, specializes in the development of machine learning algorithms to predict biological and physicochemical properties of drug candidates. Co-author of Data Mining for Business Analytics, he earned PhD’s in Chemistry from the University of Erlangen-Nürnberg in Germany and Mathematics from Fernuniversität Hagen, Germany

Rate this ebook

Tell us what you think.

Reading information

Smartphones and tablets
Install the Google Play Books app for Android and iPad/iPhone. It syncs automatically with your account and allows you to read online or offline wherever you are.
Laptops and computers
You can listen to audiobooks purchased on Google Play using your computer's web browser.
eReaders and other devices
To read on e-ink devices like Kobo eReaders, you'll need to download a file and transfer it to your device. Follow the detailed Help Center instructions to transfer the files to supported eReaders.