Week 4: Probability and Statistics in Data Science

 


In Chapter 5 of the Data Toolkit, I read about Probability and Statistics for data science. These are fundamental concepts that are a massive part of data science. These concepts help data scientists understand and interpret data, make predictions and evaluate performance. 

Descriptive statistics are numerical and have value. Examples of these are mean, median, mode, range, variance and standard deviation. 

Probability distributions describe the likelihood of different outcomes for a variable. Some of these include the gaussian, binomial, and poison distributions. An example of this would be describing a natural  phenomena, such as someone's height.


Comments

Popular posts from this blog

Week 5 - BALT 4363 - Probability and Statistics for Data Science

Week 2: BALT 4396 - Cleaning Data with Python Libraries

Week 6 - Coding through Python and AI