AveragingIn many ways, averaging is the most important application of probability. People think in terms of averages. "What was the average on the midterm", "his batting average is .302". But "Bob is an average Joe" is not a place we want to go. There are many different types of averaging. Mode and median are two kinds you may have heard of, but here we'll concentrate on taken the arithmetic mean. For the moment, we'll use average and mean synonymously to denote summing up the data points and dividing by the total number. The distinction between different kinds of averaging will become crucial when we discuss statistics.
So let's talk about how this works. Bob could average the
blades of grass that he measured. You just add up all the data for their
lengths and divide by the number of data points. In more mathematical notation,
if you have
This notation
If you performed this sort of average on Bob's data
you'd get something fairly close, but not exactly This leads to a subtle but important distinction between this kind of averaging and an "expectation value". I'm not going dwell on the precise definition of that term, but explain how it differs from just taking the kind of average we did above in eqn. 1.13. An expectation value is an extreme form of averaging. It's what you'd get if you averaged over a virtually infinite number of blades of grass. In this case it'd be 6. This is in some sense, the true value of the average for a population. The difference between this value and the averaging described above, is that there are deviations away from the true value. This is inherent in anything involving probability. With a finite number of measurements you're never going to be able to determine anything precisely.
Subsections
Josh Deutsch 2009-03-05 |