To compare \(y_n\) to \(L\), we need to explicitly reduce to expressions involving \(|x_n - L|\) because that's the only thing we know must be small. One approach looks like this:
\[{}\left| \frac{x_1+\cdots+x_n}{n} - L \right|{}\]
\[{}= \left| \frac{x_1+\cdots+x_n}{n} - \frac{nL}{n} \right|{}\]
\[{}= \left| \frac{(x_1-L)+\cdots+(x_n-L)}{n} \right|{}\]
\[{}\leq \frac{1}{n} (|x_1-L| + \cdots + |x_n - L|){}\]
The good news: For large \(n\), the terms \(|x_n - L|\) are very small.
The bad news: Not all of the indices in our expression are large.
The good news: As we let \(n\) get extremely large, the proportion of terms that are small grows to be almost all of them, and that's enough for us to proceed.
The steps must be carefully arranged in the proper order:
Fix an \(\epsilon > 0\).
We know that there is some \(N_0\) such that \(|x_n - L| < \epsilon/2\) when \(n \geq N_0\).
We know that convergent sequences are bounded. This means that there is some constant \(C\) such that \(|x_n - L| \leq C\) for all \(n\), not just \(n \geq N_0\). Without loss of generality, \(C > 0\).
Take some value of \(n\) much larger than \(N_0\) to be determined:
\[{}\frac{1}{n} \sum_{j=1}^n |x_j - L|{}\]
\[{}= \frac{1}{n} \sum_{j=1}^{N_0-1} |x_j - L| {}\]
\[{}+ \frac{1}{n} \sum_{j=N_0}^n |x_j - L|{}\]
\[{}< \frac{1}{n} \sum_{j=1}^{N_0-1} C + \frac{1}{n} \sum_{j=N_0}^n \frac{\epsilon}{2}{}\]
\[{}\leq \frac{(N_0-1)C}{n} + \frac{n-N_0}{n} \frac{\epsilon}{2}{}\]
\[{}\leq \frac{(N_0-1)C}{n} + \frac{\epsilon}{2}.{}\]
Constrain
\[n \geq \max\{N_0, 2(N_0-1)C \epsilon^{-1}\}.\]
All
\(n\) in this range are big enough for the derivation above to be valid. They're also big enough that the first term is no greater than
\(\epsilon/2\).