The error term εi is not noise to be ignored.
It captures everything that affects Y other than X: ability, family background, luck, measurement error.
Key assumption: E[εi | Xi] = 0. The error has mean zero, conditional on X.
This says the omitted factors are, on average, unrelated to X. It is a strong assumption, and one we will spend much of this course learning to worry about.