Students are constantly describing their features as functions of x only ...
So when we introduce conditional probabilities, we need a simple lesson that includes features that depend only on x, so that we can point out that these have no effect on p(y | x), and ask why not.