This is great advice. Another related issue I see is when you engineer a new fea...

s_Hogg · on June 8, 2019

OP Here, glad you liked it!

The thing you're talking about definitely happens heaps as well, because of a fundamental mental blind spot we have. I'd definitely love to hear if you've got any more stories along these lines. The psychology of what makes a successful machine learning project really interests me, and I don't mean in terms of platitudes about openness and transparency.

I'm really tempted to write another post about specifically the sort of thing you talk about in your example - narrative fallacies in machine learning. Basically because we operate in the unknown we tend to want to string the evidence we have together in a nice appealing way.

thetrainfold · on June 8, 2019

It would be unusual not to check the performance of the model at predicting the target variable, which would validate whether or not the derived feature is useful.