Diagnosing biased feature importance scores in tree-based models