SKIP TO CONTENT
Harvard Business Review Logo

If Your Data Is Bad, Your Machine Learning Tools Are Useless

April 2, 2018
Alan Schein Photography/Getty Images

Poor data quality is enemy number one to the widespread, profitable use of machine learning. While the caustic observation, “garbage-in, garbage-out” has plagued analytics and decision-making for generations, it carries a special warning for machine learning. The quality demands of machine learning are steep, and bad data can rear its ugly head twice — first in the historical data used to train the predictive model and second in the new data used by that model to make future decisions.

Partner Center