We should rename the current ranking evaluator something else (AUC Evalautor?), and add in a new evaluator that does actual ranking metrics: - P@N - H@N - MRR - CMRR