Physicist by training, ML researcher by trade, firmly convinced that a closer look at our evaluation practices will lead us to Valinor.