Gpu inference benchmark