retailers; data sets; datasets; Scala; Python API; consumer behaviour; Apache Spark