diff --git a/README.md b/README.md
index 128b4987044580d5b9310b41955013984ed18c65..c804d36b2eafa2d3ac49f449ad54679db815318d 100644
--- a/README.md
+++ b/README.md
@@ -33,6 +33,12 @@ Recent developments in Quantum Computing (QC) have paved the way for an enhancem
 
 ### Praparation of the binary classification problem
 
+The binary classification problem is constructed from the SemCity Toulouse multispectral benchmark data set, that is publicly available 👉 https://doi.org/10.5194/isprs-annals-V-5-2020-109-2020
+
+More information about the dataset can be found in the publication below 
+
+R. Roscher, M. Volpi, C. Mallet, L. Drees, and J. D. Wegner, “Semcity toulouse: a benchmark for building instance segmentation in satellite images,” Isprs annals of photogrammetry, remote sensing and spatial information sciences, vol. V-5-2020, p. 109–116, 2020.
+