New publication about cropland classification
We are glad to announce the new publication about „The impact of training class proportions on binary cropland classification“ of our colleagues from the Earth and Life Institute, Environmental Sciences, Université catholique de Louvain (Belgium).
The ground truth data sets required to train supervised classifiers are usually collected as to maximize the number of samples under time, budget and accessibility constraints. Yet, the performance of machine learning classifiers is, among other factors, sensitive to the class proportions of the training set. In this letter, the joint effect of the number of calibration samples and the class proportions on the accuracy was systematically quantified using two state-of-the-art machine learning classifiers (random forests and support vector machines). The analysis was applied in the context of binary cropland classification and focused on two contrasted agricultural landscapes. Results showed that the classifiers were more sensitive to class proportions than to sample size, though sample size had to reach 2,000 pixels before its effect leveled off. Optimal accuracies were obtained when the training class proportions were close to those actually observed on the ground. Then, synthetic minority over- sampling technique (SMOTE) was implemented to artificially regenerate the native class proportions in the training set. This resampling method led to an increase of the accuracy of up to 30%. These results have direct implications for (i) informing data collection strategies and (ii) optimizing classification accuracy. Though derived for crop- land mapping, the recommendations are generic to the problem of binary classification.
For more details visit: https://doi.org/10.1080/2150704X.2017.1362124
Waldner, F., Jacques, D.C., Löw, F. (2017) The impact of training class proportions on binary cropland classification. Remote Sensing Letters 8:1123–1132.
Source of image (map): Dr. Fabian Löw, own analysis.