Supervised classification (Tutorial)

From AWF-Wiki
(Difference between revisions)
Jump to: navigation, search
(Image statistics)
 
(70 intermediate revisions by one user not shown)
Line 1: Line 1:
= Classification with Orfeo Toolbox =
+
=Per pixel classification with QGIS and OTB processing plugin=
== Image statistics ==
+
In '''OTB Version 7.0.0''' the requirements for the input training vector data are very restrictive because of some bugs.
* Type into the search box of the Windows taskbar: {{typed|text=mapla.bat}}. Click on mapla.bat to open  Monteverdi Application Launcher.
+
** File format needs to be Shapefile (GPKG does not work).
* In the search engine of mapla, type {{typed|text=ComputeImagesStatistics}} and double click '''ComputeImagesStatistics'''.
+
** The column containing the class label needs to integer (whole number). '''Data type Integer 64bit does not work!'''
* Specify a multispectral image as Input Image: the Sentinel-2 image ''Subset_S2A_MSIL2A_20170619T_MUL_BOA.tif ''
+
** Any other columns with other data types (real or text (string) with umlauts or blank are not correctly imported which leads to failure of the modul TrainImagesClassifier.  
* Specify directory and name for the XML Output image. Specify the extension '''.xml''' for this file.
+
A work around is to make a copy of your training data and delete all other columns than the class label column
* Click on {{button|text=Execute}}.
+
[[File:Qgis_ComputeImagesStatistics.png|500px]]
+
 
+
== Train image classifier ==
+
# Add the training areas as vector polygon file ''lab05_training_input.shp'' into QGIS.  
+
# Open {{mitem|text=Orfeo Toolbox --> TrainImageClassifier (libsvm)}} to use the Support Vector Machine SVM algorithm (see figure '''B''').
+
# Set ''Subset_S2A_MSIL2A_20170619T_MUL_BOA.tif'' as {{button|text=Input image list}}.
+
# Set ''lab05_training_input.shp'' as {{button|text=Input vector list}}.
+
# Set ''Subset_S2A_MSIL2A_20170619T_MUL_BOA.tif.xml'' as {{button|text=Input XML image statistics file}}.
+
# Set {{button|text=Name of discrimination field}} to ''C_ID'' (C_ID refers to the column that contains the LUC code).
+
# Save the {{button|text=Output confusion matrix}} as ''ConfusionMatrixSVM.csv''.
+
# Save the {{button|text=Output model}} as ''SVM.model''.
+
# Calculation of accuracies :<br/> Open ''ConfusionMatrixSVM.csv'' in LibreOffice or MS Excel and calculate overall, producer and consumer accuracies.
+
 
+
== Classification==
+
# Open {{mitem|text=Orfeo Toolbox --> Image Classification}} (see figure '''C''').
+
# Set ''Subset_S2A_MSIL2A_20170619T_MUL_BOA.tif'' as {{button|text=Input image}}.
+
# Set ''SVM.model'' as {{button|text=Model file}}.
+
# Set ''Subset_S2A_MSIL2A_20170619T_MUL_BOA.xml'' as {{button|text=Statistical file}}.
+
# Save the {{button|text=Output image}} as ''su_svm.tif''.
+
# Evaluate classification results.
+
## Add the classification result ''su_svm.tif'' to QGIS.
+
## Right click ''su_svm.tif'' in the [[TOC]] and select {{mitem|text=Properties --> Style --> Style --> Load Style}}.
+
## Load ''lab05_MinDist.qml''.
+
 
+
== Compute a confusion matrix with independent reference data ==
+
# Open {{mitem|text=Orfeo Toolbox --> ComputeConfusionMatrix (Vector)}}.
+
# Set ''su_svm.tif'' as {{button|text=Input image}}.
+
# Set ''lab05_validation.shp'' as {{button|text=Input reference vector data}}.
+
# Set {{button|text=Field name}} to ''C_ID''.
+
  
 +
All these issues have been fixed in '''OTB Version 7.1.0'''!
  
 +
==Training phase==
 +
* In the search engine of Processing Toolbox, type {{typed|text=TrainImages}} and open '''TrainImagesClassifer'''.
 +
* In the {{button|text=Input Image List}} select a (or optional: several) multispectral images: '''mosaic_S2A_MSIL2A_20200521T102031_N0214_R065_T32UNC_UNB_UPC.tif'''.
 +
* In the {{button|text=Input Vector Data List}} choose a vector polygon file with training areas: '''lucc_training_2020-05-21.shp'''.
 +
* In the {{button|text=Validation Vector Data List}} and choose a vector polygon file with an independent sample of validation areas: leave empty.
 +
* Training and validation sample ratio: {{typed|text=0.25}}. This is a 4-fold cross-validation with split 0.75 per cent  for training and 0.25 per cent for testing.
 +
* Type {{typed|text=Code}} in the {{button|text=Field Name}} text field.
 +
* Choose Support Vector Machine Classifer {{button|text=libsvm}} from the drop down list.
 +
* SVM Model Type is {{button|text=csvc}}
 +
* User defined input centroids: Type any character, no need for a filename. This field is required and but ignored (a bug in the OTB plugin!)
 +
* The SVM Kernel Type is {{button|text=Linear}}.
 +
* Switch checkbox {{button|text=Parameters optimization}} '''off'''. The optimization results in a higher accuracy but takes much time (> 1 hour computation).
 +
* In the {{button|text=Output model}} specify an model file: e.g. '''svm.model'''
 +
* Define an output file for {{button|text=Output confusion matrix or contingency table}} (e.g.'''svm_matrix.csv''').
 +
* Click {{button|text=Run}}.
 +
* Click on the {{button|text=Log}} tab and inspect the '''model quality''' measures: Precision, Recall, F-score and Kappa index.
 +
[[File:qgis_otb_trainimages.png|400px]]
  
 +
==Classification phase==
 +
* In the search engine of Processing Toolbox, type {{typed|text=ImageClassifier}} and double click '''ImageClassifier'''.
 +
* Set ''mosaic_S2A_MSIL2A_20200521T102031_N0214_R065_T32UNC_UNB_UPC.tif'' as {{button|text=Input image}}.
 +
* Set {{button|text=Input _mask}} to blank (top of drop-down list).
 +
* Set '''lucc_svm.model''' as {{button|text=Model file}}.
 +
* Set the number of classes in your model: this is the number of unique classes in your training file.
 +
* Set {{button|text=Output pixel type}} to '''uint16'''
 +
* Save the {{button|text=Output image}} as '''svm_classification.tif'''.
 +
* Uncheck Confidence map: ''Open output file after running algorithm''.
 +
* Uncheck Probability map: ''Open output file after running algorithm''.
 +
* Add ''svm_classification.tif'' to QGIS canvas.
 +
* Find the prepared style file '''rast_classifcation.qml''' in the Tutorial data of Workshop 05.
 +
* Right click ''svm_classification.tif'' in the [[TOC]] and select {{mitem|text=Properties --> Symbology --> Style --> Load Style}}.
 +
* Select the style file '''svm_classification.tif'''. {{button|text=OK}}.
 +
[[File:qgis_otb_imageclassifier.png|400px]]
  
 +
==Postprocessing==
 +
This application uses a majority filter with a circular structure element.
 +
* In the search engine of Processing Toolbox, type {{typed|text=regularization}} and double click '''ClassificationMapRegularization'''.
 +
* Define {{button|text=Input classification image}} as an classification output raster.
 +
* Set {{button|text=Structuring element radius}} to {{typed|text=2}} pixels.
 +
* Set {{button|text=Output pixel type}} to {{button|text=uint16}}.
 +
* Save the {{button|text=Output regularizd image}} as '''svm_classification_majority.tif'''.
 +
* Click {{button|text=Run}}.
 +
[[File:qgis_otb_majority.png|400px]]
  
 +
=Per pixel classification with OTB standalone=
 +
==Training phase==
 +
* Type into the search box of the Windows taskbar: {{typed|text=mapla.bat}}. Click on mapla.bat to open  Monteverdi Application Launcher.
 +
* In the search engine of mapla, type {{typed|text=TrainImages}} and double click '''TrainImagesClassifer'''.
 +
* In the {{button|text=Input Image List}} click on {{button|text=+}} and select a (or optional: several) multispectral images: '''Subset_S2A_MSIL2A_20170619T_MUL.tif '''.
 +
* In the {{button|text=Input Vector Data List}} choose a vector polygon file with training areas: '''lucc_training_input.shp'''.
 +
* Activate the checkbox {{button|text=Validation Vector Data List}} and choose a vector polygon file with an independent sample of validation areas: '''lucc_validation.shp'''
 +
* In the {{button|text=Output model}} specify an output model file: e.g. '''lucc_svm.model'''
 +
* Activate the checkbox and save the {{button|text=Output confusion matrix or contingency table}} as '''lucc_svm_confusion.csv'''.
 +
* In the {{button|text=Bound sample number by minimum}} field type {{typed|text=1}}.
 +
* Set the {{button|text=training and validation sample ratio}} to {{typed|text=0 }}. (0 = all training data).
 +
* Mark  ''C_ID''  in the {{button|text=Field containing the class integer label}} (C_ID refers to the column that contains the LUC code in the training and validation vector file).
 +
* Choose {{button|text=LibSVM classifier}} from the drop down list as Classifier to use for the training.
 +
* The SVM Kernel Type is {{button|text=Linear}}.
 +
* The SVM Model Type is {{button|text=C support vector classification}}.
 +
* Switch the Parameters optimization to {{button|text=on}}.
 +
* Check user defined seed and enter an integer value.
 +
* Click on {{button|text=Execute}}.
 +
* Click on the {{button|text=Log}} tab and inspect the quality measures: Precision, Recall, F-score and Kappa index.
 +
[[File:otb_trainimages.png|500px]]
  
 +
==Classification phase==
 +
* In the search engine of mapla, type {{typed|text=ImageClassifier}} and double click '''ClassifcationMapRegularization'''
 +
* Set ''Subset_S2A_MSIL2A_20170619T_MUL.tif'' as {{button|text=Input image}}.
 +
* Set '''SVM.model''' as {{button|text=Model file}}.
 +
* Save the {{button|text=Output image}} as '''svm_classification.tif'''.
 +
[[File:otb_imageclassifier.png|500px]]
 +
* Evaluate classification results:
 +
* Add ''svm_classification.tif'' to QGIS canvas.
 +
* Download the style file '''classifcation.qml''' from Stud.IP.
 +
* Right click ''svm_classification.tif'' in the [[TOC]] and select {{mitem|text=Properties --> Style --> Style --> Load Style}}.
 +
* Select the style file '''\lucc\classification.qml'''. {{button|text=OK}}.
 +
* Open the text file '''lucc_svm_confusion.csv''' with LibreOffice Calc or MS Excel and calculate overall, user, producer accuracy and kappa index.
  
[[Category:QGIS Tutorial]]
+
==Postprocessing==
 +
This application uses a majority with a circular structure element.
 +
* In the search engine of mapla, type, {{typed|text=regular}} and double click '''ClassifcationMapRegularization'''.
 +
* Define {{button|text=Input classification image}} as ans classification output raster.
 +
* Save the {{button|text=Output regularizd image}} as '''svm_classification_majority.tif'''.
 +
* Set {{button|text=Structuring element radius}} to {{typed|text=3}} pixels.
 +
* Set {{button|text=Output pixel type}} to {{button|text=uint8}}.
 +
* Click {{button|text=Execute}}.
 +
[[File:otb_majority.png|400px]]

Latest revision as of 14:12, 1 December 2020

Contents

[edit] Per pixel classification with QGIS and OTB processing plugin

In OTB Version 7.0.0 the requirements for the input training vector data are very restrictive because of some bugs.

    • File format needs to be Shapefile (GPKG does not work).
    • The column containing the class label needs to integer (whole number). Data type Integer 64bit does not work!
    • Any other columns with other data types (real or text (string) with umlauts or blank are not correctly imported which leads to failure of the modul TrainImagesClassifier.

A work around is to make a copy of your training data and delete all other columns than the class label column

All these issues have been fixed in OTB Version 7.1.0!

[edit] Training phase

  • In the search engine of Processing Toolbox, type TrainImages and open TrainImagesClassifer.
  • In the Input Image List select a (or optional: several) multispectral images: mosaic_S2A_MSIL2A_20200521T102031_N0214_R065_T32UNC_UNB_UPC.tif.
  • In the Input Vector Data List choose a vector polygon file with training areas: lucc_training_2020-05-21.shp.
  • In the Validation Vector Data List and choose a vector polygon file with an independent sample of validation areas: leave empty.
  • Training and validation sample ratio: 0.25. This is a 4-fold cross-validation with split 0.75 per cent for training and 0.25 per cent for testing.
  • Type Code in the Field Name text field.
  • Choose Support Vector Machine Classifer libsvm from the drop down list.
  • SVM Model Type is csvc
  • User defined input centroids: Type any character, no need for a filename. This field is required and but ignored (a bug in the OTB plugin!)
  • The SVM Kernel Type is Linear.
  • Switch checkbox Parameters optimization off. The optimization results in a higher accuracy but takes much time (> 1 hour computation).
  • In the Output model specify an model file: e.g. svm.model
  • Define an output file for Output confusion matrix or contingency table (e.g.svm_matrix.csv).
  • Click Run.
  • Click on the Log tab and inspect the model quality measures: Precision, Recall, F-score and Kappa index.

Qgis otb trainimages.png

[edit] Classification phase

  • In the search engine of Processing Toolbox, type ImageClassifier and double click ImageClassifier.
  • Set mosaic_S2A_MSIL2A_20200521T102031_N0214_R065_T32UNC_UNB_UPC.tif as Input image.
  • Set Input _mask to blank (top of drop-down list).
  • Set lucc_svm.model as Model file.
  • Set the number of classes in your model: this is the number of unique classes in your training file.
  • Set Output pixel type to uint16
  • Save the Output image as svm_classification.tif.
  • Uncheck Confidence map: Open output file after running algorithm.
  • Uncheck Probability map: Open output file after running algorithm.
  • Add svm_classification.tif to QGIS canvas.
  • Find the prepared style file rast_classifcation.qml in the Tutorial data of Workshop 05.
  • Right click svm_classification.tif in the TOC and select Properties --> Symbology --> Style --> Load Style.
  • Select the style file svm_classification.tif. OK.

Qgis otb imageclassifier.png

[edit] Postprocessing

This application uses a majority filter with a circular structure element.

  • In the search engine of Processing Toolbox, type regularization and double click ClassificationMapRegularization.
  • Define Input classification image as an classification output raster.
  • Set Structuring element radius to 2 pixels.
  • Set Output pixel type to uint16.
  • Save the Output regularizd image as svm_classification_majority.tif.
  • Click Run.

Qgis otb majority.png

[edit] Per pixel classification with OTB standalone

[edit] Training phase

  • Type into the search box of the Windows taskbar: mapla.bat. Click on mapla.bat to open Monteverdi Application Launcher.
  • In the search engine of mapla, type TrainImages and double click TrainImagesClassifer.
  • In the Input Image List click on + and select a (or optional: several) multispectral images: Subset_S2A_MSIL2A_20170619T_MUL.tif .
  • In the Input Vector Data List choose a vector polygon file with training areas: lucc_training_input.shp.
  • Activate the checkbox Validation Vector Data List and choose a vector polygon file with an independent sample of validation areas: lucc_validation.shp
  • In the Output model specify an output model file: e.g. lucc_svm.model
  • Activate the checkbox and save the Output confusion matrix or contingency table as lucc_svm_confusion.csv.
  • In the Bound sample number by minimum field type 1.
  • Set the training and validation sample ratio to 0. (0 = all training data).
  • Mark C_ID in the Field containing the class integer label (C_ID refers to the column that contains the LUC code in the training and validation vector file).
  • Choose LibSVM classifier from the drop down list as Classifier to use for the training.
  • The SVM Kernel Type is Linear.
  • The SVM Model Type is C support vector classification.
  • Switch the Parameters optimization to on.
  • Check user defined seed and enter an integer value.
  • Click on Execute.
  • Click on the Log tab and inspect the quality measures: Precision, Recall, F-score and Kappa index.

Otb trainimages.png

[edit] Classification phase

  • In the search engine of mapla, type ImageClassifier and double click ClassifcationMapRegularization
  • Set Subset_S2A_MSIL2A_20170619T_MUL.tif as Input image.
  • Set SVM.model as Model file.
  • Save the Output image as svm_classification.tif.

Otb imageclassifier.png

  • Evaluate classification results:
  • Add svm_classification.tif to QGIS canvas.
  • Download the style file classifcation.qml from Stud.IP.
  • Right click svm_classification.tif in the TOC and select Properties --> Style --> Style --> Load Style.
  • Select the style file \lucc\classification.qml. OK.
  • Open the text file lucc_svm_confusion.csv with LibreOffice Calc or MS Excel and calculate overall, user, producer accuracy and kappa index.

[edit] Postprocessing

This application uses a majority with a circular structure element.

  • In the search engine of mapla, type, regular and double click ClassifcationMapRegularization.
  • Define Input classification image as ans classification output raster.
  • Save the Output regularizd image as svm_classification_majority.tif.
  • Set Structuring element radius to 3 pixels.
  • Set Output pixel type to uint8.
  • Click Execute.

Otb majority.png

Personal tools
Namespaces

Variants
Actions
Navigation
Development
Toolbox
Print/export