- Open Access
Integrating artificial neural network and classical methods for unsupervised classification of optical remote sensing data
EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 165 (2012)
A novel system named unsupervised multiple classifier system (UMCS) for unsupervised classification of optical remote sensing data is presented. The system is based on integrating two or more individual classifiers. A new dynamic selection-based method is developed for integrating the decisions of the individual classifiers. It is based on competition distance arranged in a table named class-distance map (CDM) associated to each individual classifier. These maps are derived from the class-to-class-distance measures which represent the distances between each class and the remaining classes for each individual classifier. Three individual classifiers are used for the development of the system, K-means and K-medians clustering of the classical approach and Kohonen network of the artificial neural network approach. The system is applied to ETM + images of an area North to Mosul dam in northern part of Iraq. To show the significance of increasing the number of individual classifiers, the application covered three modes, UMCS@, UMCS#, and UMCS*. In UMCS@, K-means and Kohonen are used as individual classifiers. In UMCS#, K-medians and Kohonen are used as individual classifiers. In UMCS*, K-means, K-medians and Kohonen are used as individual classifiers. The performance of the system for the three modes is evaluated by comparing the outputs of individual classifiers to the outputs of UMCSs using test data extracted by visual interpretation of color composite images. The evaluation has shown that the performance of the system with all three modes outrages the performance of the individual classifiers. However, the improvement in the class and average accuracy for UMCS* was significant compared to the improvements made by UMCS@, and UMCS#. For UMCS*, the accuracy of all classes were improved over the accuracy achieved by each of the individual classifiers and the average improvements reached (4.27, 3.70, and 6.41%) over the average accuracy achieved by K-means, K-medians and Kohonen respectively. These improvements correspond to areas of 3.37, 2.92 and 5.1 km2 respectively. While the average improvements achieved by UMCS@ and UMCS#, respectively, compared to their individual classifiers were (0.77 and 2.79%) and (0.829 and 2.92%) which correspond to (0.61 and 2.2 km2) and (0.65 and 2.3 km2) respectively.
Unsupervised classification of remotely sensed data is a technique of classifying image pixels into classes based on statistics without pre-defined training data. This means that the technique is of potential importance when training data representing the available classes is not available. Unsupervised classification is also important for providing a preliminary overview of image classes and more often it is used in the hybrid approach of image classification[1, 2]. Several methods of unsupervised classification using classical or neural network approaches have been developed and used consistently in the field of remote sensing. The most commonly used of the classical approach is K-means clustering algorithm while Kohonen network is the most commonly used one of the artificial neural network approach. So far many research works have conducted to improve the accuracy of the unsupervised classifiers. Examples of these works are the use of Kohonen classifier as a pre-stage to improve the results of clustering algorithms such as agglomerative hierarchical clustering, K-means and threshold-based clustering algorithms[5–7]. In those works one algorithm was used as a pre-stage to improve the classification results of another algorithm. That is, the final decision is made according to only one classifier’s decision. Methods involving a simultaneous use of more than one classifier in the so-called multiple classifier system (MCS) which is very common in the approach of supervised classification have not been conducted in the unsupervised classification of optical remote sensing data. See for example[8–10] for some of the MCS schemes form supervised classification of remote sensing data. The idea of MCS is based on performing more than two classifiers and integrating their decisions according to some prior or posterior knowledge concerning the output classes to reach the final decision. Prior knowledge is estimated from training data concerning the output classes while posterior knowledge, in general, represents the outputs of the individual classifiers. The operation of integration is done in one of two strategies, either by combining the outputs of the individual classifiers or by selecting one of the individual classifiers outputs. Many methods of integration have been developed for the implementation of MCS in the supervised approach of classification. Examples of combined-based methods of integration are the majority voting rule, which assigns the label scored by majority of the classifiers to the test sample and Belief function, which is knowledge-based method. It is based on the probability estimation provided by the confusion matrix derived from training data set. Examples of the dynamic classifier selection-based method of integration are classifier rank (CR) approach, which takes the decision of the classifier that correctly classifies most of the training samples neighboring the test sample and the local ranking (LR) which is based on ranking the individual classifiers for each class according to the mapping accuracy (MA) of the classes.
In this article, an integrated system of unsupervised classification named unsupervised multiple classifier system (UMCS) is developed using individual classifiers from two different approaches, traditional (classical) and artificial neural network. The system is based on new integration method of the dynamic classifier selection-based type. This method is based on class-distance maps (CDM) for the individual classifiers as the measure upon which the final decision is selected. The CDM of each individual classifier is generated from the measure of Euclidean distances between each class and the remaining classes of that individual classifier, named here as the class-to-class distance measurement (CCDM).
The remaining parts of the article are organized as follows: In the following section, the proposed system is described and detailed explanations of its major modules are given. In section “Results”, the results of applying the system to ETM + images are shown and discussed. In section “Posterior interpretation of output classes”, posterior interpretation of the classification outputs is done. In section “Individuals and multiple classifiers comparison”, comparisons between the output results are made. In section “Evaluation of system performance”, the performance of the system is evaluated and finally some concluding remarks are given in the last section.
(UMCS); the proposed system
In this article, the proposed system of classification is called UMCS to be differentiated from the multiple classifier system (MCS) which is common in supervised classification. It is designed to host three individual unsupervised classifiers and can be adapted to any number on individual classifiers. The scheme of the system for three individual classifiers is shown in Figure 1. Each of the three classifiers, K-means, K-medians and Kohonen is implemented using multi-spectral images yielding three output images. These three output images are then entered to a color unification algorithm (CUA) in order to achieve class-to-class correspondence in the three output images. Finally, the three output images of the (CUA) are integrated using CDM generated from the Euclidean distance measurements between each class and the remaining classes within the classifier, named as (CCDM). The algorithms of color unification and classifier integration method are given in the following sections.
In most cases the order of classes resulting from different approaches of unsupervised classification are affected by the way of performing the operation of clustering and the order of data presented to the process of clustering. For instance, in the Kohonen network, the training phase usually starts by giving the initial weights which control the order of the outcome classes. Therefore in order to implement the proposed system, the corresponding classes in the individual classifiers must have the same order. To achieve this step, an algorithm named CUA is developed. The aim of this algorithm is to reorder the classes an all classifiers in order to assign same color to the three nearest classes of the three classifiers. This is done by fixing the order of classes in one classifier as a reference and reordering the classes of the other two classifiers. This algorithm requires the determination of the Euclidean distance between the center of each class in the referenced classifier and the centers of all classes in each of the other two classifiers. The nearest two classes each from one classifier are given the same order (color) of the current class in the reference classifier. Then the operation is repeated until the ordering of all classes in the three classifiers is reached. The algorithm does not require re-calculation of the class centers since these centers are calculated during the implementation of the classifiers. In K-means and K-medians classifiers, the last mean vectors and median vectors upon which the classifier have reached the convergence state represent the centers of the classes. In Kohonen classifier, the weight vectors to the output neurons are taken to be the centers of the classes. The procedures of the algorithm are:
Read the centers of the classes for the three classifiers and set the class number i = 0.
Increase class number i = i + 1.
Calculate the Euclidean distance between the mean vector of Ci from the reference classifier and the mean vectors of all output classes in the other two classifiers.
Dim is the Euclidean distance between class Ci from the reference classifier and class Pm from the second classifier.
Din is the Euclidean distance between class Ci from the reference classifier and class Qn from the third classifier.
||.|| represents the norm operator.
Exchange class order.
Exchange the class order of the second classifier:
Temp = P j
P j = P i
P i = Temp
Exchange the class order of the third classifier:
Temp = Q l
Q l = Q i
Q i = Temp
Check the convergence of the algorithm.
if (i < k)
Go to step 2
Go to step 6
Integration method by CCDM
As it was mentioned in the introduction, several methods of integrating the outputs of different classifiers are available. These methods were designed for MCS of the supervised type and they require a priori knowledge which most often can be estimated from the training data. For UMCS, the training data are not available and therefore this a priori knowledge cannot be obtained. The method of majority voting may be the only one which can be used to integrate the outputs of unsupervised classifiers since it only requires the final decisions of the three classifiers. However, this rule is influenced by the degree of correlation among the errors made by individual classifiers. When these errors are correlated (all classifiers produce incorrect but similar outputs) it leads to incorrect decision and when these errors are uncorrelated (each classifier produces a unique output) it leads to failure,.
In this article, a new method of integration is introduced. It is categorized as a selection-based approach and does not need prior knowledge. It requires a posterior knowledge which can be obtained from the outputs of the three classifiers. This posterior knowledge is the within classifier CCDM which is the measure of Euclidean distance between each class and all of the remaining classes within each individual classifier. This CCDM is then used to generate a table having N columns and N-1 rows, where N is the number of classes. The elements under each column represent the distances, stored in ascending way, from the class of that column to all of the remaining classes. For each individual classifier one CDM is generated.
The procedures of implementing the algorithm are given below for UMCS made from three classifiers. It consists of two parts. In the first part, the CDM is generated. In the second part, the process of selecting the final decision is performed. The algorithm can easily be adapted to any number of classifiers. The flowchart of the algorithm is given in Figure 2.
Generation of CDM
Calculate CCDM from all the classes in each classifier using the following equation:(1)
i = 1,,,,N, j = 1,,,,N, i ≠ j and N is the number of classes in each classifier.
||.|| represents the norm operator.
Ci is the mean vector of ith class and Cj is the mean vector of jth class, both from the same classifier.
For N classes there will be N-1 distances associated with each class.
Generate CDM by sorting the values of CCDM in an ascending way to be used for competition between the individual classifiers. Let these competition distances be represented as Di,j,k, where;
i is a subscript refers to the individual classifier, i = 1,,,M and M is the total number of individual classifiers involved in the UMCS.
j refers to the current class, j = 1,,,N and N is the number of classes which is the same for all individual classifiers.
k refers to the position of the distances after being sorted in an ascending way. That is, the minimum of all will be at the top of column with position k = 1 and the maximum of all will be at the bottom of column with position k = N-1. During the process of decision selection, these distances at position k = 1 will be compared and at tie cases the distances at the next position k = 2 will be compared and so on until tie break is achieved. If the tie case continued to appear until the last position, which is a rarely occurred case, then it is broken by assigning the class produced by one of the individual classifier arbitrarily. This way of comparison makes the algorithm effective for tie cases as there will almost a zero chance for the occurrence of tie while comparing all the competitive distances. Table 1 shows a model of CDM for classifier i. In this table, each column holds the competition distances associated to each class, starting from class 1 to class N. UMCS of three individual classifiers requires three of these CDM and during the operation of decision selection there will be a competition between these distances in three column each of one CDM.
Read the three output images produced by the three individual classifiers pixel by pixel. The values of these pixels represent the class numbers produced by the three classifiers. Let these class numbers are: O, P and Q. Then perform the following statements:
if (O = P and O = Q) then
Assign class O to the pixel of the final output image
Perform the operation of integrating the decisions made by the three classifiers.
Performing the process of final decision selection
Set record number k to a value of one (the position of the first distance in each of the columns under the output classes) and set three flags (f1, f2, f3) each to a value of one.
Read the distances associated to these classes at position k (D1,O,k, D2,P,k, D3,Q,k).
Compute competitive distances (d1, d2, d3) as follows:
d1 = f1* D1,O,k
d2 = f2* D2,P,k
d3 = f3* D3,Q,k
Check the values of these competitive distances to perform one of the following cases for final decision:
Case 1: If these competitive distances are alternatively different, then assign the class with maximum competitive distance to the pixel of the final output image.
Case 2: If any two of these competitive distances have the same value and this value is lower than the competitive distance of the other class, then assign the other class to the pixel of the final output image.
Case 3 (Tie-Break): If the competitive distances for the three classes are all the same, then increase k by one and go to step 2 to read the next associated distances. If the tie case remained unbroken, then assign one of the classes arbitrarily to the pixel of the final output image.
Case 4 (Tie-Break): If the competitive distances of any two classes have same value and this value is greater than the competitive distance of the other class, then discard the other class from competition by resetting its flag (f) to zero and increase k by one then go to step 2. Here, the competition will remain between two output classes as far as the tie is not broken and if k value reached the last record at a position number equals (m - 1) where m is the total number of classes without achieving tie-break, then assign one of the classes arbitrarily to the pixel of the final output image, otherwise assign the class with maximum competitive distance to the pixel of the final output image.
The system is applied to ETM + image of an area north to Mosul dam in the northern part of Iraq. The image size is 296 × 296 square pixels which is equivalent to an area of 78.85 km2. Standard Kohonen network with R = 0 was used (the weights of only the winner neuron are updated). The number of neurons in the input layer was chosen to be 6 which is the number of the available bands of ETM + (band1, band2, band3, band4, band5 and band7). The number of neurons in the output layer was chosen to be 8 which are the same as the output classes in K-means and K-medians. However in practice, training Kohonen network usually needs wise determination of the learning rate and the number of cycles. In this article, different values of learning rate and cycle numbers were tried. Consistent results were reached by using initial learning rate of 0.7 with a decrement of (0.7/500) at each next cycle where the number of cycles is taken to be 500. Kohonen neural network with this structure is supposed to be closest to K-means than any of the other structures of Kohonen neural network. K-medians clustering is a variation of K-means, however mathematically medians are calculated instead of means,.
The selection of standard Kohonen neural network and the K-medians as being closely related to K-means clustering was done in order to show that, to what extend these classifiers can produce different results and to what extend the application of UMCS can be appreciable when individual classifiers of divers differences are chosen.
The system is applied in three modes using different number and combinations of individual classifiers in order to show the influence of increasing the number of individual classifiers on the system accuracy. In the first mode (UMCS@), K-means and Kohonen were used as two individual classifiers. In the second mode (UMCS#), K-medians and Kohonen were used as two individual classifiers. In the third mode (UMCS*), K-means, K-means and Kohonen were used as three individual classifiers. Figure 3, shows the classification results of K-means, K-means, Kohonen and the three multiple classifiers UMCS@, UMCS#, and UMCS*. In unsupervised classification usually the number of classes is chosen either arbitrarily or according to the available knowledge of the study area. Here, this number was chosen to be 8 after visual inspection of the color composite images made from different combinations of the available bands.
To show as to what extend the individual classifiers in each MCS agreed or disagreed in their decisions are given in Table 2. In this table, the percentages of pixels and their equivalent areas for which all the individual classifiers produced the same and different decisions for the three MCS (UMCS@, UMCS#, and UMCS*) are shown. According to this table the number of pixels for which the individual classifiers have given different results in the case of UMCS* is greater than those in UMCS@ and UMCS#. This is an expected result given the fact that increasing the number of individual classifiers will makes more chances of these classifiers first to give different results and second to produce uncorrelated errors,.
Posterior interpretation of output classes
In unsupervised classification the cover types that represent the output classes must be identified after the classification. Here, this interpretation was done by comparing the results of the individual classifiers and the multiple classifiers visually to the color composite images of the available bands. Two color composite images were generated using the combinations (band4, band3, band2 as RGB) and (band7, band4, band1 as RGB), Figure 4. First, the interpretation of these color composite images was implemented by comparing the colors in these two color composite images to the spectral properties of the cover types. This is one of the most commonly used methods for remote sensing data interpretation,. Table 3 shows the identities of the output classes after interpretation.
Individuals and multiple classifiers comparison
To visualize the differences between the outputs of the six classifiers, five areas were localized in rectangles of different colors. These differences can be illustrated for the area in black rectangle. Figure 5 is the zoomed image of the black rectangles for the six classifiers. In the product of K-means the area of this rectangle is dominated by blue and yellow colors, which correspond to (Dry Gray Soil) and (Wet Red Soil) cover types respectively. However, the area of blue color within this rectangle for K-means and K-medians are almost the same. In the product of Kohonen, two more colors appeared in this rectangle, the green and some patches of red colors which correspond to (Dry Red Soil) and (Less Wet Red Soil). These variations in the colors within this rectangle indicate that the three individual classifiers can produce different results for the same area. Looking at this rectangle in the UMCS products shows that these colors have been distributed differently for the three UMCS products. For instance in UMCS@ and UMCS* products, the colors and their distributions are almost the same as in K-means. This indicates that the competition between the blue and yellow colors of K-means product on one side and the green, magenta and red colors of the K-medians and Kohonen products on the other side was in favor of K-means classifier. This can be checked by looking at the CDM of the three individual classifiers, Table 4. This table shows that the competitive distance of blue in K-means is higher than the competitive distance of green color in Kohonen, therefore blue color will be the winner and will appear in the output of the MCS. On the other hand, the competition distance of yellow colors of the K-means map is greater than the competitive distance of magenta and red colors in the maps of K-medians and Kohonen therefore, in the output of MCS UMCS* yellow color will be the winner. The same rule can be applied to areas within the other rectangles with the aid of CDM of the individual classifiers.
Evaluation of system performance
The performance of the system was evaluated by selecting test data from the two color composite images representing the eight classes. The locations of these test data samples were shown as rectangles in the color composite of (Band4, band3, band2 as RGB) of Figure 4. For each class the rectangle is shown in the same color of that class. The numbers of the selected pixels for the classes 1 to 8 respectively were 320, 400, 400, 400, 200, 220, 420 and 260). This data is then entered to each of the individual classifier (K-means, K-medians and Kohonen) as well as to each of the multiple classifiers (UMCS@, UMCS# and UMCS*).
The MA was measured since this measurement takes into account the pixels that are falsely classified. The confusion matrices of the six classifiers are given in Table 5. In this table, the diagonal elements represent the number of pixels that are correctly classified (P corr ), the off-diagonal elements in the row of the class represent the number of pixels that are incorrectly classified to other classes, known as omission error (P om )and the off-diagonal elements in the column of the class represent pixels that are falsely classified to the current class, known as commission error (P com ). The MA of the eight classes for each classifier is calculated using the following equation:
Table 6 shows these mapping accuracies for the six classifiers. It can be seen that the MA of all classes are improved by UMCS*, while the MA for some classes were improved and for others were decreased by UMCS@ and UMCS# classifiers. Table 7 shows the improvements in the class and average MA made by each of the multiple classifiers over their belonging individual classifiers. The best improvements were achieved by UMCS* over each of the individual classifiers. The amounts of these improvements are 4.27, 3.70 and 6.41% over each of K-means, K-medians and Kohonen classifiers respectively. These improvements are equivalent to areas of 3.37, 2.92 and 5.1 km2. Whereas the average improvements made by UMCS@ and UMCS#, over each of the individual classifiers were much less. For UMCS@ the improvement over K-means and Kohonen were 0.77 and 2.79% (equivalent to areas of 0.61 and 2.2 km2) and for UMCS# the improvement over K-medians and Kohonen were 0.829 and 2.92% (equivalent to areas of 0.65 and 2.3 km2). However, for individual classes the maximum improvement achieved by UMCS* reached 8.51, 7.80 and 10.46% over each of K-means, K-medians and Kohonen classifiers respectively. While the improvements made by UMCS@ over K-means and Kohonen were 6.57 and 6.59% respectively and by UMCS# over K-medians and Kohonen were 4.83 and 6.44% respectively. These improvements indicate that when more classifiers are used in the multiple classifier system better improvement can be achieved. This concluded result has also been approved for the supervised multiple classifier system,.
Unsupervised classifiers (K-means, K-medians and Kohonen) representing two different approaches, classical and artificial neural network, they were integrated in a MCS the application of the system to satellite images has shown that the three classifiers may produce different results despite of being considered as closely related. The CDM, which is generated from the CCDM, is shown to be effective measure for competition during the process of classifier output integration. The way of using the record of this map makes the chance of tie cases occurrence as less as possible. For the area used in this study no tie cases have occurred. The implementation of the system does not need training data; however, the test data derived after classification are necessary only for the evaluation of the system performance. The application of the system using three individual classifiers achieved better performance than its application with only two individual classifiers. This indicates that contributing more individual classifiers will make the made-off MCS more efficient.
However, this study may represent the beginning but important step in the direction of multiple classifier approach for unsupervised classification of remote sensing data and lots of work may bb required to put the approach steps further.
Kamusoko C, Aniya M: Hybrid classification of Landsat data and GIS for land use/cover change analysis of the Bindura district. Zimbabwe. Int. J. Remote. Sens. 2009, 30(1):97-115.
Kumar U, Raja SK, Mukhopadhyay C, Ramachandra TV: Hybrid Bayesian Classifier for Improved Classification Accuracy. IEEE Trans. Geosci. Remote. Sens. Let. 2011, 8(3):474-477.
Palubinskas G, Datcu M, Pac R: Clustering algorithms for large sets of heterogeneous remote sensing data. IEEE Geosci. Remote. Sens. 1999, 3: 1591-1593. Proceedings of International Symposium, IGARSS ‘99
Mather PM, Tso B, Koch M: An Evaluation of Landsat TM Spectral data and SAR-Derived Textural Information for Lithological Discrimination in the Red Sea Hills. Sudan, INT. J. Remote. Sens. 1998, 19(4):587-607.
Goncalves ML, Netto MLA, Costa JAF, Zullo Júnior J: An unsupervised method of classifying remotely sensed images using Kohonen self organizing maps and agglomerative hierarchical clustering methods. Int. J. Remote. Sens. 2008, 29(11):3171-3207. 10.1080/01431160701442146
Anthony F, Iliyana D, Klein AG, Jensen JR: Self-Organizing Map-based Applications in Remote Sensing, Self-Organizing Maps. In InTech. Edited by: Matsopoulos GK. , ; 2010. Available from: http://www.intechopen.com/articles/show/title/self-organizing-map-based-applications-in-remote-sensing
Awad M: An Unsupervised Artificial Neural Network Method for Satellite Image Segmentation. Int. Arab. J. Inf. Technol. 2010, 7(2):199-205.
Tahir AK: A multiple Classifier System for supervised classification of remotely sensed data. J. Univ. Duhok 2011, 14(1):260-273.
Maulik U, Chakraborty D: A Robust Multiple Classifier System for Pixel Classification of Remote Sensing Images. J. Fundam. Inf. 2010, 101(4):286-304.
El-Melegy MT, Ahmed SM: Neural networks in multiple classifier system for remote sensing image classification. Studies in Fuzziness and soft computing 2007, 210: 65-94. 10.1007/978-3-540-38233-1_3
Smits PC: Multiple classifier systems for supervised remote sensing image classification based on dynamic classifier selection. IEEE Trans. Geosci. Remote. Sens. 2002, 40(4):801-813. 10.1109/TGRS.2002.1006354
Sabourin M, Mitiche A, Thomas D, Nagy G: Classifier Combination of hand printed digit recognition, Proceeding of Second International Conference on Document Analysis and Recognition. IEEE conference publications, Tsukuba Saenie City, Japan; 1993:163-166.
Bradley PS, Mangasarian OL, Street WN: Clustering via Concave Minimization. In Advances in Neural Information Processing Systems. Edited by: Mozer MC, Jordan MI, Petsche T. MIT Press, Cambridge, MA; 1997:368-374.
Benediktsson JA, Chanussot J, Fauvel M: Multiple classifier systems in remote sensing: from basics to recent developments, Proceedings of the 7th international conference on Multiple classifier systems. Springer-Verlag, Berlin, Heidelberg; 2007:501-512.
This study was supported by the University of Duhok / Faculty of Science as a part of the scientific research plan of the Physics Department for the year 2010 / 2011.
The author declares that he has no competing interests.
About this article
Cite this article
Tahir, A.A. Integrating artificial neural network and classical methods for unsupervised classification of optical remote sensing data. EURASIP J. Adv. Signal Process. 2012, 165 (2012). https://doi.org/10.1186/1687-6180-2012-165
- Individual Classifier
- Mapping Accuracy
- Reference Classifier
- Unsupervised Classification
- Artificial Neural Network Approach