Open Access

Development of band ratioing algorithms and neural networks to detection of oil spills using Landsat ETM+ data

EURASIP Journal on Advances in Signal Processing20122012:107

https://doi.org/10.1186/1687-6180-2012-107

Received: 11 July 2011

Accepted: 10 May 2012

Published: 10 May 2012

Abstract

Accurate knowledge of the spatial extents and distributions of an oil spill is very impor-tant for efficient response. This is because most petroleum products spread rapidly on the water surface when released into the ocean, with the majority of the affected area becoming covered by very thin sheets. This article presents a study for examining the feasibility of Landsat ETM+ images in order to detect oil spills pollutions. The Landsat ETM+ images for 1st, 10th, 17th May 2010 were used to study the oil spill in Gulf of Mexico. In this article, an attempt has been made to perform ratio operations to enhance the feature. The study concluded that the bands difference between 660 and 560 nm, division at 660 and 560 and division at 825 and 560 nm, normalized by 480 nm provide the best result. Multilayer perceptron neural network classifier is used in order to perform a pixel-based supervised classification. The result indicates the potential of Landsat ETM+ data in oil spill detection. The promising results achieved encourage a further analysis of the potential of the optical oil spill detection approach.

Keywords

oil spillsmulti-spectralLandsat ETM+neural networksremote sensing

Introduction

Oil spills are causing serious damage to marine and coastal ecosystem. It is estimated that 0.25% of world oil production ends up in the ocean. However, the main contribution of oil pollution originating from transportation activities still originates not from ship accidents, but from routine ship operations like tank washing and engine effluent discharges [1]. Timely and accurate detection of oil slicks help to monitor oil spills and manage coastal resources. The detection of oil spills can efficiently be improved by the use of satellite images which are characterized by suitable special resolution for this purpose [2]. Remote sensing techniques include radar, microwave, infrared, and visible sensors [3]. However, most of oil spills are estimated by Synthetic Aperture Radar (SAR) data [3], which are limited by revisit frequency and coverage.

The exploitation of optical satellite images allows for large areas monitoring and remote zones control, providing more frequent information if compared to the use of SAR images only. Moreover, the possibility of detecting oil spills by optical satellite sensors has been demonstrated [46].

It is the optical properties and chemistry (OH, CN, and CH bonds) of oil that makes it detectable by remote sensing techniques in marine environment [1].

From an optical perspective, the effect of an oil layer on water can be described by two main processes:
  1. (1)

    The specular reflection of light off the surface of the oil layer, and more importantly

     
  2. (2)

    The absorption of up-welling light from the water column by the overlying oil layer [5, 7].

     

The contrast of sea areas polluted by oil depends on the form of the oil substance. It is negative, positive, or zero with film, and only positive with suspensions [5].

In the visible region of the electromagnetic spectrum, oil has a higher surface reflectance than water [8]. So, reliable identification of oil spill by remote sensing is only possible if the difference in measured radiance between an oil covered and a clean surface is greater than the background variability.

In the literature, the two main methods of statistical and neural network-based classification have been proposed for detecting oil spills. Solberg et al. and Topouzelis et al. [911] detected oil slicks from SAR images by combining a statistical model with a rule-based approach. Del Frate et al. [12] detected oil spills based on a multilayer perceptron (MLP) neural network using ERS images. However, to the best of the authors' knowledge, automatic oil spill detection by neural networks in optical data is not reported in the literature. We observe that in the case of optical data, where a spectral signature for each pixel is available, a pixel-based algorithm can be designed. This is not possible with SAR where the detection is usually implemented starting from a single-band acquisition. This means that a preliminary image segmentation is necessary to extract the dark spot and an object classification to distinguish between actual oil spill and look-alikes is then applied.

In this article, an attempt has been made to perform ratio operations to enhance the oil spots, extracting the features by an MLP neural network, and demonstrate the potential of Landsat ETM+ data in oil spill monitoring in the Gulf of Mexico based on the optical properties of oil slicks detected by neural network.

Methods

Landsat ETM+ images for the area from 29 N to 27 N and 87 W to 90 W for Gulf of Mexico, acquired in 1st, 10th, and 17th May 2010 were used to study the previously known oil spills [13]. Landsat ETM+ has eight bands which may be combined in various ways by assigning one band to each of the three visible channels: red, green, and blue, to create a false color image.

Subimages containing oil spills were extracted using the Area of Interest tool of ENVI (Environment for Visualizing Images) software to spare disk space and to make classification and image interpretation more expedient and focused.

The data were then projected in Geo (lat/long) projection and WGS84 datum. Further the image was exported into *.TIFF format for further analysis (slick detection, feature extraction, spectral enhancement, and filtering).

The stripes in Landsat ETM+ data (caused by the Scan Line Corrector in the ETM+ instrument failed On May 31, 2003) have been removed by a developed model which identifies stripe positions based on edge-detection and applies line-tracing algorithms. Pixels not affected by striping are used to construct spline functions describing spatial gray level distributions of an image. Detected stripes are corrected by replacing the pixels with more reasonable gray values computed from constructed spline functions.

To produce optimal contrast and variation for color composition of those individual bands, ratio operations were applied to the images with various band combinations. For the classification test, MLPs have been considered, which have been found to have the best suited topology for pixel level classifications [14].

These are feedforward networks, where the input flows only in one direction to the output, and each neuron of a layer is connected to all neurons of the successive layer but has no feedback to neurons in the previous layers [14]. The individual neuron is the elemental building block of each layer, and it is mainly characterized by its activation function. An activation function, φ(v), defines the output of a neuron in terms of the linear combination of inputs, v. There are different kinds of activation functions: the threshold function, the piecewise-linear function, and the sigmoid function [14]. The most commonly used is the sigmoid function. An example of the sigmoid function is the logistic function, defined by Equation (2) where a > 0 is the slope parameter;
ϕ V = 1 ( 1 + e - av )
(1)
The activation functions defined in Equation. (1) range from 0 to +1. It is sometimes desirable to have the activation function range from -1 to +1, in which case the activation function assumes an anti-symmetric form with respect to the origin. For the corresponding form of a sigmoid function, we may use the hyperbolic tangent function, defined by
C V = tanh V
(2)

Allowing an activation function of the sigmoid type to assume negative values as prescribed by Equation (2) has analytic benefits [14]. In our study, the use of hyperbolic tangent function provided better results than those obtained with the sigmoid function of Equation (1).

The net was trained using the back propagation algorithm, which uses a gradient search technique and iteratively adjusts the weight coefficients in the network to minimize an error function equal to the mean square difference between the desired and the actual net output [14].

Result and discussion

The earlier studies carried out by Hu et al. [6] reported that the shorter wavelengths were more sensitive to optical signature of oil; therefore, Landsat ETM+ data in spectral bands B1 (480 nm), B2 (560 nm), B3 (660 nm), B4 (825 nm) have been used in this study [6].

As it is already mentioned, the refractive index of oil is greater than that of sea water, but there are possibilities of masking the data, while performing atmospheric corrections [15]. Therefore, in order to identify the oil spill area, just geometrically corrected data have been used.

To identify the spectral signature of oil spill, a spectral profile along transect (A) was plotted. The visual interpretation in individual bands did not produce significant signature of oil (Figure 1). In order to identify the specific signatures of oil spill by enhancing the contrast, band ratio operations were performed. The ratios were computed using 65 different combinations of bands.
Figure 1

Profile along transect A (1 st May 2010, Original Image).

To determine which combination showed the best contrast between the surface slick and surrounding clear ocean water, we extracted data along several transect lines for each band combination [4]. The ratio combinations that showed excellent contrast (about -0.2, defined as (Rs - Rc)/Rc where "Rs" and "Rc" stand for slick luminance and clear water luminance, whereas the contrast of the original bands are about 0.07) and the most significant for retrieving oil spill are given below:
RS 1 = B 4 / B 2 / B 1
RS 2 = B 3 / B 2 / B 1
RS 3 = B 3 / B 2 / B 1

These combinations can be interpreted based on spectral signature of oil and water background in landsat ETM+. Oil and water background show steadily increasing reflectance spectrum between wavelength 475-675 nm, while they have different level of absorption in 675-800 nm, thus, information of oil spill could be extracted by band ratio of B4/B2 and B3/B2 [8]. There is a decrease in water background reflection from 550-750 nm; so to get some information on background the difference of bands 3 and 2 can be applied. Additionally, the difference between bands 3 and 2 is useful to visualize the data in RGB format.

The shorter wavelength at blue (B1) is normally found to be contaminated with the signatures of biogenic materials. Therefore, for removing biogenic materials effects, the results were normalized with respect to 480 nm [15].

Color-composite images can be created by assigning RS1, RS2, and RS3 values, computed from the Landsat ETM+ data into red, green, and blue channels, respectively. The algorithm developed for oil spill was applied for the data of Gulf of Mexico on 1st, 10th, and 17th May 2010.

Figure 2a presents the ratio image of Gulf of Mexico at May 1st, 2010. The oil spill signatures were very well identified along transect A (Figure 3).
Figure 2

Band ratioing and classification results. (a) Color composite of band ratioing for Landsat ETM+ data collected May 1st, 2010, in the Gulf of Mexico. (b) NN classification result of image (a). (c) Result of thresholding method applied on image (a).

Figure 3

Profile along transect A (1 st May 2010, Image output of band rationing operation).

When thick oil spots are present in clear, deep ocean water, strong yellow and orange tones in the color-composite image are indicative of areas containing them (Figure 2a). These tones result from the relatively higher values of RS1 and RS2 and relatively lower values for RS3 for oil spot as compared to clear, deep ocean water (Figure 3). Blue and purple tones indicate the absence of oil or insufficient quantity or thickness to cause spectral differences associated with oil in spectral bands.

Once the best ratios to discriminate between the pixels contain oil and clear water have been determined, an MLP algorithm for pixel classification has been designed. The result has been compared with the result of Multiband thresholding algorithm which is the combined result of AND operation for thresholded RS1 and RS3 bands.

Several attempts have been made to properly select the number of units to be considered in the hidden layers. The pixels for train/test the net are 80,152 pixels which have been extracted from one image (10th May 2010, Gulf of Mexico) (Figure 4b). The training sets contain 60% and the test sets contain 40% of all pixels which are not belonging to the training set. Pixel selection for train/test set has been done randomly and repeated six times; so the presented results of root mean square error (RMSE) errors are the average of these repetitions for each topology (Figure 5). The topology 3-4-2 has been finally chosen for its good performance in terms of classification accuracy, RMSE, and training time. The number of about 10,000 training cycles was sufficient to get the network learned. The described topology is reported in Figure 6. The input of the net consists of RS1, RS2, and RS3 bands and the output providing classified pixels in terms of oil spill or others and one MLP NNs has been used for classifying all images.
Figure 4

Band ratioing and classification results. (a) Subimages of color composite of band ratioing for Landsat ETM+ data collected May 17th, 2010, in the Gulf of Mexico. (b) Subimages of color composite of band ratioing for Landsat ETM+ data collected May 10th, 2010, in the Gulf of Mexico, used for training the NN. (c) NN classification result of image (a). (d) NN classification result of image (b).

Figure 5

RMSE errors for different NN topologies.

Figure 6

Feedforward topology for an NN.

Accuracy assessment has been carried out considering the other two images (1st May 2010, 17th May 2010). For both of them 5,000 pixels have randomly been selected and then labeling made by visual interpretation. As it has been shown in Table 1, the overall rate of the accurately classified pixels in image collected at 1st May 2010 is 97%, whereas the accuracy of the same image segmented by thresholding method is 43%. Also in the second test image (17th May 2010) the performance of the neural classification is satisfactory with an overall accuracy of 95%.
Table 1

The error matrix of classified images shows the accuracy assessment

Image

Well classified Pixels

Misclassified pixels

Accuracy %

ETM+ (1th, May 2010)

4850

150

97%

ETM+ (10th, May 2010)

4950

50

99%

ETM+ (17th, May 2010)

4750

250

95%

In Figures 2b and 7b, some subareas of image collected at May 1st 2010 have been shown, where the better advantage of NN can clearly be observed based on visual interpretation.
Figure 7

Band ratioing and classification results. (a) Color composite of band ratioing for Landsat ETM+ data collected May 1st, 2010, in the Gulf of Mexico. (b) NN classification result of image (a). (c) Result of thresholding method applied on image (a).

The neural network simulator (SNNS) developed at the University of Stuttgart, Stuttgart, Germany, has been used for the classification algorithm implementation and proved to be a high level and reliable software package [16].

Conclusion

This article presents and demonstrates the potential of Landsat ETM+ to automatically detect and extract oil spill in marine environment. The band ratioing approach used in this study was found to work well for the identification of potential hydrocarbon contaminants in water. It has been observed that the bands difference between bands 3 (660 nm) and 2 (560 nm), division ratio of bands 3 (660 nm) and 2 (560 nm) and division ratio of bands 4 (825 nm) and 2 (560 nm) normalized by band 1 (480 nm) were most suitable to retrieve oil spill.

MLPs with different topologies have been applied to a set of features describing oil spill characteristics in order to perform a supervised classification. Best performances are obtained using an MLP neural network with 3:4:2 topology trained by the standard backpropagation algorithm. This algorithm has been designed for Landsat ETM+ data but it can be applied on different multispectral data (WorldView2, Spot, etc.).

The largest challenge in detection of oil spills in SAR images is accurate discrimination between oil spills and look-alikes. Natural films cannot always be properly distinguished from oil spills based on a SAR image alone but additional information can be derived from optical sensors. Future oil spill systems should incorporate oil spill information from multisensory studies.

Declarations

Authors’ Affiliations

(1)
Information, Systems and Production Department, Tor Vergata University

References

  1. Ornitz BE, Champ MA: Oil Spills First Principles:Prevention and Best Response. Elsevier, Amsterdam; 2002:653.Google Scholar
  2. Kvenvolden KA, Cooper CK: Natural seepage of crude oil into the marine environment. Geo-Marine Lett 2003, 23(3):140-146. 10.1007/s00367-003-0135-0View ArticleGoogle Scholar
  3. Brekke C, Solberg AHS: Oil spill detection by satellite remote sensing. Remote Sens Environ 2005, 95(1):1-13. 10.1016/j.rse.2004.11.015View ArticleGoogle Scholar
  4. Hu C, Li X, Pichel WG, Muller-Kerger E: Detection of natural oil slicks in the NW Gulf of Mexico using MODIS imagery. Geophys Res Lett 2009, 36(1):L01604.View ArticleGoogle Scholar
  5. Otremba Z, Piskozub J: Modelling of the optical contrast of an oil film on a sea surface. Opt Exp 2001, 9(8):411-416. 10.1364/OE.9.000411View ArticleGoogle Scholar
  6. Hu C, et al.: MODIS detects oil spills in Lake Maracaibo, Venezuela. Eos Trans AGU 2003, 84(33):313-317.View ArticleGoogle Scholar
  7. Wettle M, et al.: Assessing the effect of hydrocarbon oil type and thickness on a remote sensing signal: a sensitivity study based on the optical properties of two different oil types and the HYMAP and Quickbird sensors. Remote Sens Environ 2009, 113(9):2000-2010. 10.1016/j.rse.2009.05.010View ArticleGoogle Scholar
  8. Svejkovsky J, Muskat J, Mullin J: Adding a multispectral aerial system to the oil spill response Arsenal. Sea Technol 2009, 50(8):17.Google Scholar
  9. Solberg AHS, Brekke C, Husoy PO: Oil spill detection in Radarsat and Envisat SAR images. IEEE Trans Geosci Remote Sens 2007, 45(3):746-755.View ArticleGoogle Scholar
  10. Solberg AHS, Dokken ST, Solberg R: Automatic detection of oil spills in Envisat, Radarsat and ERS SAR images. Geoscience and Remote Sensing Symposium, IGARSS '03. Proceeding. IEEE International 2003.Google Scholar
  11. Topouzelis KN: Oil spill detection by SAR images: dark formation detection, feature extraction and classification algorithms. Sensors 2008, 8(10):6642-6659. 10.3390/s8106642View ArticleGoogle Scholar
  12. Del Frate F, et al.: Neural networks for oil spill detection using ERS-SAR data. IEEE Trans Geosci Remote Sens 2000, 38(5):2282-2287. 10.1109/36.868885View ArticleGoogle Scholar
  13. USGS2010. [http://hdds.usgs.gov/hdds/]
  14. Bishop CM: Neural Networks for Pattern Recognition. Oxford University Press, Oxford; 1995.MATHGoogle Scholar
  15. Srivastava H, Singh T: Assessment and development of algorithms to detection of oil spills using MODIS data. J Indian Soc Remote Sens 2010, 38(1):161-167. 10.1007/s12524-010-0007-9MathSciNetView ArticleGoogle Scholar
  16. Zell A, et al.: SNNS Stuttgart neural network simulator user manual. Inst. Parallel Distributed High Performance Syst., Univ. Stuttgart, Stuttgart, Germany; 1995.Google Scholar

Copyright

© Taravat and Del Frate; licensee Springer. 2012

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.