- Open Access
Enhancement of acoustic tomography using spatial and frequency diversities
© Mansour; licensee Springer. 2012
- Received: 28 July 2011
- Accepted: 13 June 2012
- Published: 24 October 2012
This article introduces several contributions to enhance an important application such as acoustic tomography (AT), using mainly the spatial and spectral diversities of underwater acoustic signals. Due to their inherited properties, (i.e. spareness, non-stationarity or cyclostationarity, wide-band frequency range, wide range of power, etc.), the process of underwater acoustic signals becomes a real challenge for many scientists and engineers who are involved in studies related to the ocean. For various applications, these studies require huge and daily information. AT techniques remain fast and cheap ways to obtain such data. Nowadays, active acoustic tomography (AAT), is communally used to generate powerful and repetitive acoustic sources. Recently, researchers have been attracted by an alternative way, called passive acoustic tomography (PAT), which uses acoustic opportune signals of their environment. PAT techniques are mainly used for ecological, economical and other reasons such as military applications. With PAT, no signal is emitted; therefore, problems become more challenging. The number and positions of existent sources are unknown, and sensors measure mixtures of available sources. Algorithms based on time or frequency domains are widely deployed to classify, identify, and study received signals in AAT applications. For PAT, researchers employ multiple sensors in order to add an extra dimension, (such as space). This article focuses on approaches used in space along with time or frequency to extract information, improve performances, and simplify the overall architecture. This article explains the use of signal processing and statistical approaches to solve problems raised using PAT and discusses the experimental results. The review of the literature offers a big variety of algorithms to deal with classic AAT problems. Therefore, only problems related to PAT have been considered herein.
- Underwater acoustic applications
- Passive acoustic tomography
- Multipath channel
- Sparseness or non-stationary signals
- Independent component analysis
- High order statistics
Oceans cover more than 70% of the earth surface, roughly containing 97% of all our water supply and playing a major role in global climate regulation and economical systems.
Acoustic tomography (AT) is used in many civil or military applications such as: mapping underwater surfaces, oceanographical, meteorological applications, (to measure the temperature, the salinity, the motion and the depth of the water), to improve sonar technology, as well as other applications. Many algorithms have been developed to deal with active acoustic tomography (AAT).
Emitting sources similar to natural sounds or noises: a set of artificial signals imitating natural sounds, (whales, dolphins, etc.), or noises, (waves, ships, etc.), are generated. The main advantage of such approach remains in the control of the sources and their positions, (similar to active methods). In order to achieve this, researchers could imitate the time-frequency signature of nature signals. However, this method is not totaly discreet as the generated signals may have different high order statistics (HOS), instantaneous power, or frequency than original signals. Besides, artificial signals generally can be characterized by specific patterns, (periodicity, time or statistical coherence, fixed positions, and deterministic motions, etc.). These specific patterns can be used to unmask hidden emitted signals.
Using natural signals: by completely relying on existing natural signals, a PAT system with a high discreet level can be achieved. However, main drawbacks of such system are the lack of information, (number, positions or natures of sources, etc).
Applying hybrid systems: by mixing the previous two strategies, better performances and good discretion levels could be achieved. However, that will results in more complex emitter-receiver systems.
On the one hand, it seems that the 2nd strategy is the more attractive one, (completely discreet systems and no emitters). On the other hand, the problems raised in this case are more challenging because of the total lack of information about the sources. In order to reduce the complexity of this problem, we investigate several advanced signal processing techniques and statistical approaches. In fact, let us assume that we are able to estimate the number of the sources, separate the sources form their mixing observed signals, and evaluate their statistical properties. In this case, the identification of the channel could be, also, investigated. Therefore, remaining PAT problems become very similar to AAT.
The article’s primary purpose is to discuss the preprocess observed mixed signals to extract maximum information about the sources, then, we can apply classic algorithms to deal with residual problems. This article is organized as follows: Section “Acoustic oceanic tomography”, describes AAT and PAT, briefly; Section “Assumption and background, contains the assumptions and mathematical models; Section “Pre-processing systems”, presents the preliminary studies; Section “Adaptive HOS estimators, proposes new HOS estimators, in order to enhance the spatial diversity of original sources; Section “Spatial diversity and independence discrimination criteria, discusses several criteria, so as to exploit the spatial or the spectral diversity of our signals; Section “Blind separation of observed acoustic signals”, presents independent component analysis, (ICA), algorithms to separate mixed observed signals; Section “Experimental results”, shows experimental results; and Section “Conclusion”, presents the conclusions.
Acoustic tomography’s goal is to get a fast and cheap monitoring of water mass and sub-bottom characteristics. This monitoring requires an inversion 2-step procedure. First, estimate the acoustic properties, (such as the water column sound speed profile, 3D structure of internal tides in water masses, geo-acoustic parameters of the seafloor), from the measurement of a known propagated acoustic waveform between fixed sources and receivers. Second, infer some ocean physical parameters from these estimated acoustic characteristics.
Active acoustic tomography
To perform oceanic tomography, an active acoustic emission is propagated between an emitter and a set of receivers on an horizontal track of about 10 km long. Frequencies involved in tomography range is from 30 Hz to a few kHz, whereas, power range is from 180 to 220 dB.
First works in tomography have been only considered deep water channel, (depth deeper than 1 km). In this case and in order to estimate underwater acoustic transmission channel parameters, acoustic refraction is the main physical phenomenon which should be considered.
In the mid 1990s, scientists have extend their interests to shallow water, (i.e. depth less than 300 m),.
In shallow water, an acoustic propagation encounters numerous interactions with the sea surface and the sea floor. Therefore, new techniques had to be developed such as ‘matched field processing’ in and the ‘matched impulse response processing’ in. In their applications, a single input multiple output, (SIMO), configuration is used to extract channel information.
To get efficient results in a SIMO configuration, a large number of sensors should be used which means increasing the experimental setting. To tackle the last problem and using frequency diversity, researchers proposed “matched impulse response processing” methods. In the last case, a wide band signal should be emitted, but a single distant hydrophone could be enough as a receiver. The main idea of such technique consists of estimating the channel impulse response by applying maximum likelihood or matched filter estimations on the known emitted and received signals. Once the channel response filter has been estimated, other features such as time delay or magnitude of arrivals could be extracted. The last features could be used in order to estimate water column and sub-bottom properties.
Passive acoustic tomography
Active acoustic tomography strongly relies on the possibility to emit powerful acoustic signals in the ocean. Major problems can arise. powerful emissions need a heavy power supply which can drastically limit the efficiency of autonomous monitoring systems, thereby causing drastic harm to marine mammals and disturbing their behavior. Finally in a warfare context, some constraints about covertness may exit in the acoustic process. To overcome these problems, the concept of PAT has recently emerged in the community.
Passive acoustic tomography consists in estimating acoustic properties by using natural opportunity sources present in the channel at the time of interest without using active emission. Surface noise created by breaking waves, ship noise, and marine mammal calls are three kinds of opportunistic sources which are under the scope of passive tomography.
The main drawbacks of PAT are the lack of information about the number, positions, and nature of emitted signals. With more than two sources many actual tomography algorithms can’t give satisfactory results. Many others don’t work well or at all when the emitted signals are wide band signals. Some algorithms take into consideration the position of acoustic sound emitters. Typically, in real world PAT applications, underwater acoustic signals are generated by various moving sources whose number and positions are hardly, (or impossible), to be identified, (as in the case of shoal of fish or wave noises). It is obvious that PAT is a quite difficult technique requiring substantial effort in signal processing to tackle the unknowns of source position and emitted waveform as well as to separate the sources present simultaneously in the channel before switching them toward a dedicated blind inversion processor.
In PAT applications, the sources are obviously signals of opportunities which have various properties such as spatial diversity, different probability density functions (pdf), different temporal or spectral structures, different time-frequency signatures, etc. These properties can be used at different level of the separation stage. However, in PAT applications, simple and cheap systems are often used which means that linear multi-sensor antenna are not recommended. Mainly, for this reason, ICA algorithms will be of great importance to reach our goal. ICA algorithms can successfully handle multi-input multi-output (MIMO) channel.
In a previous work, an extensive experimental study has been conducted in order to classify and characterize many recorded anthropogenic signals, (made by human activities as boats, ships, or submarine noises, etc.), and natural signals, (mainly animals sounds or natural noises, such as waves etc.). According to that study, one can add to the above mentioned features, the following ones:
Recorded signals are affected by a background ocean noise which can be considered as an additive white Gaussian noise (AWGN).
Some signals have a very weak kurtosis.
Almost all of the signals are non-stationary signals with more or less cyclic behavior as boat noises.
Natural signals are very sparse ones and artificial ones are very noisy.
The above mentioned properties have been considered to select appropriate ICA algorithms.
Underwater acoustic channel
Underwater sounds are produced by natural or artificial phenomena through forced mass injection leading to inhomogeneous wave equations which can be converted to frequency domain. The frequency-domain wave equation is called the Helmholtz’s equation which gives us an underwater sound propagation model. A general solution of the Helmholtz’s equation is very difficult to obtain. Therefore, researchers use simplified propagation models, (such as the ray theory, the mode theory, the parabolic model, the hybrid model, etc.), according to their applications. The choice of a propagation model depends on many parameters such as wave frequency, the depth of the sea, etc. In our case, (shallow water, i.e. the channel depth is about few hundred meters), our frequency range is from 300 to 10 KHz, the ray theory was the more appropriate propagation model.
The above equation is an empirical relationship satisfied when 0 ≤ T ≤ 30, 30 ≤ S ≤ 40, and D ≤ 8000. In shallow deep underwater channels, (depth less than 300 m), where emitters and receivers are not so close to the water surface nor to the bottom and the distances among emitters and receivers are less than 3 Km, the sound speed could be approximated by a constant.
The reflected acoustic waves on the bottom of the propagation channel depend on many parameters such as the composition and the geometrical properties of the bottom.
The reflected acoustic waves on the top of the propagation channel, i.e. the water surface, depend, also, on many parameters such as the wind, the wave frequency as well as the swell properties. For this reason, the water surface can’t be considered as a flat surface. Therefore, the direction of a reflected acoustic wave is dispersed in the space. However in average term, reflected acoustic waves can be considered as obtained by a flat surface with absorption coefficients. In our model, a flat surface is considered and random coefficients are added to characterize other unknown parameters.
where, (in kHz), T is the water temperature, (°C), S = 3.5% is the water salinity, (in the ocean S ≈ 35g/l), P w is the water pressure, (in kg/m2), A = 2.34 ∗ 10−6and B = 3.38 ∗ 10−6.
From physical point of view, an acoustic ray represents a propagation trajectory of an emitted signal between the source, (emitter), and the receiver. In many cases, the channel depth is limited in size which means that the propagation is multi-rays. Each ray may be bent by refraction if the sound speed is a function of depth and range. Ray trajectories and sound speed profile allow us to compute propagation times. In addition ray trajectories, water attenuation, boundaries roughness and sub-bottom properties allow us to compute the signal magnitude.
From a computational view point, ray trajectory is computed by solving the ‘Eikonal equation’ but signal magnitude is obtained as a result of ‘Transport equation’. As general and analytical solutions of Eikonal and transport equations do not exist, researchers use approximate and simulated results.
Here H(i) denotes the q × p real constant matrix corresponding to the impulse response of the channel at time i and S(n−i) is the source vector at time (n−i).
As it was mentioned before that the processing of acoustic signals is a very challenging problem. To enhance our processing algorithms, pre and post processing systems have been proposed.
Pre- & post-processing
Our sources are bounded in frequency domain. Therefore, a low-pass filter was extremely helpful for us to reduce the impact of the AWGN and, then, achieve better performances. It is worth mentioning that only three tested algorithms have given satisfactory results. These three algorithms, (for further details see the following references[19–23]), were dedicated to separate non-stationary sources (audio or music signals). The last two algorithms[22, 23], which be called in the following SOS and Parra and Alvino, are implemented in frequency domain using discrete frequency adapted filters.
Estimation of source number
It is obvious that the number of sources is an input parameter. ICA algorithms can cope with an overestimate number of sources, (extra separated signals should be residual noises). However, an underestimation of that number can affect seriously overall performances. For this reason, a rough estimation of that number should be considered. To roughly estimate the source number, few approaches have been considered and briefly discussed. Hereinafter, the channel is assumed overdetermined, (i.e. q > p).
Here Σ S stands for the unknown and invertible diagonal covariance matrix of the statistically independent sources. For noise free channel, the rank of Σ X becomes equal to the rank of Σ S otherwise the number of sources.
With an AWGN channel, Σ X becomes a full rank matrix. Without loss of generality, let us assume that noise components have the same variance, then, the q singular values λ i of Σ X will have different values except the last q − p ones. Normally, the first p singular values are linked to signal space and the last q − p ones are related to the noise space. In order to apply this method, one should deal with two problems: How can we estimate the covariance matrix of non-stationary signals an what is the optimal threshold between the two sets of singular values? The estimation of covariance matrices has been conducted over slippery estimation windows, see Section “Adaptive HOS estimators’. Concerning the threshold, it can be easily set when the signal to noise ratio (SNR) is relatively high. Unfortunately, the SNR is our case is not high enough, (i.e. SNR > 2 dB). Therefore, different thresholds have been considered:
If q > p + 5, one can easily set a threshold as the limit between two sets of singular values. This approach requires a very good SNR and q >> p.
To improve the first approach, normalized singular values have been considered, (i.e. λ i have been divided by the maximum λ i ). Experimental results showed that a threshold can be easily set using normalized singular values when SNR is higher than 10 dB and the signatures of sources are relatively the same, (the signature of the i th source on the j th sensor is the power received by that sensor from that source. Therefore, the signature of a source depends on the source power and the channel parameters.). The last two assumptions can’t be, always, satisfied in our application.
Another method was considered: first, the singular values λ i should be sorted in descending order; second, sorted λ i should be divided by λ2. Finally, the number of sources is considered as the number of normalized λ i > ε, where ε depends on SNR. Experimentally, we obtained satisfactory results for SNR higher than 4 dB and ε > 0.1.
By considering that the signals are close to Gaussian ones, one can use Akaike’s information criterion, (AIC), to set the threshold. Even though the gaussianity assumption is a strong one, (underwater acoustic signals are very strong non-stationary signals which can not be considered as gaussian signals), Karhunen et al. shows that obtained results are still satisfactory.
where stands for the extended signal vector and T N (H) is the Sylvester matrix which is full rank under some mild assumptions.
where Hs+is the hypothesis that number of sources is higher than s and the threshold T s should be set so that the allowable probability of false alarm can be achieved.
A main advantage of the last algorithm comparing to previous approaches is that this algorithm can be applied even though the noise are spatially correlated and that it can give a confidence level for the estimated number. The main drawback is the computational effort. In fact, with 2N + 1 receivers, one can only estimate a source number up to N. In the following, we consider that the number of sources is already estimated.
In order to exploit spatial diversity, many blind or semi-blind separation; or identification algorithms uses HOS, in time or frequency domain. For this reason, the estimation of cross cumulants and moments up to the fourth order have been investigated in this section, further details are given in Appendix 1.
k4(X) is a consistent biased estimator of Cum4(X). In previous studies, we proposed and compared estimators for auto-cumulants of second and fourth orders. Here, we propose new adaptive HOS estimators for fourth order cross-cumulants which can be applied on underwater acoustic signals which are non-stationary signals.
where 0 < λ < 1 is a forgotten factor. To evaluate the performances of last estimators, some simulations have been conducted using a non-stationary zero-mean signals. For example, let S(n) be a non-stationary signal that consists of four parts:
S1 is an uniform signal in [-1, 1] with 8,000 samples.
S2 is Gaussian with unit variance and 5,000 samples.
S3 is an uniform signal in [-2, 2] with 3,000 samples.
S4 is Gaussian with a standard deviation and 4,000 samples.
Using S, two other signals have been generated X(n) = S(n) and Y(n) = S3(n), (it is clear that x i and y i are i.i.d and that x i depends on y i ). Using the definition of cumulants and the properties S, we can prove that:
For uniform parts,, here a is the maximum amplitude.
For the Gaussian parts, Cum31(X,Y) = 6σ6.
Finally, x N and y N in Equation (15) have been replaced by their average over a small estimation window, (10 to 50 samples). The above proposed estimators can be improved by considering non iid samples. However, in the last case, a stochastic model with transition probability should be considered. The last statement is beyond the scoop of this manuscript and it will be considered in a future study. Hereinafter, HOS are estimated at different stages using the estimators described in this section.
where s2(n) represents a mixture of all sources except the first one s1(n). The filter is a residual separation filter. The separation is considered achieved when the norm of the residual error h2(z)∗s2(n) becomes much less than the one of the separated signal h1(z)∗s1(n). In addition, the identification or classification of underwater acoustic signals is extraordinarily difficult step because these signals are non-stationary and non-intelligible sparse signals with low variable kurtosis. In this context, the classification of ICA algorithms according to the separation quality becomes a difficult and important task.
The following discrimination criteria can be optimized to maximize the spatial diversity or the independence among estimated signals. At the same time, they can be very useful to quantify the separation achievement. In the last case, these criteria are called performance indices.
To apply the crosstalk, one should have original sources. Therefore, this performance index cannot be applied in real situation where sources are unknown. However it is very useful in simulations.
It is well known that sources can be separated from a convolutive mixture up to a permutation and up to a scalar filter. Therefore, the last definition D r is useless for the BSS convolutive mixture, see Equation (16), since it doesn’t take into consideration the power ratio between the filtered version of the signal ξ1 = h1(z)∗s1(n) and the residual error h2(z)∗s2(n).
Our experimental results show that for a low order channel filter, (<20), this performance index can be used efficiently. When the order of channel is larger than 20, computing time becomes a big issue.
where U = (u1,…,u n ) T is a random vector and P U (V) (resp. are the joint, (resp. marginal), PDF. In the context of BSS problem, the joint and the marginal PDF are unknown but they can be estimated.
Here is the joint PDF estimator and is the marginal PDF estimator. Good results have been obtained with stationary signals, but we couldn’t get similar results for underwater acoustic signals.
Here h is an integrable function from to which satisfies the following two conditions:
h is a non zero almost everywhere and a positive function.
For analytical FCF Φ(Ω), h should be positive around zero and vanish elsewhere.
Here is a square integrable kernel function that its Fourier transform should be non zero almost everywhere and is a scale factor, (i.e. a positive function only depends on the PDF of X i ).
Square Gaussian Kernel
Inverse of Square Gaussian Kernel second derivative function
In our experimental studies, the best results were obtained using the Gaussian Kernel. In fact, the Gaussian Kernel gives the largest possible difference between the quadratic independence measure applied on a vector A with i.i.d uniformly independent components and the quadratic independence measure applied on a vector B = MA, here M is a full rank mixing matrix. The main drawback of such performance index is the important computing time.
Non-linear Kernel decorrelation
We call Cov(X,Y) and Var(X) respectively the covariance and the variance of X and Y . It is worth mentioning that is a vectorial space of all functions applied from to which contents all Fourier transform basis, (i.e. the exponential functions exp(jwx), with). means the independence between X and Y .
Using different kernels, Gaussian, Polynomial and Hermite functions, the NL-decorrelation is applied on source and mixed signals
NL-decorrelation of sources
NL-decorrelation of mixed signals
i.i.d uniform PDF
Kernel ‘Gaussian’ − 23.4
Kernel ‘Gaussien’ −5.8319
Kernel ‘poly’ − 25.5
Kernel ‘poly’ 8.1
Kernel ‘hermite’ − 22.4
Kernel ‘hermite’ −20.4
Four acoustic signals
Kernel ‘poly’ − 33.4
Kernel ‘poly’ 3.2
Kernel ‘poly’ −14.9817
Four acoustic signals
Kernel ‘poly’ − 31.3
Kernel ‘poly’ 8.8
Kernel ‘poly’ −13.2
Simplified non-linear decorrelation
‘Gauss’: Gaussian kernel.
‘poly’: 6th order polynomial Kernel which its coefficients are the components of an unitary vector.
‘atan’: Saturation kernel using arc-tangent function.
‘tanh’: Saturation kernel using hyperbolic tangent function.
Simplified NL-decorrelation applied on source and mixed signals using different kernels
NL-decorrelation of sources
NL-decorrelation of mixed signals
Kernel ‘Gaussian’ − 66.3211
Kernel ‘Gaussian’ −40.6513
Kernel ‘poly’ −49.2054
Kernel ‘poly’ −6.6205
Kernel ‘atan’ −63.2202
Kernel ‘atan’ −0.0802
Kernel ‘tanh’ −52.5625
Kernel ‘tanh’ 0.1597
Four acoustic signals
Kernel ‘atan’ −40.7142
Kernel ‘atan’ 1.5864
Kernel ‘atan’ −31.8532
Four acoustic signals
Kernel ‘tanh’ −86.6931
Kernel ‘tanh’ 1.0391
Kernel ‘tanh’ −57.4885
Independence measure based on the FCF
where g is an adequately chosen function, is the approximation of the score function of X, and Φ(X) is the PDF of zero mean and unite variance Gaussian signal. Our experimental studies show that the computing time is the main drawback of this performance index. We should mention that for stationary signals, this performance index is consistent. Unfortunately, the last intersting property is useless in our application since the acoustic signals are non-stationary signals.
Here k(t,s) is a bounded estimation window. Our experimental studies show that:
The obtained values depend on original sources. This inconvenient is common to previous performance indices.
For beta random variables, good results have been obtained. On the other hand, we noticed bad results for uniform random signals.
For acoustic signals, we noticed good results for instantaneous mixture and bad ones for convolutive mixtures.
Computing time is crucial.
where Γ = (Perfc(X i ,X j )) and. Good results have been obtained using this performance index on instantaneous or convolutive mixture of acoustic signals. However, the computing time is relatively important.
In previous study, we implemented and tested some instantaneous ICA algorithms. According to that study, good results, at least in instantaneous mixture of acoustic underwater signals, can be obtained using ICA algorithms based on HOS or dedicated to non-stationary signals. The algorithms discussed in this section have been selected according to our previous study.
In real applications of PAT, hydrophones could record mixed signals. In order to apply classic AAT algorithms, one should, first, separate the recorded mixed signals. It was mentioned that in PAT applications, MIMO configuration is quite possible. In this case, the sources could be generated and recorded at different locations. This spatial diversity could be translated into statistical independence. Since the early of 1990s, ICA, has been considered as a set of important signal processing tools[40–42]. By assuming that the unknown p emitted signals, (i.e. sources), are statistically independent from each other, ICA consists on retrieving a set of independent signals, (output signals), from the observation of unknown mixtures of the p sources. It was proved that the output signals can be the sources up to a factor, (or filter), scale and up to a permutation.
Due to long and sparse impulse response of acoustic underwater channels and acoustic underwater signals’ features, (i.e. non-stationary, close to Gaussian, sparseness, etc.), see Section “Assumption and background’, many ICA algorithms couldn’t achieve the separation of sources in our application. Every selected and implemented algorithm has been evaluated using the following steps: we, first, used the same, (or similar), signals to the ones originally proposed by the authors of that algorithm. Second, an algorithm should be run over some simulated scenarios using a set of non-stationary signals, (normally speech signals), in memoryless or simple convolutive channels. Algorithms that give good, (or at least satisfactory), results in the first two stages have been selected in our project.
Best experimental results were obtained using two frequency domain ICA Algorithms[22, 23] based on the minimization of second order statistics criteria in frequency-domain. These two algorithms exploit the spatial and the spectral diversity of the original signals. In the following, the major tested algorithms are briefly described.
Blind estimation of time delay
It is worth to be mentioned that the authors proposed in another version of their algorithm. However, we didn’t implemented the latest version of the algorithm, for the simple reason that the first version of algorithm didn’t give satisfactory results in our application. In fact, underwater acoustic channel is more complex than the model considered by the authors.
In the early 1990s, Nguyen and Jutten[45–47] were the first to propose an ICA algorithm to separate a convolutive mixture of speech signals. The first version of the algorithm consists on the minimization of a cost function as the mathematical expectation of an odd nonlinear function evaluated over the estimated signals. Later on, they proposed another cost function as the sum of fourth order cross-cumulants. To prevent a matrix invertible problem, they proposed a recursive structure which can only deal with a mixture of two sources. The latest constraint can be easily avoided by using our recursive system proposed in. In addition, the algorithms proposed by Nugyen et al. can be, easily, implemented and they have been used to separate speech signals. For these reasons, we decided to implement these algorithms.
In addition to different versions originally proposed by the authors, we implemented hybrid structures, (i.e. a minimization of cost function based on a weighted sum of their different cost functions). Unfortunately, our experimental studies show that the algorithm, in all implemented versions, is not helpful to reach our goal. In fact the performance of the separation were not satisfactory due to the particularity of our application. It is worth mentionning that the convergence of the algorithm was a critical point in many cases.
Natural gradient applied to entropy maximization
In order to characterize and localize the developing of material defects, acoustic emission analysis (AEA) is used. To improve the performance of their AEA, Kosel et al. have processed observed signals by using an ICA algorithm proposed earlier by Amari and Cardoso based on the natural gradient minimization algorithm proposed in, and introduced independently by Cardoso and Laheld under the name of relative gradient.
In the context of our project, many simulations have been conducted. According to our experimental studies, these algorithms can render good results for stationary signals and for relatively short channel filters, (i.e. low order filters). Unfortunately, divergence problems or non satisfactory results were often observed when the signals were sparse non stationary ones and the channel filter was very long as in our application.
Blind separation of non stationary signals
H(z) is a full rank stable filter matrix and it has no zero on the unit circle.
The sources are zero-mean non-stationary signals.
The sources have different auto-covariance r i (n,m) = E(s i (n)s i (n−m)) which should be a time function.
where stands for time delay and R Y (n) = E(Y(n)Y T (n)).
The convergence needed a huge number of samples. Besides, obtained results were not always satisfactory. The performances of the algorithm depended on the source signals as well as the transmission channel. The algorithm was a time and memory consuming.
A frequency domain method for BSS of convolutive audio mixture (SOS)
where X im (w) is the Fourier transform of the observed signals, and J is the number of estimated windows such that L J < L m and J L J > L m .
where F R (w,m) and F I (w,m) are the real and the imaginary parts of Equation (36). Finally, the minimization was done using a conjugate gradient algorithm.
Convolutive blind separation of non-stationary sources
Using the structure proposed in Figure2, many simulations have been conducted. Generally, over 500,000–1,000,000 samples were needed to achieve the separation. The original sources were sampled at 44 KHz. In almost all the simulations, the separation of artificial or natural signals have been successfully achieved. In these simulations, we have set the channel depth between 100 to 500 m, the distances among the sources or the sensors were among 30 to 100 m, the distances among the different sources and the divers sensors are from 1,500 to 2,500 m, the number of sensors is strictly higher than the number of sources.
In this article, several signal processing contributions applied on real world application such as the PAT, have been presented. Many simulations have been conducted and experimental studies showed the necessity of considering pre-processing and post processing of the observed signals in order to achieve properly the separation of the sources.
Many algorithms have been implemented and tested. However, few algorithms which are dedicated to the separation of non-stationary signals, give us satisfactory results. In a real scenario of warfare applications, the use of any ICA algorithm becomes very challenging. In fact, many ICA algorithms can not achieve satisfactory results when:
Most of the signals are close to Gaussian ones.
Sources have very inhomogeneous power, (the power ratio can be up to a dozen of dB).
SNR can be very limited depending on operational situations.
Even though ICA algorithms can handle convolutive mixtures. However, in our applications, the channel filter orders can be up to few thousand. At the same time, such a filter is a very sparse one. In fact, just few filter parameters do not vanish.
Our future work consists on developing an ICA algorithm which can use other features of acoustic signals such as sparseness along with non-stationarity, etc.
Appendix 1: HOS estimators
where is the estimator of the r th order moment at the k th iteration.
Appendix 2: adaptive unbiased estimator of 4th order cumulant
A part of this work was supported by the French Military Center for Hydrographic & Oceanographic Studies, (SHOM i.e. Service Hydrographique et Océanographique de la Marine, Centre Militaire d’Océanographie).
- Gervaise C, Quinquis A, Martins N: Time frequency approach of blind study of acoustic submarine channel and source recognition. In Physics in Signal and Image Processing, PSIP 2001. Marseille, France; January 2001.Google Scholar
- Munk W, Worcester P, Wunsch C: Ocean Acoustic Tomography. Cambridge University Press, Cambridge,; 1995.View ArticleGoogle Scholar
- Baggeroer AB, Kuperman WA, Mikhalevsky PN: An overview of matched field methods in ocean acoustics. IEEE J. Oceanic Eng 1993, 18: 4.View ArticleGoogle Scholar
- Chapman NR, Lindsay CE: Matched-field inversion for geoacoustic model parameters in shallow water. IEEE J. Oceanic Eng 1996, 21: 4.View ArticleGoogle Scholar
- Hermand JP: Broad-band geoacoustic inversion in shallow water from waveguide impulse response measurements on a single hydrophone: theory and experimental results. IEEE J. Oceanic Eng 1999, 24: 1.View ArticleGoogle Scholar
- Michalopoulou ZH: Estimating the impulse response of ocean: correlation versus deconvolution, in Inverse problems in underwater acoustics. Springer, Paris and Milan and Barcelone,; 2001.Google Scholar
- Gervaise C, Vallez S, Ioana O, Staphan Y, Simard Y: Passive acoustic tomography: review, new concepts and application using marine mammals. J. Mar. Biol. Assoc. U. K 2007, 87: 5-10. 10.1017/S0025315407054872View ArticleGoogle Scholar
- Martins N, Jesus S, Gervaise C, Quinquis A: A time-frequency approach to blind deconvolution in multipath underwater channels. In Proceedings of International Conference on Acoustics Speech and Signal Processing 2002, ICASSP 2002. Orlando, Florida, USA; 13–17 May 2002.Google Scholar
- Gaucher D, Gervaise C: Feasibility of passive oceanic acoustic tomography: a Cramer Rao bounds approach. In Oceans 2003 Marine Technology and Ocean Science Conference. San Diego, USA; 22–26 September 2003. pp. 56–60Google Scholar
- Gaucher D, Gervaise C, LE Flock H: Contributions to passive acoustic oceanic tomography. In 7me Journes d’Acoustique Sous-Marine. Brest, France;Google Scholar
- Mansour A, Jutten C: What should we say about the kurtosis. IEEE Signal Process. Lett December 1999, 6(2):321-322.View ArticleGoogle Scholar
- Jensen FB, Kuperman WA, Porter MB, Schmidt H: Computational ocean acoustics. Springer-Verlag, New York, London, Tokyo,; 2000.MATHGoogle Scholar
- Etter P: Recent advances in underwater acoustic modelling and simulation. J. Sound Vib 2001, 240(2):351-383. 10.1006/jsvi.2000.3212View ArticleGoogle Scholar
- Etter P: Underwater acoustic modeling principles, techniques and applications. Elsevier, New York,; 1991.MATHGoogle Scholar
- Lurton X: Introduction to underwater acoustics principles and applications. Springer, London,; 2002.Google Scholar
- Brekhovskikh LM, Lysanov YP: Fundamentals of ocean acoustics. Springer Verlag, New York,; 2003.MATHGoogle Scholar
- Shulkin M, Marsh HW: Sound absorption in sea water. J. Acoustical Soc. Am 1962, 134: 864-865.View ArticleGoogle Scholar
- Etter PC: Underwater acoustic modeling and simulation. Spon Press Editor, London, UK,; 2003.View ArticleMATHGoogle Scholar
- Kawamoto M, Matsuoka K, Ohnishi N: A method of blind separation for convolved non-stationary signals. Neurocomputing 1998, 22: 157-171. 10.1016/S0925-2312(98)00055-1View ArticleMATHGoogle Scholar
- Kawamoto M, Kardec Barros A, Mansour A, Matsuoka K, Ohnishi N: Real world blind separation of convolved non-stationary signals. In First International Workshop on Independent Component Analysis and signal Separation (ICA99). Edited by: Cardoso JF, Jutten Ch, loubaton Ph. Aussois, France; 11–15 January 1999. pp. 347–352Google Scholar
- Rahbar K, Reilly J: Blind separation of convolved sources by joint approximate diagonalization of cross-spectral density matrices. In Proceedings of International Conference on Acoustics Speech and Signal Processing 2001, ICASSP 2001. Salt Lake City, Utah, USA; May 7–11 2001.Google Scholar
- Rahbar K, Reilly J: A frequency domain method for blind source separation of convolutive audio mixtures. IEEE Trans. Speech Audio Process 2005, 13(5):832-844.View ArticleGoogle Scholar
- Parra L, Alvino CV: Convolutive blind separation of non-stationnary sources. IEEE Trans. Speech Audio Process May 2000, 8(3):320-327. 10.1109/89.841214View ArticleGoogle Scholar
- Mansour A, Jutten C, Loubaton Ph: Subspace method for blind separation of sources and for a convolutive mixture model. In European Signal Processing Conference. Elsevier, Triest, Italy; September 1996. pp. 2081–2084Google Scholar
- Kailath T: Linear systems. Prentice Hall, New Jersey,; 1980.MATHGoogle Scholar
- Karhunen J, Cichocki A, Kasprazak W, Pajunen P: On neural blind source separation with noise suppression and redundancy reduction. Int. J. Neural Syst April 1997, 8(2):219-237.View ArticleGoogle Scholar
- Mansour A: A mutually referenced blind multiuser separation of convolutive mixture algorithm. Signal Process November 2001, 81(11):2253-2266.View ArticleMATHGoogle Scholar
- Chen W, Reilly JP, Wong KM: Detection of the number of signals in noise with banded covariance matrices. IEE Proc- Radar, sonar and Navogation October 1996, 143(5):289-294.View ArticleGoogle Scholar
- Kendall M, Stuart A: The advanced theory of statistics: Design and analysis, and time-series. Charles Griffin & Company Limited, London,; 1961.MATHGoogle Scholar
- Martin A, Mansour A: Comparative study of high order statistics estimators. In International Conference on Software, Telecommunications and Computer Networks. Split (Croatia), Dubrovnik (Croatia), Venice (Italy); October 10–13 2004. pp. 511–515Google Scholar
- Tan Y, Wang J, Zurada JM: Nonlinear blind source separation using a radial basis function network. IEEE Trans. Neural Networks January 2001, 12(1):124-134. 10.1109/72.896801View ArticleGoogle Scholar
- Pham D-T: Fast algorithm for estimating mutual information, entropies and score functions. In 4th International Workshop on Independent Component Analysis and blind Signal Separation, ICA2003. Nara, Japan; 1–4 April 2003. pp. 17–22Google Scholar
- Rosenblatt M: A quadratic measure of deviation of two-dimensional density estimates and a test of independence. Ann. Stat 1975, 3(1):1-14. 10.1214/aos/1176342996MathSciNetView ArticleMATHGoogle Scholar
- Kankainen A: Consistent testing of total independence based on empirical characteristic functions. Ph.D. thesis, University Jyvaskyla 1995Google Scholar
- Achard S, Pham D-T, Jutten C: Quadratic dependence measure for nonlinear blind sources separation. In 4th International Workshop on Independent Component Analysis and blind Signal Separation, ICA2003. Nara, Japan; 1–4 April 2003. pp. 263–268Google Scholar
- Bach FR, Jordan MI: Finding clusters in independent component analysis. In 4th International Workshop on Independent Component Analysis and blind Signal Separation, ICA2003. Nara, Japan; 1–4 April 2003. pp. 891–896Google Scholar
- Feuerverger A: A consistent test for bivariate dependence. Int. Stat. Rev 1993, 61(3):419-433. 10.2307/1403753View ArticleMATHGoogle Scholar
- Murata N: Properties of the empirical characteristic function and its application to testing for independence. In Third International Workshop on Independent Component Analysis and signal Separation (ICA2001). San Diego, California, USA; 9–12 December 2001. pp. 295–300Google Scholar
- Mansour A, Gervaise C: ICA applied to passive ocean acoustic tomography. WSEAS Trans. on Acoustics and Music April 2004, 1(2):83-89.Google Scholar
- Cardoso JF, Comon P: Independent component analysis, a survey of some algebraic methods. In International Symposium on Circuits and Systems Conference, volume 2. Atlanta, USA; May 1996. pp. 93–96Google Scholar
- Mansour A, Kardec Barros A, Ohnishi N: Blind separation of sources: Methods, assumptions and applications. IEICE Trans Fundam Electron, Commun and Comput Sci August 2000, E83-A(8):1498-1512.Google Scholar
- Jutten C, Karhunen J: Advances in nonlinear blind source separation. In 4th International Workshop on Independent Component Analysis and blind Signal Separation, ICA2003. Nara, Japan; 1–4 April 2003. pp. 245–256Google Scholar
- Comon P: Independent component analysis, a new concept? Signal Process April 1994, 36(3):287-314.View ArticleMATHGoogle Scholar
- Emile B, Comon P: Estimation of time delays between unknown colored signals. Signal Process 1998, 69: 93-100. 10.1016/S0165-1684(98)00061-9View ArticleGoogle Scholar
- Nguyen Thi L, Jutten C, Caelen J: Separation aveugle de parole et de bruit dans un mlange convolutif. In Actes du XIIIème colloque GRETSI. Juan-Les-Pins, France; September 1991. pp. 737–740Google Scholar
- Nguyen Thi L, Jutten C, Caelen J: Speech enhancement: Analysis and comparison of methods in various real situations. In European Signal Processing Conference. Edited by: Vandewalle J, Boite R, Moonen M, Oosterlinck A. Elsevier, Brussels, Belgium; August 1992. pp. 303–306Google Scholar
- Nguyen Thi L, Jutten C: Blind sources separation for convolutive mixtures. Signal Process 1995, 45(2):209-229. 10.1016/0165-1684(95)00052-FView ArticleMATHGoogle Scholar
- Kardec Barros A, Mansour A, Ohnishi N: Removing artifacts from ECG signals using independent components analysis. NeuroComputing 1999, 22: 173-186.View ArticleMATHGoogle Scholar
- Kosel T, Grabec I, Kosel F: Time delay estimation of acoustic emission signals using ICA. Ultrasonics 2002, 40: 303-306. 10.1016/S0041-624X(02)00111-7View ArticleGoogle Scholar
- Amari SI, Cardoso JF: Blind source separation-semiparametric statistical approach. IEEE Trans. on Signal Process November 1997, 45(11):2692-2700.View ArticleGoogle Scholar
- Amari SI: Neural learning in structured parameter spaces: Natural Riemannian Gradient. In Neural Information Processing System-Natural and Synthetic. San Diego, Colorado, USA; 2–7 December 1996.Google Scholar
- Cardoso JF, Laheld B: Equivariant adaptive source separation. IEEE Trans. Signal Process December 1996, 44(12):3017-3030. 10.1109/78.553476View ArticleGoogle Scholar
- Douglas SC, Cichocki A, Amari SI: Multichannel blind separation and deconvolution of sources with arbitrary distributions, in the book Neural Networks for Signal Processing. In IEEE Workshop on Neural Networks for Signal Processing. New York; September 1997. pp. 436–445Google Scholar
- Cichocki A, Douglas SC, Amari S: Robust techniques for independent component analysis (ICA) with noisy data. NeuroComputating 1998, 22: 113-129. 10.1016/S0925-2312(98)00052-6View ArticleMATHGoogle Scholar
- Matsuoka K, Oya M, Kawamoto M: A neural net for blind separation of nonstationary signals. Neural Networks 1995, 8(3):411-419. 10.1016/0893-6080(94)00083-XView ArticleGoogle Scholar
- Kawamoto M, Matsuoka K, Oya M: Blind separation of sources using temporal correlation of the observed signals. IEICE Trans. Fundam Electron, Commun. Comput Sci April 1997, E80-A(4):111-116.Google Scholar
- Noble B, Daniel JW: Applied linear algebra. Prentice-Hall, New Jersey,; 1988.MATHGoogle Scholar
- McCullagh P: Tensor methods in statistics. Chapman and Hall, London,; 1987.MATHGoogle Scholar
- Shiryayev AN: Probability. Springer Verlag, London,; 1984.View ArticleGoogle Scholar
- Papoulis A: Probability, random variables, and stochastic processes. McGraw-Hill, New York,; 1991.MATHGoogle Scholar
- Kotz S, Johnson NL: Encyclopedia of statistical sciences. University of Amesterdam, Amesterdam,; 1993.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.