Skip to main content

Compressive sampling of swallowing accelerometry signals using time-frequency dictionaries based on modulated discrete prolate spheroidal sequences

Abstract

Monitoring physiological functions such as swallowing often generates large volumes of samples to be stored and processed, which can introduce computational constraints especially if remote monitoring is desired. In this article, we propose a compressive sensing (CS) algorithm to alleviate some of these issues while acquiring dual-axis swallowing accelerometry signals. The proposed CS approach uses a time-frequency dictionary where the members are modulated discrete prolate spheroidal sequences (MDPSS). These waveforms are obtained by modulation and variation of discrete prolate spheroidal sequences (DPSS) in order to reflect the time-varying nature of swallowing acclerometry signals. While the modulated bases permit one to represent the signal behavior accurately, the matching pursuit algorithm is adopted to iteratively decompose the signals into an expansion of the dictionary bases. To test the accuracy of the proposed scheme, we carried out several numerical experiments with synthetic test signals and dual-axis swallowing accelerometry signals. In both cases, the proposed CS approach based on the MDPSS yields more accurate representations than the CS approach based on DPSS. Specifically, we show that dual-axis swallowing accelerometry signals can be accurately reconstructed even when the sampling rate is reduced to half of the Nyquist rate. The results clearly indicate that the MDPSS are suitable bases for swallowing accelerometry signals.

1 Introduction

Continuous monitoring of physiological functions such as swallowing can pose severe constraints on data acquisition and processing systems. Even when sampling physiological signals at low rates (e.g., 250 Hz), we end up with close to a million of samples in the first hour of monitoring. Similar computational burdens are ever-present in telemedicine, and in recent years we have witnessed numerous efforts to deal with this problem. One such effort is to compress the acquired signals immediately upon sampling using various schema (e.g. [1]). The other is to rethink the way we acquire the data, and a number of recent publications have begun looking at this approach (e.g., [25]).

The idea of compressive sensing (CS) has gained considerable attention in recent years. The main idea behind CS is to diminish the number of steps involved when acquiring data by combining sampling and compression into a single step [3, 4]. Specifically, CS enables one to acquire the data at sub-Nyquist rates, and recover it accurately from such sparse samples [3].

In this article, we propose an approach for CS of swallowing accelerometry signals based on a time-frequency dictionary. In particular, the members of the dictionary are recently proposed modulated discrete spheroidal sequences (MDPSS) [6, 7]. The bases within the time-frequency dictionary are obtained by modulation and variation of the bandwidth of discrete prolate spheroidal sequences (DPSS) to reflect the vaying time-frequency nature of many biomedical signals, including the swallowing acclerometry signals considered in this article. Using the proposed approach, we carry out a numerical analysis of synthetic test signals and real swallowing accelerometry signals. The numerical analysis using the synthetic test signals showed that the CS approach based on MDPSS was more accurate than the CS approach based on DPSS (e.g., [7, 8]). Additionally, the analysis of swallowing accelerometry signals showed that we can obtain 90% cross-correlation between the reconstructed signals and the actual signals using only 50% percent of samples. This has been observed for three different types of swallowing tasks.

The article is organized as follows: Section 2 describes swallowing accelerometry and outlines the advantages of this approach for detecting swallowing difficulties. In Section 3, we describe the proposed approach for CS using the time-frequency based dictionary consisting of MDPSS bases. Section 4 reports the data analysis steps that we carried out to obtain the reported results, which are presented in Section 5 along with the discussion of the same results. The conclusions are drawn in Section 6.

2 Swallowing accelerometry

Swallowing (deglutition) is a complex process of transporting food or liquid from the mouth to the stomach consisting of four phases: oral preparatory, oral, pharyngeal, and esophageal [9]. Dysphagic patients (i.e., patients suffering from swallowing difficulty) usually deviate from the well-defined pattern of healthy swallowing. Dysphagia frequently develops in stroke patients, head injured patients, and patients with others with paralyzing neurological diseases [10]. Patients with dysphagia are prone to choking and aspiration (the entry of material into the airway below the true vocal folds) [9]. Aspiration and dysphagia may lead to serious health sequelae including malnutrition and dehydration [11, 12], degradation in psychosocial well-being [13, 14], aspiration pneumonia [15], and even death [16].

The videofluoroscopic swallowing study (VFSS) is used widely in today's dysphagia management and it represent the gold standard for assessment [9, 17]. However, VFSS requires expensive X-ray equipment as well as expertise from speech-language pathologists and radiologists. Hence, only a limited number of institutions can offer VFSS and the procedure has been associated with long waiting lists [18, 19]. In addition, day-to-day monitoring of dysphagia is crucial due to the fact that the severity of dysphagia can fluctuate over time and VFSS is not suitable for such day-to-day monitoring.

Cervical auscultation is a promising non-invasive tool for the assessment of swallowing disorders [20] involving the examination of swallowing signals acquired via a stethoscope or other acoustic and/or vibration sensors during deglutition [21]. Swallowing accelerometry is one such approach and employs an accelerometer as a sensor during cervical auscultation. Swallowing accelerometry has been used to detect aspiration in several studies, which have described a shared pattern among healthy swallow signals, and verified that this pattern is either absent, delayed or aberrant in dysphagic swallow signals [2234].

However, these previous studies used single-axis accelerometers and exclusively monitored vibrations propagated in the anterior-posterior direction at the cervical region. Proper hyolaryngeal movement with precise timing during bolus transit is vital for airway protection in swallowing [9]. Since the motion of the hyolaryngeal structure during swallowing occurs in both anterior-posterior (A-P) and superior-inferior (S-I) directions, the employment of dual-axis accelerometry seems well motivated. Since correlation has been reported between the extent of laryngeal elevation and the magnitude of the A-P swallowing accelerometry signal [35], it is hypothesized that vibrations in the S-I axis also capture useful information about laryngeal elevation. From a physiological stand point, the S-I axis appears to be as worthy of investigation as the A-P axis because the maximum excursion of the the hyolaryngeal structure during swallowing is of similar magnitude in both the anterior and superior directions [36, 37]. Recent contributions have indeed confirmed that dual-axis accelerometers yield more information and enhance analysis capabilities [3843].

2.1 Data

Sample signals used in this article were collected from 408 participants (ages 18-65) over a 3 month period from a public science centre in Toronto, Ontario, Canada. All participants provided written consent and had no documented swallowing disorders. The research ethics boards of the Toronto Rehabilitation Institute and Holland Bloorview Kids Rehabilitation Hospital (both located in Toronto, Ontario, Canada) approved the study protocol.

To collect data from participants, we used a dual-axis accelerometer (ADXL322, Analog Devices), which was attached to the participant's neck (anterior to the cricoid cartilage) using double-sided tape. The axes of acceleration were aligned to the anterior-posterior and superior-inferior directions. Data were band-pass filtered in hardware with a pass band of 0.1-3,000 Hz and sampled at 10 kHz using a custom LabVIEW program running on a laptop computer. With the accelerometer attached, each participant was cued to perform five saliva swallows (i.e., dry swallows), five water swallows by cup with their chin perpendicular to the floor (i.e., wet swallows) and five water swallows in the chin-tucked position. The entire data collection session lasted 15 min per participant.

3 Proposed scheme

Traditional signal processing approaches for sensing and processing of information have relied on the Shannon sampling theorem, which states that a bandlimited signal x(t) can be reconstructed from uniform samples {x(kT s )}:

x ( t ) = k x ( k T s ) sin ( Ω max ( t - k T s ) / π ) Ω max ( t - k T s ) / π
(1)

where T s is the sampling period and Ωmax represents the maximum frequency present in the signal. In other words, the Shannon sampling theorem states that in order to ensure accurate representation and reconstruction of a signal with Ωmax, we should sample it at least at 2Ωmax samples per second (i.e., the Nyquist rate). However, many recent publications have challenged this approach for a number of reasons (e.g., [44, 45]). First, by using the Shannon sampling theorem we rely on bases of infinite support, while we generally reconstruct signal samples in the finite domain [44]. Second, large bandwidth values can severely constraint sampling architectures [45]. Third, even when we consider signals with a relatively low band-width values such as swallowing accelerometry signals, continuous monitoring of swallowing function can produce large number of redundant samples, which severely constraints our processing efforts.

A recently proposed idea of CS resolves some of the aforementioned issues [35]. CS is a method closely related to transform coding, since a transform code converts input signals, embedded in a high-dimensional space, into signals that lie in a space of significantly smaller dimensions (e.g., wavelet and Fourier transforms) [4]. CS approaches are particularly suited for K-sparse signals, i.e., signals that can be represented by significant K coefficients over an N-dimensional basis. Encoding of a K-sparse, discrete-time signal of dimension N is accomplished by computing a measurement vector y that consists of M << N linear projections of the vector x. This can be compactly described via

y = Φ x
(2)

where Φ represents an M × N matrix and is often refer to as the sensing matrix [4]. A natural formulation of the recovery problem is within an norm minimization framework, which seeks a solution to the problem

min x 0 subject to y - Φ x 2 < η
(3)

where η is the expected noise of measurements, ||x||0 counts the number of nonzero entries of x and || • ||2 is the Euclidian norm. Unfortunately, the above minimization is not suitable for many applications as it is NP-hard [46]. To avoid the computational burden, approaches like thresholding, (orthogonal) matching pursuit and basis pursuits have been proposed [46]. In this article, we will focus on the matching pursuit [47].

Given the CS framework, the immediate question is how to define the sensing matrix Φ, that is the bases used in the recovery of the signal. Most commonly used sensing matrices are random matrices with independent identically distributed (i.i.d.) entries formed by sampling either a Gaussian distribution or a symmetric Bernoulli distribution [48]. Previous publications have shown that these matrices can recover the signal with high probability [48]. However, when dealing with biomedical signals, we would like to "precisely" recover the signals (i.e., with a very small error). Therefore, we propose to use a time-frequency dictionary (also known as frames [49]) based on modulated discrete prolate spheroidal sequences (MDPSS).

3.1 Time-frequency dictionaries based on MDPSS

To understand MDPSS, let's begin with a general description of discrete prolate spheroidal sequences (DPSS). Given N such that n = 0, 1, . . . , N −1 and the normalized half-bandwidth, W such that 0 < W < 0.5, the k th DPSS, v k (n, N, W ), is defined as the real solution to the system of equations [50]:

m = 0 N - 1 sin [ 2 π W ( n - m ) ] π ( n - m ) v k ( m , N , W ) = λ k ( N , W ) v k ( n , N , W ) k = 0 , 1 , , N - 1
(4)

with λ k (N, W ) being the ordered non-zero eigenvalues of (4)

λ 0 ( N , W ) > λ 1 ( N , W ) , , λ N - 1 ( N , W ) > 0 .
(5)

Slepian showed that behaviour of these eigenvalues for fixed k and large N is given by

1 - λ k ( N , W ) ~ π k ! 2 14 k + 9 4 α 2 k + 1 4 [ 2 - α ] - ( k + 0 . 5 ) N k + 0 . 5 e - γ N
(6)

where

α = 1 - cos ( 2 π W ) γ = log 1 + 2 ( α ) 2 - α

The first 2NW eigenvalues are very close to 1 while the rest rapidly decays to zero [50]. Interestingly enough, it has been observed that these quantities are also the eigenvalues of N × N matrix C(m, n) [50], where the elements of such a matrix are

C ( m , n ) = sin [ 2 π W ( n - m ) ] π ( n - m ) m , n = 0 , 1 , , N - 1
(7)

and the vector obtained by time-limiting the DPSS, v k (n, N, W ), is an eigenvector of C(m, n). The DPSS are doubly orthogonal, that is, they are orthogonal on the infinite set {−∞, . . . , } and orthonormal on the finite set {0, 1, . . . , N − 1}, that is,

- v i ( n , N , W ) v j ( n , N , W ) = λ i δ i j
(8)
n = 0 N - 1 v i ( n , N , W ) v j ( n , N , W ) = δ i j
(9)

where i, j = 0, 1, . . . , N − 1. The sequences also obey symmetry laws

v k ( n , N , W ) = ( - 1 ) k v k ( N - 1 - n , N , W )
(10)
v k ( n , N , W ) = ( - 1 ) k v N - 1 - k ( N - 1 - n , N , 1 / 2 - W )
(11)

where n = 0, ± 1, ± 2, . . . and k = 0, 1, . . . , N − 1.

If these DPSS are used for signal representation, then usually accurate and sparse representations are obtained when both the DPSS and the signal under investigation occupy the same band (e.g., [6, 51]). However, problems arise when the signal is centered around some frequency o | > 0 and occupies bandwidth smaller than 2W. In such situations, a larger number of DPSS is required to approximate the signal with the same accuracy despite the fact that narrowband signals are more predictable then wider band signals [7, 52]. In order to find a better basis, MDPSS were proposed in [6, 7]. MDPSS are defined as

M k ( N , W , ω m ; n ) = exp ( j ω m n ) v k ( N , W ; n )
(12)

where ω m = 2πf m is a modulating frequency. It is easy to see that MDPSS are also doubly orthogonal, obey the same Equation (4) and are bandlimited to the frequency band [−W + ω m : W + ω m ].

The next question which needs to be answered is how to choose a proper modulation frequency ω m . In the simplest case when the spectrum S(ω) of the signal is confined to a known band [ω1; ω2], i.e.,

S ( ω )  =  0 ω [ ω 1 , ω 2 ] and | ω 1 | < | ω 2 | 0 elsewhere
(13)

then the modulating frequency, ω m , and the bandwidth of the DPSSs are naturally defined by

ω m = ω 1 + ω 2 2
(14)
W = ω 2 - ω 1 2
(15)

as long as both satisfy:

ω m + W < 1 2 .
(16)

However, in practical applications, exact frequency band is known only with a certain degree of accuracy and usually evolves in time. Therefore, only some relatively wide frequency band is expected to be known. In such situations, an approach based on one-band-fits-all may not produce a sparse and accurate approximation of the signal. In order to resolve this problem it was suggested to use a band of bases with different widths to account for time-varying bandwidths [53]. However, such representation once again ignores the fact that the actual signal bandwidth could be much less then 2W dictated by the bandwidth of the DPSS. In order to provide further robustness to the estimation problem we suggest to use of a time-frequency dictionary containing bases which reflect various bandwidth scenarios.

To construct this time-frequency dictionary, it is assumed that an estimate of the maximum frequency is available. The first few bases in the dictionary are the actual traditional DPSS with bandwidth W. Additional bases could be constructed by partitioning the band [−ω; ω] into K subbands with the boundaries of each subband given by [ω k ; ωk+1], where 0 ≤ kK − 1, ωk+1> ω k , and ω0 = −ω, ωK-1= ω. Hence, each set of MDPSS has a bandwidth equal to ωk+1ω k and a modulation frequency equal to ω m = 0.5(ω k + ωk+1).

Obviously, a set of such function again forms a basis of functions limited to the bandwidth [−ω; ω]. While particular partition is arbitrary for every level K ≥ 1, we can chose to partition the bandwidth in any desired way as shown in Figure 1. In this article, we partition the bandwidth in equal blocks, as shown in Figure 1d, to reduce amount of stored pre-computed DPSS. In general, finding the best partitioning approach would be based on a priori knowledge about the phenomenon under investigation. Unless such knowledge is available, there is no strong reason for us to believe that non-uniform approaches shown in Figures 1a-c would yield a better performance than the uniform partitioning scheme shown in Figure 1d without extensive optimization procedures. However, such investigations are beyond the scope of this manuscript.

Figure 1
figure 1

Different approaches to form a 4-band time-frequency dictionary based on MDPSS. (a)-(c) demonstrate non-uniform blocks, while (d) represents a uniform-block division.

3.2 Matching pursuit and MDPSS-based frames

As mentioned at the beginning of Section 3, the CS approaches can be NP-hard, which are not practically viable. Fortunately, efficient algorithms, known generically as matching pursuit [47, 49], can be used to avoid some of the computational burden associated with the CS. The main feature of the algorithm is that when stopped after a few steps, it yields an approximation using only a few basis functions [47]. The matching pursuit decomposes any signal into a linear expansion of waveforms that are selected from a redundant dictionary of functions [47]. It is a general, greedy, sparse function approximation scheme with the squared error loss, which iteratively adds new functions (i.e. basis functions) to the linear expansion. In comparison to a basis pursuit it significantly reduces the computational complexity, since the basis pursuit minimizes a global cost function over all bases present in the dictionary [47]. If the dictionary is orthogonal the method works perfectly. Also, to achieve compact representation of the signal, it is necessary that the atoms are representative of the signal behaviour and that the appropriate atoms from the dictionary are chosen.

The algorithm for the matching pursuit starts with initial approximation for the signal, x ^ , and the residual, R:

x ^ ( 0 ) ( m ) = 0
(17)
R ( 0 ) ( m ) = x ( m )
(18)

where m represent the M time indices that are uniformly or non-uniformly distributed. Then, the matching pursuit builds up a sequence of sparse approximation by reducing the norm of the residue, R = x ^ - x . At stage k, it identifies the dictionary atom that best correlates with the residual and then adds to the current approximation a scalar multiple of that atom, such that

x ^ ( k ) ( m ) = x ^ ( k - 1 ) ( m ) + α k ϕ k ( m )
(19)
R ( k ) ( m ) = x ( m ) - x ^ ( k ) ( m )
(20)

where α k = ( R ( k - 1 ) ( m ) , ϕ k ( m ) ) /|| ϕ k ( m ) | | 2 . The process continues till the norm of the residual R(k)(m) does not exceed required margin of error ε > 0: ||R(k)(m)|| ≤ ε[47].

Here we can consider two stopping approaches. One is based on the idea that the normalized mean square error should be below a certain threshold value, γ:

| | x - x ^ ( k ) | | 2 2 | | x | | 2 2 γ
(21)

An alternative stopping rule can mandate that the number of bases, n B , needed for signal approximation should satisfy n B K. In previous contributions (e.g., [6]), is set equal to 2NW + 1 to compare the performance of the MDPSS-based frames with DPSS.

In either case, a matching pursuit approximates the signal using L bases as

x ( n ) = l = 1 L x ( m ) , ϕ l ( m ) ϕ l ( n ) + R ( L ) ( n )
(22)

where ϕ l are L bases from the dictionary with the strongest contributions.

3.3 Estimation of sampling times

Based on the definition of MDPSS, we are expected to know when the sampling times occur in order to use a proper value of the basis function. However, this assumption is typically not realized and we need to estimate the time location. Therefore, let us assume that the signal

x ( t ) = m = 0 M - 1 x ( t ^ m ) δ ( t - t ^ m ) + n ( t )
(23)

is a superposition of M delta functions with additive noise n(t) resulting from the nonuniform sampling. To estimate t ^ m let us first consider the period extension of the signal:

x ( t ) = k = - X k e j k Ω o t + n ( t )
(24)

where Ω o = 2π/T and the Fourier coefficients are given by

X k = m = 0 M - 1 x ( t ^ m ) e - j k Ω o t ^ m = m = 0 M - 1 x ( t ^ m ) u m k - ( M - 1 ) k ( M - 1 )
(25)

where u m = e - j Ω o t ^ m . The problem is them to find the parameters t ^ m that satisfy the above equation from the noisy nonuniform samples, which can be achieved using the annihilating filter [2, 44, 54]. In particular, if the transfer function of the annihilating filter is defined as

A ( z ) = m = 0 M - 1 ( 1 - u m z - 1 ) = m = 0 M - 1 α m z - m
(26)

then by filtering both sides of Equation (25) using the filter, we get

m = 0 M - 1 α m X k - m = m = 0 M - 1 n = 0 N - 1 x ( t ^ n ) u n k - m α m = m = 0 M - 1 x ( t ^ n ) n = 0 N - 1 u n - m α m u n k
(27)

where the last term is due to u n being a root of A(z). Then, A(z) can be obtained by solving Equation (27) for {α m } (i.e., set Equation (27) equal to zero and solve for filter coefficients). Using the roots of A(z), u m = e - j Ω o t ^ m / T , the nonuniform sampling time are estimated by

t ^ m = - T 2 π j log u m m = 0 , , M - 1
(28)

A thorough description of the procedure can be found in [2, Appendices 1 and 2].

4 Data analysis

Our data analysis consists of two parts. In the first part, we consider the synthetic test signals in order to examine the accuracy of the scheme in well-known conditions. In the second part, we use dual-axis swallowing accelerometry signals to examine how accurately we can recover these signals from sparse samples. In both cases, we will follow the procedure shown in Figure 2.

Figure 2
figure 2

A flow chart for the proposed algorithm.

4.1 Synthetic test signals

To analyze the proposed scheme, we assumed the following test signal:

x ( n ) = i = 1 10 A i sin ( 2 π f i n T s ) + σ ζ ( n )
(29)

where 0 ≤ n < N, T s = 1/256, N = 256, A i is uniformly drawn from random values in 0[2] and f i ~ N(30, 102). ζ(n) represents white Gaussian noise and σ is its standard deviation.

The first experiment consists of maintaining 150 samples equally spaced throughout the signal. The SNR values are varied between 0 and 30 dB in 1-dB increments, while the normalized half-bandwidth W is altered between 0.300 and 0.375 in 0.025 increments. We compared the accuracy of the proposed approach using 7- and 15-band MDPSS-based dictionaries against the CS approach based on DPSS. The accuracy was compared by evaluating the normalized mean square error:

MSE = | | x ( n ) - x ^ ( n ) | | 2 2 | | x ( n ) | | 2 2
(30)

where x(n) is a realization of the signal defined by Equation (29) and x ^ ( n ) represents a recovered signal. The MSE values were obtained using 1,000 realizations. To calculate the recovered signal using the DPSS, we used the following formula

x ^ DPSS ( n ) = U ( n , k ) U ( m , k ) T U ( m , k ) U ( m , k ) T x ( m )
(31)

where A denotes the pseudo-inverse of a matrix; U(n, k) is the matrix containing K bases (i.e., DPSS) and each sequence is of length N; m denotes the time instances at which the samples are available.

In the second experiment, we vary the number of available samples from 50 samples to 200 samples in increments of 10 samples in order to understand how the number of samples affects the overall accuracy of the proposed scheme. The samples are uniformly distributed, and the normalized half-bandwidth is set to 0.30. The lower boundary of 50 samples denotes a very aggressive scheme, as it represents approximately 20% of the original samples. On the other hand, the upper boundary of 200 samples represents a very lenient scheme for compressive sampling since it represents approximately 78% of the original samples. Additionally, we use the following four SNR values: 5, 15, 25 and 35 dB. The accuracy of the proposed CS-approach is examined using a 7- and 15-band MDPSS based dictionaries against the CS-approach based on DPSS. The accuracy metric is the MSE value defined by Equation (30) and 1,000 realizations are used to obtain its values.

The third experiment examines the effects of non-uniform sampling times on the overall performance of the CS-based schemes. In particular, we use 100 non-uniform samples and the SNR values were incremented by 1 dB from 0 to 30 dB. Also, the normalized half-bandwidth is varied in 0.025 increments from 0.30 to 0.375. The accuracy of the proposed approach based on MDPSS is compared against the CS-approach based on DPSS. Specifically, we use 7- and 15-band MDPSS-based time-frequency dictionaries. The accuracy metric is again the MSE value defined by Equation (30). 1,000 realizations are used again to obtain the MSE values, and for each realization new 100 time positions are achieved.

4.2 Swallowing accelerometry signals

Using the proposed scheme, we analyze how accurately we can recover dual-axis swallowing accelerometry signals from sparse samples. Specifically, we assume two different scenarios: only 30% of the original samples are available and only 50% of the original samples are available. In both cases, we examine whether the uniform or non-uniform sub-Nyquist rates have significant effects on the overall effectiveness of the proposed scheme. In this numerical experiment, we use a 10-band MDPSS based dictionary with the normalized half-bandwidth equal to 0.15. To evaluate the effectiveness of the proposed approach when considering dual-axis swallowing accelerometry signals, we adopted performance metrics used in other biomedical applications (e.g., [5, 55, 56]). Those metrics are:

  • Cross-correlation (CC): CC is used to evaluate the similarity between the original and the reconstructed signal, and is defined as:

    CC = n = 1 N ( x ( n ) - μ x ) ( x ^ ( n ) - μ x ^ ) n = 1 N ( x ( n ) - μ x ) 2 n = 1 N ( x ^ ( n ) - μ x ^ ) 2 × 100 %
    (32)

    where x(n) is the original signal and x ^ ( n ) represents a reconstructed signal. In addition, µ x and μ x ^ denote the mean values of x(n) and x ^ ( n ) , respectively.

  • Percent root difference (PRD): PRD measures distortion in reconstructed biomedical signals, and is defined as:

    PRD ( % ) = n = 1 N ( x ( n ) - x ^ ( n ) ) 2 n = 1 N x 2 ( n ) × 100 %
    (33)
  • Root mean square error (RMSE): RMSE also measures distortion and is often beneficial to minimize this metric when finding the optimal approximation of the signal. RMSE is defined as:

    RMSE = n = 1 N ( x ( n ) - x ^ ( n ) ) 2 N
    (34)
  • Maximum error (MAXERR): MAXERR is used to understand the local distortions in the reconstructed signal, and it particularly denotes the largest error between the samples of the original signal and the reconstructed signal. The metric is defined as:

    MAXERR = max ( x ( n ) - x ^ ( n ) )
    (35)

In order to establish statistical significance of our results, a non-parametric inferential statistical method known as the Mann-Whitney test was used [57], which assesses whether observed samples are drawn from a single population (i.e., the null hypothesis). For multi-group testing, the extension of the Mann-Whitney test known as the Kruskal-Wallis was used [58]. A 5% significance was used.

5 Results and discussion

In this section, we present the results of numerical experiments and discuss those results. First, we will discuss the results based on the synthetic test signals. In the second part, we will discuss the results of numerical experiments considering the application of the proposed approach to dual-axis swallowing accelerometry signals.

5.1 Synthetic test signals

The results of the first numerical experiment are shown in Figure 3. Several observations are in order. First, the proposed approach for CS based on the time-frequency dictionary containing MDPSS achieved more accurate signal reconstructions than the CS approach based on DPSS. This can be observed regardless of the initial bandwidth used for discrete prolate sequences. Second, the CS approaches based on both MDPSS and DPSS bases provide similar accuracy at very low SNR values (e.g., SNR < 5 dB), which is consisted with previous publications which showed that the accuracy of CS diminishes as SNR decreases [59].

Figure 3
figure 3

The effects of increasing initial bandwidth of discrete prolate sequences: (a) W = 0.300; (b) W = 0.325; (c) W = 0.350; (d) W = 0.375. The dashed lines denotes MSE obtained with the DPSS; the solid line indicates MSE obtained with a 15-band MDPSS-based dictionary; and the solid line with squares denotes a 7-band MDPSS-based dictionary.

The results of the second simulation are shown in Figure 4. As expected, CS approaches based on MDPSS and DPSS have similar accuracies for a low SNR value (i.e., SNR = 5 dB) as shown in Figure 4a. Both types of bases (i.e., MDPSS and DPSS) are not suitable for accurate representations of random variables, and possibly dictionaries based on random bases would be a more suitable approach for low SNR values. As SNR increases, the MSE decreases for both approaches and the CS approach based on MDPSS obtains higher accuracy. The results also showed that if the percent of available samples is below 30 (i.e., we are acquiring signals at rates that are 30% of the original Nyquist rate), the DPSS and MDPSS based schemes achieve similar accuracy.

Figure 4
figure 4

Increasing number of samples used in CS while altering the SNR values: (a) SNR = 5 dB; (b) SNR = 15 dB; (c) SNR = 25 dB; and (d) SNR = 35 dB. The dashed lines denotes MSE obtained with the DPSS; the solid line indicates MSE obtained with a 15-band MDPSS-based dictionary; and the solid line with squares denotes a 7-band MDPSS-based dictionary.

The results of third numerical experiment are summarized in Figure 5. They clearly depict the advantage of the CS approach based on the MDPSS over the approach based on DPSS even non-uniform sampling is used. For all four considered cases, we achieved more accurate results with MDPSS than with DPSS. Additionally, more accurate results are achieved when we use a 15-band dictionary than the 7-band dictionary. This is in accordance with the previous results shown in Figure 3, which also showed that more comprehensive dictionaries can provide more accurate results due to the fact that they can account for many different time-varying bandwidth scenarios.

Figure 5
figure 5

The effects of random time positions of samples on the accuracy of the proposed scheme while altering the bandwidth of discrete prolate sequences: (a) W = 0.300; (b) W = 0.325; (c) W = 0.350; (d) W = 0.375. The dashed lines denotes MSE obtained with the DPSS; the solid line indicates MSE obtained with a 15-band MDPSS-based dictionary; and the solid line with squares denotes a 7-band MDPSS-based dictionary.

5.2 CS of swallowing accelerometry signals

Tables 1, 2, 3 and 4 depict the results of the numerical analysis when the proposed scheme is applied to dual-axis swallowing accelerometry signals. Sample signals are shown in Figure 6.

Table 1 Performance of the proposed method for recovery of dual-axis swallowing accleremetry signals when considering 30% of samples and a uniform sampling scheme
Table 2 Performance of the proposed method for recovery of dual-axis swallowing accleremetry signals when considering 30% of samples and a non-uniform sampling scheme
Table 3 Performance of the proposed method for recovery of dual-axis swallowing accleremetry signals when considering 50% of samples and a uniform sampling scheme
Table 4 Performance of the proposed method for recovery of dual-axis swallowing accleremetry signals when considering 50% of samples and a non-uniform sampling scheme
Figure 6
figure 6

Sample wet swallow from a healthy participant: (a) the original signal in the A-P direction; (b) the original signal in the S-I direction; (c) the recovered signal in the A-P direction (50% samples, CC = 99.7%); (d) the recovered signal in the S-I direction (50% samples, CC = 99.8%); (e) the error between the original and the recovered signal in the A-P direction; (f) the error between the original and the recovered signal in the S-I direction.

Several observations are in order. First, we achieved very high agreement between the reconstructed data and the original signals with uniformly spread out samples. Statistically higher results were achieved with 50% of samples than with 30% of samples when considering the CCs results (p << 0.01), which resulted in statistically lower errors with 50% of samples when considering the three error metrics (p << 0.01).

Second, statistically worse results have been obtained when using non-uniform (random) sampling times (p << 0.01) in comparison to uniform sampling for both 30% of samples and 50% of samples. This result is expected, as it becomes more challenging to recover the signal accurately with non-uniform samples. Additionally, it is difficult to recover swallowing vibrations accurately, given that these vibrations are short-duration transients. Unless the non-uniform samples capture the behavior of these short-duration transients, a larger recovery error is achieved. However, with 50% of samples, we still obtain very high agreement between the recovered data and the original signals. As a matter of fact, the results obtained with 50% of samples with non-uniform sampling are comparable to the results obtained with 30% of samples when using uniform sampling.

Third, amongst the considered swallowing tasks, dry swallows tend to be recovered most accurately, followed by the wet swallows and lastly by the wet chin down swallows. From a physiological point of view, this is expected since during the dry swallowing manoeuver only small amounts of liquid (i.e., saliva) are swallowed. It is also expected that wet chin down swallows will be more difficult to recover due to the complex maneuvering required during these swallows, which may introduce signal components otherwise not present during the dry and/or wet swallowing tasks.

Therefore, based on the presented results, we can state with high confidence that CS based on the time-frequency dictionary containing MDPSS is suitable scheme for dual-axis swallowing acceleromtry signals. Particularly accurate results have been obtained when we use 50% of samples. We expect that further improvements can be achieved by optimizing the parameters of the recovery process with respect to the considered error metrics.

6 Conclusion

In this article, a CS algorithm for accurate reconstruction of dual-axis swallowing accelerome-try signals from sparse samples was proposed. The proposed algorithm uses a time-frequency dictionary based on MDPSS. The modulating of DPSS was performed in order to account for the time-varying nature of the dual-axis swallowing accelerometry signals. The proposed CS algorithm was tested using both synthetic test signals and swallowing accelerometry signals. In both cases, we achieved very accurate representations with MDPSS, which makes these bases suitable for CS approaches of swallowing accelerometry signals. Specifically, we showed that even when the dual-axis swallowing accelerometry signals were subsampled at by 50% below the Nyquist rate, we still achieved very accurate representations of these signals.

References

  1. Brechet L, Lucas M-F, Doncarli C, Farina D: Compression of biomedical signals with mother wavelet optimization and best-basis wavelet packet selection. IEEE Trans Biomed Eng 2007, 54(12):2186-2192.

    Article  Google Scholar 

  2. Vetterli M, Marziliano P, Blu T: Sampling signals with finite rate of innovation. IEEE Trans Signal Process 2002, 50(6):1417-1428. 10.1109/TSP.2002.1003065

    Article  MathSciNet  Google Scholar 

  3. Donoho DL: Compressed sensing. IEEE Trans Inf Theory 2006, 52(4):1289-1306.

    Article  MathSciNet  Google Scholar 

  4. Dai W, Milenković O: Subspace pursuit for compressive sensing signal reconstruction. IEEE Trans Inf Theory 2009, 55(5):2230-2249.

    Article  Google Scholar 

  5. Poh K-K, Marziliano P: Compressive sampling of EEG signals with finite rate of innovation. EURASIP J Adv Signal Process 2010, 2010: 12. Article ID 183105

    Article  Google Scholar 

  6. Sejdić E, Luccini M, Primak S, Baddour K, Willink T: Channel estimation using dpss based frames. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008. Las Vegas, Nevada, USA; 2008:2849-2852.

    Chapter  Google Scholar 

  7. Oh J, Senay S, Chaparro LF: Signal reconstruction from nonuniformly spaced samples using evolutionary Slepian transform-based POCS. EURASIP J Adv Signal Process 2010, 2010: 12. Article ID 367317

    Article  Google Scholar 

  8. Davenport MA, Wakin MB: Reconstruction and cancellation of sampled multiband signals using discrete prolate spheroidal sequences. In Proc of Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS11). Edinburgh, Scotland, UK; 2011:61.

    Google Scholar 

  9. Logemann JA: Evaluation and Treatment of Swallowing Disorders. 2nd edition. PRO-ED, Austin, Texas, USA; 1998.

    Google Scholar 

  10. Miller AJ: The Neuroscienti c Principles of Swallowing and Dysphagia. Singular Pub-lishing Group, San Diego, USA; 1999.

    Google Scholar 

  11. Curran JE: Nutritional Considerations Dysphagia: Diagnosis and Management. Butterworth-Heinemann, Boston, USA; 1992:255-266.

    Google Scholar 

  12. Smithard DG, O'Neill PA, Park C, Morris J, Wyatt R, England R, Martin DF: Complications and outcome after acute stroke: does dysphagia matter? Stroke 1996, 27(70):1200-1204.

    Article  Google Scholar 

  13. Riensche LL, Lang K: Treatment of swallowing disorders through a multidisciplinary team approach. Educat Gerontol 1992, 18(3):277-284. 10.1080/0360127920180309

    Article  Google Scholar 

  14. Ekberg O, Hamdy S, Woisard V, Wuttge-Hannig A, Ortega P: Social and psychological burden of dysphagia: Its impact on diagnosis and treatment. Dysphagia 2002, 17(2):139-146. 10.1007/s00455-001-0113-5

    Article  Google Scholar 

  15. Miller RM: Clinical Examination for Dysphagia Dysphagia: Diagnosis and Management. Butterworth-Heinemann, Boston, USA; 1992:143-162.

    Google Scholar 

  16. Ding R, Logemann JA: Pneumonia in stroke patients: a retrospective study. Dysphagia 2000, 15(2):51-57.

    Article  Google Scholar 

  17. Tabaee A, Johnson P, Gartner CJ, Kalwerisky K, Desloge RB, Stewart M: Patient-controlled comparison of flexible endoscopic evaluation of swallowing with sensory testing (FEESST) and videofluoroscopy. Laryngoscope 2006, 116(5):821-825. 10.1097/01.mlg.0000214670.40604.45

    Article  Google Scholar 

  18. Ramsey DJC, Smithard DG, Kalra L: Can pulse oximetry or a bedside swallowing assessment be used to detect aspiration after stroke? Stroke 2006, 37(12):2984-2988. 10.1161/01.STR.0000248758.32627.3b

    Article  Google Scholar 

  19. Steele C, Allen C, Barker J, Buen P, French R, Fedorak A, Day S, Lapointe J, Lewis L, MacKnight C, McNeil S, Valentine J, Walsh L: Dysphagia service delivery by speech-language pathologists in Canada: results of a national survey. Canadian J Speech Lang Pathol Audiol 2007, 31(4):166-177.

    Google Scholar 

  20. Cichero JAY, Murdoch BE: The physiologic cause of swallowing sounds: answers from heart sounds and vocal tract acoustics. Dysphagia 1998, 13(1):39-52. 10.1007/PL00009548

    Article  Google Scholar 

  21. Youmans SR, Stierwalt JAG: An acoustic profile of normal swallowing. Dysphagia 2005, 20(3):195-209. 10.1007/s00455-005-0013-1

    Article  Google Scholar 

  22. Reddy NP, Costarella BR, Grotz RC, Canilang EP: Biomechanical measurements to characterize the oral phase of dysphagia. IEEE Trans Biomed Eng 1990, 37(4):392-397. 10.1109/10.52346

    Article  Google Scholar 

  23. Reddy NP, Canilang EP, Casterline J, Rane MB, Joshi AM, Thomas R, Candadai R: Noninvasive accelaration measurements to characterize the pharyngeal phase of swallowing. J Biomed Eng 1991, 13: 379-383. 10.1016/0141-5425(91)90018-3

    Article  Google Scholar 

  24. Prabhu DNF, Reddy NP, Canilang EP: Neural networks for recognition of acceleration patterns during swallowing and coughing. In Proc of the 16th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Engineering Ad-vances: New Opportunities for Biomedical Engineers). Volume 2. Baltimore, MD, USA; 1994:1105-1106.

    Chapter  Google Scholar 

  25. Gupta V, Prabhu DNF, Reddy NP, Canilang EP: Spectral analysis of acceleration signals during swallowing and coughing. Proc of the 16th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Engineering Advances: New Opportunities for Biomedical Engineers) 1994, 2: Baltimore, MD, USA, 1292-1293.

    Article  Google Scholar 

  26. Suryanarayanana S, Reddy NP, Canilang EP: A fuzzy logic diagnosis system for classification of pharyngeal dysphagia. Int J Bio-Med Comput 1995, 38(3):207-215. 10.1016/S0020-7101(05)80002-9

    Article  Google Scholar 

  27. Reddy NP, Thomas R, Canilang EP, Casterline J: Toward classification of dysphagic patients using biomechanical measurements. J Rehabil Res Dev 1994, 31(4):335-344.

    Google Scholar 

  28. Reddy NP, Katakam A, Gupta V, Coppenger J, Simcox D, Marmon C, Canilang EP, Stephenson L, Barengo R, England E, Gavula J, Royed V, Freshwater B, Whit-lock M, Hooverman M: Noninvasive measurement of dysphagia: simultaneous acceleration measurements during videofluorography. In Proc of the 18th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Bridging Disciplines for Biomedicine). Volume 1. Amsterdam, Netherlands; 1996:142-143.

    Chapter  Google Scholar 

  29. Joshi AC, Reddy NP: Fractal analysis of acceleration signals due to swallowing. In Proc of the First Joint BMES/EMBS Conference. Volume 2. Atlanta, GA, USA; 1999:12.

    Google Scholar 

  30. Das A, Reddy NP, Narayanan J: Hybrid fuzzy logic committee neural networks for recognition of swallow acceleration signals. Comput Methods Prog Biomed 2001, 64(2):87-99. 10.1016/S0169-2607(00)00099-7

    Article  Google Scholar 

  31. Chau T, Casas M, Berall G, Kenny D: Testing the stationarity and normality of paediatric aspiration signals. In Proc of the Second Joint EMBS/BMES Conference. Volume 1. Houston, TX, USA; 2002:186-187.

    Google Scholar 

  32. Chau T, Chau D, Casas M, Berall G, Kenny DJ: Investigating the stationarity of paediatric aspiration signals. IEEE Trans Neural Sys Rehabil Eng 2005, 13(1):99-105. 10.1109/TNSRE.2004.841384

    Article  Google Scholar 

  33. Lee J, Blain S, Casas M, Kenny D, Berall G, Chau T: A radial basis classifier for the automatic detection of aspiration in children with dysphagia. J NeuroEng Rehabil 2006, 3(140):17.

    Google Scholar 

  34. Lee J, Blain S, Casas M, Kenny D, Berall G, Chau T: A radial basis function classifier for pediatric aspiration detection. In Proc of 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS '06). Volume 2006. New York City; 2006:3553-3556.

    Google Scholar 

  35. Reddy NP, Katakam A, Gupta V, Unnikrishnan R, Narayanan J, Canilang EP: Measurements of acceleration during videofluorographic evaluation of dysphagic patients. Med Eng Phys 2000, 22(6):405-412. 10.1016/S1350-4533(00)00047-3

    Article  Google Scholar 

  36. Kim Y, McCullough GH: Maximum hyoid displacement in normal swallowing. Dysphagia 2008, 23(3):274-279. 10.1007/s00455-007-9135-y

    Article  Google Scholar 

  37. Ishida R, Palmer JB, Hiiemae KM: Hyoid motion during swallowing: factors affecting forward and upward displacement. Dysphagia 2002, 17(4):262-272. 10.1007/s00455-002-0064-5

    Article  Google Scholar 

  38. Lee J, Steele CM, Chau T: Time and time-frequency characterization of dual-axis swal-lowing accelerometry signals. Physiol Meas 2008, 29(9):1105-1120. 10.1088/0967-3334/29/9/008

    Article  Google Scholar 

  39. Sejdić E, Steele CM, Chau T: Segmentation of dual-axis swallowing accelerometry signals in healthy subjects with analysis of anthropometric effects on duration of swallowing activities. IEEE Trans Biomed Eng 2009, 56(4):1090-1097.

    Article  Google Scholar 

  40. Sejdić E, Komisar V, Steele CM, Chau T: Baseline characteristics of dual-axis swallowing accelerometry signals. Ann Biomed Eng 2010, 38(3):1048-1059. 10.1007/s10439-009-9874-z

    Article  Google Scholar 

  41. Sejdić E, Steele CM, Chau T: Understanding statistical persistence of dual-axis swallowing accelerometry signals. Comput Biol Med 2010, 40(11):839-844. 10.1016/j.compbiomed.2010.09.002

    Article  Google Scholar 

  42. Damouras S, Sejdić E, Steele CM, Chau T: An on-line swallow detection algorithm based on the quadratic variation of dual-axis accelerometry. IEEE Trans Signal Process 2010, 58(6):3352-3359.

    Article  MathSciNet  Google Scholar 

  43. Sejdić E, Steele CM, Chau T: Scaling analysis of baseline dual-axis cervical accelerometry signals. Comput Methods Prog Biomed 2011, 103(3):113-120. 10.1016/j.cmpb.2010.06.010

    Article  Google Scholar 

  44. Senay S, Chaparro LF, Durak L: Reconstruction of nonuniformly sampled time-limited signals using prolate spheroidal wave functions. Signal Process 2009, 89(12):2585-2595. 10.1016/j.sigpro.2009.04.020

    Article  Google Scholar 

  45. Mamaghanian H, Khaled N, Atienza D, Vandergheynst P: Compressed sensing for realtime energy-efficient ECG compression on wireless body sensor nodes. IEEE Trans Biomed Eng 2011, 58(9):2456-2466.

    Article  Google Scholar 

  46. Rauhut H, Schnass K, Vandergheynst P: Compressed sensing and redundant dictionaries. IEEE Trans Inf Theory 2008, 54(5):2210-2219.

    Article  MathSciNet  Google Scholar 

  47. Mallat SG, Zhang Z: Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 1993, 41(12):3397-3415. 10.1109/78.258082

    Article  Google Scholar 

  48. Candes EJ, Wakin MB: An introduction to compressive sampling. IEEE Signal Process Mag 2008, 25(2):21-30.

    Article  Google Scholar 

  49. Kovačević J, Chabira A: Life beyond bases: the advent of thes frames (part I). IEEE Signal Process Mag 2007, 24(4):86-104.

    Article  Google Scholar 

  50. Slepian D: Prolate spheroidal wave functions, Fourier analysis, and uncertainty-V: the discrete case. The Bell Syst Tech J 1978, 57(5):1371-1430.

    Article  Google Scholar 

  51. Zemen T, Mecklenbräuker CF: Time-variant channel estimation using discrete prolate spheroidal sequences. IEEE Trans Signal Process 2005, 53(9):3597-3607.

    Article  MathSciNet  Google Scholar 

  52. Proakis J: Digital Communications. 4th edition. McGraw-Hill, New York; 2001.

    Google Scholar 

  53. Zemen T, Hofstetter H, Steinbnock G: Successive Slepian subspace projection in time and frequency for time-variant channel estimation. In 14th IST Mobile and Wireless Summit. Dresden, Germany; 2005:1-4.

    Google Scholar 

  54. Blu T, Dragotti P-L, Vetterli M, Marziliano P, Coulot L: Sparse sampling of signal innovations. IEEE Signal Process Mag 2008, 25(2):31-40.

    Article  Google Scholar 

  55. Boucheham B, Ferdi Y, Batouche MC: Recursive versus sequential multiple error measures reduction: a curve simplification approach to ECG data compression. Comput Methods Prog Biomed 2006, 81(2):162-173. 10.1016/j.cmpb.2005.11.008

    Article  Google Scholar 

  56. Scholkmann F, Spichtig S, Muehlemann T, Wolf M: How to detect and reduce movement artifacts in near-infrared imaging using moving standard deviation and spline interpolation. Physiol Meas 2010, 31(5):649-662. 10.1088/0967-3334/31/5/004

    Article  Google Scholar 

  57. Mann HB, WhitneyL DR: On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat 1947, 18(1):50-60. 10.1214/aoms/1177730491

    Article  Google Scholar 

  58. Kruskal WH, Wallis WA: Use of ranks in one-criterion analysis of variance. J Am Stat Assoc 1952, 47(260):583-621.

    Article  Google Scholar 

  59. Candés E, Romberg JK, Tao T: Stable signal recovery from incomplete and inaccurate measurements. Commun Pure Appl Math 2006, 59(8):1207-1223. 10.1002/cpa.20124

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ervin Sejdić.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Sejdić, E., Can, A., Chaparro, L.F. et al. Compressive sampling of swallowing accelerometry signals using time-frequency dictionaries based on modulated discrete prolate spheroidal sequences. EURASIP J. Adv. Signal Process. 2012, 101 (2012). https://doi.org/10.1186/1687-6180-2012-101

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1687-6180-2012-101

Keywords