Publications

Refine Results

(Filters Applied) Clear All

Speech enhancement using sparse convolutive non-negative matrix factorization with basis adaptation

Published in:
INTERSPEECH 2012: 13th Annual Conf. of the Int. Speech Communication Assoc., 9-13 September 2012.

Summary

We introduce a framework for speech enhancement based on convolutive non-negative matrix factorization that leverages available speech data to enhance arbitrary noisy utterances with no a priori knowledge of the speakers or noise types present. Previous approaches have shown the utility of a sparse reconstruction of the speech-only components of an observed noisy utterance. We demonstrate that an underlying speech representation which, in addition to applying sparsity, also adapts to the noisy acoustics improves overall enhancement quality. The proposed system performs comparably to a traditional Wiener filtering approach, and the results suggest that the proposed framework is most useful in moderate- to low-SNR scenarios.
READ LESS

Summary

We introduce a framework for speech enhancement based on convolutive non-negative matrix factorization that leverages available speech data to enhance arbitrary noisy utterances with no a priori knowledge of the speakers or noise types present. Previous approaches have shown the utility of a sparse reconstruction of the speech-only components of...

READ MORE

Supervector LDA - a new approach to reduced-complexity i-vector language recognition

Published in:
INTERSPEECH 2012: 13th Annual Conf. of the Int. Speech Communication Assoc., 9-13 September 2012.

Summary

In this paper, we extend our previous analysis of Gaussian Mixture Model (GMM) subspace compensation techniques using Gaussian modeling in the supervector space combined with additive channel and observation noise. We show that under the modeling assumptions of a total-variability i-vector system, full Gaussian supervector scoring can also be performed cheaply in the total subspace, and that i-vector scoring can be viewed as an approximation to this. Next, we show that covariance matrix estimation in the i-vector space can be used to generate PCA estimates of supervector covariance matrices needed for Joint Factor Analysis (JFA). Finally, we derive a new technique for reduced-dimension i-vector extraction which we call Supervector LDA (SV-LDA), and demonstrate a 100-dimensional i-vector language recognition system with equivalent performance to a 600-dimensional version at much lower complexity.
READ LESS

Summary

In this paper, we extend our previous analysis of Gaussian Mixture Model (GMM) subspace compensation techniques using Gaussian modeling in the supervector space combined with additive channel and observation noise. We show that under the modeling assumptions of a total-variability i-vector system, full Gaussian supervector scoring can also be performed...

READ MORE

Vocal-source biomarkers for depression - a link to psychomotor activity

Published in:
INTERSPEECH 2012: 13th Annual Conf. of the Int. Speech Communication Assoc., 9-13 September 2012.

Summary

A hypothesis in characterizing human depression is that change in the brain's basal ganglia results in a decline of motor coordination. Such a neuro-physiological change may therefore affect laryngeal control and dynamics. Under this hypothesis, toward the goal of objective monitoring of depression severity, we investigate vocal-source biomarkers for depression; specifically, source features that may relate to precision in motor control, including vocal-fold shimmer and jitter, degree of aspiration, fundamental frequency dynamics, and frequency-dependence of variability and velocity of energy. We use a 35-subject database collected by Mundt et al. in which subjects were treated over a six-week period, and investigate correlation of our features with clinical (HAMD), as well as self-reported (QIDS) Total subject assessment scores. To explicitly address the motor aspect of depression, we compute correlations with the Psychomotor Retardation component of clinical and self-reported Total assessments. For our longitudinal database, most correlations point to statistical relationships of our vocal-source biomarkers with psychomotor activity, as well as with depression severity.
READ LESS

Summary

A hypothesis in characterizing human depression is that change in the brain's basal ganglia results in a decline of motor coordination. Such a neuro-physiological change may therefore affect laryngeal control and dynamics. Under this hypothesis, toward the goal of objective monitoring of depression severity, we investigate vocal-source biomarkers for depression...

READ MORE

Individual and group dynamics in the reality mining corpus

Published in:
Proc. 2012 ASE/IEEE Int. Conf. on Social Computing, 3-5 September 2012, pp. 61-70.

Summary

Though significant progress has been made in recent years, traditional work in social networks has focused on static network analysis or dynamics in a large-scale sense. In this work, we explore ways in which temporal information from sociographic data can be used for the analysis and prediction of individual and group behavior in dynamic, real-world situations. Using the MIT Reality Mining corpus, we show how temporal information in highly-instrumented sociographic data can be used to gain insights otherwise unavailable from static snapshots. We show how pattern of life features extend from the individual to the group level. In particular, we show how anonymized location information can be used to infer individual identity. Additionally, we show how proximity information can be used in a multilinear clustering framework to detect interesting group behavior over time. Experimental results and discussion suggest temporal information has great potential for improving both individual and group level understanding of real-world, dense social network data.
READ LESS

Summary

Though significant progress has been made in recent years, traditional work in social networks has focused on static network analysis or dynamics in a large-scale sense. In this work, we explore ways in which temporal information from sociographic data can be used for the analysis and prediction of individual and...

READ MORE

Amplifier-free slab-coupled optical waveguide optoelectronic oscillator systems

Published in:
Opt. Express, Vol. 20, No. 17, 13 August 2012, pp. 19589-19598.
Topic:

Summary

We demonstrate a free-running 3-GHz slab-coupled optical waveguide (SCOW) optoelectronic oscillator (OEO) with low phase-noise (88 dB down from carrier). The SCOW-OEO uses highpower low-noise SCOW components in a single-loop cavity employing 1.5- km delay. The noise properties of our SCOW external-cavity laser (SCOWECL) and SCOW photodiode (SCOWPD) are characterized and shown to be suitable for generation of high spectral purity microwave tones. Through comparisons made with SCOW-OEO topologies employing amplification, we observe the sidemode levels to be degraded by any amplifiers (optical or RF) introduced within the OEO cavity.
READ LESS

Summary

We demonstrate a free-running 3-GHz slab-coupled optical waveguide (SCOW) optoelectronic oscillator (OEO) with low phase-noise (88 dB down from carrier). The SCOW-OEO uses highpower low-noise SCOW components in a single-loop cavity employing 1.5- km delay. The noise properties of our SCOW external-cavity laser (SCOWECL) and SCOW photodiode (SCOWPD) are characterized...

READ MORE

Probabilistic reasoning for streaming anomaly detection

Published in:
2012 SSP: 2012 IEEE Statistical Signal Processing Workshop, 5-8 August 2012, pp. 377-380.

Summary

In many applications it is necessary to determine whether an observation from an incoming high-volume data stream matches expectations or is anomalous. A common method for performing this task is to use an Exponentially Weighted Moving Average (EWMA), which smooths out the minor variations of the data stream. While EWMA is efficient at processing high-rate streams, it can be very volatile to abrupt transient changes in the data, losing utility for appropriately detecting anomalies. In this paper we present a probabilistic approach to EWMA which dynamically adapts the weighting based on the observation probability. This results in robustness to data anomalies yet quick adaptability to distributional data shifts.
READ LESS

Summary

In many applications it is necessary to determine whether an observation from an incoming high-volume data stream matches expectations or is anomalous. A common method for performing this task is to use an Exponentially Weighted Moving Average (EWMA), which smooths out the minor variations of the data stream. While EWMA...

READ MORE

Toward matched filter optimization for subgraph detection in dynamic networks

Published in:
2012 SSP: 2012 IEEE Statistical Signal Processing Workshop, 5-8 August 2012, pp. 113-116.

Summary

This paper outlines techniques for optimization of filter coefficients in a spectral framework for anomalous subgraph detection. Restricting the scope to the detection of a known signal in i.i.d. noise, the optimal coefficients for maximizing the signal's power are shown to be found via a rank-1 tensor approximation of the subgraph's dynamic topology. While this technique optimizes our power metric, a filter based on average degree is shown in simulation to work nearly as well in terms of power maximization and detection performance, and better separates the signal from the noise in the eigenspace.
READ LESS

Summary

This paper outlines techniques for optimization of filter coefficients in a spectral framework for anomalous subgraph detection. Restricting the scope to the detection of a known signal in i.i.d. noise, the optimal coefficients for maximizing the signal's power are shown to be found via a rank-1 tensor approximation of the...

READ MORE

Multifunction phased array radar (MPAR) spectral usage analysis

Summary

This report addressed two technical risks associated with replacing current air traffic and weather surveillance radars with a single type of multifunction phased array radar (MPAR). The first risk is whether radio spectrum usage would increase with the MPAR network and whether the allocated band will have enough spectral space. This question is addressed in two steps. First, single-radar spectrum usage is estimated based on certain assumptions regarding the radar design. Second, locations based on a previous radar placement study are used together with a terrain-dependent propagation model to compute the number of frequency channels needed at each site. We conclude that the overall spectrum usage is likely to increase with MPAR, but that the targeted window in S band will be able to accommodate the occupancy at all sites. The second risk is whether self-interference will limit the ability of the MPAR to operate asynchronously and adaptively on different antenna faces. This question is addressed by employing a simple bistatic ground clutter model to characterize the interference between adjacent faces. We conclude that some interference is unavoidable, but it would likely only occur during times when a transmit beam is at its maximum off-broadside angle (~2% of the time).
READ LESS

Summary

This report addressed two technical risks associated with replacing current air traffic and weather surveillance radars with a single type of multifunction phased array radar (MPAR). The first risk is whether radio spectrum usage would increase with the MPAR network and whether the allocated band will have enough spectral space...

READ MORE

Nanosatellites for Earth environmental monitoring: the MicroMAS project

Summary

The Micro-sized Microwave Atmospheric Satellite (MicroMAS) is a 3U cubesat (34x10x10 cm, 4.5 kg) hosting a passive microwave spectrometer operating near the 118.75-GHz oxygen absorption line. The focus of the first MicroMAS mission (hereafter, MicroMAS-1) is to observe convective thunderstorms, tropical cyclones, and hurricanes from a near-equatorial orbit at approximately 500-km altitude. A MicroMAS flight unit is currently being developed in anticipation of a 2014 launch. A parabolic reflector is mechanically rotated as the spacecraft orbits the earth, thus directing a cross-track scanned beam with FWHM beamwidth of 2.4-degrees, yielding an approximately 20-km diameter footprint at nadir incidence from a nominal altitude of 500 km. Radiometric calibration is carried out using observations of cold space, the earth?s limb, and an internal noise diode that is weakly coupled through the RF front-end electronics. A key technology feature is the development of an ultra-compact intermediate frequency processor module for channelization, detection, and A-to-D conversion. The antenna system and RF front-end electronics are highly integrated and miniaturized. A MicroMAS-2 mission is currently being planned using a multiband spectrometer operating near 118 and 183 GHz in a sunsynchronous orbit of approximately 800-km altitude. A HyMAS- 1 (Hyperspectral Microwave Atmospheric Satellite) mission with approximately 50 channels near 118 and 183 GHz is also being planned. In this paper, the mission concept of operations will be discussed, the radiometer payload will be described, and the spacecraft subsystems (avionics, power, communications, attitude determination and control, and mechanical structures) will be summarized.
READ LESS

Summary

The Micro-sized Microwave Atmospheric Satellite (MicroMAS) is a 3U cubesat (34x10x10 cm, 4.5 kg) hosting a passive microwave spectrometer operating near the 118.75-GHz oxygen absorption line. The focus of the first MicroMAS mission (hereafter, MicroMAS-1) is to observe convective thunderstorms, tropical cyclones, and hurricanes from a near-equatorial orbit at approximately...

READ MORE

A preliminary investigation of Tower Flight Data Manager safety benefits

Published in:
Applied Human Factors and Ergonomics Conf., 21 July 2012.

Summary

Improvements to current air traffic management technologies and techniques are required to move toward the next generation air transportation system (NextGen). The Tower Flight Data Manager (TFDM) is a prototype air traffic control system consisting of the: (1) Flight Data Manager (FDM) facilitating interaction with electronic flight data, (2) Tower Information Display System (TIDS) providing enhanced surveillance information, and (3) Supervisor Display providing a means for front line managers and traffic management coordinators to interact with strategic and tactical planning and decision support tools. Given that TFDM aims to enable safe and efficient operations under NextGen, it is critical to analyze potential safety impacts and determine what types of real-world safety issues can be prevented or mitigated by TFDM. With this goal in mind, we reviewed 560 National Transportation Safety Board (NTSB) and Aviation Safety Reporting System (ASRS) reports focusing on commercial air carrier operations over a five year period. Over 100 reports were deemed relevant to TFDM and further analyzed to determine the likelihood that these safety-related events could have been mitigated or prevented by the key TFDM capabilities outlined above. A systematic method for generating probabilistic estimates of benefits for a technology not yet deployed was utilized to produce effectiveness ratings for the various TFDM components.
READ LESS

Summary

Improvements to current air traffic management technologies and techniques are required to move toward the next generation air transportation system (NextGen). The Tower Flight Data Manager (TFDM) is a prototype air traffic control system consisting of the: (1) Flight Data Manager (FDM) facilitating interaction with electronic flight data, (2) Tower...

READ MORE