Publications

Refine Results

(Filters Applied) Clear All

Query-by-example using speaker content graphs

Published in:
INTERSPEECH 2012: 13th Annual Conf. of the Int. Speech Communication Assoc., 9-13 September 2012.

Summary

We describe methods for constructing and using content graphs for query-by-example speaker recognition tasks within a large speech corpus. This goal is achieved as follows: First, we describe an algorithm for constructing speaker content graphs, where nodes represent speech signals and edges represent speaker similarity. Speech signal similarity can be based on any standard vector-based speaker comparison method, and the content graph can be constructed using an efficient incremental method for streaming data. Second, we apply random walk methods to the content graph to find matching examples to an unlabeled query set of speech signals. The content-graph based method is contrasted to a more traditional approach that uses supervised training and stack detectors. Performance is compared in terms of information retrieval measures and computational complexity. The new content-graph based method is shown to provide a promising low-complexity scalable alternative to standard speaker recognition methods.
READ LESS

Summary

We describe methods for constructing and using content graphs for query-by-example speaker recognition tasks within a large speech corpus. This goal is achieved as follows: First, we describe an algorithm for constructing speaker content graphs, where nodes represent speech signals and edges represent speaker similarity. Speech signal similarity can be...

READ MORE

Analyzing and interpreting automatically learned rules across dialects

Published in:
INTERSPEECH 2012: 13th Annual Conf. of the Int. Speech Communication Assoc., 9-13 September 2012.

Summary

In this paper, we demonstrate how informative dialect recognition systems such as acoustic pronunciation model (APM) help speech scientists locate and analyze phonetic rules efficiently. In particular, we analyze dialect-specific characteristics automatically learned from APM across two American English dialects. We show that unsupervised rule retrieval performs similarly to supervised retrieval, indicating that APM is useful in practical applications, where word transcripts are often unavailable. We also demonstrate that the top-ranking rules learned from APM generally correspond to the linguistic literature, and can even pinpoint potential research directions to refine existing knowledge. Thus, the APM system can help phoneticians analyze rules efficiently by characterizing large amounts of data to postulate rule candidates, so they can reserve time to conduct more targeted investigations. Potential applications of informative dialect recognition systems include forensic phonetics and diagnosis of spoken language disorders.
READ LESS

Summary

In this paper, we demonstrate how informative dialect recognition systems such as acoustic pronunciation model (APM) help speech scientists locate and analyze phonetic rules efficiently. In particular, we analyze dialect-specific characteristics automatically learned from APM across two American English dialects. We show that unsupervised rule retrieval performs similarly to supervised...

READ MORE

Vocal-source biomarkers for depression - a link to psychomotor activity

Published in:
INTERSPEECH 2012: 13th Annual Conf. of the Int. Speech Communication Assoc., 9-13 September 2012.

Summary

A hypothesis in characterizing human depression is that change in the brain's basal ganglia results in a decline of motor coordination. Such a neuro-physiological change may therefore affect laryngeal control and dynamics. Under this hypothesis, toward the goal of objective monitoring of depression severity, we investigate vocal-source biomarkers for depression; specifically, source features that may relate to precision in motor control, including vocal-fold shimmer and jitter, degree of aspiration, fundamental frequency dynamics, and frequency-dependence of variability and velocity of energy. We use a 35-subject database collected by Mundt et al. in which subjects were treated over a six-week period, and investigate correlation of our features with clinical (HAMD), as well as self-reported (QIDS) Total subject assessment scores. To explicitly address the motor aspect of depression, we compute correlations with the Psychomotor Retardation component of clinical and self-reported Total assessments. For our longitudinal database, most correlations point to statistical relationships of our vocal-source biomarkers with psychomotor activity, as well as with depression severity.
READ LESS

Summary

A hypothesis in characterizing human depression is that change in the brain's basal ganglia results in a decline of motor coordination. Such a neuro-physiological change may therefore affect laryngeal control and dynamics. Under this hypothesis, toward the goal of objective monitoring of depression severity, we investigate vocal-source biomarkers for depression...

READ MORE

Individual and group dynamics in the reality mining corpus

Published in:
Proc. 2012 ASE/IEEE Int. Conf. on Social Computing, 3-5 September 2012, pp. 61-70.

Summary

Though significant progress has been made in recent years, traditional work in social networks has focused on static network analysis or dynamics in a large-scale sense. In this work, we explore ways in which temporal information from sociographic data can be used for the analysis and prediction of individual and group behavior in dynamic, real-world situations. Using the MIT Reality Mining corpus, we show how temporal information in highly-instrumented sociographic data can be used to gain insights otherwise unavailable from static snapshots. We show how pattern of life features extend from the individual to the group level. In particular, we show how anonymized location information can be used to infer individual identity. Additionally, we show how proximity information can be used in a multilinear clustering framework to detect interesting group behavior over time. Experimental results and discussion suggest temporal information has great potential for improving both individual and group level understanding of real-world, dense social network data.
READ LESS

Summary

Though significant progress has been made in recent years, traditional work in social networks has focused on static network analysis or dynamics in a large-scale sense. In this work, we explore ways in which temporal information from sociographic data can be used for the analysis and prediction of individual and...

READ MORE

Amplifier-free slab-coupled optical waveguide optoelectronic oscillator systems

Published in:
Opt. Express, Vol. 20, No. 17, 13 August 2012, pp. 19589-19598.
Topic:

Summary

We demonstrate a free-running 3-GHz slab-coupled optical waveguide (SCOW) optoelectronic oscillator (OEO) with low phase-noise (88 dB down from carrier). The SCOW-OEO uses highpower low-noise SCOW components in a single-loop cavity employing 1.5- km delay. The noise properties of our SCOW external-cavity laser (SCOWECL) and SCOW photodiode (SCOWPD) are characterized and shown to be suitable for generation of high spectral purity microwave tones. Through comparisons made with SCOW-OEO topologies employing amplification, we observe the sidemode levels to be degraded by any amplifiers (optical or RF) introduced within the OEO cavity.
READ LESS

Summary

We demonstrate a free-running 3-GHz slab-coupled optical waveguide (SCOW) optoelectronic oscillator (OEO) with low phase-noise (88 dB down from carrier). The SCOW-OEO uses highpower low-noise SCOW components in a single-loop cavity employing 1.5- km delay. The noise properties of our SCOW external-cavity laser (SCOWECL) and SCOW photodiode (SCOWPD) are characterized...

READ MORE

Probabilistic reasoning for streaming anomaly detection

Published in:
2012 SSP: 2012 IEEE Statistical Signal Processing Workshop, 5-8 August 2012, pp. 377-380.

Summary

In many applications it is necessary to determine whether an observation from an incoming high-volume data stream matches expectations or is anomalous. A common method for performing this task is to use an Exponentially Weighted Moving Average (EWMA), which smooths out the minor variations of the data stream. While EWMA is efficient at processing high-rate streams, it can be very volatile to abrupt transient changes in the data, losing utility for appropriately detecting anomalies. In this paper we present a probabilistic approach to EWMA which dynamically adapts the weighting based on the observation probability. This results in robustness to data anomalies yet quick adaptability to distributional data shifts.
READ LESS

Summary

In many applications it is necessary to determine whether an observation from an incoming high-volume data stream matches expectations or is anomalous. A common method for performing this task is to use an Exponentially Weighted Moving Average (EWMA), which smooths out the minor variations of the data stream. While EWMA...

READ MORE

Toward matched filter optimization for subgraph detection in dynamic networks

Published in:
2012 SSP: 2012 IEEE Statistical Signal Processing Workshop, 5-8 August 2012, pp. 113-116.

Summary

This paper outlines techniques for optimization of filter coefficients in a spectral framework for anomalous subgraph detection. Restricting the scope to the detection of a known signal in i.i.d. noise, the optimal coefficients for maximizing the signal's power are shown to be found via a rank-1 tensor approximation of the subgraph's dynamic topology. While this technique optimizes our power metric, a filter based on average degree is shown in simulation to work nearly as well in terms of power maximization and detection performance, and better separates the signal from the noise in the eigenspace.
READ LESS

Summary

This paper outlines techniques for optimization of filter coefficients in a spectral framework for anomalous subgraph detection. Restricting the scope to the detection of a known signal in i.i.d. noise, the optimal coefficients for maximizing the signal's power are shown to be found via a rank-1 tensor approximation of the...

READ MORE

Multifunction phased array radar (MPAR) spectral usage analysis

Summary

This report addressed two technical risks associated with replacing current air traffic and weather surveillance radars with a single type of multifunction phased array radar (MPAR). The first risk is whether radio spectrum usage would increase with the MPAR network and whether the allocated band will have enough spectral space. This question is addressed in two steps. First, single-radar spectrum usage is estimated based on certain assumptions regarding the radar design. Second, locations based on a previous radar placement study are used together with a terrain-dependent propagation model to compute the number of frequency channels needed at each site. We conclude that the overall spectrum usage is likely to increase with MPAR, but that the targeted window in S band will be able to accommodate the occupancy at all sites. The second risk is whether self-interference will limit the ability of the MPAR to operate asynchronously and adaptively on different antenna faces. This question is addressed by employing a simple bistatic ground clutter model to characterize the interference between adjacent faces. We conclude that some interference is unavoidable, but it would likely only occur during times when a transmit beam is at its maximum off-broadside angle (~2% of the time).
READ LESS

Summary

This report addressed two technical risks associated with replacing current air traffic and weather surveillance radars with a single type of multifunction phased array radar (MPAR). The first risk is whether radio spectrum usage would increase with the MPAR network and whether the allocated band will have enough spectral space...

READ MORE

Nanosatellites for Earth environmental monitoring: the MicroMAS project

Summary

The Micro-sized Microwave Atmospheric Satellite (MicroMAS) is a 3U cubesat (34x10x10 cm, 4.5 kg) hosting a passive microwave spectrometer operating near the 118.75-GHz oxygen absorption line. The focus of the first MicroMAS mission (hereafter, MicroMAS-1) is to observe convective thunderstorms, tropical cyclones, and hurricanes from a near-equatorial orbit at approximately 500-km altitude. A MicroMAS flight unit is currently being developed in anticipation of a 2014 launch. A parabolic reflector is mechanically rotated as the spacecraft orbits the earth, thus directing a cross-track scanned beam with FWHM beamwidth of 2.4-degrees, yielding an approximately 20-km diameter footprint at nadir incidence from a nominal altitude of 500 km. Radiometric calibration is carried out using observations of cold space, the earth?s limb, and an internal noise diode that is weakly coupled through the RF front-end electronics. A key technology feature is the development of an ultra-compact intermediate frequency processor module for channelization, detection, and A-to-D conversion. The antenna system and RF front-end electronics are highly integrated and miniaturized. A MicroMAS-2 mission is currently being planned using a multiband spectrometer operating near 118 and 183 GHz in a sunsynchronous orbit of approximately 800-km altitude. A HyMAS- 1 (Hyperspectral Microwave Atmospheric Satellite) mission with approximately 50 channels near 118 and 183 GHz is also being planned. In this paper, the mission concept of operations will be discussed, the radiometer payload will be described, and the spacecraft subsystems (avionics, power, communications, attitude determination and control, and mechanical structures) will be summarized.
READ LESS

Summary

The Micro-sized Microwave Atmospheric Satellite (MicroMAS) is a 3U cubesat (34x10x10 cm, 4.5 kg) hosting a passive microwave spectrometer operating near the 118.75-GHz oxygen absorption line. The focus of the first MicroMAS mission (hereafter, MicroMAS-1) is to observe convective thunderstorms, tropical cyclones, and hurricanes from a near-equatorial orbit at approximately...

READ MORE

A preliminary investigation of Tower Flight Data Manager safety benefits

Published in:
Applied Human Factors and Ergonomics Conf., 21 July 2012.

Summary

Improvements to current air traffic management technologies and techniques are required to move toward the next generation air transportation system (NextGen). The Tower Flight Data Manager (TFDM) is a prototype air traffic control system consisting of the: (1) Flight Data Manager (FDM) facilitating interaction with electronic flight data, (2) Tower Information Display System (TIDS) providing enhanced surveillance information, and (3) Supervisor Display providing a means for front line managers and traffic management coordinators to interact with strategic and tactical planning and decision support tools. Given that TFDM aims to enable safe and efficient operations under NextGen, it is critical to analyze potential safety impacts and determine what types of real-world safety issues can be prevented or mitigated by TFDM. With this goal in mind, we reviewed 560 National Transportation Safety Board (NTSB) and Aviation Safety Reporting System (ASRS) reports focusing on commercial air carrier operations over a five year period. Over 100 reports were deemed relevant to TFDM and further analyzed to determine the likelihood that these safety-related events could have been mitigated or prevented by the key TFDM capabilities outlined above. A systematic method for generating probabilistic estimates of benefits for a technology not yet deployed was utilized to produce effectiveness ratings for the various TFDM components.
READ LESS

Summary

Improvements to current air traffic management technologies and techniques are required to move toward the next generation air transportation system (NextGen). The Tower Flight Data Manager (TFDM) is a prototype air traffic control system consisting of the: (1) Flight Data Manager (FDM) facilitating interaction with electronic flight data, (2) Tower...

READ MORE