Publications
Uni-traveling-carrier variable confinement waveguide photodiodes
Summary
Summary
Uni-traveling-carrier waveguide photodiodes (PDs) with a variable optical confinement mode size transformer are demonstrated. The optical mode is large at the input for minimal front-end saturation and the mode transforms as the light propagates so that the absorption profile is optimized for both high-power and high-speed performance. Two differently designed...
Assessing the speaker recognition performance of naive listeners using Mechanical Turk
Summary
Summary
In this paper we attempt to quantify the ability of naive listeners to perform speaker recognition in the context of the NIST evaluation task. We describe our protocol: a series of listening experiments using large numbers of naive listeners (432) on Amazon's Mechanical Turk that attempts to measure the ability...
Virtuoso: narrowing the semantic gap in virtual machine introspection
Summary
Summary
Introspection has featured prominently in many recent security solutions, such as virtual machine-based intrusion detection, forensic memory analysis, and low-artifact malware analysis. Widespread adoption of these approaches, however, has been hampered by the semantic gap: in order to extract meaningful information about the current state of a virtual machine, detailed...
Towards reduced false-alarms using cohorts
Summary
Summary
The focus of the 2010 NIST Speaker Recognition Evaluation (SRE) was the low false alarm regime of the detection error trade-off (DET) curve. This paper presents several approaches that specifically target this issue. It begins by highlighting the main problem with operating in the low-false alarm regime. Two sets of...
The MIT LL 2010 speaker recognition evaluation system: scalable language-independent speaker recognition
Summary
Summary
Research in the speaker recognition community has continued to address methods of mitigating variational nuisances. Telephone and auxiliary-microphone recorded speech emphasize the need for a robust way of dealing with unwanted variation. The design of recent 2010 NIST-SRE Speaker Recognition Evaluation (SRE) reflects this research emphasis. In this paper, we...
NAP for high level language identification
Summary
Summary
Varying channel conditions present a difficult problem for many speech technologies such as language identification (LID). Channel compensation techniques have been shown to significantly improve performance in LID for acoustic systems. For high-level token systems, nuisance attribute projection (NAP) has been shown to perform well in the context of speaker...
Informative dialect recognition using context-dependent pronunciation modeling
Summary
Summary
We propose an informative dialect recognition system that learns phonetic transformation rules, and uses them to identify dialects. A hidden Markov model is used to align reference phones with dialect specific pronunciations to characterize when and how often substitutions, insertions, and deletions occur. Decision tree clustering is used to find...
A time-warping framework for speech turbulence-noise component estimation during aperiodic phonation
Summary
Summary
The accurate estimation of turbulence noise affects many areas of speech processing including separate modification of the noise component, analysis of degree of speech aspiration for treating pathological voice, the automatic labeling of speech voicing, as well as speaker characterization and recognition. Previous work in the literature has provided methods...
Identification and compensation of Wiener-Hammerstein systems with feedback
Summary
Summary
Efficient operation of RF power amplifiers requires compensation strategies to mitigate nonlinear behavior. As bandwidth increases, memory effects become more pronounced, and Volterra series based compensation becomes onerous due to the exponential growth in the number of necessary coefficients. Behavioral models such as Wiener-Hammerstein systems with a parallel feedforward or...
Thermally tuned dual 20-channel ring resonator filter bank in SOI (silicon-on-insulator)
Summary
Summary
Two 20-channel second-order optical filter banks have been fabricated. With tuning, the requirements for a wavelength multiplexed photonic AD-converter (insertion loss 1-3 dB, extinction >30 dB and optical bandwidth 22-27 GHz) are met.