Publications

Refine Results

(Filters Applied) Clear All

R&D Areas

R&D Groups

Year

Items per page

High performance computing productivity model synthesis

January 1, 2004

Journal Article

Author:

Jeremy Kepner

Published in:

Int. J. High Perform. Comp. App., Vol. 12, No. 4, Winter 2004, pp. 505-516.

Topic:

high performance computing

R&D area:

R&D group:

ISR Systems and Architectures

Summary

The Defense Advanced Research Projects Agency (DARPA) High Productivity Computing System (HPCS) program is developing systems that deliver increased value to users at a rate commensurate with the rate of improvement in the underlying technologies. For example, if the relevant technology was silicon, the goal of such a system would be to double in productivity (or value) every 18 months, following Moore's law. The key questions are how we define and measure productivity, and what the underlying technologies that affect productivity are. The goal of this paper is to synthesize from several different productivity models a single model that captures the main features of all the models. In addition we will start the process of putting the model on an empirical foundation by incorporating selected results from the software engineering and high performance computing (HPC) communities. An asymptotic analysis of the model is conducted to check that it makes sense in certain special cases. The model is extrapolated to a HPC context and several examples are explored, including HPC centers, HPC users, and interactive grid computing. Finally, the model hints at a profoundly different way of viewing HPC systems, where the user must be included in the equation, and innovative hardware is a key aspect to lowering the very high costs of HPC software.

READ LESS

Summary

High performance computing productivity model synthesis

Beyond cepstra: exploiting high-level information in speaker recognition

December 11, 2003

Conference Paper

Author:

Douglas A. Reynolds

…

Published in:

Proc. Multimodal User Authentication Workshop, 11-12 December, 2003, pp. 223-9.

Topic:

speaker recognition

R&D area:

Cyber Security and Information Sciences

R&D group:

Artificial Intelligence Technology and Systems

Summary

Traditionally speaker recognition techniques have focused on using short-term, low-level acoustic information such as cepstra features extracted over 20-30 ms windows of speech. But speech is a complex behavior conveying more information about the speaker than merely the sounds that are characteristic of his vocal apparatus. This higher-level information includes speaker-specific prosodics, pronunciations, word usage and conversational style. In this paper, we review some of the techniques to extract and apply these sources of high-level information with results from the NIST 2003 Extended Data Task.

READ LESS

Summary

Beyond cepstra: exploiting high-level information in speaker recognition

Exploiting nonacoustic sensors for speech enhancement

December 11, 2003

Conference Paper

Author:

Thomas F. Quatieri

…

Published in:

Proc. Workshop on Multimodal User Authentication, 11-12 December 2003, pp. 66-73.

Topic:

speaker recognition

R&D area:

Cyber Security and Information Sciences

R&D group:

Artificial Intelligence Technology and Systems

Summary

Nonacoustic sensors such as the general electromagnetic motion sensor (GEMS), the physiological microphone (P-mic), and the electroglottograph (EGG) offer multimodal approaches to speech processing and speaker and speech recognition. These sensors provide measurements of functions of the glottal excitation and, more generally, of the vocal tract articulator movements that are relatively immune to acoustic disturbances and can supplement the acoustic speech waveform. This paper describes an approach to speech enhancement that exploits these nonacoustic sensors according to their capability in representing specific speech characteristics in different frequency bands. Frequency-domain sensor phase, as well as magnitude, is found to contribute to signal enhancement. Preliminary testing involves the time-synchronous multi-sensor DARPA Advanced Speech Encoding Pilot Speech Corpus collected in a variety of harsh acoustic noise environments. The enhancement approach is illustrated with examples that indicate its applicability as a pre-processor to low-rate vocoding and speaker authentication, and for enhanced listening from degraded speech.

READ LESS

Summary

Exploiting nonacoustic sensors for speech enhancement

Multimodal speaker authentication using nonacuostic sensors

December 11, 2003

Conference Paper

Author:

William M. Campbell

…

Published in:

Proc. Workshop on Multimodal User Authentication, 11-12 December 2003, pp. 215-222.

Topic:

speech enhancement

R&D area:

Cyber Security and Information Sciences

R&D group:

Artificial Intelligence Technology and Systems

Summary

Many nonacoustic sensors are now available to augment user authentication. Devices such as the GEMS (glottal electromagnetic micro-power sensor), the EGG (electroglottograph), and the P-mic (physiological mic) all have distinct methods of measuring physical processes associated with speech production. A potential exciting aspect of the application of these sensors is that they are less influenced by acoustic noise than a microphone. A drawback of having many sensors available is the need to develop features and classification technologies appropriate to each sensor. We therefore learn feature extraction based on data. State of the art classification with Gaussian Mixture Models and Support Vector Machines is then applied for multimodal authentication. We apply our techniques to two databases--the Lawrence Livermore GEMS corpus and the DARPA Advanced Speech Encoding Pilot corpus. We show the potential of nonacoustic sensors to increase authentication accuracy in realistic situations.

READ LESS

Summary

Multimodal speaker authentication using nonacuostic sensors

Passive operating system identification from TCP/IP packet headers

November 19, 2003

Conference Paper

Author:

Richard P. Lippmann

…

Published in:

ICDM Workshop on Data Mining for Computer Security, DMSEC, 19 November 2003.

Topic:

machine learning

R&D area:

Cyber Security and Information Sciences

R&D group:

Cyber Operations and Analysis Technology

Summary

Accurate operating system (OS) identification by passive network traffic analysis can continuously update less-frequent active network scans and help interpret alerts from intrusion detection systems. The most recent open-source passive OS identification tool (ettercap) rejects 70% of all packets and has a high 75-class error rate of 30% for non-rejected packets on unseen test data. New classifiers were developed using machine-learning approaches including cross-validation testing, grouping OS names into fewer classes, and evaluating alternate classifier types. Nearest neighbor and binary tree classifiers provide a low 9-class OS identification error rate of roughly 10% on unseen data without rejecting packets. This error rate drops to nearly zero when 10% of the packets are rejected.

READ LESS

Summary

Passive operating system identification from TCP/IP packet headers

Biometrically enhanced software-defined radios

November 17, 2003

Conference Paper

Author:

Joseph P. Campbell Jr

…

Published in:

Software Defined Radio Tech. Conf., SDR, 17-19 November 2003.

Topic:

speaker recognition

R&D area:

Cyber Security and Information Sciences

R&D group:

Artificial Intelligence Technology and Systems

Summary

Software-defined radios and cognitive radios offer tremendous promise, while having great need for user authentication. Authenticating users is essential to ensuring authorized access and actions in private and secure communications networks. User authentication for software-defined radios and cognitive radios is our focus here. We present various means of authenticating users to their radios and networks, authentication architectures, and the complementary combination of authenticators and architectures. Although devices can be strongly authenticated (e.g., cryptographically), reliably authenticating users is a challenge. To meet this challenge, we capitalize on new forms of user authentication combined with new authentication architectures to support features such as continuous user authentication and varying levels of trust-based authentication. We generalize biometrics to include recognizing user behaviors and use them in concert with knowledge- and token-based authenticators. An integrated approach to user authentication and user authentication architectures is presented here to enhance trusted radio communications networks.

READ LESS

Summary

Biometrically enhanced software-defined radios

Auditory signal processing as a basis for speaker recognition

October 19, 2003

Conference Paper

Author:

Thomas F. Quatieri

…

Published in:

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 19-22 October, 2003, pp. 111-114.

Topic:

speaker recognition

R&D area:

Cyber Security and Information Sciences

R&D group:

Artificial Intelligence Technology and Systems

Summary

In this paper, we exploit models of auditory signal processing at different levels along the auditory pathway for use in speaker recognition. A low-level nonlinear model, at the cochlea, provides accentuated signal dynamics, while a a high-level model, at the inferior colliculus, provides frequency analysis of modulation components that reveals additional temporal structure. A variety of features are derived from the low-level dynamic and high-level modulation signals. Fusion of likelihood scores from feature sets at different auditory levels with scores from standard mel-cepstral features provides an encouraging speaker recognition performance gain over use of the mel-cepstrum alone with corpora from land-line and cellular telephone communications.

READ LESS

Summary

Auditory signal processing as a basis for speaker recognition

System adaptation as a trust response in tactical ad hoc networks

October 13, 2003

Conference Paper

Author:

Bracha Epstein

…

Published in:

IEEE MILCOM 2003, 13-16 October 2003, pp. 209-214.

Topic:

cryptography

R&D area:

Cyber Security and Information Sciences

R&D group:

Artificial Intelligence Technology and Systems

Summary

While mobile ad hoc networks offer significant improvements for tactical communications, these networks are vulnerable to node capture and other forms of cyberattack. In this paper we evaluated via simulation of the impact of a passive attacker, a denial of service (DoS) attack, and a data swallowing attack. We compared two different adaptive network responses to these attacks against a baseline of no response for 10 and 20 node networks. Each response reflects a level of trust assigned to the captured node. Our simulation used a responsive variant of the ad hoc on-demand distance vector (AODV) routing algorithm and focused on the response performance. We assumed that the attacks had been detected and reported. We compared performance tradeoffs of attack, response, and network size by focusing on metrics such as "goodput", i.e., percentage of messages that reach the intended destination untainted by the captured node. We showed, for example, that under general conditions a DoS attack response should minimize attacker impact while a response to a data swallowing attack should minimize risk to the system and trust of the compromised node with most of the response benefit. We show that the best network response depends on the mission goals, network configuration, density, network performance, attacker skill, and degree of compromise.

READ LESS

Summary

System adaptation as a trust response in tactical ad hoc networks

Acoustic, phonetic, and discriminative approaches to automatic language identification

September 1, 2003

Conference Paper

Author:

Elliot Singer

…

Published in:

Proc. EUROSPEECH 2003, 1-4 September 2003, pp. 1345-1348.

Topic:

language recognition

R&D area:

Cyber Security and Information Sciences

R&D group:

Artificial Intelligence Technology and Systems

Summary

Formal evaluations conducted by NIST in 1996 demonstrated that systems that used parallel banks of tokenizer-dependent language models produced the best language identification performance. Since that time, other approaches to language identification have been developed that match or surpass the performance of phone-based systems. This paper describes and evaluates three techniques that have been applied to the language identification problem: phone recognition, Gaussian mixture modeling, and support vector machine classification. A recognizer that fuses the scores of three systems that employ these techniques produces a 2.7% equal error rate (EER) on the 1996 NIST evaluation set and a 2.8% EER on the NIST 2003 primary condition evaluation set. An approach to dealing with the problem of out-of-set data is also discussed.

READ LESS

Summary

Acoustic, phonetic, and discriminative approaches to automatic language identification

Integration of speaker recognition into conversational spoken dialogue systems

September 1, 2003

Conference Paper

Author:

Timothy J. Hazen

…

Published in:

EUROSPEECH 2003, 1-4 September 2003.

Topic:

speaker recognition

R&D area:

Cyber Security and Information Sciences

R&D group:

Artificial Intelligence Technology and Systems

Summary

In this paper we examine the integration of speaker identification/verification technology into two dialogue systems developed at MIT: the Mercury air travel reservation system and the Orion task delegation system. These systems both utilize information collected from registered users that is useful in personalizing the system to specific users and that must be securely protected from imposters. Two speaker recognition systems, the MIT Lincoln Laboratory text independent GMM based system and the MIT Laboratory for Computer Science text-constrained speaker-adaptive ASR-based system, are evaluated and compared within the context of these conversational systems.

READ LESS

Summary

Integration of speaker recognition into conversational spoken dialogue systems

Publications

Refine Results

High performance computing productivity model synthesis

Summary

Summary

Beyond cepstra: exploiting high-level information in speaker recognition

Summary

Summary

Exploiting nonacoustic sensors for speech enhancement

Summary

Summary

Multimodal speaker authentication using nonacuostic sensors

Summary

Summary

Passive operating system identification from TCP/IP packet headers

Summary

Summary

Biometrically enhanced software-defined radios

Summary

Summary

Auditory signal processing as a basis for speaker recognition

Summary

Summary

System adaptation as a trust response in tactical ad hoc networks

Summary

Summary

Acoustic, phonetic, and discriminative approaches to automatic language identification

Summary

Summary

Integration of speaker recognition into conversational spoken dialogue systems

Summary

Summary

Showing Results