| Affirming a Nation: The Construction of U.S. National Identity During the Persian Gulf War and the War on Terrorism |
2003 |
149 pages |
| Authors:
John S. Hutcheson; UNIV OF WASHINGTON SEATTLE
|
 | This thesis explores the construction of U.S. national identity by the president and the news media during two of America's most significant crises in the late 20th century: the 1991 Persian Gulf War and the September 11, 2001 attacks and subsequent war on terrorism. Quantitative and qualitative content analysis of major presidential addresses and editorials in the New York Times and The Washington Post before and during these crises suggest ... |
|
| Finding the FOO: A Pilot Study for a Multimodal Interface |
2003 |
7 pages |
| Authors:
Dennis Perzanowski; Derek Brock; William Adams; Magdalena Bugajska; Alan C. Schultz; J. G. Trafton; Sam Blisard; Majorie Skubic; NAVAL RESEARCH LAB WASHINGTON DC CENTER FOR APPLIED RESEARCH IN ARTIFICIAL INTELLIGENCE
|
 | In their research on intuitive means for humans and intelligent, mobile robots to collaborate, the authors use a multimodal interface that supports speech and gestural inputs. As a preliminary step to evaluate their approach and to identify practical areas for future work, they conducted a Wizard-of-Oz pilot study with five participants who each collaborated with a robot on a search task in a separate room. The goal was to find ... |
|
| Automating After Action Review: Attributing Blame or Credit in Team Training |
2003 |
11 pages |
| Authors:
Jonathan Gratch; Wenji Mao; UNIVERSITY OF SOUTHERN CALIFORNIA MARINA DEL REY INFORMATION SCIENCES INST
|
 | This paper presents automated methods for facilitating after action review in team training exercises. Much of the learning from team training arises from frank after-the-fact discussions of the exercise, combining individual attributions of blame or credit into a more objective view of what transpired. These individual attributions are social judgments involving not only causality but also explanations of individual responsibility, free will and mitigating circumstances. Such judgments are a key ... |
|
| Dialogue Management for an Automated Multilingual Call Center |
2003 |
4 pages |
| Authors:
Hilda Hardy; Tomek Strzalkowski; Min Wu; STATE UNIV OF NEW YORK AT ALBANY INST FOR INFORMATICS LOGICS AND SECURITY STUDIES
|
 | The AMITIES project (Automated Multilingual Interaction with Information and Services) has been established under joint funding from the European Commission's 5th Framework Program and the U.S. DARPA to develop the next generation of empirically-induced human-computer interaction capabilities in spoken language. One of the central goals of this project is to create a dialogue management system capable of engaging the user in human-like conversation within a specific domain. The domain we ... |
|
| The Pragmatics of Taking a Spoken Language System Out of the Laboratory |
2003 |
4 pages |
| Authors:
Jody J. Daniels; Helen W. Hastie; LOCKHEED MARTIN CORP CAMDEN NJ ADVANCED TECHNOLOGY LABS
|
 | Lockheed Martin's Advanced Technology Laboratories has been designing developing testing and evaluating spoken language understanding systems in several unique operational environments over the past five years. Through these experiences we have encountered numerous challenges in making each system become an integral part of a user's operations. In this paper we discuss these challenges and report how we overcame them with respect to a number a of domains. |
|
| Dramatic Expression in Opera, and Its Implications for Conversational Agents |
2003 |
9 pages |
| Authors:
W. L. Johnson; UNIVERSITY OF SOUTHERN CALIFORNIA MARINA DEL REY INFORMATION SCIENCES INST
|
 | It is commonly agreed among embodied conversational agent (ECA) researchers that ECA behavior should be based upon principles of human face-to-face communication (Cassell et al., 2000; Traum & Rickel, 2002). It is less commonly acknowledged that principles of human acting can inform the design of ECA behavior, particularly in making behavior engaging and understandable. Character animators, in contrast, understand clearly the relationship between character behavior and acting (Porter, 1997), and ... |
|
| Nonlinear Nonlocal Cochlear Models, Multitones, Noises and Masking Thresholds |
30 DEC 2002 |
11 pages |
| Authors:
Jack Xin; TEXAS UNIV AT AUSTIN DEPT OF MATHEMATICS
|
 | An important part of voice signal processing is to perform a nonlinear operation along frequency on the short time spectrogram, while the nonlinear adaptation along time is better understood. We developed, computed and analyzed a class of nonlinear nonlocal cochlear models to approximate this nonlinear aspect. The model is mechanical in nature, and outputs the acoustic responses on the basilar membrane. In case of two or three tones, our results ... |
|
| Evaluation and Improvement of a Speaker Verification System in Military Environments |
26 DEC 2002 |
23 pages |
| Authors:
David A. Heide; NAVAL RESEARCH LAB WASHINGTON DC
|
 | The U.S. Navy is constantly looking for ways to improve security. One of the possible solutions proposed has been the use of biometrics, defined by the Biometric Consortium as "automated methods of recognizing a person base on a physiological or behavioral characteristic". One of the least costly, most convenient, and least invasive methods of biometrics is speaker verification. The Naval Research Laboratory's Voice Systems Section has undergone an extensive study ... |
|
| Voice Biometrics for Information Assurance Applications |
05 DEC 2002 |
43 pages |
| Authors:
George S. Kang; Yvette Lee; NAVAL RESEARCH LAB WASHINGTON DC
|
 | In 2002, the President of the United States established an organization within the DOD to develop and promulgate biometrics technologies to achieve security in information, information systems, weapons, and facilities. NRL has been tasked to study voice biometrics for applications in which other biometrics techniques are difficult to apply. The ultimate goal of voice biometrics is to enable the use of voice as a password. Voice biometrics are "man-in-the-loop" systems ... |
|
| Performance Assessment of a COTS Speech Recognition System on the N4 Database |
DEC 2002 |
17 pages |
| Authors:
David T. Williamson; Robin A. Snyder Jr; AIR FORCE RESEARCH LAB WRIGHT-PATTERSON AFB OH HUMAN EFFECTIVENESS DIRECTORATE
|
 | This report discusses the evaluation of a commercially available speech recognition system on the NATO Native and Non-Native (N4) database. Using the statistical language modeling techniques, trigram language models were generated for each of three countries in the database, CA, NL, and UK. Due to time constraints, the DE database was not evaluated. For each of the countries, two factors were assessed. The first was overall word accuracy and the ... |
|
| An Automated Method of Facilitating Analysis of Voice Communications |
NOV 2002 |
|
| Authors:
Philip G. Clark; Rowland E. Dickinson; DEFENCE SCIENCE AND TECHNOLOGY ORGANISATION SALISBURY (AUSTRALIA) INFO SCIENCES LAB
|
 | DSTO has historically gathered voice communications data for analysis. Preparing this data for analysis is very time intensive and in many cases cannot be undertaken due to resource constraints. This paper describes a simple computer tool to automatically log and compress live communications activity or previously tape recorded analogue information. The tool presents this information in a form ready for immediate computer based analysis. |
|
| The Effects of Background Noise on the Performance of an Automatic Speech Recogniser |
NOV 2002 |
|
| Authors:
Jason Littlefield; Ahmad Hashemi-Sakhtsari; DEFENCE SCIENCE AND TECHNOLOGY ORGANISATION SALISBURY (AUSTRALIA) INFO SCIENCES LAB
|
 | Ambient or environmental noise is a major factor that affects the performance of an automatic speech recogniser. Large vocabulary, speaker- dependent, continuous speech recognisers are commercially available. Speech recognisers perform well in a quiet environment, but poorly in a noisy environment. Speaker-dependent speech recognisers require training prior to them being tested, where the level of background noise in both phases affects the performance of the recogniser. This study aims to ... |
|
| Integrating Multiple Knowledge Sources for Utterance-Level Confidence Annotation in the CMU Communicator Spoken Dialog System |
NOV 2002 |
31 pages |
| Authors:
Dan Bohus; Alex Rudnicky; CARNEGIE-MELLON UNIV PITTSBURGH PA SCHOOL OF COMPUTER SCIENCE
|
 | In the recent years, automated speech recognition has been the main drive behind the advent of spoken language interfaces, but at the same a time a severe limiting factor in the development of these systems. We believe that increased robustness in the face of recognition errors can be achieved by making the systems aware of their own misunderstandings, and employing appropriate recovery techniques when breakdowns in interacted occur. In this ... |
|
| Interfacing COTS Speech Recognition and Synthesis Software to a Lotus Notes Military Command and Control Database |
OCT 2002 |
|
| Authors:
Oliver Carr; DEFENCE SCIENCE AND TECHNOLOGY ORGANISATION SALISBURY (AUSTRALIA) INFO SCIENCES LAB
|
 | Speech recognition and synthesis technologies have become commercially viable over recent years. Two current market leading products in speech recognition technology are Dragon NaturallySpeaking and IBM ViaVoice. This report describes the development of speech user interfaces incorporating these products with Lotus Notes and Java applications. These interfaces enable data entry using speech recognition and allow warnings and instructions to be issued via speech synthesis. The development of a military vocabulary ... |
|
| Exploitation of Existing Voice Over Internet Protocol Technology for Department of the Navy Application |
SEP 2002 |
127 pages |
| Authors:
Henry M. Vegter Jr.; David T. Wallace; NAVAL POSTGRADUATE SCHOOL MONTEREY CA
|
 | This thesis documents an investigation into the technology of Voice over Internet Protocol (VoIP) VoIP promises to be a widely accepted technology in the future, The issues of efficient use of bandwidth over network choke points, cost savings gained from a common data and voice infrastructure, reduced cost associated with toll calls and the merger of the telephone with the desktop will keep adoption of this technology on the path ... |
|
| An Audio Architecture Integrating Sound and Live Voice for Virtual Environments |
SEP 2002 |
197 pages |
| Authors:
Eric M. Krebs; NAVAL POSTGRADUATE SCHOOL MONTEREY CA
|
 | The purpose behind this thesis was to design and implement audio system architecture, both in hardware and in software, for use in virtual environments The hardware and software design requirements were aimed at implementing acoustical models, such as reverberation and occlusion, and live audio streaming to any simulation employing this architecture, Several free or open-source sound APIs were evaluated, and DirectSound3DTM was selected as the core component of the audio ... |
|
| Telecommunications Technology and Service Changes Since the Telecommunications Act of 1996 |
SEP 2002 |
95 pages |
| Authors:
Matthew R. Simmons; NAVAL POSTGRADUATE SCHOOL MONTEREY CA
|
 | The Telecom Act of 1996 was intended to address the lack of competition and reduce regulation in local telephone services and in other areas of the telecommunications sector, The competitive situation in telecommunications, particularly regarding local telephone services, has experienced a limited amount of positive change as a result of the Act, Local Telephone service consumers are still given little choice as the Incumbent Local Exchange Carriers (ILECs) continue to ... |
|
| Rough 'N' Ready: A Meeting Recorder and Browser |
SEP 2002 |
19 pages |
| Authors:
John Makhoul; Francis Kubala; BBN TECHNOLOGIES CAMBRIDGE MA
|
 | The objective of this effort is to integrate and enhance existing technologies in speech recognition, speaker identification, and topic classification to provide cost-effective transcription, structural summarization, and retrieval of user-specified aspects of meetings. A software system consisting of a meeting recorder and browser was designed and developed to provide a higher level view of collaborative meetings, co-locational or distributed and a way to browse through and listen to those parts ... |
|
| National Airspace System: FAA's Approach to Its New Communications System Appears Prudent, but Challenges Remain |
JUL 2002 |
28 pages |
| Authors:
GENERAL ACCOUNTING OFFICE WASHINGTON DC
|
 | The Federal Aviation Administration (FAA) uses radios to provide air- ground voice and data communications for pilots and air traffic controllers to safely coordinate all flight operations-ground movements of aircraft at airports, take-offs and landings, and separation distances between aircraft as they cruise at high altitudes. However, the anticipated growth in air traffic, coupled with FAA's efforts to reduce air traffic delays and introduce new air traffic services, will create ... |
|
| Multi-modal Speech Recognition Workshop 2002 |
12 JUN 2002 |
92 pages |
| Authors:
Sung H. Yoon; NORTH CAROLINA AGRICULTURAL AND TECHNICAL STATE UNIV GREENSBORO
|
 | Proceedings of the 2002 Multi-modal Speech Recognition Workshop held in Greensboro, North Carolina, June 10-12, 2002. Sponsored by Defense Advanced Research Projects Agency and the Center for Intelligent Mobile Information Systems, North Carolina A&T State University. The workshop consisted of two sessions; Session 1: Sensors and Sensor Fusion, and Session 2: Audio-Visual Speech Recognition. |
|
| The Effect of Sound Spatialization on Responses to Overlapping Messages |
JUN 2002 |
137 pages |
| Authors:
James R. Campbell; NAVAL POSTGRADUATE SCHOOL MONTEREY CA
|
 | The purpose of this research was to determine if a spatialized headphone display would improve users' recognition accuracy when listening to more than two overlapping messages. This type of task has numerous applications in a variety of different military settings, such as aviation communications and combat information centers. Two experiments were conducted in the Advanced Auditory Displays Laboratory at the Naval Postgraduate School. The first experiment was a pilot study ... |
|
| Communicative Arts. A Selected Bibliography |
JUN 2002 |
23 pages |
| Authors:
ARMY WAR COLL LIBRARY CARLISLE BARRACKSPA
|
 | The U.S. Army War College Library presents Communicative Arts: A selected Bibliography now in its sixteenth revised edition, as an invitation for you to enjoy the wealth of resources available in our library that will help you improve your ability to communicate. Like the earlier versions, this annual bibliography lists references for materials which will provide skills, techniques, and approaches you may need to effectively express your ideas. The bibliography's ... |
|
| Nonlinear Auditory Modeling as a Basis for Speaker Recognition |
17 MAY 2002 |
|
| Authors:
T. F. Quatieri; MASSACHUSETTS INST OF TECH LEXINGTON LINCOLN LAB
|
 | In this report, we develop a front-end nonlinear auditory model based on recent work of Dau, Puschel, and Kohlrausch (DPK) Dau, Puschel, and Kohlrausch, 1997. An important aspect of the model is the robust accentuation of temporal change in a signal at the cochlea level that forms the basis of a feature set for automatic speaker recognition. Preliminary speaker recognition experiments with the DPK front-end auditory model give performance close ... |
|
| Speech-to-Speech Translation: Technology and Applications Study |
10 MAY 2002 |
|
| Authors:
C. J. Weinstein; MASSACHUSETTS INST OF TECH LEXINGTON LINCOLN LAB
|
 | This report describes a study effort on the state-of-the-art and lessons learned in automated, two-way, speech-to-speech translation and its potential application to military problems. The study includes and comments upon an extensive set of references on prior and current work in speech translation. The study includes recommendations on future military applications and on R&D needed to successfully achieve those applications. Key findings of the study include: (1) R&D speech translation ... |
|
| Advanced Interfaces and Testbed for Space Operator Consoles SBIR Phase II, Final Report |
APR 2002 |
571 pages |
| Authors:
Thomas J. Starkey; Thomas W. Marlow; Robert J. Remington; John D. Ianni; MONTEREY TECHNOLOGIES INC MONTEREY CA
|
 | This report summarizes the development of an improved Human-System Interface (HSI) for space operators, and the development of a testbed for evaluating this and other new HSI concepts. This work was performed under a Small Business Innovation Research (SBIR) contract to Monterey Technologies, Inc. A task analysis was performed to identify bottlenecks in the current HSI. It was found that controllers were limited by the number of manual and visual ... |
|
| An Examination of the Role of Communication Problems in Preventable Medical Adverse Events |
APR 2002 |
50 pages |
| Authors:
Fred P. Stone; AIR UNIV MAXWELL AFB AL
|
 | This exploratory, descriptive study examined 30 medical malpractice case files and 30 medical incident investigations to identify the prevalence of three barriers to effective communication among healthcare professionals. These cases were randomly selected from the files of the Office of the Air Force Surgeon General. Barriers included problems with encoding and decoding information, hierarchical structure of teams, and time pressures and workload. The results of this study indicated that communication ... |
|
| Innovation on a Hospital Perinatal Unit: Cell Phone Use by Nurses |
APR 2002 |
79 pages |
| Authors:
Thomas J. Petrilak; JOHNS HOPKINS HOSPITAL BALTIMORE MD
|
 | The nursing shortage in the United States has a direct impact on all aspects of health care throughout the nation. Consequences of this shortage affect cost, job satisfaction, turnover rate, recruitment effort, and ultimately the quality of care provided to the patient. Many initiatives are underway to confront this shortage. One area of focus is the reduction of nurse times pent performing functions that could be accomplished by hospital support ... |
|
| Services Based Collaboration/Coalition Networks |
APR 2002 |
11 pages |
| Authors:
Ramana Reddy; Kankanahalli Srinivas; Sumitra M. Reddy; Rao Mikkilineni; WEST VIRGINIA UNIV MORGANTOWN LANE DEPARTMENT OF COMPUTER SCIENCE AND ELECTRICAL ENGG
|
 | We propose a novel framework and a scalable architecture that can be used for rapid formation of coalitions and to perform collaborative transactions. Our framework combines the traditional theory of organizations with the theory of signaling from telephone networks to bring about a unifying theory of collaboration to enable dynamic virtual enterprises toned on-demand. The new framework, overcomes many limitations of the traditional frameworks. Our framework known as the PPP/SST ... |
|
| Telecommunications: Federal and State Universal Service Programs and Challenges to Funding |
FEB 2002 |
67 pages |
| Authors:
GENERAL ACCOUNTING OFFICE WASHINGTON DC
|
 | Title 1 of the Communications Act of 1934 sets forth the nation's telecommunications policy, including making communication services available so far as possible, to all the people of the United States. Early efforts by FCC, state regulators, and industry to promote universal service generally began in the 1950s. At that time, increasing amounts of the costs associated with providing local telephone service were recovered from rates for long distance services. ... |
|
| Quotient Signal Decomposition and Order Estimation |
2002 |
30 pages |
| Authors:
D. Napoletani; C. A. Berenstein; P. S. Krishnaprasad; MARYLAND UNIV COLLEGE PARK INST FOR SYSTEMS RESEARCH
|
 | In this paper, the authors propose a method for blind signal decomposition of speech signals that does not require that the sources are independent or stationary. This method, which they consider a simple instance of nonlinear projection pursuit, is based on the possibility of recovering the areas in the time-frequency domain where the original signals are isolated, or almost isolated, with the use of suitable quotients of linear combinations of ... |
|
| Limited Domain Synthesis of Expressive Military Speech for Animated Characters |
2002 |
5 pages |
| Authors:
W. L. Johnson; Shrikanth Narayanan; Richard Whitney; Rajat Das; Catherine LaBore; UNIVERSITY OF SOUTHERN CALIFORNIA LOS ANGELES INFORMATION SCIENCES INSTITUTE
|
 | Text-to-speech synthesis can play an important role in interactive education and training applications, as voices for animated agents. Such agents need high-quality voices capable of expressing intent and emotion. This paper presents preliminary results in an effort aimed at synthesizing expressive military speech for training applications. Such speech has acoustic and prosodic characteristics that can differ markedly from ordinary conversational speech. A limited domain synthesis approach is used employing samples ... |
|
| Limited Domain Synthesis of Expressive Military Speech for Animated Characters |
2002 |
5 pages |
| Authors:
W. L. Johnson; S. Narayanan; R. Whitney; R. Das; M. Bulut; C. LaBore; UNIVERSITY OF SOUTHERN CALIFORNIA LOS ANGELES INFORMATION SCIENCES INSTITUTE
|
 | Text-to-speech synthesis can play an important role in interactive education and training applications, as voices for animated agents. Such agents need high-quality voices capable of expressing intent and emotion. This paper presents preliminary results in an effort aimed at synthesizing expressive military speech for training applications. Such speech has acoustic and prosodic characteristics that can differ markedly from ordinary conversational speech. A limited domain synthesis approach is used employing samples ... |
|
| Building VoiceXML-Based Applications |
2002 |
5 pages |
| Authors:
Christina Bennett; Ariadna Font Llitjos; Stefanie Shriver; Alexander Rudnicky; Alan W. Black; CARNEGIE-MELLON UNIV PITTSBURGH PA LANGUAGE TECHNOLOGIES INST
|
 | The Language Technologies Institute (LTI) at Carnegie Mellon University has, for the past several years, conducted a lab course in building spoken-language dialog systems. In the most recent versions of the course, we have used (commercial) web-based development environments to build systems. This paper describes our experiences and discusses the characteristics of applications that are developed within this framework. |
|
| Variable-Data-Rate Voice Encoder for Voice Over Internet Protocol (VOIP) (CD-ROM) |
28 DEC 2001 |
|
| Authors:
George S. Kang; NAVAL RESEARCH LAB WASHINGTON DC
|
 | ELECTRONIC FILE CHARACTERISTICS: 6 files; Adobe PDF document and QuickTime audio-video fiels (.MOV). PHYSICAL DESCRIPTION: 1 computer laser optical disc (CD-ROM); 4 3/4 in.; 3.34 MB. SYSTEMS DETAIL NOTE: IBM-clone PC-compatible; requires audio-video drivers. ABSTRACT: Over the years, all DOD voice terminals transmitted speech at a constant data rate. That was the era when each user was allotted a fixed amount of channel resources (e.g., bandwidth). Now, the age of ... |
|
| Tactical Voice Communications Over Shipboard Local Area Networks |
DEC 2001 |
117 pages |
| Authors:
Glenn R. Urie; NAVAL POSTGRADUATE SCHOOL MONTEREY CA
|
 | The United States Navy's next generation ship(s) scheduled for commissioning in the year 2004 and beyond will integrate tactical shipboard voice communications system into the local area network (LAN). A single network eliminates separate voice and data infrastructures, consolidates services, and reduces the cost of communications. The existing installation of high-speed shipboard data networks has laid the foundation for the convergence of these two technologies. Currently, there is no high ... |
|
| Techniques for Preprocessing Speech Signals for More Effective Audio Interfaces |
DEC 2001 |
99 pages |
| Authors:
Phillip L. DeLeon; NEW MEXICO STATE UNIV LAS CRUCES DEPT OF ELECTRICAL AND COMPUTER ENGINEERING
|
 | A user receiving instructions from various speech sources may be overwhelmed when these sources are speaking simultaneously in an unfavorable acoustical and noisy environment. In such a case, the user is required to separate the various sources from the mixture in order to make the speech intelligible. If no one source dominates or the mixing occurs for a sustained period of time, the human user may become mentally and physically ... |
|
| Investigation and Evaluation of Voice Stress Analysis Technology |
NOV 2001 |
115 pages |
| Authors:
Darren Haddad; Sharon Walter; Roy Ratley; Megan Smith; AIR FORCE RESEARCH LAB ROME NY INFORMATION DIRECTORATE
|
 | Numerous police officers and agencies have been approached in recent years by vendors touting computer-based systems capable of measuring stress in a person's voice as an indicator of deception. These systems are advertised as being cheaper, easier to use, less invasive in use, and less constrained in their operation than polygraph technology. They claim that a speaker's medical condition, age, or consumption of drugs does not affect use of their ... |
|
| Experimental Comparisons of Data Entry by Automatic Speech Recognition, Keyboard, and Mouse |
NOV 2001 |
|
| Authors:
Helen Mitchard; Jim Winkles; DEFENCE SCIENCE AND TECHNOLOGY ORGANISATION SALISBURY (AUSTRALIA) ELECTRONICS AND SURVEILLANCE RESEARCH
|
 | The objective was to determine the conditions under which Automatic Speech Recognition (ASR) is an efficient choice for data entry. In particular the focus was on data entry tasks that are part of constructing military messages. The ADF Formatted Messaging System utilises a structured formatting system to constrain the semantics of a message but also includes a field for unlimited and unstructured text. Hence the data entry tasks involved range ... |
|
| CDMA Reverse Link Spatial Combining Gains: Optimal vs. MRC in a Faded Voice-Data System Having a Single Dominant High Data User |
Nov-2001 |
6 pages |
| Authors:
James R Zeidler; Joseph P Burke; SPACE AND NAVAL WARFARE SYSTEMS CENTER SAN DIEGO CA
|
 | The per user carrier-to-interference ratio (CINR) enhancement in the Reverse Link (mobile to base station) of a CDMA communications system is analyzed using different antenna array spatial combining algorithms: Optimum Combining (OC) versus Maximal Ratio Combining (MRC) in a multi-rate (combined voice and data users) multi-antenna scenario. Many low data voice users and a single dominant high data user are used to achieve a high degree of colored spatial interference ... |
|
| Compressibility Analysis of the Tongue During Speech |
25 OCT 2001 |
|
| Authors:
Devrim Unay; Cengizhan Ozturk; Maureen Stone; BOGAZICI UNIV ISTANBUL (TURKEY) INST OF BIOMEDICAL ENGINEERING
|
 | The motivation of this study is to observe the compressibility of the tongue during speech. The tongue has a complicated muscular structure. Real-time MRI (16 frames/s) with tagging has been used for imaging during the utterance of four short syllables, "sha", "ga", "ta" and "ba". Four- dimensional parametric motion field analysis has been used which allows point tracking everywhere on the tongue. In this paper, 3D compression and expansion analysis ... |
|
| Isolated Speech Recognition Using Artificial Neural Networks |
25 OCT 2001 |
|
| Authors:
Prasad D. Polur; Ruobing Zhou; Jun Yang; Fedra Adnani; Rosalyn S. Hobson; VIRGINIA COMMONWEALTH UNIV RICHMOND SCHOOL OF ENGINEERING
|
 | In this project Artificial Neural Networks are used as research tool to accomplish Automated Speech Recognition of normal speech. A small size vocabulary containing the words YES and NO is chosen. Spectral features using cepstral analysis are extracted per frame and imported to a feedforward neural network which uses a backpropagation with momentum training algorithm. The network is trained to recognize and classify the incoming words into the respective categories. ... |
|
| Preserving Spectral Contrast in Amplitude Compression for Hearing Aids |
25 OCT 2001 |
|
| Authors:
Juan C. Tejero-Calado; Janet C. Rutledge; Peggy B. Nelson; MARYLAND UNIV BALTIMORE DIV OF OTOLARYNGOLOGY
|
 | Amplitude compression processing is used to reduce the amplitude level variations of speech to fit the reduced dynamic ranges of sensorineural impaired listeners. However this processing results in spectral smearing due in part to reduced peak-to-valley ratios. Presented here are two variations of a compression processing algorithm based on a sinusoidal speech model that preserves the important spectral peaks. Both models operate on a time-varying, frequency- dependent basis to adjust ... |
|
| Study of Acoustic Features of Newborn Cries that Correlate with the Context |
25 OCT 2001 |
|
| Authors:
H. E. Baeck; M. N. Souza; UNIVERSIDADE FEDERAL DO RIO DE JANIERO (BRAZIL) BIOMEDICAL ENGINEERING PROGRAM
|
 | Many researches related to the infant cry analysis intent to estimate the context and/or obtain objective information concerning the physical and emotional condition of newborns. Using several techniques in signal processing, peculiar acoustics features, such as the fundamental frequency and formants, are classically analyzed. However, the findings reveal the existence of some contests with respect to the conclusions. In this article a specific phonologic program was used to analyze the ... |
|
| Hidden Markov Model Classification of Myoelectric Signals in Speech |
25 OCT 2001 |
|
| Authors:
A. D. Chan; K. Englehart; B. Hudgins; D. F. Lovely; NEW BRUNSWICK UNIV FREDERICTON
|
 | A hidden Markov model based classifier is proposed in this paper to perform automatic speech recognition using myoelectric signals from the muscles of vocal articulation. The classifier's resilience to temporal variance is compared to a linear discriminant analysis classifier that was used in a pervious study. Speech recognition was performed, using five channels of myoelectric signals, on isolated words from a 10-word vocabulary. Temporal variance was induced by temporally misaligning ... |
|
| Capability Integration Framework for Developing C4ISTAR Solutions |
21 SEP 2001 |
|
| Authors:
David Sundstrom; Rick Holt; LOCKHEED MARTIN CORP BETHESDA MD
|
|
| Time-Series Segmentation |
31 AUG 2001 |
18 pages |
| Authors:
Paul M. Baggenstoss; DEPARTMENT OF THE NAVY WASHINGTON DC
|
 | This patent application discloses a method for segmenting a signal into segments having similar spectral characteristics. Initially the method generates a table of previous values from older signal values that contains a scoring value for the best segmentation of previous values and a segment length of the last previously identified segment. The method then receives a new sample of the signal, and computes a new spectral characteristic function for the ... |
|
| Pilot Visual Acquisition of Traffic: Operational Communications from OpEval-1 |
MAY 2001 |
18 pages |
| Authors:
O. V. Prinzo; FEDERAL AVIATION ADMINISTRATION OKLAHOMA CITY OK CIVIL AEROMEDICAL INST
|
 | Avionics devices designed to provide pilots with graphically displayed traffic information will enable pilots to acquire and verify the identity of any intruder aircraft within the general area, either before or in accordance with a controller-issued traffic advisory or alert. A preliminary evaluation was performed of an airborne capability to display traffic information (OpEval-1 July 1999). As part of that evaluation, audiotapes were analyzed of the communications between pilots flying ... |
|
| Data-Linked Pilot Reply Time On Controller Workload and Communication in a Simulated Terminal Option |
MAY 2001 |
21 pages |
| Authors:
O. V. Prinzo; FEDERAL AVIATION ADMINISTRATION OKLAHOMA CITY OK CIVIL AEROMEDICAL INST
|
 | This report describes an analysis of air traffic control communication and workload in a simulated terminal radar approach control environment. The objective of this study was to investigate how pilot-to- controller data-link acknowledgment time might affect controller-perceived workload and operational communication. Eight controllers provided air traffic services to simulated arrival aircraft during a moderate-traffic density simulation in which voice radio and data link communications were available. The effect of a ... |
|
| The Intelligibility of Multiple Talkers Separated Spatially in Noise |
MAY 2001 |
28 pages |
| Authors:
Mark A. Ericson; Richard L. McKinley; AIR FORCE RESEARCH LAB WRIGHT-PATTERSON AFB OH HUMAN EFFECTIVENESS DIRECTORATE
|
 | Speech communication are seldom isolated auditory events in quiet environments. Frequently, the desired speech signal is confounded with other speech signals and noises. Real-world environments often degrade the intelligibility of the desired speech signal. In this book chapter (Binaural and Spatial Hearing in Real and Virtual Environments, Lawrence Erlbaum Associates, Mahwah NJ, publishers, 1997), the literature on the speech intelligibility of competing messages and the masking of speech is reviewed. ... |
|
| Implementation of Call Activity Detection for ATM Voice Services in Project Parakeet |
MAR 2001 |
|
| Authors:
W. D. Blair; A. B. Reynolds; DEFENCE SCIENCE AND TECHNOLOGY ORGANISATION SALISBURY (AUSTRALIA) ELECTRONICS AND SURVEILLANCE RESEARCH
|
 | Asynchronous Transfer Mode (ATM) has been introduced into the land tactical communications system (specifically the Parakeet system) and offers the potential for increasing the effectiveness of the communications trunks by dynamically sharing the capacity between competing demands. Such a capability has been limited in respect of the trunks between the Parakeet circuit switches since Parakeet uses military standard protocols that do not integrate easily with the civil standard ATM. The ... |
|