Robust Multilingual End-to-End Speech Recognition

Warning

This opportunity is closed.

Organization

Office of the Director of National Intelligence (ODNI)

Reference Code

ICPD-2023-19

How to Apply

Create and release your Profile on Zintellect – Postdoctoral applicants must create an account and complete a profile in the on-line application system. Please note: your resume/CV may not exceed 2 pages.

Complete your application – Enter the rest of the information required for the IC Postdoc Program Research Opportunity. The application itself contains detailed instructions for each one of these components: availability, citizenship, transcripts, dissertation abstract, publication and presentation plan, and information about your Research Advisor co-applicant.

Additional information about the IC Postdoctoral Research Fellowship Program is available on the program website located at: https://orise.orau.gov/icpostdoc/index.html.

If you have questions, send an email to ICPostdoc@orau.org. Please include the reference code for this opportunity in your email.

Application Deadline

2/28/2023 6:00:00 PM Eastern Time Zone

Description

Research Topic Description, including Problem Statement:

Recent deep learning developments have produced dramatic improvements in the performance of speech recognition technologies. Many advancements are due to end-to-end (E2E) neural network models, which optimize a direct mapping from audio data directly to text information with a single network. However, the supervised manner in which these E2E models are trained does not produce state-of-the-art performance results in adverse conditions, such as noisy audio environments and low-resource language scenarios.

With this project, we aim to identify methods to improve E2E speech recognition performance robustness for noisy audio environments and low-resource language scenarios.

Example Approaches:

Advanced speech enhancement modules could be developed as a data preprocessing step for noisy audio environments. Although using speech enhancement modules to improve the listening quality of noisy audio is a well-established research technique, it is known to be harmful as a speech recognition front-end module. Solving this degradation issue is an active research challenge.
Pre-training/fine-tuning approaches could be explored for low-resource language scenarios. Supervised E2E models require large amounts of labeled data to achieve state-of-the-art performance results, which is an obstacle for achieving comparable results for low-resource languages. Exploring approaches using self-supervised learning representation models pre-trained with multiple languages and fine-tuning for low-resource target languages is an active research area that could be explored.

Relevance to the Intelligence Community (IC):

Speech recognition is a fundamental Human Language Technology (HLT) capability that provides audio content triage capabilities. New state-of-the-art E2E neural network approaches remove onerous large, labeled data requirements needed to develop a speech recognition capability for a language of interest; however, these gains are at the expense of lower performance results in adverse audio environments and low-resource scenarios. The work described above would identify solutions to these issues which are critical to maintaining a competitive edge in language analysis capabilities.

Qualifications

Postdoc Eligibility

U.S. citizens only
Ph.D. in a relevant field must be completed before beginning the appointment and within five years of the application deadline
Proposal must be associated with an accredited U.S. university, college, or U.S. government laboratory
Eligible candidates may only receive one award from the IC Postdoctoral Research Fellowship Program

Research Advisor Eligibility

Must be an employee of an accredited U.S. university, college or U.S. government laboratory
Are not required to be U.S. citizens

Key Words: #Speech Recognition, #Multilingual, #Deep Learning, #Neural Network, #End-to-end, #Low Resource, #Noise Robustness

Eligibility Requirements

Citizenship: U.S. Citizen Only
Degree: Doctoral Degree.
Discipline(s):
- Chemistry and Materials Sciences (12 )
- Communications and Graphics Design (5 )
- Computer, Information, and Data Sciences (17 )
- Earth and Geosciences (21 )
- Engineering (27 )
- Environmental and Marine Sciences (14 )
- Life Health and Medical Sciences (48 )
- Mathematics and Statistics (11 )
- Other Non-Science & Engineering (2 )
- Physics (16 )
- Science & Engineering-related (1 )
- Social and Behavioral Sciences (29 )

ORISE GO

The ORISE GO mobile app helps you stay engaged, connected and informed during your ORISE experience – from application, to offer, through your appointment and even as an ORISE alum!

Visit ORISE GO

Download PDF