Настройки

Укажите год
-

Небесная энциклопедия

Космические корабли и станции, автоматические КА и методы их проектирования, бортовые комплексы управления, системы и средства жизнеобеспечения, особенности технологии производства ракетно-космических систем

Подробнее
-

Мониторинг СМИ

Мониторинг СМИ и социальных сетей. Сканирование интернета, новостных сайтов, специализированных контентных площадок на базе мессенджеров. Гибкие настройки фильтров и первоначальных источников.

Подробнее

Форма поиска

Поддерживает ввод нескольких поисковых фраз (по одной на строку). При поиске обеспечивает поддержку морфологии русского и английского языка
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Укажите год
Укажите год

Применить Всего найдено 1694. Отображено 198.
13-07-2020 дата публикации

СПОСОБ ПОВЫШЕНИЯ РАЗБОРЧИВОСТИ РЕЧИ ПОЖИЛЫМИ ЛЮДЬМИ ПРИ ПРИЕМЕ ЗВУКОВЫХ ПРОГРАММ НА НАУШНИКИ

Номер: RU2726326C1

Изобретение относится к средствам для повышения разборчивости речи пожилыми людьми при приеме звуковых программ на наушники. Технический результат заключается в повышении эффективности повышения разборчивости речи пожилыми людьми. Используют усилитель наушников с частотной характеристикой, обратной аудиограмме пожилого человека соответствующей возрастной группы. Возможное некоторое несоответствие параметров аудиограммы выбранной возрастной группы особенностям восприятия звуков конкретным человеком не препятствует повышению разборчивости речи. 1 ил.

Подробнее
12-12-2019 дата публикации

VORRICHTUNG UND VERFAHREN ZUR VERBESSERUNG DER PRIVATSPHÄRE

Номер: DE112018001454T5
Автор: WILES PAUL, Wiles, Paul

Ausführungsformen der vorliegenden Erfindung stellen ein Fahrzeug-Datenschutzsystem (700) dar, das Audioeingabemittel (130, 190, 720) zum Empfangen eines externen Audiosignals (725), das Audio von innerhalb eines Fahrzeugs (900) anzeigt, Audioquellenmittel (710) umfasst, 910) zum Empfangen des externen Audiosignals (725) und zum Bestimmen eines Ausgangsaudiosignals (735) in Abhängigkeit davon zum Reduzieren einer externen Sprachverständlichkeit innerhalb des Fahrzeugs (900), und Audioausgabemittel (145, 146, 147, 730, 920) zum Empfangen des Ausgangsaudiosignals (735) und zum Ausgeben von diesem entsprechendem Audio (925), um außerhalb des Fahrzeugs (900) zumindest teilweise hörbar zu sein.

Подробнее
15-07-2009 дата публикации

PROCEDURE AND DEVICE FOR THE SQUELCH

Номер: AT0000435481T
Принадлежит:

Подробнее
25-04-2019 дата публикации

AUDIO SIGNAL

Номер: CA0003079640A1
Принадлежит: CPST INTELLECTUAL PROPERTY

A computer device (100) for processing audio signals is described. The computer device (100) includes at least a processor and a memory. The computer device (100) is configured to receive a bitstream comprising a combined audio signal, the combined audio signal comprising a first audio signal including speech and a second audio signal. The computer device (100) is configured to compress the combined audio signal to provide a compressed audio signal. The computer device (100) is configured to control a dynamic range of the compressed audio signal to provide an output audio signal. In this way, a quality of the speech included in the output audio signal is improved.

Подробнее
21-06-2018 дата публикации

SOUND MANAGEMENT METHOD AND SYSTEM

Номер: CA0003044079A1
Принадлежит: MOFFAT & CO.

A computer implemented method for managing a sound emitting device comprising: receiving data associated with operation of the sound emitting device at a predetermined location; processing said data to determine an operating characteristic of that device for that location; comparing the operating characteristic with a predetermined mathematical relationship to determine whether a difference exists; and identifying an input adjustment to correct the difference wherein the input adjustment optionally is within a predetermined range and optionally does not exceed a predetermined maximum increment; wherein the predetermined mathematical relationship is between an input variable and an output variable in respect of the sound emitting device.

Подробнее
07-09-2018 дата публикации

METHOD AND SYSTEM FOR LIMITING SOUND VOLUME CONTAINING CEMENT

Номер: FR0003063566A1
Принадлежит: PEUGEOT CITROEN AUTOMOBILES SA

L'invention concerne un téléphone de voiture (20) avec un système de limitation de volume sonore (25) comportant un récepteur d'entrée (21) et un émetteur de sortie (22) de téléphonie mobile aptes à communiquer par un réseau hertzien (RH) avec un poste distant (40), ainsi qu'un haut-parleur d'émission sonore (23) et un microphone de capture sonore (24) reliés respectivement au récepteur d'entrée (21) et à l'émetteur de sortie (22) via le système de limitation (25). Le système de limitation (25) comporte en liaison un module de détection d'empreinte de voix (27), un variateur de volume (30) et un équipement anti-écho (26). De plus, le module de détection (27) est agencé entre le récepteur d'entrée (21), et le variateur de volume (30), ce dernier étant relié au haut-parleur (23), et l'équipement anti-écho (25) est monté entre le microphone (24) et l'émetteur de sortie (22).

Подробнее
25-02-2005 дата публикации

Diver/above surface two way mobile telephone communication technique having adaptive system with microphone measuring presence/absence expelled air sounds and cancelling if present

Номер: FR0002859040A1
Принадлежит:

L'invention concerne un procédé de traitement d'un signal électrique par un dispositif de communication, au moins une partie du dispositif de communication comprenant au moins un microphone générant le signal électrique étant située dans un milieu liquide, un plongeur dans le milieu liquide étant à proximité du microphone, le plongeur rejetant un mélange gazeux dans le milieu liquide, caractérisé en ce que le procédé comporte les étapes de détermination dans le signal électrique de la présence ou l'absence d'au moins une composante issue d'un signal acoustique généré par le rejet du mélange gazeux dans le milieu liquide, génération d'un signal adapté à la présence ou l'absence dans le signal électrique de la composante générée par le rejet du mélange gazeux dans le milieu liquide et de transfert du signal adapté à un correspondant. L'invention concerne aussi le dispositif de traitement associé.

Подробнее
11-01-2022 дата публикации

Mixing apparatus, mixing method, and non-transitory computer-readable recording medium

Номер: US0011222649B2

A mixing apparatus having a stereo output includes: a first signal processor that mixes a first signal and a second signal in a first channel; a second signal processor that mixes a third signal and a fourth signal in a second channel; a third channel that processes a weighted sum of a signal of the first channel and a signal of the second channel; and a gain deriving part that generates a gain mask commonly used in the first channel and the second channel, wherein the gain deriving part determines a first gain commonly applied to the first signal and the third signal, and a second gain commonly applied to the second signal and the fourth signal, so that predetermined conditions for simultaneous gain generation are satisfied at least at the first channel and the second channel among the first channel, the second channel, and the third channel.

Подробнее
15-07-2021 дата публикации

POST-PROCESSING GAINS FOR SIGNAL ENHANCEMENT

Номер: US20210217435A1

A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.

Подробнее
16-09-2021 дата публикации

AUTOMATIC GAIN CONTROL BASED ON MACHINE LEARNING LEVEL ESTIMATION OF THE DESIRED SIGNAL

Номер: US20210287691A1
Принадлежит:

Method includes receiving, through a plurality of channels, audio data corresponding to a plurality of frequency ranges; determining, for each channel's frequency ranges, speech audio and/or noise energy level using a model trained by machine learning; determining a speech signal with removed noise for each channel; determining one or more statistical values associated with an energy level of a channel's speech signal with the removed noise; determining a strongest channel that has highest statistical values associated with an energy level of a speech signal; determining that the one or more statistical values associated with the energy level of the strongest channel's speech signal satisfy a threshold condition; comparing statistical values associated with an energy level of a speech signal of each channel with those of the strongest channel; and determining whether to update a gain value for a channel based on the channel's statistical values associated with the energy level.

Подробнее
15-02-2024 дата публикации

METHOD AND APPARATUS FOR DETERMINING A MEASURE OF SPEECH INTELLIGIBILITY

Номер: US20240055013A1

A method of estimating speech intelligibility is disclosed. The method comprises the steps of providing at least a first time-dependent signal derived from a first auditory stimulus and a corresponding first measured EEG response; comparing at least part of the first signal with at least part of the first measured EEG response in order to determine a signal-response latency difference; comparing the signal-response latency difference to a reference value; and deriving a measure of speech intelligibility based on the comparison of the signal-response latency difference and the reference value.

Подробнее
08-04-2020 дата публикации

DYNAMIC TEXT-TO-SPEECH RESPONSE FROM A SMART SPEAKER

Номер: EP3631792A1
Принадлежит:

Подробнее
28-11-2018 дата публикации

EFFECTIVE PRE-ECHO ATTENUATION IN A DIGITAL AUDIO SIGNAL

Номер: EP2867893B1
Принадлежит: Orange

Подробнее
20-09-2023 дата публикации

METHOD OF COMPENSATING A PROCESSED AUDIO SIGNAL

Номер: EP3671740B1
Принадлежит: GN Audio A/S

Подробнее
26-07-2017 дата публикации

Dynamic frequency-dependent sidetone generation

Номер: GB0002546563A
Принадлежит:

A multi-microphone device (eg. a mobile phone, fig. 2B) generates a sidetone based on the mode of operation (eg. phone call, speaker recognition, automatic speech recognition ASR) of the device according to the microphone signals (M1, M2). The sidetone may accentuate higher frequencies of the speakers voice, enhance the signal-to-noise ratio, cancel bone-conducted speech or compensate for an occlusion effect.

Подробнее
21-06-2023 дата публикации

Method and apparatus for improving speech intelligibility in a room

Номер: GB0002605693B
Принадлежит: PORSCHE AG [DE]

Подробнее
06-01-2004 дата публикации

AUDIO SIGNAL PROCESSING APPARATUS

Номер: AU2003263380A1
Принадлежит:

Подробнее
07-08-2013 дата публикации

TWO MODE AGC FOR SINGLE AND MULTIPLE SPEAKERS

Номер: CA0002803615A1
Принадлежит:

A control system for varying an audio level in a communication system, the control system comprising a receiving unit for receiving an audio signal and a video signal, a determining unit for determining a number of individuals speaking determined by performing recognition on either the audio signal or the video signal; and a gain adjustment unit for adjusting a gain of the audio signal based on said number of determined individuals that are speaking.

Подробнее
05-11-2019 дата публикации

CONFERENCING SYSTEM INCLUDING A REMOTE MICROPHONE AND METHOD OF USING THE SAME

Номер: CA0002857173C

A conferencing system and method are disclosed. An exemplary system includes a conference device and one or more user devices. The conference device is configured to receive audio information from one or more user devices. The audio information received by the conference device can be mixed with audio information received by conference device microphones.

Подробнее
14-07-2015 дата публикации

CONFERENCING SYSTEM INCLUDING A REMOTE MICROPHONE AND METHOD OF USING THE SAME

Номер: CA0002857173A1
Принадлежит:

A conferencing system and method are disclosed. An exemplary system includes a conference device and one or more user devices. The conference device is configured to receive audio information from one or more user devices. The audio information received by the conference device can be mixed with audio information received by conference device microphones.

Подробнее
27-06-2017 дата публикации

atenuação eficaz de pré-ecos em um sinal de áudio digital

Номер: BR112014032587A2
Принадлежит:

Подробнее
19-09-2013 дата публикации

AUDIO SIGNAL PROCESSING DEVICE AND AUDIO SIGNAL PROCESSING METHOD

Номер: WO2013136846A1
Принадлежит:

Provided is an audio signal processing device for adjusting attack sound, reverberation, and noise components, and matching the output sound to the listener preferences. The audio signal processing device comprises the following: an FTF unit for determining a frequency spectrum signal by converting an input audio signal from a time region to a frequency region, and generating a first amplitude spectrum signal and a phase spectrum signal; an attack component control unit (10) for generating a second amplitude spectrum signal by controlling the attack component of the first amplitude spectrum signal; a reverberation component control unit (20) for generating a third amplitude spectrum signal by controlling the reverberation component of the first amplitude spectrum signal; a first addition unit (40) for generating a fourth amplitude spectrum signal by synthesizing the first amplitude spectrum signal, the second amplitude spectrum signal, and the third amplitude spectrum signal; and an IFFT ...

Подробнее
12-09-2019 дата публикации

VOICE PROCESSING METHOD, VOICE PROCESSING DEVICE, AND RECORDING MEDIUM

Номер: WO2019172396A1
Принадлежит:

A voice processing device that comprises a time period lengthening/shortening part 31 that, for a voice signal X that represents a voice, anteriorly shortens a first steady time period from among a plurality of steady time periods that have temporally stable acoustic characteristics and, immediately after the first steady time period from among the plurality of steady time periods, anteriorly lengthens a transition time period that is between the first steady time period and a second steady time period that has a different pitch than the first steady time period.

Подробнее
06-07-2021 дата публикации

Speech enhancement method and apparatus, device and storage medium

Номер: US0011056130B2

The present disclosure provides a speech enhancement method and apparatus, a device and a storage medium. The method includes: acquiring a first speech signal and a second speech signal; obtaining a signal to noise ratio of the first speech signal; determining, according to the signal to noise ratio of the first speech signal, a fusion coefficient of filtered signals corresponding to the first speech signal and the second speech signal; and performing, according to the fusion coefficient, speech fusion processing on the filtered signals corresponding to the first speech signal and the second speech signal to obtain an enhanced speech signal. Thereby, it is realized that a fusion coefficient of speech signals of a non-air conduction speech sensor and an air conduction speech sensor is adaptively adjusted according to environment noise, thereby improving the signal quality after speech fusion, and improving the effect of speech enhancement.

Подробнее
31-10-2019 дата публикации

VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD

Номер: US2019334497A1
Принадлежит:

Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

Подробнее
07-09-2023 дата публикации

SIGNAL PROCESSING DEVICE AND METHOD, AND PROGRAM

Номер: US20230282226A1
Автор: YUKI YAMAMOTO
Принадлежит:

The present technique relates to a signal processing device, method, and program which are capable of reducing the production cost of content. The signal processing device includes: a voice detection unit that, based on a mixed audio signal containing a sound of a target sound source and a sound of a non-target sound source different from the target sound source, detects a time segment of the sound of the target sound source from the mixed audio signal; and a voice determination unit that, based on (i) label information indicating the time segment of the sound of the target sound source in an audio signal of the target sound source and (ii) a detection result for the time segment of the sound of the target sound source, performs determination processing for determining whether the sound of the target sound source in the mixed audio signal is easy to hear. The present technique can be applied in a signal processing device.

Подробнее
17-10-2023 дата публикации

System and method for phonetic hashing and named entity linking from output of speech recognition

Номер: US0011790175B2
Принадлежит: Fresh Consulting, Inc

A system and method for named entity linking from the output of speech-to-text systems by using an approximate string matching that normalizes common sounds, removes ambiguities, removes silent consonants, and accounts for speech slurring for long names. Additionally, the system and method for named entity linking from the output of speech-to-text systems employs a hierarchical matching system that performs multiple attempts using various mechanisms for resolving the name, starting with a very strict mechanism, and proceeding sequentially through less strict mechanisms.

Подробнее
22-02-2024 дата публикации

ENHANCED DE-ESSER FOR IN-CAR COMMUNICATIONS SYSTEMS

Номер: US20240062770A1
Принадлежит:

Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.

Подробнее
08-11-2017 дата публикации

A SIGNAL PROCESSOR

Номер: EP3242295A1
Принадлежит:

A signal processor comprising: a signal-manipulation-block configured to: receive a cepstrum-input-signal, wherein the cepstrum-input-signal is in the cepstrum domain and comprises a plurality of bins; receive a pitch-bin-identifier that is indicative of a pitch-bin in the cepstrum-input-signal; and generate a cepstrum-output-signal based on the cepstrum-input-signal by: scaling the pitch-bin relative to one or more of the other bins of the cepstrum-input-signal; or determining an output-pitch-bin-value based on the pitch-bin, and setting one or more of the other bins of the cepstrum-input-signal to a predefined value; or determining an output-other-bin-value based on one or more of the other bins of the cepstrum-input-signal, and setting the pitch-bin to a predefined value.

Подробнее
15-01-2020 дата публикации

SYSTEM AND METHOD FOR RELATIVE ENHANCEMENT OF VOCAL UTTERANCES IN AN ACOUSTICALLY CLUTTERED ENVIRONMENT

Номер: EP3593349A1
Принадлежит:

Подробнее
27-03-2019 дата публикации

Audio Signal

Номер: GB0002566760A
Принадлежит:

A device 100 for processing audio signals includes at least a processor and a memory and is configured to receive a bitstream comprising a combined audio signal AS, the combined audio signal comprising a first audio signal AS1 including speech S1 and a second audio signal AS2 (comprising a music signal M2). The device is configured to compress the combined audio signal S102 to provide a compressed audio signal, and to control a dynamic range of the compressed audio signal to provide an output audio signal AS’. In this way, a quality of the speech included in the output audio signal is improved. The device is configured to compress the combined audio signal by selectively reducing an amplitude of the second audio signal, selectively increasing an amplitude of the speech included in the first audio signal, and matching amplitudes of the first audio signal and the second audio signal. The device may be configured to selectively harmonically excite the compressed audio signal. The device may ...

Подробнее
12-10-2022 дата публикации

Method and apparatus for improving speech intelligibility in a room

Номер: GB0002605693A
Принадлежит:

Speech intelligibility of a person in a room or vehicle is improved by detecting active speech via a microphone, identifying the active speaker by matching to speech profiles or via face recognition, and then reducing the sound in the room at speech-sensitive frequencies (eg. A, B & C) via sound damping from eg. adjustable panelling or destructive interference from eg. a smart speaker. The frequency-dependent cancellation may be focussed on parts of the room where an audience or listener are located, and recorded speech may be delayed or corrected for phase when reproduced.

Подробнее
07-11-2016 дата публикации

주파수 의존 측음 교정

Номер: KR1020160128412A
Принадлежит:

... 개인용 오디오 디바이스는 제 1 마이크로폰의 출력으로부터 측음(sidetone) 신호를 생성하는 하나 이상의 조정가능한 계수들을 갖는 측음 회로를 포함한다. 측음 회로는 제 1 마이크로폰 신호와 측음 신호 사이의 관계를 변경하기 위한 하나 이상의 조정가능한 계수들을 갖는다. 개인용 오디오 디바이스는 또한, 청취자의 귀에서 재생 오디오 및 측음 신호를 재생하기 위한 변환기(transducer) 및 청취자의 귀에 전달된 변환기의 출력을 측정하기 위한 제 2 마이크로폰을 포함한다. 측음 회로는 측음 신호에 대한 상기 제 2 마이크로폰 신호의 응답을 추정하고 추정된 응답에 따라 측음 회로의 계수를 조정하기 위한 교정 회로를 포함한다.

Подробнее
05-12-2016 дата публикации

소음 환경에서 음절 형태 기반 음소 가중 기법을 이용한 음성의 명료도 향상 방법 및 이를 기록한 기록매체

Номер: KR0101682796B1
Автор: 최승호, 이영호, 주종한

... 본 발명은 소음 환경에서 음절 형태 기반 음소 가중 기법을 이용한 음성의 명료도 향상 방법에 있어서, 음성 신호로부터 음절을 검출하는 단계, 검출된 음절을 분석하여 음절 형태를 분류하는 단계, 주변 소음으로부터 추정된 소음 환경에서 음절 형태별로 자음의 전력을 각각 다르게 조절하는 단계 및 전력 정규화를 통해 자음의 전력을 강화한 음성 신호와, 그렇지 않은 음성 신호의 전력을 같게 하여 출력하는 단계를 포함한다. 본 발명에 의하면 음절 형태를 기반으로 음소를 가중함으로써, 소음 환경에서 명료도를 향상시킬 수 있는 효과가 있다.

Подробнее
28-08-2019 дата публикации

Номер: KR1020190100470A
Автор:
Принадлежит:

Подробнее
18-07-2017 дата публикации

método e aparelho de melhoria de fala de máscara respiradora

Номер: BR112015018441A2
Автор: ROGER KIHLBERG
Принадлежит:

Подробнее
02-01-2020 дата публикации

APPARATUSES AND ASSOCIATED METHODS FOR SPATIAL PRESENTATION OF AUDIO

Номер: WO2020002022A1
Принадлежит:

An apparatus, the apparatus comprising means configured to: receive audio content comprising voice audio and ambient audio and directional information indicative of a direction of the at least one sound source and the direction of the remote user relative to the reference point; receive a reference location; provide for presentation of the ambient audio with a first spatial audio effect, based on the directional information, and presentation of the voice audio with a second spatial audio effect, based on the directional information, receive repositioning signalling from the remote user device; and provide for presentation of the audio content using a modification of the first spatial audio effect to reposition an ambient-perceived direction based on the repositioning signalling and/or a modification of the second spatial audio effect to reposition a voice-perceived direction based on the repositioning signalling to increase the spatial separation between the voice- perceived direction and ...

Подробнее
17-06-2021 дата публикации

ADJUSTING AUDIO AND NON-AUDIO FEATURES BASED ON NOISE METRICS AND SPEECH INTELLIGIBILITY METRICS

Номер: WO2021119102A1
Принадлежит:

Some implementations involve determining a noise metric and/or a speech intelligibility metric and determining a compensation process corresponding to the noise metric and/or the speech intelligibility metric. The compensation process may involve altering a processing of audio data and/or applying a non-audio-based compensation method. In some examples, altering the processing of the audio data does not involve applying a broadband gain increase to the audio signals. Some examples involve applying the compensation process in an audio environment. Other examples involve determining compensation metadata corresponding to the compensation process and transmitting an encoded content stream that includes encoded compensation metadata, encoded video data and encoded audio data from a first device to one or more other devices.

Подробнее
19-10-2021 дата публикации

Audio quality of speech in sound systems

Номер: US0011151981B2

A computer implemented method, apparatus, and computer program product for a sound system. Speech recognition is performed on input audio data comprising speech input to a sound system. Speech recognition is additionally performed on at least one instance of output audio data comprising speech reproduced by one or more audio speakers of the sound system. A difference between a result of speech recognition performed on the input audio data and a result of speech recognition performed on an instance of corresponding output audio data is determined. The quality of the reproduced speech is determined as unsatisfactory when the difference is greater than or equal to a threshold. A corrective action may be performed, to improve the quality of the speech reproduced by the sound system, if it is determined that the speech quality of the reproduced sound is unsatisfactory.

Подробнее
25-12-2018 дата публикации

Audio encoder and decoder

Номер: US0010163446B2

This disclosure falls into the field of audio coding, in particular it is related to the field of spatial audio coding, where the audio information is represented by multiple audio objects including at least one dialog object. In particular the disclosure provides a method and apparatus for enhancing dialog in a decoder in an audio system. Furthermore, this disclosure provides a method and apparatus for encoding such audio objects for allowing dialog to be enhanced by the decoder in the audio system.

Подробнее
07-01-2021 дата публикации

Method for Processing an Acoustic Speech Input Signal and Audio Processing Device

Номер: US20210006910A1
Принадлежит: Two Pi GMBH

The invention relates to a method for processing an acoustic input signal, preferably a speech signal, said method comprising the following steps: a) receiving a digital representation (S) of an acoustic input signal, b) calculating at least one statistical parameter (P) of the digital representation (S) of the acoustic input signal, c) calculating a compression ratio function (CR) based—on a prescribed constant compression ratio (CR), said prescribed constant compression ratio (CR) uniformly mapping acoustic input signals of a selected magnitude to acoustic output signals of a selected magnitude, and—on at least one statistical parameter (P) calculated in step b), and d) applying the non-uniform compression ratio function (CR) according to step c) on the digital representation (S) of the acoustic input signal delivering a digital representation (S) of an enhanced acoustic output signal. 1. A method for processing an acoustic input signal , the method comprising:a) receiving a digital representation of an acoustic input signal,b) calculating at least one statistical parameter of the digital representation of the acoustic input signal, on a prescribed constant compression ratio, where the prescribed constant compression ratio uniformly maps acoustic input signals of a selected magnitude to acoustic output signals of a selected magnitude, and', 'on at least one statistical parameter calculated in step b),, 'c) calculating a compression ratio function based'}wherein the compression ratio function deviates from the prescribed constant compression ratio by including a non-uniform mapping of acoustic input signals of a selected magnitude to acoustic output signals of a selected magnitude, wherein the non-uniformity of the mapping procedure is determined based on at least one statistical parameter calculated according to step b), andd) applying the non-uniform compression ratio function according to step c) on the digital representation of the acoustic input signal to ...

Подробнее
23-06-2015 дата публикации

Method and apparatus for audio intelligibility enhancement and computing apparatus

Номер: US0009064497B2
Принадлежит: HTC Corporation, HTC CORP, HTC CORPORATION

Method and apparatus for audio intelligibility enhancement and computing apparatus are provided. The method includes the following steps. Environment noise is detected by performing voice activity detection according to a detected audio signal from at least a microphone of a computing device. Noise information is obtained according to the detected environment noise and a first audio signal. A second audio signal is outputted by boosting the first audio signal under an adjustable headroom by the computing device according to the noise information and the first audio signal.

Подробнее
03-09-2015 дата публикации

VOICE PROCESSING DEVICE, NOISE SUPPRESSION METHOD, AND COMPUTER-READABLE RECORDING MEDIUM STORING VOICE PROCESSING PROGRAM

Номер: US20150248895A1
Автор: Chikako Matsumoto
Принадлежит: FUJITSU LIMITED

A voice processing device includes a noise-originating coefficient calculation section that calculates a noise-originating coefficient that gradually decreases as a target value of stationary noise for each frequency increases, the target value being calculated based on an amplitude value of a frequency spectrum obtained by time-frequency transforming a voice signal for a predetermined period of time, and a suppression signal generation section that generates, when the frequency spectrum is determined as being stationary on the basis of the amplitude value, a suppression signal by multiplying a suppression coefficient based on the noise-originating coefficient by the amplitude value, the suppression signal being frequency-time transformed to be output.

Подробнее
07-06-2022 дата публикации

Phone stand using a plurality of microphones

Номер: US0011355135B1
Автор: Chi Fai Ho, John Chiong
Принадлежит: TP Lab, Inc.

A phone stand includes a phone holder for coupling to a phone for conducting an audio session, the audio session including at least one voice session conducted by an application executing on the phone and a plurality of microphones including a particular microphone closer to a location where a user is expected to be positioned than other microphones. The phone stand further includes a system controller configured to: receive sound signals from the particular microphone, the sound signals comprising the user's speech; separate the sounds signals into speech signals and non-speech signals; obtain one or more input mixing attributes for the speech signals and the non-speech signals; modify the speech signals and the non-speech signals based on the one or more input mixing attributes; generate mixed signals by combining the modified speech signals and the modified non-speech signals; and send the mixed signals to the phone.

Подробнее
04-03-2020 дата публикации

DYNAMIC TEXT-TO-SPEECH PROVISIONING

Номер: EP3510591B1
Принадлежит: Google LLC

Подробнее
07-08-2019 дата публикации

АУДИОКОДИРОВЩИК И ДЕКОДЕР

Номер: RU2696952C2

Изобретение относится к средствам пространственного аудиокодирования, когда аудиоинформация представлена множеством аудиообъектов, содержащим по меньшей мере один объект с диалогом. Технический результат заключается в повышении эффективности кодирования аудио. Получают множество сигналов понижающего микширования, при этом сигналы понижающего микширования являются результатом понижающего микширования множества аудиообъектов, содержащего по меньшей мере один объект, представляющий собой диалог. Получают дополнительную информацию, указывающую на коэффициенты, позволяющие реконструкцию множества аудиообъектов из множества сигналов понижающего микширования. Получают данные, определяющие, какой из множества аудиообъектов представляет собой диалог. Изменяют коэффициенты, используя параметр усиления и данные, определяющие, какой из множества аудиообъектов представляет собой диалог. Реконструируют по меньшей мере указанный по меньшей мере один объект, представляющий собой диалог, с применением измененных ...

Подробнее
23-10-2019 дата публикации

Audio Signal

Номер: GB0002566760B
Принадлежит: PLEASE HOLD UK LTD, Please Hold (UK) Ltd

Подробнее
15-06-2003 дата публикации

SPEECH PROCESSING

Номер: AT0000241845T
Принадлежит:

Подробнее
15-11-2010 дата публикации

SYSTEM AND PROCEDURE FOR THE DECREASE OF UPLINKGERÄU

Номер: AT0000487214T
Принадлежит:

Подробнее
04-06-2020 дата публикации

Audio signal

Номер: AU2018351031A1
Принадлежит: Halfords IP

A computer device (100) for processing audio signals is described. The computer device (100) includes at least a processor and a memory. The computer device (100) is configured to receive a bitstream comprising a combined audio signal, the combined audio signal comprising a first audio signal including speech and a second audio signal. The computer device (100) is configured to compress the combined audio signal to provide a compressed audio signal. The computer device (100) is configured to control a dynamic range of the compressed audio signal to provide an output audio signal. In this way, a quality of the speech included in the output audio signal is improved.

Подробнее
22-10-2020 дата публикации

DIALOGUE ENHANCEMENT IN AUDIO CODEC

Номер: CA3134792A1
Принадлежит:

Dialogue enhancement of an audio signal, comprising obtaining a set of time-varying parameters configured to estimate a dialogue component present in said audio signal, estimating the dialogue component from the audio signal, applying a compressor only to the estimated dialogue component, to generate a processed dialogue component, applying a user-determined gain to the processed dialogue component, to provide an enhanced dialogue component. The processing of the estimated dialogue may be performed on the decoder side or encoder side. The invention enables an improved dialogue enhancement.

Подробнее
26-06-2014 дата публикации

EFFECTIVE ATTENUATION OF PRE-ECHOS IN A DIGITAL AUDIO SIGNAL

Номер: CA0002894743A1
Принадлежит:

L'invention se rapporte à un procédé de traitement d'atténuation de pré-écho dans un signal audionumérique décodé selon un décodage par transformée. Ce procédé comporte les étapes suivantes: - décomposition (E603) du signal décodé en au moins deux sous-signaux selon un critère de décomposition prédéterminé; - calcul (E604) de facteurs d'atténuation par sous-signal et par échantillon d'une zone de pré-écho préalablement déterminée; - atténuation (E605) de pré-écho dans la zone de pré-écho de chacun des sous- signaux par application des facteurs d'atténuation aux sous-signaux; et -obtention (E606) du signal atténué par addition des sous-signaux atténués. L'invention se rapporte aussi à un dispositif de traitement mettant en uvre les étapes du procédé décrit, à un décodeur comportant un tel dispositif.

Подробнее
06-04-2018 дата публикации

적응형 오디오 신호 생성, 코딩 및 렌더링을 위한 시스템 및 방법

Номер: KR1020180035937A
Принадлежит:

... 복수의 독립적인 모노포닉(monophonic) 오디오 스트림들을 포함하는 오디오 데이터를 처리하는 적응형 오디오 시스템을 위한 실시예들이 설명된다. 하나 이상의 스트림들은 스트림이 채널-기반인지 오브젝트-기반 스트림인지의 여부를 특정하는 메타데이터와 연관시켰다. 채널-기반 스트림들은 채널명의 수단으로 인코딩된 렌더링 정보를 갖고; 오브젝트-기반 스트림들은 연관 메타데이터에 인코딩된 위치 표현들을 통해 인코딩된 위치 정보를 갖는다. 코덱은 독립 오디오 스트림들을 모든 오디오 데이터를 포함하는 단일 시리얼 비트스트림으로 패키징한다. 이 구성은 사운드의 렌더링 위치가 믹서의 의도에 대응하도록 재생 환경의 특성(예를 들면, 룸 사이즈, 외형 등)에 기초하는, 타자(他者) 중심 레퍼런스 프레임에 따라 사운드가 렌더링되도록 허용한다. 오브젝트 위치 메타데이터는 적응형 오디오 콘텐트를 재생하도록 셋 업되는 룸의 이용가능한 스피커 위치들을 이용하여 사운드를 정확하게 재생하도록 요구된 적합한 타자 중심 레퍼런스 프레임 정보를 포함한다.

Подробнее
03-12-2015 дата публикации

ENHANCING INTELLIGIBILITY OF SPEECH CONTENT IN AN AUDIO SIGNAL

Номер: WO2015183728A2
Принадлежит:

Embodiments of the present invention relate to signal processing. Methods for enhancing intelligibility of speech content in an audio signal are disclosed. One of the methods comprises obtaining reference loudness of the audio signal. The method further comprises enhancing the intelligibility of the speech content by adjusting partial loudness of the audio signal based on the reference loudness and a degree of the intelligibility. Corresponding systems and computer program products are also disclosed.

Подробнее
25-04-2019 дата публикации

AUDIO SIGNAL

Номер: WO2019077373A1
Автор: COOKE, Michael
Принадлежит:

A computer device (100) for processing audio signals is described. The computer device (100) includes at least a processor and a memory. The computer device (100) is configured to receive a bitstream comprising a combined audio signal, the combined audio signal comprising a first audio signal including speech and a second audio signal. The computer device (100) is configured to compress the combined audio signal to provide a compressed audio signal. The computer device (100) is configured to control a dynamic range of the compressed audio signal to provide an output audio signal. In this way, a quality of the speech included in the output audio signal is improved.

Подробнее
29-05-2018 дата публикации

Sound verification

Номер: US0009984703B2

In some examples, sound verification may include a speaker device that may be configured to transmit sound at a dynamic volume level and a listening device that may be configured to receive the sound and provide feedback to the speaker device based on the received sound. The primary transceiver device may be further configured to adjust the dynamic volume level based on the feedback provided by the secondary transceiver device.

Подробнее
01-09-2016 дата публикации

AUDIO PROCESSING FOR MULTI-PARTICIPANT COMMUNICATION SYSTEMS

Номер: US20160255203A1
Принадлежит: AT&T Intellectual Property I, L.P.

Audio processing is provided to determine whether an audio issue is present within a multi-participant communication system such as a teleconference or videoconference bridge or a trunk dispatch system. Audio issues such as background noise, background conversations, or other unwanted audio that is being interjected into the multi-participant conversation and that may be dominating the audio are detected by measuring characteristics of audio samples taken from the communication ports of the multi-participant communication system. A correction may then be applied to the audio received through the communication port by a processor of the multi-participant communication system without intervention by an administrator, such as by muting the port, applying a noise cancellation to audio from the port, or time-shifting the audio from the port.

Подробнее
01-07-2021 дата публикации

Audio Device with Speech-Based Audio Signal Processing

Номер: US20210201926A1
Автор: Michael Stark
Принадлежит:

An audio device with an electro-acoustic transducer and a processor that is configured to determine if input audio signals are speech-based, and if the input audio signals are determined to be speech-based apply speech dynamic range compression to the input audio signals, to develop revised audio signals. The revised audio signals are provided to the transducer. 1. A computer program product having a non-transitory computer-readable medium including computer program logic encoded thereon that , when performed on an audio device that is configured to play audio signals over an electro-acoustic transducer , causes the audio device to:determine if input audio signals are speech-based;if the input audio signals are determined to be speech-based, apply speech dynamic range compression to the input audio signals, to develop revised audio signals; andprovide the revised audio signals to the transducer.2. The computer program product of claim 1 , wherein if the input audio signals are determined to be speech-based the computer program product further causes the audio device to apply at least one of speech static equalization or speech dynamic equalization to the input audio signals.3. The computer program product of claim 2 , wherein if the input audio signals are not determined to be speech-based the computer program product causes the audio device to apply at least one of non-speech static equalization claim 2 , non-speech dynamic equalization claim 2 , or non-speech dynamic range compression to the input audio signals.4. The computer program product of claim 3 , wherein if the input audio signals are not determined to be speech-based the computer program product causes the audio device to apply non-speech static equalization claim 3 , non-speech dynamic equalization claim 3 , and non-speech dynamic range compression to the input audio signals.5. The computer program product of claim 3 , wherein speech static equalization has less low frequency compensation and more high ...

Подробнее
17-09-2019 дата публикации

Voice activity detector for audio signals

Номер: US0010418052B2

According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.

Подробнее
20-05-2021 дата публикации

Data Driven Radio Enhancement

Номер: US20210151069A1
Принадлежит: BabbleLabs LLC

Systems and methods are disclosed for data driven radio enhancement. For example, methods may include demodulating a radio signal to obtain a demodulated audio signal; determining a window of audio samples based on the demodulated audio signal; applying an audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the audio enhancement network includes a machine learning network that has been trained using demodulated audio signals derived from radio signals; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

Подробнее
27-01-2021 дата публикации

A METHOD FOR OPERATING A BINAURAL HEARING SYSTEM

Номер: EP3252764B1
Принадлежит: Sivantos Pte. Ltd.

Подробнее
10-01-2017 дата публикации

ЭФФЕКТИВНОЕ ОСЛАБЛЕНИЕ ОПЕРЕЖАЮЩИХ ЭХО-СИГНАЛОВ В ЦИФРОВОМ ЗВУКОВОМ СИГНАЛЕ

Номер: RU2607418C2
Принадлежит: ОРАНЖ (FR)

Изобретение относится к средствам ослабления опережающих эхо-сигналов в цифровом звуковом сигнале. Технический результат заключается в обеспечении возможности ослабления высоких частот и паразитных опережающих эхо-сигналов при декодировании без передачи кодирующим устройством какой-либо вспомогательной информации. Ослабляют опережающие эхо-сигналы в цифровом звуковом сигнале, получаемом путем кодирования посредством преобразования. В декодированном сигнале обнаруживают положение атаки. Определяют зону опережающего эхо-сигнала, предшествующую положению атаки, обнаруженному в декодированном сигнале. Вычисляют коэффициенты ослабления на каждый подблок зоны опережающего эхо-сигнала в зависимости, по меньшей мере, от кадра, в котором была обнаружена атака, и от предыдущего кадра. Производят ослабление опережающего эхо-сигнала в подблоках зоны опережающего эхо-сигнала при помощи соответствующих коэффициентов ослабления. Способ ослабления опережающего эхо-сигнала дополнительно содержит этап применения ...

Подробнее
19-09-2017 дата публикации

ОПТИМИЗАЦИЯ ГРОМКОСТИ И ДИНАМИЧЕСКОГО ДИАПАЗОНА ЧЕРЕЗ РАЗЛИЧНЫЕ УСТРОЙСТВА ВОСПРОИЗВЕДЕНИЯ

Номер: RU2631139C2

Изобретение относится к области обработки аудиосигналов, в частности к обработке потоков битов аудиоданных с метаданными. Технический результат заключается в обеспечении приема потоков битов. Технический результат достигается за счет анализа метаданных, чтобы определять то, действительно ли упомянутые метаданные представляют собой или включают в себя метаданные профиля, указывающие целевой профиль, причем метаданные профиля пригодны для выполнения по меньшей мере одного из управления громкостью, нормализации громкости или управления динамическим диапазоном аудиоданных в соответствии с целевым профилем, и при этом целевой профиль определяет целевую громкость и/или по меньшей мере одну целевую характеристику динамического диапазона, подвергнутой рендерингу версии аудиоданных для воспроизведения устройством воспроизведения аудио из группы устройств воспроизведения аудио. 4 н. и 15 з.п. ф-лы, 17 ил.

Подробнее
09-12-1998 дата публикации

Speech processing

Номер: GB0009822529D0
Автор:
Принадлежит:

Подробнее
02-11-2006 дата публикации

NOISE SUPPRESSION PROCESS AND DEVICE

Номер: CA0002574468A1
Принадлежит:

Подробнее
13-06-2019 дата публикации

VOICE AWARE AUDIO SYSTEM AND METHOD

Номер: CA0003084890A1
Принадлежит: STIKEMAN ELLIOTT S.E.N.C.R.L.,SRL/LLP

A voice aware audio system and a method for a user wearing a headset to be aware of an outer sound environment while listening to music or any other audio source. An adjustable sound awareness zone gives the user the flexibility to avoid hearing far distant voices. The outer sound can be analyzed in a frequency domain to select an oscillating frequency candidate and in a time domain to determine if the oscillating frequency candidate is the signal of interest. If the signal directed to the outer sound is determined to be a signal of interest the outer sound is mixed with audio from the audio source.

Подробнее
19-02-2013 дата публикации

SYSTEM FOR IMPROVING SPEECH INTELLIGIBILITY THROUGH HIGH FREQUENCY COMPRESSION

Номер: CA0002569221C
Принадлежит: QNX SOFTWARE SYSTEMS LIMITED

... ²²A speech enhancement system that improves the intelligibility and the ²perceived quality of ²processed speech includes a frequency transformer and a spectral compressor. ²The frequency ²transformer converts speech signals from the time domain to the frequency ²domain. The ²spectral compressor compresses a pre-selected portion of the high frequency ²band and maps ²the compressed high frequency band to a lower band limited frequency range.²² ...

Подробнее
26-10-2016 дата публикации

The acoustic signal processing apparatus and acoustic signal processing method

Номер: CN0104185870B
Автор:
Принадлежит:

Подробнее
24-10-2019 дата публикации

MIXING DEVICE, MIXING METHOD, AND MIXING PROGRAM

Номер: WO2019203124A1
Принадлежит:

Provided is a mixing technology with which it is possible to suppress degradation of a low-priority sound and output a more natural mixed sound irrespective of the size or quality of a playback device. This mixing device for mixing a first signal and a second signal on a time-frequency plane has: a control signal generation unit which generates a control signal that indicates whether or not to perform a preferential mixing that includes amplification of the first signal and attenuation of the second signal; and a gain derivation unit which derives, on the basis of the control signal, a first gain to amplify the first signal and a second gain to attenuate the second signal. The control signal at least takes a first value and a second value different from the first value. The first value does not continue beyond a specific bandwidth on the time-frequency axis. When the control signal takes the first value, the mixing device applies the preferential mixing to the first and second signals, ...

Подробнее
25-06-2020 дата публикации

BIOMETRIC USER RECOGNITION

Номер: WO2020128476A1
Автор: LESSO, John Paul
Принадлежит:

A method of biometric user recognition comprises, in an enrolment stage, receiving first biometric data relating to a biometric identifier of the user; generating a plurality of biometric prints for the biometric identifier, based on the received first biometric data, and enrolling the user based on the plurality of biometric prints. Then, during a verification stage, the method comprises receiving second biometric data relating to the biometric identifier of the user; performing a comparison of the received second biometric data with the plurality of biometric prints; and performing user recognition based on the comparison.

Подробнее
06-06-2017 дата публикации

Methods and apparatus for improving understandability of audio corresponding to dictation

Номер: US0009671999B2

According to some aspects, a method for improving understandability of audio corresponding to dictation to assist a transcriptionist in transcribing the dictation is provided. The method comprises presenting a user interface to the transcriptionist, the user interface including at least one control that can be selectively set to one of a plurality of settings, receiving a selection of one of the plurality of settings via the at least one control, and compressing a dynamic range of at least a portion of the audio using at least one parameter value associated with the selected setting.

Подробнее
03-09-2019 дата публикации

Enhancing audio content for voice isolation and biometric identification by adjusting high frequency attack and release times

Номер: US0010403302B2
Принадлежит: YOBE, INC., YOBE INC, Yobe, Inc

Systems and methods for isolating audio content and biometric authentication include receiving, with an audio receiver, an audio signal spanning a plurality of frequency bands, identifying a speech signal carried by a voice frequency band selected from the plurality of frequency bands, enhancing the speech signal relative to other audio content within the audio signal, and extracting a voice profile key that uniquely identifies the speech signal, wherein enhancing the first speech signal comprises adjusting attack and release times of the speech signal based on sound events within the speech signal, the attack time being associated with very high frequency sounds that are not phase-shifted.

Подробнее
22-11-2022 дата публикации

Hearing system containing a hearing instrument and a method for operating the hearing instrument

Номер: US0011510018B2
Принадлежит: Sivantos Pte. Ltd.

A hearing system contains a hearing instrument and the hearing instrument is configured to support the hearing of a hearing-impaired user. The hearing instrument is operated via an operating method. The method includes capturing a sound signal from an environment of the hearing instrument, processing the captured sound signal to at least partially compensate the hearing-impairment of the user and outputting the processed sound signal to the user. The captured sound signal is analyzed to recognize speech intervals, in which the captured sound signal contains speech. During recognized speech intervals, at least one time derivative of an amplitude and/or a pitch of the captured sound signal is determined. The amplitude of the processed sound signal is temporarily increased, if the at least one derivative fulfills a predefined criterion.

Подробнее
04-06-2014 дата публикации

ADAPTIVE VOICE INTELLIGIBILITY PROCESSOR

Номер: EP2737479A2
Принадлежит:

Подробнее
09-09-2024 дата публикации

КОНТРОЛЛЕР ВЫРАВНИВАТЕЛЯ ГРОМКОСТИ И СПОСОБ УПРАВЛЕНИЯ

Номер: RU2826268C2

Изобретение относится к обработке звуковых сигналов, в частности к устройствам и способам классификации и обработки звуковых сигналов, в особенности к управлению усилителем диалога, виртуализатором окружающего звука, выравнивателем громкости и эквалайзером. Техническим результатом является обеспечение автоматической настройки устройства улучшения качества звука в непрерывном режиме в зависимости от воспроизводимого звукового содержимого для предотвращения слышимых искажений в точках переключения. Заявлены способ нормализации громкости и устройство обработки звука для нормализации громкости на основе целевой величины громкости. В одном варианте осуществления производят определение параметров динамического усиления, применяемых к звуковым сегментам звукового сигнала на основе целевой величины громкости: для первого звукового сегмента – в зависимости от кратковременной характеристики звукового сигнала, для второго звукового сегмента – в зависимости от долговременной характеристики звукового ...

Подробнее
26-03-2021 дата публикации

«РЕЧЕВОЙ КОРРЕКТОР» - УСТРОЙСТВО ДЛЯ УЛУЧШЕНИЯ РАЗБОРЧИВОСТИ РЕЧИ

Номер: RU203218U1

Полезная модель относится к реабилитационной технике и медицине и может применяться в быту, медицинских и образовательных организациях. Технический результат заключается в обеспечении возможности использования устройства для улучшения разборчивости речи без предварительной настройки за счет полосовых фильтров, настроенных на частоты спектральных зон фонетических признаков речи. Технический результат достигается за счет устройства для улучшения разборчивости речи, которое состоит из корпуса, внутри которого закреплены микрофон, полосовые фильтры, настроенные на частоты спектральных зон фонетических признаков речи, усилитель мощности с регулятором громкости, гнездо для подключения наушников, соединенные последовательно, а также блютуз-гарнитура (Bluetooth), микрофоном которой является микрофон устройства, а выход на наушники которой соединен с входом полосовых фильтров устройства, аккумулятор, обеспечивающий питание устройства, схема зарядки аккумулятора с гнездом для подключения внешнего ...

Подробнее
25-12-2018 дата публикации

Способ повышения разборчивости речи

Номер: RU2676022C1

Изобретение относится к средствам для разборчивости речи. Технический результат заключается в повышение разборчивости речи. Речевой сигнал усиливается, фильтруется системой полосовых фильтров, подается на телефон или динамик. При этом обеспечивается возможность отключения полосовых фильтров в различных комбинациях. Полосовые фильтры настроены в соответствии со спектральными зонами, несущими основные фонетические признаки звуков речи. 1 з.п. ф-лы.

Подробнее
20-08-2016 дата публикации

ЭФФЕКТИВНОЕ ОСЛАБЛЕНИЕ ОПЕРЕЖАЮЩИХ ЭХО-СИГНАЛОВ В ЦИФРОВОМ ЗВУКОВОМ СИГНАЛЕ

Номер: RU2015102814A
Принадлежит:

... 1. Способ ослабления опережающих эхо-сигналов в цифровом звуковом сигнале, получаемом путем кодирования посредством преобразования, при этом при декодировании способ содержит этапы, на которых:обнаруживают (Detect.) в декодированном сигнале положение атаки;определяют (ZPE) зону опережающего эхо-сигнала, предшествующую положению атаки, обнаруженному в декодированном сигнале;вычисляют (F.Att.) коэффициенты ослабления на каждый подблок зоны опережающего эхо-сигнала в зависимости по меньшей мере от кадра, в котором обнаружена атака, и от предыдущего кадра;выполняют ослабление (Att.) опережающего эхо-сигнала в подблоках зоны опережающего эхо-сигнала при помощи соответствующих коэффициентов ослабления,при этом способ дополнительно содержит этап, на котором:применяют адаптивную фильтрацию (F) для придания спектральной формы зоне опережающего эхо-сигнала на текущем кадре до обнаруженного положения атаки.2. Способ по п. 1, дополнительно содержащий этапы, на которых вычисляют по меньшей мере один ...

Подробнее
13-08-2015 дата публикации

Respirator mask speech enhancement apparatus and method

Номер: AU2014212792A1
Принадлежит:

Speech enhancement apparatus and respirator masks including speech enhancement apparatus, as well as methods of enhancing speech transmission for the wearer of a respirator mask are described herein. In one or more embodiments, the speech enhancement apparatus and methods described herein detect acoustic energy within a first frequency range in the clean air envelope of a respirator mask and deliver compensating acoustic energy outside of the clean air envelope using a speaker. The compensating acoustic energy, in one or more embodiments, exhibits a predetermined attenuated amplitude profile such that the compensating acoustic energy has an amplitude less than 6 dB greater than the acoustic attenuation profile of the mask body over at least 90% of a predetermined attenuated frequency range.

Подробнее
07-02-2003 дата публикации

SOUND INTELLIGIBILTY ENHANCEMENT USING A PSYCHOACOUSTIC MODEL AND AN OVERSAMPLED FILTERBANK

Номер: CA0002354755A1
Принадлежит:

A sound intelligibility enhancement system is disclosed. The system uses a psychoacoustic model and an oversampled filterbank where the level of a signal-of-interest that falls below the environmental noise is selectively amplified as a function of the input level so that it is audible above the noise.

Подробнее
03-01-2014 дата публикации

EFFECTIVE PRE-ECHO ATTENUATION IN A DIGITAL AUDIO SIGNAL

Номер: CA0002874965A1
Принадлежит:

L'invention porte sur un procédé de traitement d'atténuation de pré -écho dans un signal audionumérique engendré à partir d'un codage par transformée, dans lequel, au décodage, le procédé comporte les étapes de détection (Detect.) d'une position d'attaque dans le signal décodé, de détermination (ZPE) d'une zone de pré-écho précédant la position d'attaque détectée dans le signal décodé, de calcul (F. Att.) de facteurs d'atténuations par sous-bloc de la zone de pré-écho, en fonction au moins de la trame dans laquelle l'attaque a été détectée et de la trame précédente, d' atténuation (Att.) de pré-écho dans les sous-blocs de la zone de pré -écho par les facteurs d'atténuation correspondants. Le procédé comporte en outre, l'application d'un filtrage (F) de mise en forme spectrale de la zone de pré-écho sur la trame courante jusqu'à la position détectée de l'attaque. L'invention vise également un dispositif mettant en ouvre le procédé ainsi qu'un décodeur comportant un tel dispositif.

Подробнее
17-11-2020 дата публикации

EFFECTIVE ATTENUATION OF PRE-ECHOS IN A DIGITAL AUDIO SIGNAL

Номер: CA0002894743C
Принадлежит: ORANGE

A method is provided for processing attenuation of pre-echo in a digital audio signal decoded by transform decoding. The method includes the following acts: decomposition of the decoded signal into at least two sub-signals according to a pre-determined decomposition criterion; calculation of attenuation factors per sub-signal and per sample of a previously determined pre-echo zone; attenuation of pre-echo in the pre-echo zone of each of the sub-signals by applying attenuation factors to the sub-signals; and production of the attenuated signal by addition of the attenuated sub-signals. Also provided are a processing device implementing the acts of the described method, and a decoder including such a device.

Подробнее
25-04-2019 дата публикации

NOISE REDUCTION USING MACHINE LEARNING

Номер: WO2019079713A1
Принадлежит:

The technology described in this document can be embodied in a method for processing an input signal that represents a signal of interest captured in the presence of noise to generate a de-noised estimate of the signal of interest. The method includes receiving an input signal representing a signal of interest captured in the presence of noise. The method also includes processing at least a portion of the input signal using a digital filter to generate a filtered signal, the digital filter configured to suppress at least a portion of spectrum of the noise. The method further includes processing the filtered signal using a first neural network to generate a de-noised estimate of the signal of interest, wherein the first neural network is trained to compensate for distortions introduced by the digital filter in the signal of interest.

Подробнее
19-01-2017 дата публикации

METHODS OF FREQUENCY-MODULATED PHASE CODING (FMPC) FOR COCHLEAR IMPLANTS AND COCHLEAR IMPLANTS APPLYING SAME

Номер: WO2017011396A1
Принадлежит:

A method of generating frequency-modulated pulse trains in a CI includes dividing data representing audio spanning frequency bands into a plurality of bins associated with each frequency band, each bin representing an energy level of the data within the frequency band within a time period; associating each frequency band with a phase probability that starts at an initial phase probability value (PPV), resets to a minimum PPV after generating a pulse, and increases from the minimum PPV to a maximum PPV over a time period; for each bin, assigning a power probability as a normalized intensity being a number between a minimum power probability and a maximum power probability representing the energy level of the bin, and generating a pulse in an electrode associated with the frequency band associated with the bin when a random number generated is less than the power probability divided by the phase probability.

Подробнее
21-09-2017 дата публикации

AN AUDIO SIGNAL PROCESSING APPARATUS AND METHOD FOR PROCESSING AN INPUT AUDIO SIGNAL

Номер: WO2017157427A1
Принадлежит:

The invention relates to an audio signal processing apparatus (100) and method for processing an input audio signal (101) into an output audio signal (103). The audio signal processing apparatus (100) comprises a decomposer (105) configured to decompose the input audio signal (101) into a direct audio signal (102a) and a diffuse audio signal (102b), a modifier (107) configured to modify the direct audio signal (102a) in order to obtain a modified direct audio signal (102a'), wherein the modifier (107) comprises a bandwidth extender (107a) configured to extend an upper cutoff frequency of the frequency range of the direct audio signal (102a), and a combiner (109) configured to combine the modified direct audio signal (102a') with the diffuse audio signal (102b) in order to obtain the output audio signal (103).

Подробнее
17-11-2020 дата публикации

Speaker enrollment

Номер: US0010839810B2

A method of speaker modelling for a speaker recognition system, comprises: receiving a signal comprising a speaker's speech; and, for a plurality of frames of the signal: obtaining a spectrum of the speaker's speech; generating at least one modified spectrum, by applying effects related to a respective vocal effort; and extracting features from the spectrum of the speaker's speech and the at least one modified spectrum. The method further comprises forming at least one speech model based on the extracted features.

Подробнее
05-01-2012 дата публикации

Audio human verification

Номер: US20120004914A1
Принадлежит: Microsoft Corp

A system generates an audio challenge that includes a first voice and one or more second voices, the first voice being audibly distinguishable, by a human, from the one or more second voices. The first voice conveys first information and the second voice conveys second information. The system provides the audio challenge to a user and verifies that the user is human based on whether the user can identify the first information in the audio challenge.

Подробнее
17-05-2012 дата публикации

Post-noise suppression processing to improve voice quality

Номер: US20120123775A1
Принадлежит: Individual

Provided are methods and systems for improving quality of speech communications. The method may be for improving quality of speech communications in a system having a speech encoder configured to encode a first audio signal using a first set of encoding parameters associated with a first noise suppressor. A method may involve receiving a second audio signal at a second noise suppressor which provides much higher quality noise suppression than the first noise suppressor. The second audio signal may be generated by a single microphone or a combination of multiple microphones. The second noise suppressor may suppress the noise in the second audio signal to generate a processed signal which may be sent to a speech encoder. A second set of encoding parameters may be provided by the second noise suppressor for use by the speech encoder when encoding the processed signal into corresponding data.

Подробнее
14-06-2012 дата публикации

Method and system for reconstructing speech from an input signal comprising whispers

Номер: US20120150544A1
Принадлежит: NANYANG TECHNOLOGICAL UNIVERSITY

A system for reconstructing speech from an input signal comprising whispers is disclosed. The system comprises an analysis unit configured to analyse the input signal to form a representation of the input signal; an enhancement unit configured to modify the representation of the input signal to adjust a spectrum of the input signal, wherein the adjusting of the spectrum of the input signal comprises modifying a bandwidth of at least one formant in the spectrum to achieve a predetermined spectral energy distribution and amplitude for the at least one formant; and a synthesis unit configured to reconstruct speech from the modified representation of the input signal.

Подробнее
02-08-2012 дата публикации

Voice correction device, voice correction method, and recording medium storing voice correction program

Номер: US20120197634A1
Принадлежит: Fujitsu Ltd

A voice correction device includes a detector that detects a response from a user, a calculator that calculates an acoustic characteristic amount of an input voice signal, an analyzer that outputs an acoustic characteristic amount of a predetermined amount when having acquired a response signal due to the response from the detector, a storage unit that stores the acoustic characteristic amount output by the analyzer, a controller that calculates an correction amount of the voice signal on the basis of a result of a comparison between the acoustic characteristic amount calculated by the calculator and the acoustic characteristic amount stored in the storage unit, and a correction unit that corrects the voice signal on the basis of the correction amount calculated by the controller.

Подробнее
04-07-2013 дата публикации

VOICE CLARIFICATION APPARATUS

Номер: US20130173262A1
Принадлежит: YAMAHA CORPORATION

The voice clarification apparatus includes a plurality of band-pass filters that respectively extract a plurality of band components, which are included in a voice band, from an input audio signal; a gain determination unit that determines a gain according to the level of a signal of a band component which is extracted by at least one band-pass filter of the plurality of band-pass filters; a level adjustment unit that adjusts the levels of signals of the plurality of band components which are extracted by the plurality of band-pass filters using the gain; and a first addition unit that adds a signal which is based on the audio signal to a signal in which the gain is adjusted by the level adjustment unit, and outputs a signal obtained through the addition. 1. A voice clarification apparatus comprising:a plurality of band-pass filters that respectively extract a plurality of band components, which are included in a voice band, from an input audio signal;a gain determination unit that determines a gain according to a level of a signal of a band component which is extracted by at least one band-pass filter of the plurality of band-pass filters;a level adjustment unit that adjusts levels of signals of the plurality of band components which are extracted by the plurality of band-pass filters using the gain; anda first addition unit that adds a signal which is based on the audio signal to a signal in which the gain is adjusted by the level adjustment unit, and outputs a signal obtained through the addition.2. The voice clarification apparatus according to claim 1 ,wherein the gain determination unit includes a conversion unit which converts input levels based on a signal indicative of voice components into a gain which has predetermined input and output characteristics, andwherein the conversion unit outputs the gain which is greater than “1” when an absolute value of a level of the signal indicative of the voice components is equal to or less than a threshold, and outputs ...

Подробнее
26-12-2013 дата публикации

Method of simultaneously transforming a plurality of voice signals input to a communications system

Номер: US20130346071A1
Автор: Jean-Pierre Baudry
Принадлежит: Eurocopter SA

A method of simultaneously transforming at least two input voice signals x i of a communications system ( 30 ), each input voice signal x i being received at a specific reception frequency F i and corresponding to the voice of a remote party communicating with a user of the communications system ( 30 ). During an initialization stage, a transformation T i is allocated to at least one reception frequency F i of the input voice signals x i , and during a utilization stage, transformations T i are applied simultaneously to the input voice signals x i as a function of the reception frequencies F i , modifying at least one characteristic of each of the input voice signals x i . Thus, the voice of each remote party in communication with the user of the communications system ( 30 ) is modified artificially by a transformation T i , thereby making it easier for the user to perceive and discriminate between simultaneous voices from the remote parties.

Подробнее
06-03-2014 дата публикации

Binaural enhancement of tone language for hearing assistance devices

Номер: US20140064496A1
Автор: Ning Li
Принадлежит: Starkey Laboratories Inc

Disclosed herein, among other things, are methods and apparatus for binaural enhancement of tone language for hearing assistance devices. One aspect of the present subject matter includes a method for enhancing pitch in a hearing assistance system having a first and second hearing assistance device. A signal is received using a microphone of the first hearing assistance device. Pitch detection is performed on the signal to obtain a pitch value. The pitch value is wirelessly transmitted from the first hearing assistance device to the second hearing assistance device. In various embodiments, the pitch value of the first hearing assistance device is combined with a pitch value of the second hearing assistance device. The gain is adjusted based on the combined pitch value, in various embodiments.

Подробнее
06-03-2014 дата публикации

Adjustment apparatus and method

Номер: US20140067383A1
Автор: Kaori Endo
Принадлежит: Fujitsu Ltd

A disclosed adjustment apparatus includes: a calculation unit that calculates a ratio between a first frequency characteristic in a first frequency bandwidth of voice signals and a second frequency characteristic in a second frequency bandwidth of the voice signals, which is higher than the first frequency bandwidth, and calculates an adjustment amount for adjusting at least a portion of a frequency characteristic of the voice signals so that the calculated ratio approaches a predetermined reference, when the calculated ratio does not satisfy the predetermined reference; and a modification unit that modifies at least the portion of the frequency characteristic of the voice signals according to the adjustment amount.

Подробнее
03-04-2014 дата публикации

Communication and speech enhancement system

Номер: US20140093117A1
Принадлежит: TOKTOME ACOUSTICS LLC

A communication and speech enhancement system featuring a first transducer designed to be temporarily affixed to a human such as a hospital patient to convert the audible vibrations of human speech into an electrical signal. The transducer provides this electrical signal to one or more electronic modules which modify and enhance the signal. The enhanced signal may then be amplified and converted back into audible sound by means of a second transducer. A user of the system controls the electronic modules through a user interface. In an embodiment, one or both of the user interface and second transducer feature smooth surfaces amenable to cleaning and sterilizing with liquid agents.

Подробнее
07-01-2016 дата публикации

Communication and speech enhancement system

Номер: US20160001110A1
Принадлежит: Delores Speech Products LLC

A communication and speech enhancement system featuring a first transducer designed to be temporarily affixed to a human such as a hospital patient to convert the audible vibrations of human speech into an electrical signal. The transducer provides this electrical signal to one or more electronic modules which modify and enhance the signal. The enhanced signal may then be amplified and converted back into audible sound by means of a second transducer. A user of the system controls the electronic modules through a user interface. In an embodiment, one or both of the user interface and second transducer feature smooth surfaces amenable to cleaning sterilizing with liquid agents.

Подробнее
06-01-2022 дата публикации

Signal processing device, sound-reproduction system, and sound reproduction method

Номер: US20220005485A1
Автор: Kanro Oyama, Masafumi TAO

A signal processing device includes: a processor; and a memory having instructions. The instructions, when executed by the processor, cause the signal processing device to perform operations. The operations include performing a modulation processing of modulating a sound signal by using a modulation parameter based on an interaural phase difference at a listening position of the sound signal.

Подробнее
05-01-2017 дата публикации

ENHANCEMENT OF NOISY SPEECH BASED ON STATISTICAL SPEECH AND NOISE MODELS

Номер: US20170004841A1
Автор: JENSEN Jesper
Принадлежит: OTICON A/S

A system for enhancement of noisy speech comprises an input unit is configured to subdivide the spectrum of the input signal into a plurality of frequency sub-bands and to provide time-frequency coefficients X(k,m) for a sequence [X(k, m′−D+1) . . . X(k,m′)] of observable noisy signal samples for each of said frequency sub-bands, where k and m are frequency and time indices, respectively, and D is larger than 1. The system further comprises enhancement processing unit configured to receive X(k,m) and to provide enhanced time-frequency coefficients Ŝ(k, m), a storage for statistical model(s) of speech and for statistical model(s) of noise, and an optimizing unit configured to provide said enhanced time-frequency coefficients Ŝ(k,m) using said statistical model of speech and said statistical model of noise, while considering said sequence [X(k, m′−D+1) . . . X(k, m′)] of observable noisy signal samples. Thereby the enhancement processing unit is able to determine the enhanced time-frequency coefficients based on the time-frequency coefficients for each of said frequency sub-bands. 1. A method for enhancement of speech in noise , the method comprising:providing a noisy input signal in a plurality of frequency sub-bands (k);for each of said frequency sub-bands providing time-frequency coefficients X(k,m) corresponding to a sequence [X(k,m′−D+1) . . . X(k,m′)] of observable noisy signal samples, where k and m are frequency and time indices, respectively, and D is larger than 1,enhancing said time-frequency coefficients X(k,m) thereby providing enhanced time-frequency coefficients Ŝ(k,m);providing a statistical model of speech;providing a statistical model of noise;providing said enhanced time-frequency coefficients Ŝ(k,m) using said statistical model of speech and said statistical model of noise, while considering said sequence [X(k, m′−D+1) . . . X(k,m′)] of observable noisy signal samples.2. The method according to wherein said statistical model of speech comprises a ...

Подробнее
07-01-2016 дата публикации

VOICE EMPHASIS DEVICE

Номер: US20160005420A1
Принадлежит: Mitsubishi Electric Corporation

An input signal analyzer determines a boundary frequency within the limit of a range which does not exceed a first frequency from the mode of an input signal. A spectrum compressor compresses a power spectrum of frequencies in a band higher than the first frequency in a frequency direction. A gain corrector performs a gain correction on the compressed power spectrum. A spectrum synthesizer reflects the power spectrum outputted from the gain corrector in a band determined by both the first frequency and the boundary frequency. A frequency-to-time converter converts both a synthesized power spectrum provided by the spectrum synthesizer and a phase spectrum of the input signal into ones in the time domain, and outputs these spectra. 1. A voice emphasis device comprising:a time-to-frequency converter that converts an input signal in a time domain into a power spectrum which is a signal in a frequency domain;an input signal analyzer that analyzes a mode of said input signal from said power spectrum;a band determinator that determines a boundary frequency within a limit of a range which does not exceed a predetermined first frequency from the mode of said input signal;a spectrum compressor that compresses a power spectrum of frequencies in a band higher than said first frequency in a frequency direction;a spectrum synthesizer that reflects said compressed power spectrum in a band determined by both said first frequency and said boundary frequency; anda frequency-to-time converter that converts both a synthesized power spectrum outputted from said spectrum synthesizer and a phase spectrum of said input signal into ones in the time domain, to acquire an emphasized signal.2. The voice emphasis device according to claim 1 , wherein a gain corrector that claim 1 , by correcting the power spectrum compressed by said spectrum compressor in such a way that power of the power spectrum before the compression in a band on which said spectrum compressor performs the compression is ...

Подробнее
13-01-2022 дата публикации

MULTI-STREAM TARGET-SPEECH DETECTION AND CHANNEL FUSION

Номер: US20220013134A1
Принадлежит:

Audio processing systems and methods include an audio sensor array configured to receive a multichannel audio input and generate a corresponding multichannel audio signal and target-speech detection logic and an automatic speech recognition engine or VoIP application. An audio processing device includes a target speech enhancement engine configured to analyze a multichannel audio input signal and generate a plurality of enhanced target streams, a multi-stream target-speech detection generator comprising a plurality of target-speech detector engines each configured to determine a probability of detecting a specific target-speech of interest in the stream, wherein the multi-stream target-speech detection generator is configured to determine a plurality of weights associated with the enhanced target streams, and a fusion subsystem configured to apply the plurality of weights to the enhanced target streams to generate an enhancement output signal. 1. A system comprising:a target-speech enhancement engine configured to receive a multichannel audio input signal and generate a plurality of enhanced target streams based on the received multichannel audio input signal, each of the plurality of enhanced target streams being generated according to different enhancement separation criteria; anda fusion subsystem configured to enhance a target audio signal associated with the multichannel audio input signal based at least in part on the plurality of enhanced target streams.2. The system of claim 1 , wherein the enhancement separation criteria include at least one of an adaptive spatial filtering algorithm claim 1 , a beamforming algorithm claim 1 , a blind source separation algorithm claim 1 , a single channel enhancement algorithm claim 1 , or a neural network.3. The system of claim 1 , further comprising:an audio sensor array configured to detect sound in an environment and generate the multichannel audio input signal based on the detected sound.4. The method of claim 3 , ...

Подробнее
08-01-2015 дата публикации

Speech intelligibility detection

Номер: US20150010156A1
Автор: Yaakov Chen
Принадлежит: DSP Group Israel Ltd

Methods and systems are provided for enhancing speech intelligibility in electronic devices. During outputting of acoustic signal via an electronic device, measurement of forces applied by user of the electronic device against the device (or enclosure thereof) may be obtained. The force measurements may be used to assess and/or estimate the listening intelligibility experienced by the user. Further, the force measurements may be used to control or adjust a listening intelligibility stage applied during generation and/or processing of the acoustic signals that are outputted via the electronic device. In some instances, an audio input, corresponding to ambient noise affecting intelligibility, may be obtained, and may be used to control or assist in controlling the listening intelligibility stage.

Подробнее
09-01-2020 дата публикации

Processing spoken commands to control distributed audio outputs

Номер: US20200013397A1
Принадлежит: Amazon Technologies Inc

A system that is capable of controlling multiple entertainment systems and/or speakers using voice commands. The system receives voice commands and may determine audio sources and speakers indicated by the voice commands. The system may generate audio data from the audio sources and may send the audio data to the speakers using multiple interfaces. For example, the system may send the audio data directly to the speakers using a network address, may send the audio data to the speakers via a voice-enabled device or may send the audio data to the speakers via a speaker controller. The system may generate output zones including multiple speakers and may associate input devices with speakers within the output zones. For example, the system may receive a voice command from an input device in an output zone and may reduce output audio generated by speakers in the output zone.

Подробнее
26-01-2017 дата публикации

NOISE ELIMINATION CIRCUIT

Номер: US20170025133A1
Автор: Liu Lian
Принадлежит:

A noise elimination circuit of particular application in enhancing vocal clarity in a teleconference includes a first voice processing circuit, a second voice processing circuit, and a subtracter. The first voice processing circuit receives and processes a first voice from a first microphone and the second voice processing circuit receives and processes the same voice from a second microphone (second voice). The first voice and the second voice include voice signals and noises. The subtracter is electrically connected to the two voice processing circuits to receive the first voice and the second voice respectively processed by the first voice processing circuit and the second voice processing circuit. The subtracter substracts the second voice from the first voice, and outputs a clear voice from which noise has been eliminated. 1. A noise elimination circuit , comprising:a first voice processing circuit, configured to receive and process a first voice from a first microphone, and the first voice comprises a first voice signal and a first noise;a second voice processing circuit, configured to receive and process a second voice from a second microphone, and the second voice comprises a second voice signal and a second noise; anda subtracter, coupled to the first voice processing circuit and the second voice processing circuit, configured to receive the first voice and the second voice processed by the first voice processing circuit and the second voice processing circuit, and to subtract the second voice from the first voice to output a voice signal without noises.2. The noise elimination circuit of claim 1 , wherein the subtracter comprises a first integrated operational amplifier claim 1 , having a first input port coupled to a first voice processing circuit output port; and having a second input port coupled to a second voice processing circuit output port and a first integrated operational amplifier output port.3. The noise elimination circuit of claim 1 , wherein ...

Подробнее
17-02-2022 дата публикации

DEVICE AND METHOD FOR WIRELESSLY COMMUNICATING ON BASIS OF NEURAL NETWORK MODEL

Номер: US20220051688A1
Принадлежит:

Disclosed are a device and method for wirelessly communicating. The device according to one example embodiment of the present disclosure may comprise a transceiver and a controller connected to the transceiver, wherein the controller is configured to identify at least one additional sample on the basis of a digital signal by using a neural network model and upscale the digital signal by adding the at least one identified additional sample to a plurality of samples of the digital signal. 1. A device for wireless communication , the device comprising:a transceiver; and identify at least one additional sample by using a neural network model, based on a digital signal, and', 'upscale the digital signal by adding the identified at least one additional sample to a plurality of samples of the digital signal., 'a controller connected to the transceiver, wherein the controller is configured to2. The device of claim 1 , wherein to identify the at least one additional sample using the neural network model based on the digital signal claim 1 , the controller is further configured to:determine a weight in response to the digital signal; andidentify the at least one additional sample based on the digital signal and the weight.3. The device of claim 1 , wherein the controller is further configured to generate the neural network model by:obtaining a first output digital signal upscaled from a first input digital signal in response to the first input digital signal;obtaining a difference between the first output digital signal and one reference digital signal of a set of at least one reference digital signal; andobtaining a second output digital signal upscaled from a second input digital signal based on the difference and the second input digital signal.4. The device of claim 3 , wherein the difference is related to at least one sample not corresponding to a plurality of samples of the first output digital signal among a plurality of samples of the one reference digital signal.5. ...

Подробнее
04-02-2021 дата публикации

SPEECH SIGNAL CASCADE PROCESSING METHOD, TERMINAL, AND COMPUTER-READABLE STORAGE MEDIUM

Номер: US20210035596A1
Автор: LIANG Junbin
Принадлежит:

A method for improving speech signal intelligibility is performed at a device. A speech signal is obtained. A correspondence between the speech signal and a respective user group among different user groups having distinct voice characteristics is identified. Pre-encoding signal augmentation is performed on the speech signal with a respective pre-augmentation filtering coefficient that corresponds to the respective user group to obtain a group-specific pre-augmented speech signal. The device encodes the pre-augmented speech signal for subsequent transmission through the voice communication channel. An encoded version of the pre-augmented speech signal has reduced loss of signal quality as compared to an encoded version of the speech signal that is obtained without the pre-encoding signal augmentation. 1. A speech signal cascade processing method performed at a first terminal having one or more processors and memory storing a plurality of computer programs to be executed by the one or more processors , comprising:obtaining a speech signal from a second terminal via a voice communication channel, wherein the speech signal is processed with different audio codecs at the first terminal and the second terminal, respectively;performing feature recognition on the speech signal to determine a set of feature characteristics for the speech signal;when the set of feature characteristics matches a first set of predefined features, performing pre-augmented filtering on the speech signal by using a first set of pre-augmented filter coefficients, to obtain a pre-augmented speech signal;when the set of feature characteristics matches a second set of predefined features, performing pre-augmented filtering on the speech signal by using a second set of pre-augmented filter coefficients, to obtain the pre-augmented speech signal; andperforming cascade encoding/decoding to the pre-augmented speech signal to generate an augmented speech signal.2. The method according to claim 1 , wherein ...

Подробнее
07-02-2019 дата публикации

Automatic Gain Adjustment for Improved Wake Word Recognition in Audio Systems

Номер: US20190043521A1
Принадлежит: Intel Corporation

A mechanism is described for facilitating automatic gain adjustment in audio systems according to one embodiment. A method of embodiments, as described herein, includes determining status of one or more of gain settings, mute settings, and boost settings associated with one or more microphones based on a configuration of a computing device including a voice-enabled device. The method may further comprise recommending adjustment of microphone gain based on the configuration and the status of one or more of the gain, mute, and boost settings, and applying the recommended adjustment of the microphone gain. 1. An apparatus comprising:detection and observation logic to determine status of one or more of gain settings, mute settings, and boost settings associated with one or more microphones based on a configuration of the apparatus including a voice-enabled device;gain/boost adjustment and decision logic (“gain/boost logic”) to recommend adjustment of microphone gain based on the configuration and the status of one or more of the gain, mute, and boost settings; andgain/boost application logic (“application logic”) to apply the recommended adjustment of the microphone gain.2. The apparatus of claim 1 , further comprising mute enforcement logic to enforce muting of the one or more microphones based on the mute settings and according to a mute command3. The apparatus of claim 1 , wherein the recommended adjustment comprises a first gain compensation including muting one or more microphone signals before the one or more microphone signals enter a wake word recognizer (WWR) based on the configuration where the gain and boost settings do not modify signal reception capabilities of the WWR.4. The apparatus of claim 1 , wherein the recommended adjustment comprises a second gain compensation including compensating microphone boost and muting the one or more microphone signals before the one or more microphone signals enter the wake word recognizer (WWR) based on the configuration ...

Подробнее
16-02-2017 дата публикации

SYSTEMS AND METHODS FOR SPEECH PROCESSING

Номер: US20170047081A1
Принадлежит: Yobe, Inc.

Systems and methods described herein modify audio content on an electronic device. Embodiments can be configured to detect a mode of the electronic device to determine whether the device is in a telephone mode; receive a speech signal from a speech source while the device is in the telephone mode; and process the speech signal to improve the perceived quality of the speech at a recipient when the electronic device is in a telephone mode; wherein processing the speech signal to improve the perceived quality of the speech comprises, decreasing the signal level of audio content outside of a determined frequency band relative to the signal level of the audio content within the determined frequency band; and wherein the determined frequency band is a frequency band associated a vocal range of the anticipated speech content. 1. A method for modifying audio content on an electronic device , the method comprising:detecting an audio mode in which the electronic device is operating to determine a type of audio content processing to apply to audio content based on the detected audio mode, wherein the type of audio content processing comprises speech processing when the electronic device is detected to be in a speech-related audio mode, and music processing when the electronic device is detected to be in a playback audio mode;if the detected audio mode is a speech-related audio mode, receiving a speech signal from a speech source; andprocessing the speech signal to improve a perceived quality of the speech at a recipient; decreasing a signal level of audio content outside of a determined frequency band relative to a signal level of audio content within the determined frequency band; and', 'adjusting attack and release times of the speech signal based on sound events within the speech signal; and, 'wherein processing the speech signal to improve the perceived qualify of the speech compriseswherein the determined frequency band is a frequency band associated with a vocal range of ...

Подробнее
06-02-2020 дата публикации

METHODS FOR HEARING-ASSIST SYSTEMS IN VARIOUS VENUES

Номер: US20200045482A1
Автор: Epstein Barry
Принадлежит:

A hearing-assist system for use in a venue in which the system includes circuitry inserted in a signal path between a program source feed of sound and hearing-assist units of users in that venue which improves the quality of sound heard by the users via the hearing-assist units. 1. A hearing-assist system for use in a venue intermediate a program source feed and hearing assist devices borne by patrons in the venue , the hearing-assist system adapted to modify the hearing quality of sound transmitted from the program source feed to the hearing assist devices , the hearing-assist system comprising:(a) processing circuitry configured to reduce selected high energy components of the so-transmitted sound;(a) processing circuitry configured to optimize selected components of the so-transmitted sound; and(c) processing circuitry configured to modify the dynamic range of the so-transmitted sound.2. The hearing-assist system defined in further including circuitry for introducing time delays to the signals corresponding to the sound received by the hearing assist devices.3. The hearing-assist system defined in in which the time delays are respectively different to correspond to different hearing needs.4. The hearing assist system defined in in which the time delays are respectively different to correspond to different locations of the hearing assist devices in the venue.5. A hearing-assist system for use in a venue intermediate a program source feed and hearing assist devices of patrons in the venue claim 3 , the hearing assist system adapted to improve the hearing quality of sound received by the hearing assist devices compared to the hearing quality of sound transmitted from the program source feed toward the hearing assist devices claim 3 , the hearing assist system comprising processing circuitry having at least one processing stage from a group consisting of a processing stage for modifying selected high energy components of the transmitted sound claim 3 , a processing ...

Подробнее
03-03-2022 дата публикации

SYSTEM AND METHOD FOR PHONETIC HASHING AND NAMED ENTITY LINKING FROM OUTPUT OF SPEECH RECOGNITION

Номер: US20220067291A1
Принадлежит:

A system and method for named entity linking from the output of speech-to-text systems by using an approximate string matching that normalizes common sounds, removes ambiguities, removes silent consonants, and accounts for speech slurring for long names. Additionally, the system and method for named entity linking from the output of speech-to-text systems employs a hierarchical matching system that performs multiple attempts using various mechanisms for resolving the name, starting with a very strict mechanism, and proceeding sequentially through less strict mechanisms.

Подробнее
03-03-2022 дата публикации

Method for operating a hearing device based on a speech signal, and hearing device

Номер: US20220068293A1
Принадлежит: Sivantos Pte Ltd

A method for operating a hearing device on the basis of a speech signal. An acousto-electric input transducer of the hearing device records a sound containing the speech signal from surroundings of the hearing device and converts the sound into an input audio signal. A signal processing operation generates an output audio signal based on the input audio signal. At least one articulatory and/or prosodic feature of the speech signal is quantitatively acquired through analysis of the input audio signal by way of the signal processing operation, and a quantitative measure of a speech quality of the speech signal is derived on the basis of the property. At least one parameter of the signal processing operation for generating the output audio signal based on the input audio signal is set on the basis of the quantitative measure of the speech quality of the speech signal.

Подробнее
03-03-2022 дата публикации

METHOD FOR RATING THE SPEECH QUALITY OF A SPEECH SIGNAL BY WAY OF A HEARING DEVICE

Номер: US20220068294A1
Автор: LUGGER MARKO, THIEMT JANA
Принадлежит:

A method for rating the speech quality of a speech signal by a hearing device. An acousto-electric input transducer records sound containing the speech signal and converts it into an input audio signal. At least one articulatory and/or prosodic property of the speech signal is quantitatively acquired through analysis of the input audio signal, and a quantitative measure of speech quality is derived based on the articulatory and/or prosodic property. A hearing device with an acousto-electric input transducer configured to record a sound and convert it into an input audio signal, and a signal processing apparatus that is designed to quantitatively acquire at least one articulatory and/or prosodic property of a component, contained in the input audio signal, of a speech signal based on analysis of the input audio signal and to derive a quantitative measure of the speech quality based on the at least one articulatory and/or prosodic property. 1. A method for rating a speech quality of a speech signal by a hearing device , the method comprising:recording a sound with an acousto-electric input transducer of the hearing device, the sound containing the speech signal from surroundings of the hearing device, and converting the sound into an input audio signal;quantitatively acquiring at least one articulatory property and/or prosodic feature of the speech signal through analysis of the input audio signal by a signal processing operation, andderiving a quantitative measure of the speech quality based on the at least one articulatory property and/or prosodic feature.2. The method according to claim 1 , the method further comprising acquiring claim 1 , as articulatory property of the speech signal claim 1 , at least one of:a characteristic variable correlated with the precision of predefined formants of vowels in the speech signal;a characteristic variable correlated with the dominance of consonants and/or fricatives in the speech signal; ora characteristic variable correlated ...

Подробнее
03-03-2022 дата публикации

USER CALL QUALITY IMPROVEMENT

Номер: US20220070695A1
Автор: Karanam Hemanth
Принадлежит:

The disclosed system provides a facility for improving user call quality at a mobile device. The system includes an over-the-top (OTT) client that may be installed on the mobile device for allowing a user to initiate call quality tests. The system performs a call quality test to generate a call quality score or metric from the obtained audio sample via a Call Quality Algorithm. If the call quality score or metric falls below a predetermined threshold, the system may suggest (via the OTT client or via a push notification) that the user of the mobile device switch from the first communication interface (e.g., a radio network such as 4G) to a second communication interface (e.g., Wi-Fi) on the mobile device. The disclosed system tracks the call quality score or metric of multiple mobile devices operating within a telecommunications network, identifies service outages within the network, and notifies impacted mobile devices.

Подробнее
26-02-2015 дата публикации

Methods and systems for enhancing pitch associated with an audio signal presented to a cochlear implant patient

Номер: US20150057998A1
Принадлежит: ADVANCED BIONICS AG

An exemplary method of enhancing pitch of an audio signal presented to a cochlear implant patient includes 1) determining a frequency spectrum of an audio signal presented to a cochlear implant patient, the frequency spectrum comprising a plurality of frequency bins that each contain spectral energy, 2) generating a modified spectral envelope of the frequency spectrum of the audio signal, 3) identifying each frequency bin included in the plurality of frequency bins that contains spectral energy above the modified spectral envelope and each frequency bin included in the plurality of frequency bins that contains spectral energy below the modified spectral envelope, 4) enhancing the spectral energy contained in each frequency bin identified as containing spectral energy above the modified spectral envelope, and 5) compressing the spectral energy contained in each frequency bin identified as containing spectral energy below the modified spectral envelope. Corresponding methods and systems are also disclosed.

Подробнее
01-03-2018 дата публикации

System and Method for Auditing and Filtering Digital Audio Files

Номер: US20180061430A1
Автор: Brunton Alan
Принадлежит:

A computerized method for filtering a digital audio file to generate an output audio file that induces optimal health and cognitive ability in a listener of a playback of the output audio file is described herein. The method includes the steps of identifying a plurality of target frequencies that span within an octave, identifying a plurality of mid-point frequencies that are situated at mid-points between any two adjacent target frequencies, applying a peaking filter to the digital audio file centered around the plurality of mid-point frequencies to produce highest frequency attenuation at the plurality of mid-point frequencies, and generating the output audio file. 1. A computerized method for filtering a digital audio file to generate an output audio file that induces optimal health and cognitive ability in a listener of a playback of the output audio file , comprising:identifying a plurality of target frequencies that span within at least one octave;identifying a plurality of mid-point frequencies that are situated at mid-points between any two adjacent target frequencies;applying a set of peaking filters to the digital audio file centered around the plurality of mid-point frequencies to produce highest frequency attenuation at the plurality of mid-point frequencies; andgenerating the output audio file.2. The computerized method of claim 1 , wherein identifying a plurality of target frequencies comprises identifying a plurality of target frequencies that span more than one octave.3. The computerized method of claim 1 , wherein identifying a plurality of target frequencies comprises receiving a user input indicative of a number of target frequencies to be identified.4. The computerized method of claim 1 , wherein identifying a plurality of target frequencies comprises identifying seven target frequencies.5. The computerized method of claim 1 , wherein applying a set of peaking filters comprises applying five peaking filters of different bandwidths centered about ...

Подробнее
20-02-2020 дата публикации

PLAYBACK ENHANCEMENT IN AUDIO SYSTEMS

Номер: US20200058317A1
Автор: Gaalaas Joseph
Принадлежит:

Audio systems and methods are provided that enhance a portion of audio content relative to other portions of the audio content. The systems and methods select the portion to be enhanced and calculate an intelligibility metric of the selected portion, such as a dialogue portion. The systems and methods determine a gain based at least in part upon the intelligibility metric and apply the gain to the selected portion to provide an enhanced portion. The systems and methods provide an audio signal, based at least in part upon the enhanced portion, to an output for conversion to an acoustic signal, such as by an acoustic transducer. 1. An audio sound system , comprising:an input to receive input audio content;an output configured to provide an audio signal for conversion to acoustic signals in a listening environment; anda processor coupled to the input and to the output and configured to select a portion of the input audio content to be enhanced relative to other portions of the input audio content, to calculate an intelligibility metric of the selected portion, to determine a gain based at least in part upon the intelligibility metric, to apply the gain to the selected portion to provide an enhanced portion, to produce output audio content by combining the enhanced portion with the other portions of the input audio content, and to provide the output audio content to the output as the audio signal.2. The audio sound system of wherein the processor is further configured to select the portion of the input audio content as a dialogue portion and to calculate the intelligibility metric as a speech intelligibility metric of the selected dialogue portion relative to the other portions of the input audio content.3. The audio sound system of wherein the processor is further configured to select the portion of the input audio content as a dialogue portion based upon at least one of a center channel of the input audio content and a correlated portion of a left and right channel of ...

Подробнее
04-03-2021 дата публикации

EFFICIENT DRC PROFILE TRANSMISSION

Номер: US20210065728A1
Принадлежит: DOLBY INTERNATIONAL AB

A method () for decoding an encoded audio signal () is described. The encoded audio signal () comprises a sequence of frames. Furthermore, the encoded audio signal () is indicative of a plurality of different dynamic range control (DRC) profiles for a corresponding plurality of different rendering modes. Different subsets of DRC profiles from the plurality of DRC profiles are comprised within different frames of the sequence of frames, such that two or more frames of the sequence of frames jointly comprise the plurality of DRC profiles. The method () comprises determining a first rendering mode from the plurality of different rendering modes; determining () one or more DRC profiles from a subset of DRC profiles comprised within a current frame of the sequence of frames; determining () whether at least one of the one or more DRC profiles is applicable to the first rendering mode; selecting () a default DRC profile as a current DRC profile, if none of the one or more DRC profiles is applicable to the first rendering mode; wherein definition data of the default DRC profile is known at a decoder () for decoding the encoded audio signal (); and decoding the current frame using the current DRC profile. 1. A method for decoding an encoded audio signal; wherein the encoded audio signal comprises a sequence of frames comprising encoded audio data and metadata , the metadata including a plurality of different sets of dynamic range control , referred to as DRC , gains , wherein the encoded audio signal further comprises an indication of a loudness of the audio signal , and DRC configuration metadata in one or more frames of the sequence of frames , wherein the DRC configuration metadata indicates a plurality of DRC profiles associated with the encoded audio signal , and , for each DRC profile , a range of output reference levels for which the DRC profile is applicable , wherein each set of DRC gains corresponds to one of the plurality of DRC profiles , the method comprising: ...

Подробнее
20-02-2020 дата публикации

TRANSDUCER APPARATUS FOR HIGH SPEECH INTELLIGIBILITY IN NOISY ENVIRONMENTS

Номер: US20200059717A1
Принадлежит:

A transducer apparatus to provide high speech-intelligibility in a noisy environment. The transducer apparatus comprises a vibration-sensing transducer adapted to be placed on the non-honey and non-cartilaginous, i.e., fleshy, part of the head of the user—either on the all-flesh part of the cheek or all-flesh under chin. The vibrations sensed are vibrations arising from the user's voice in his mouth and conducted to the surface of the fleshy area of the users cheek or under-chin, and not by bone vibration. The embodiments of the invention include its application into headsets, earsets and helmets; and a switching means; and a means to realize a vibration transducer from an acoustical microphone. 1. A transducer apparatus comprising a transducer , whereinthe transducer is adapted to be placed on and to sense vibrations on the non-boney or non-cartilaginous part of the user's head, andthe vibrations arise from the user's voice.2. A transducer apparatus according to claim 1 , whereinthe non-boney part of the user's head is the fleshy area of the user's cheek near the mouth of the user, andthe vibrations are conducted to the surface of the fleshy area of the user's cheek through the flesh of the user's cheek.3. A transducer apparatus according to claim 1 , whereinthe transducer is an accelerometer, shock sensor, gyroscope, vibration microphone or vibration sensor.4. A transducer apparatus comprising a transducer claim 1 , whereinthe transducer is an acoustical-sensing microphone adapted to sense vibrations,the acoustical microphone having a housing,the housing having a hole that serves as the acoustical input port, andthe adaption of the acoustical microphone to sense vibrations is by means of the hole being placed on or pressed against the non-boney or non-cartilaginous part of said user's head.5. A transducer apparatus according to claim 4 , whereinthe hole is adapted to be covered by a membrane.6. A transducer apparatus according to claim 1 , whereinthe non-boney or ...

Подробнее
10-03-2016 дата публикации

Method and System for Scaling Ducking of Speech-Relevant Channels in Multi-Channel Audio

Номер: US20160071527A1
Автор: Muesch Hannes

A method and system for filtering a multi-channel audio signal having a speech channel and at least one non-speech channel, to improve intelligibility of speech determined by the signal. In typical embodiments, the method includes steps of determining at least one attenuation control value indicative of a measure of similarity between speech-related content determined by the speech channel and speech-related content determined by the non-speech channel, and attenuating the non-speech channel in response to the at least one attenuation control value. Typically, the attenuating step includes scaling of a raw attenuation control signal (e.g., a ducking gain control signal) for the non-speech channel in response to the at least one attenuation control value. Some embodiments are a general or special purpose processor programmed with software or firmware and/or otherwise configured to perform filtering in accordance the invention. 1. A method for filtering a multi-channel audio signal having a speech channel and at least one non-speech channel , to improve intelligibility of speech determined by the signal , said method including the steps of:(a) determining at least one attenuation control value indicative of a measure of similarity between speech-related content determined by the speech channel and speech-related content determined by at least one non-speech channel of the multi-channel audio signal, where the attenuation control value is generated based on at least one speech enhancement likelihood value for the non-speech channel, and the speech enhancement likelihood value is indicative of a likelihood that said at least one non-speech channel is indicative of content that enhances perceived quality of speech content determined by the speech channel; and(b) attenuating at least one non-speech channel of the multi-channel audio signal in response to the at least one attenuation control value.2. The method of claim 1 , wherein each attenuation control value determined ...

Подробнее
28-02-2019 дата публикации

SELECTIVE ENFORCEMENT OF PRIVACY AND CONFIDENTIALITY FOR OPTIMIZATION OF VOICE APPLICATIONS

Номер: US20190066686A1
Принадлежит:

A computer-implemented method includes identifying a plurality of protected pieces from a conversation. The computer-implemented method further includes generating one or more confidence scores for each protected piece, wherein a confidence score is a degree of associativity between a protected piece and a type of sensitive information. The computer-implemented method further includes determining that the protected piece is associated with the type of sensitive information. The computer-implemented method further includes determining a type of protection action for each protected piece in the plurality of protected pieces. The computer-implemented method further includes performing the type of protection action for each protected piece in the plurality of protected pieces to form a modified conversation that is devoid of the sensitive information. A corresponding computer system and computer program product are also disclosed. 1. A computer-implemented method comprising:identifying a plurality of protected pieces from a conversation, wherein each protected piece in the plurality of protected pieces corresponds to a portion of the conversation that includes sensitive information;generating one or more confidence scores for each protected piece in the plurality of protected pieces, wherein a confidence score is a degree of associativity between a protected piece and a type of sensitive information;determining that the protected piece is associated with the type of sensitive information based, at least in part, on the confidence score exceeding a given threshold level;determining a type of protection action for each protected piece in the plurality of protected pieces based, at least in part, on the type of sensitive information associated with the protected piece; andperforming the type of protection action for each protected piece in the plurality of protected pieces to form a modified conversation, wherein the modified conversation is devoid of the sensitive ...

Подробнее
05-03-2020 дата публикации

Information processing apparatus and information processing method

Номер: US20200074994A1
Автор: Tatsuya Igarashi
Принадлежит: Sony Corp

A system that acquires first audio data including a voice command captured by a microphone; identifies second audio data included in broadcast content corresponding to a timing at which the first audio data is captured by the microphone; extracts the second audio data from the first audio data to generate third audio data; converts the third audio data to text data corresponding to the voice command; and outputs the text data.

Подробнее
05-03-2020 дата публикации

Data Driven Radio Enhancement

Номер: US20200075033A1
Принадлежит: BabbleLabs LLC

Systems and methods are disclosed for data driven radio enhancement. For example, methods may include demodulating a radio signal to obtain a demodulated audio signal; determining a window of audio samples based on the demodulated audio signal; applying an audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the audio enhancement network includes a machine learning network that has been trained using demodulated audio signals derived from radio signals; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

Подробнее
05-03-2020 дата публикации

ELECTRONIC DEVICE AND OPERATION METHOD THEREOF

Номер: US20200076389A1
Принадлежит: SAMSUNG ELECTRONICS CO., LTD.

Provided are an electronic device and an operation method thereof. The operation method of an electronic device for processing an audio signal may include obtaining viewing environment information related to sound intelligibility, processing an input audio signal by separating the input audio signal into a first channel including a primary signal and a second channel including an ambient signal based on the viewing environment information, processing the input audio signal based on a frequency band and based on the viewing environment information, and generating an output signal based on processing the input audio signal. 1. An operation method of an electronic device for processing an audio signal , the operation method comprising:obtaining viewing environment information related to sound intelligibility;processing an input audio signal by separating the input audio signal into a first channel including a primary signal and a second channel including an ambient signal based on the viewing environment information;processing the input audio signal based on a frequency band and based on the viewing environment information; andgenerating an output signal based on processing the input audio signal.2. The operation method of claim 1 , wherein the viewing environment information includes at least one of information associated with ambient noise around the electronic device claim 1 , information associated with a space where the electronic device is located claim 1 , information associated with an ambient device around the electronic device claim 1 , and information associated with an installation environment of the electronic device.3. The operation method of claim 1 , wherein the processing the input audio signal by separating the input audio signal into the first channel including the primary signal and the second channel including the ambient signal comprises:determining weights for the primary signal and the ambient signal based on the viewing environment information; ...

Подробнее
05-03-2020 дата публикации

Methods and systems for wireless audio

Номер: US20200077175A1
Автор: Kozo Okuda
Принадлежит: Semiconductor Components Industries LLC

Various embodiments of the present technology comprise a method and system for wireless audio. In various embodiments, the system comprises a set of wirelessly connected ear buds, each ear bud suitable for placing in a human ear canal. Each ear bud comprises a microphone, an asynchronous sampling rate converter, a timer, and an audio clock. One ear bud from the set further comprises a control circuit and a synchronizer to synchronize the input of sound signals captured by the microphones and/or synchronize the processing and output of the sound signals.

Подробнее
23-03-2017 дата публикации

Residual Noise Suppression

Номер: US20170084289A1
Принадлежит:

A method includes determining a preprocessed audio signal by removing some noise from an input audio signal. Here, portions of the preprocessed audio signal that include speech are separated by portions of the preprocessed audio signal that include residual noise. Additionally, the method includes determining an amplified signal by suppressing the preprocessed audio signal over the portions that include residual noise, and maintaining the preprocessed audio signal over the portions that include speech. 1. A method comprising:determining a preprocessed audio signal by removing some noise from an input audio signal, wherein portions of the preprocessed audio signal that include speech are separated by portions of the preprocessed audio signal that include residual noise; and suppressing the preprocessed audio signal over the portions that include residual noise, and', 'maintaining the preprocessed audio signal over the portions that include speech., 'determining an amplified signal by'}2. The method of claim 1 , further comprising:determining the portions of the preprocessed audio signal that include residual noise as corresponding to times when an envelope of the preprocessed audio signal is less than or equal to a first threshold signal; anddetermining the portions of the preprocessed signal that include speech as corresponding to times when the envelope of the preprocessed audio signal is larger than the first threshold signal.3. The method of claim 2 , wherein a value of the first threshold signal is in a range from 5% to 20% of a maximum value of the envelope of the preprocessed audio signal.4. The method of claim 2 , further comprising: a value equal to a maximum gain value for the portions of the preprocessed audio signal that include speech, and', 'at least one value smaller than the maximum gain value and larger than or equal to a threshold ratio for the portions of the preprocessed audio signal that include residual noise., 'setting a gain signal for ...

Подробнее
12-03-2020 дата публикации

SELECTIVE ENFORCEMENT OF PRIVACY AND CONFIDENTIALITY FOR OPTIMIZATION OF VOICE APPLICATIONS

Номер: US20200082123A1
Принадлежит:

A computer-implemented method includes identifying a plurality of protected pieces from a conversation. The computer-implemented method further includes generating one or more confidence scores for each protected piece, wherein a confidence score is a degree of associativity between a protected piece and a type of sensitive information. The computer-implemented method further includes determining that the protected piece is associated with the type of sensitive information. The computer-implemented method further includes determining a type of protection action for each protected piece in the plurality of protected pieces. The computer-implemented method further includes performing the type of protection action for each protected piece in the plurality of protected pieces to form a modified conversation that is devoid of the sensitive information. A corresponding computer system and computer program product are also disclosed. 1. A computer-implemented method comprising:identifying a plurality of protected pieces from a conversation, wherein each protected piece in the plurality of protected pieces corresponds to a portion of the conversation that includes sensitive information;determining a type of protection action for each protected piece in the plurality of protected pieces based, at least in part, on the type of sensitive information associated with the protected piece; andperforming the type of protection action for each protected piece in the plurality of protected pieces to form a modified conversation, wherein the modified conversation is devoid of the sensitive information.2. The computer-implemented method of claim 1 , wherein determining the type of protection action for each protected piece in the plurality of protected pieces is further based on a type of medium in which the conversation is stored.3. The computer-implemented method of claim 1 , further comprising:requesting additional clarifying information about the protected piece based on a ...

Подробнее
25-03-2021 дата публикации

PITCH EMPHASIS APPARATUS, METHOD AND PROGRAM FOR THE SAME

Номер: US20210090587A1

Provided is pitch enhancement processing having little unnaturalness even in time segments for consonants, and having little unnaturalness to listeners caused by discontinuities even when time segments for consonants and other time segments switch frequently. A pitch emphasis apparatus carries out the following as the pitch enhancement processing: for a time segment in which a spectral envelope of a signal has been determined to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past than the time by a number of samples Tcorresponding to a pitch period of the time segment, a pitch gain σof the time segment, a predetermined constant B, and a value greater than 0 and less than 1, to (2) the signal of the time. 1. A pitch emphasis apparatus that obtains an output signal by executing pitch enhancement processing on each of time segments of a signal originating from an input audio signal , the apparatus comprising: [{'sub': 0', '0', '0, 'for a time segment in which a spectral envelope of the signal has been determined to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past than the time by a number of samples Tcorresponding to a pitch period of the time segment, a pitch gain σof the time segment, a predetermined constant B, and a value greater than 0 and less than 1, to (2) the signal of the time, and'}, {'sub': 0', '0', '0, 'for a time segment in which a spectral envelope of the signal has been determined not to be flat, obtaining an output signal for each of times in the time segment, the output signal being a signal including a signal obtained by adding (1) a signal obtained by multiplying the signal of a time, further in the past ...

Подробнее
29-03-2018 дата публикации

UTILIZATION OF LOCATION AND ENVIRONMENT TO IMPROVE RECOGNITION

Номер: US20180090134A1
Принадлежит:

A portable terminal has a network interface that receives a set of instructions having a sequence of at least one location and audio properties associated with the at least one location from a server. An audio circuit receives audio signals picked up by a microphone and processes the audio signals in a manner defined by the audio properties associated with the at least one location. A speech recognition module receives processed signals from the audio circuit and carries out a speech recognition process thereupon. 1. A device , comprising:a network interface that receives a set of instructions from a server, the instructions comprising at least one location where at least one action is to be carried out by a user and audio processing parameters comprising audio properties associated with the at least one location;an audio circuit that receives audio signals picked up by a microphone and processes the audio signals in a manner defined by the audio processing parameters comprising the audio properties associated with the at least one location, the audio processing parameters having been ascertained from the set of instructions; anda speech recognition module that receives processed signals from the audio circuit and carries out a speech recognition process thereupon.2. The device according to claim 1 , where audio signals picked up by the microphone are stored and conveyed to a server.3. The device according to claim 1 , where the speech recognition module utilizes a user template that characterizes speech of a particular user to enhance recognition accuracy.4. The device according to claim 1 , where the audio circuit comprises an amplifier and where the gain of the amplifier is set by the audio processing parameters comprising the audio properties for the at least one location.5. The device according to claim 1 , where the audio circuit comprises a noise comparison circuit that compares the audio with a noise model defined by the audio processing parameters ...

Подробнее
05-05-2022 дата публикации

COMMUNICATION DEVICE AND SIDETONE VOLUME ADJUSTING METHOD THEREOF

Номер: US20220139414A1
Принадлежит:

A communication device and a sidetone volume adjusting method thereof are disclosed. The communication device includes a sound processor, a far-end sound receiver, a near-end sound receiver, a volume adjuster, and a sound player. The far-end sound receiver is configured to receive a far-end sound and transmit it to the sound processor. The near-end sound receiver is configured to receive a near-end sound such that the sound processor receives the near-end sound to form a sidetone. The volume adjustment module is configured to adjust the volume of the far-end sound and the sidetone to form the adjusted far-end sound and the adjusted sidetone, wherein the volume of the adjusted sidetone is based on the near-end sound and the adjusted far-end sound. The sound player is used to play the adjusted sidetone and the adjusted far-end sound.

Подробнее
31-03-2016 дата публикации

Assistive listening system and method for television, radio & music systems

Номер: US20160094920A1
Автор: Kenneth A. Ullrich
Принадлежит: Individual

An assistive-listening system is used with sound-producing equipment that includes a signal source, and first and second sound sources operatively associated with the signal source and configured to produce sound corresponding to signals received from the signal source. The assistive-listening system includes a volume control operatively associated with the signal source and configured proportionally to change the volume of both the first and second sound sources. Also included is a support structure configured to support and position the second sound source so that a hearing-impaired listener may listen effectively to sound controlled by the volume control without disturbing normal-hearing listeners.

Подробнее
05-05-2022 дата публикации

Spatial Audio Processing

Номер: US20220141612A1
Принадлежит: NOKIA TECHNOLOGIES OY

According to an example embodiment, a technique for spatial audio processing including: determining at least one spatial parameter based, at least partially, on at least one input audio signal captured with at least one first device, configured to represent at least a portion of an audio scene; identifying a portion of interest of the audio scene based, at least partially, on the at least one spatial parameter; generating at least one first audio signal based, at least partially, on the at least one input audio signal; generating at least one second audio signal based, at least partially, on at least one audio signal captured with at least one second device; and combining, at least partially, the at least one first audio signal and the at least one second audio signal into at least one combined audio signal.

Подробнее
30-03-2017 дата публикации

Automatic Calculation of Gains for Mixing Narration Into Pre-Recorded Content

Номер: US20170092290A1
Автор: Barkale Suraj Suhas

A system and method of mixing narration into content. The system automatically reduces the volume of the content according to a threshold value and a knee value. In this manner, the audio of the content does not overwhelm the narration. 1. A method of automatically mixing first audio and second audio that are associated with video , the method comprising:receiving, by a mobile device, a user selection of a first content item, wherein the first content item has video data and first audio data, and wherein the first audio data is synchronized with the video data;outputting, by the mobile device, the video data and the first audio data;receiving, by the mobile device, second audio data from a microphone of the mobile device, wherein the second audio data is received contemporaneously with outputting the video data and the first audio data;calculating, by the mobile device, a loudness measure, wherein the loudness measure includes a loudness of the first audio data;attenuating, by the mobile device, the first audio data according to the loudness measure to form attenuated first audio data;mixing, by the mobile device, the attenuated first audio data and the second audio data to form a second content item, wherein the second content item has the video data, the attenuated first audio data, and the second audio data, and wherein the attenuated first audio data and the second audio data are synchronized with the video data; andstoring, by the mobile device, the second content item having been formed.2. The method of claim 1 , wherein the second audio data corresponds to narration by a user of the mobile device.3. The method of claim 1 , further comprising:receiving, by the mobile device, the video data and the first audio data, wherein the video data is received from a camera of the mobile device, and wherein the first audio data is received from a microphone of the mobile device; andstoring, by the mobile device, the video data and the first audio data as the first ...

Подробнее
01-04-2021 дата публикации

CONFERENCING AUDIO MANIPULATION FOR INCLUSION AND ACCESSIBILITY

Номер: US20210098013A1
Автор: Day Phil Noel
Принадлежит:

Various embodiments herein each include at least one of systems, methods, and software for conference audio manipulation for inclusion and accessibility. One embodiment, in the form of a method that may be performed, for example, on a server or a participant computing device. This method includes receiving a voice signal via a network and modifying an audible characteristic of the voice signal that is perceptible when the voice signal is audibly output. The method further includes outputting the voice signal including the modified audible characteristic. 1. A method comprising:receiving a voice signal via a network;selecting a speaker profile based on at least one of a property and content of the voice signal;modifying, based in part on the selected speaker profile, an audible characteristic of the voice signal that is perceptible when the voice signal is audibly output; andoutputting the voice signal including the modified audible characteristic.2. The method of claim 1 , wherein the audible characteristic of the voice signal is an audible frequency range.3. The method of claim 2 , wherein modifying the audible characteristic includes changing occurrences of the audible frequency range within the voice signal to a different audible frequency range based on a user setting.4. The method of claim 1 , wherein the speaker profile is selected based on processing of the audio signal by a speaker recognition process to obtain speaker identity data that is used to select the speaker profile.5. The method of claim 1 , wherein a speaker profile identifies at least one audible characteristic of the voice signal that is to be modified and how each of the at least one audible characteristics are to be modified.6. The method of claim 1 , wherein the modifying of the audible characteristic of the voice signal includes applying a filter to remove an audible portion of the voice signal.7. The method of claim 1 , further comprising:receiving user input that identifies the audible ...

Подробнее
14-04-2016 дата публикации

RESPIRATOR MASK SPEECH ENHANCEMENT APPARATUS AND METHOD

Номер: US20160101301A1
Автор: Kihlberg Roger
Принадлежит:

Speech enhancement apparatus and respirator masks including speech enhancement apparatus, as well as methods of enhancing speech transmission for the wearer of a respirator mask are described herein. In one or more embodiments, the speech enhancement apparatus and methods described herein detect acoustic energy within a first frequency range in the clean air envelope of a respirator mask and deliver compensating acoustic energy outside of the clean air envelope using a speaker. The compensating acoustic energy is, in one or more embodiments, delivered in one or more predetermined attenuated frequency ranges that cover less than all of the detected first frequency range. In one or more embodiments, the compensating acoustic energy may be delivered with an attenuated amplitude profile that uniform or that is non-uniform over the one or more attenuated frequency ranges. 1. A respirator mask comprising:a mask body configured to define a clean air envelope between the mask and the mouth and nose of wearer; a microphone configured for attachment to the mask body, the microphone further configured to detect acoustic energy within the clean air envelope when attached to the mask body;', 'a speaker configured to produce acoustic energy outside of the clean air envelope;', receive a speech signal from the microphone, wherein the speech signal is indicative of acoustic energy detected by the microphone within a first frequency range; and', 'deliver an output signal to the speaker, wherein the output signal is configured to cause the speaker to emit compensating acoustic energy, wherein the compensating acoustic energy is emitted in one or more predetermined attenuated frequency ranges that cover less than all of the first frequency range, and wherein the compensating acoustic energy comprises a predetermined attenuated amplitude profile over each predetermined attenuated frequency range of the one or more predetermined attenuated frequency ranges., 'a controller operably ...

Подробнее
06-04-2017 дата публикации

ENHANCING INTELLIGIBILITY OF SPEECH CONTENT IN AN AUDIO SIGNAL

Номер: US20170098456A1

Embodiments of the present invention relate to signal processing. Methods for enhancing intelligibility of speech content in an audio signal are disclosed. One of the methods comprises obtaining reference loudness of the audio signal. The method further comprises enhancing the intelligibility of the speech content by adjusting partial loudness of the audio signal based on the reference loudness and a degree of the intelligibility. Corresponding systems and computer program products are also disclosed. 1. A method for enhancing intelligibility of speech content in an audio signal , the speech content contained in a speech component of the audio signal , the method comprising:obtaining reference loudness of the audio signal; andenhancing the intelligibility of the speech content by adjusting partial loudness of the audio signal based on the reference loudness and a degree of the intelligibility.2. The method according to claim 1 , wherein adjusting the partial loudness of the audio signal comprises:increasing the partial loudness of the speech component based on the reference loudness and the degree of the intelligibility.3. The method according to claim 1 , wherein adjusting the partial loudness of the audio signal comprises:in response to a determination that the audio signal contains a non-speech component, reducing the partial loudness of the non-speech component based on the reference loudness and the degree of the intelligibility.4. The method according to claim 1 , wherein enhancing the intelligibility of the speech content by adjusting the partial loudness of the audio signal comprises:adjusting the partial loudness of the audio signal to the reference loudness;determining whether an intelligibility criterion is met by the intelligibility of the speech content in the adjusted audio signal;determining target loudness in response to the intelligibility criterion being not met; and adjusting the partial loudness of the audio signal to the target loudness.5. The ...

Подробнее
12-05-2022 дата публикации

METHOD AND APPARATUS FOR PROCESSING AN AUDIO SIGNAL, AUDIO DECODER, AND AUDIO ENCODER

Номер: US20220148609A1
Принадлежит:

A method is described that processes an audio signal. A discontinuity between a filtered past frame and a filtered current frame of the audio signal is removed using linear predictive filtering. 1. A method for processing an audio signal , the method comprising:removing a discontinuity between a filtered past frame and a filtered current frame of the audio signal using linear predictive filtering.2. The method of claim 1 , comprising filtering the current frame of the audio signal and removing the discontinuity by modifying a beginning portion of the filtered current frame by a signal acquired by linear predictive filtering a predefined signal with initial states of the linear predictive filter defined on the basis of a last part of the past frame.3. The method of claim 2 , wherein the initial states of the linear predictive filter are defined on the basis of a last part of the unfiltered past frame filtered using the set of filter parameters for filtering the current frame.4. The method of claim 1 , further comprising estimating the linear predictive filter on the filtered or non-filtered audio signal.5. The method of claim 4 , wherein estimating the linear predictive filter comprises estimating the filter based on the past and/or current frame of the audio signal or based on the past filtered frame of the audio signal using the Levinson-Durbin algorithm.6. The method of claim 1 , wherein the linear predictive filter comprises a linear predictive filter of an audio codec.7. The method of claim 1 , wherein removing the discontinuity comprises processing the beginning portion of the filtered current frame claim 1 , wherein the beginning portion of the current frame comprises a predefined number of samples being less or equal than the total number of samples in the current frame claim 1 , and wherein processing the beginning portion of the current frame comprises subtracting a beginning portion of a zero-input-response (ZIR) from the beginning portion of the filtered ...

Подробнее
13-04-2017 дата публикации

VEHICLE AUDIO TRANSMISSION CONTROL

Номер: US20170103773A1
Принадлежит:

Methods and systems for controlling audio communications between occupants of a vehicle are provided. In accordance with one embodiment, a system includes an interface and a processor. The interface is configured to at least facilitate receiving a request for sound transmission from a first occupant inside a vehicle to a second occupant inside the vehicle. The processor is coupled to the interface, and is configured to at least facilitate identifying respective locations of the first occupant and the second occupant, and performing the sound transmission with an adjustment for a phase difference based at least in part on the respective locations of the first occupant and the second occupant. 1. A method comprising:receiving a request for sound transmission from a first occupant inside a vehicle to a second occupant inside the vehicle;identifying respective locations of the first occupant and the second occupant;adjusting for a phase difference between a transmitted sound from the first occupant and a reflected sound from the first occupant, wherein the adjusting for the phase difference is made based at least in part on the respective locations of the first occupant and the second occupant andperforming the sound transmission with the adjustment for the phase difference, wherein the step of performing the sound transmission comprises provided the sound transmission of an amplified sound from the first occupant via an audio speaker that is disposed inside the vehicle proximate to the second occupant, and wherein the adjustment adjusts for a latency between the amplified sound and the reflected sound.23.-. (canceled)4. The method of claim 1 , wherein the adjustment adjusts for a latency between the amplified sound and the reflected sound.5. The method of claim 4 , further comprising:determining a distance between the first occupant and the second occupant; anddetermining the latency using the distance.6. The method of claim 5 , wherein the step of determining the ...

Подробнее
13-04-2017 дата публикации

Audio Signal Processing

Номер: US20170103774A1
Автор: Karsten V. Sørensen
Принадлежит: Microsoft Technology Licensing LLC

An estimated system gain spectrum of an acoustic system is generated, and updated in real-time to respond to changes in the acoustic system. Peak gains in the estimated system gain spectrum are tracked as the estimated system gain spectrum is updated. Based on the tracking, at least one frequency at which the estimated system gain spectrum is currently exhibiting a peak gain is identified. Based on the identification of the at least one frequency, an audio equalizer is controlled to apply, to a first speech containing signal to be played out via an audio output device of the audio device and/or to a second speech containing signal received via an audio input device of the audio device, an equalization filter to reduce the level of that signal at the identified frequency. The equalization filter is applied continuously throughout intervals of both speech activity and speech inactivity in that signal.

Подробнее
21-04-2016 дата публикации

SYSTEMS, METHODS, AND DEVICES FOR INTELLIGENT SPEECH RECOGNITION AND PROCESSING

Номер: US20160111111A1
Автор: Levitt Harry
Принадлежит:

Systems, methods, and devices for intelligent speech recognition and processing are disclosed. According to one embodiment, a method for improving intelligibility of a speech signal may include (1) at least one processor receiving an incoming speech signal comprising a plurality of sound elements; (2) the at least one processor recognizing a sound element in the incoming speech signal to improve the intelligibility thereof; (3) the at least one processor processing the sound element by at least one of modifying and replacing the sound element; and (4) the at least one processor outputting the processed speech signal comprising the processed sound element. 1. A method for improving intelligibility of a speech signal , comprising:at least one processor receiving an incoming speech signal comprising a plurality of sound elements;the at least one processor recognizing a sound element in the incoming speech signal to improve the intelligibility thereof;the at least one processor processing the sound element by at least one of modifying and replacing the sound element; andthe at least one processor outputting the processed speech signal comprising the processed sound element.2. The method of claim 1 , wherein the sound element comprises at least one of a continuant sound element and a non-continuant sound element.3. The method of claim 1 , wherein the processing increases a duration of the sound element.4. The method of claim 1 , wherein the processing decreases a duration of the sound element.5. The method of claim 1 , further comprising:the at least one processor recognizing a second sound element in the incoming speech signal to improve the intelligibility thereof; andthe at least one processor processing the second sound element by at least one of modifying and replacing the sound element;wherein the second sound element is modified or replaced to compensate for the processing of the first sound element.6. The method of claim 1 , wherein the sound element is a speech ...

Подробнее
20-04-2017 дата публикации

Voice converting apparatus and method for converting user voice thereof

Номер: US20170110143A1
Принадлежит: SAMSUNG ELECTRONICS CO LTD

A voice converting apparatus and a voice converting method are provided. The method of converting a voice using a voice converting apparatus including receiving a voice from a counterpart, analyzing the voice and determining whether the voice abnormal, converting the voice into a normal voice by adjusting a harmonic signal of the voice in response to determining that the voice is abnormal, and transmitting the normal voice.

Подробнее
29-04-2021 дата публикации

APPARATUS AND METHOD FOR POWER EFFICIENT SIGNAL CONDITIONING FOR A VOICE RECOGNITION SYSTEM

Номер: US20210125607A1
Принадлежит: Google Technology Holdings LLC

A disclosed method includes monitoring an audio signal energy level while having a noise suppressor deactivated to conserve battery power, buffering the audio signal in response to a detected increase in the audio energy level, activating and running a voice activity detector on the audio signal in response to the detected increase in the audio energy level and activating and running a noise estimator in response to voice being detected in the audio signal by the voice activity detector. The method may further include activating and running the noise suppressor only if the noise estimator determines that noise suppression is required. The method activates and runs a noise type classifier to determine the noise type based on information received from the noise estimator and selects a noise suppressor algorithm, from a group of available noise suppressor algorithms, where the selected noise suppressor algorithm is the most power consumption efficient. 1. A computer-implemented method when executed on data processing hardware of a computing device causes the data processing hardware to perform operations comprising:receiving an audio signal detected by a first microphone in a group of microphones of the computing device while a second microphone in the group of microphones is powered off;while the second microphone is powered off, determining an audio signal energy level of the audio signal detected by the first microphone has deviated from a baseline audio signal energy level by more than a threshold amount; andin response to determining that the audio signal energy level of the audio signal detected by the first microphone has deviated from the baseline audio signal energy level by more than the threshold amount, triggering the second microphone to power on.2. The computer-implemented method of claim 1 , wherein the operations further comprise claim 1 , in response to determining the audio signal energy level of the audio signal detected by the first microphone has ...

Подробнее
27-04-2017 дата публикации

METHOD AND SYSTEM FOR ADJUSTING USER SPEECH IN A COMMUNICATION SESSION

Номер: US20170116883A1
Принадлежит:

A system that incorporates the subject disclosure may include, for example, receive user speech captured at a second end user device during a communication session between the second end user device and a first end user device, apply speech recognition to the user speech, identify an unclear word in the user speech based on the speech recognition, adjust the user speech to generate adjusted user speech by replacing all or a portion of the unclear word with replacement audio content, and provide the adjusted user speech to the first end user device during the communication session. Other embodiments are disclosed. 1. A method , comprising:detecting, by a processing system including a processor, a communication session between a first user and a second user;receiving, by the processing system, user input from the first user, wherein the user input is sent from a first communication device;determining, by the processing system, an impairment of the first user responsive to analyzing, by the processing system, the user input from the first user;modifying, by the processing system, the user input according to a group of adjustment techniques resulting in modified user input responsive to determining the impairment; andproviding, by the processing system, the modified user input to a second communication device, wherein the second communication device is associated with the second user.2. The method of claim 1 , wherein the determining of the impairment of the first user comprises:accessing, by the processing system, a first user profile for the first user; anddetermining, by the processing system, the impairment of the first user according to the first user profile.3. The method of claim 1 , wherein the determining of the impairment of the first user comprises:monitoring, by the processing system, previous communications of the first user resulting in monitored previous communications; anddetermining, by the processing system, the impairment of the first user according ...

Подробнее
09-04-2020 дата публикации

Information processing apparatus and information processing method

Номер: US20200111505A1
Принадлежит: Sony Corp

[Object] To more flexibly control the affinity of a spoken utterance for a background sound in accordance with the importance degree of an information notification. [Solution] There is provided an information processing apparatus including an utterance control unit that controls an output of a spoken utterance corresponding to notification information. The utterance control unit controls an output mode of the spoken utterance on the basis of an importance degree of the notification information and affinity for a background sound. In addition, there is provided an information processing method including controlling, by a processor, an output of a spoken utterance corresponding to notification information. The controlling further includes controlling an output mode of the spoken utterance on the basis of an importance degree of the notification information and affinity for a background sound.

Подробнее
25-08-2022 дата публикации

AUDIO SIGNAL PROCESSING METHOD, APPARATUS AND DEVICE, AND STORAGE MEDIUM

Номер: US20220270631A1
Принадлежит:

An electronic device obtains audio signals collected by different microphones in a microphone array. The device filters the audio signals using a first filter to obtain a first target beam. The first filter is configured to suppress an interference speech in the audio signals and enhance a target speech in the audio signals. The device filters the audio signals using a second filter to obtain a first interference beam. The second filter is configured to suppress the target speech and enhance the interference speech. The device a second interference beam of the first interference beam using a third filter. The device determines a difference between the first target beam and the second interference beam as a first audio processing output. The device adaptively updates at least one of the second filter and the third filter, and updates the first filter according to the updated second filter and/or third filter. 1. An audio signal processing method performed by an electronic device , the method comprising:obtaining audio signals collected by different microphones in a microphone array;filtering the audio signals using a first filter to obtain a first target beam, wherein the first filter is configured to suppress an interference speech in the audio signals and enhance a target speech in the audio signals;filtering the audio signals using a second filter to obtain a first interference beam, wherein the second filter is configured to suppress the target speech and enhance the interference speech;obtaining a second interference beam of the first interference beam using a third filter, wherein the third filter is configured to perform a weighted adjustment on the first interference beam;determining a difference between the first target beam and the second interference beam as a first audio processing output; andadaptively updating at least one of the second filter and the third filter; andupdating the first filter according to the updated second filter and/or third filter.2 ...

Подробнее
25-08-2022 дата публикации

EVALUATION APPARATUS, TRAINING APPARATUS, METHODS AND PROGRAMS FOR THE SAME

Номер: US20220270635A1

An evaluation device applies a lowpass filter with a cutoff frequency being a first predetermined value or a second predetermined value greater than the first predetermined value with or without change of feedback formant frequencies which are formant frequencies of a picked-up speech signal, converts the picked-up speech signal, feeds back the converted speech signal to a subject, and includes an evaluation unit that calculates a compensatory response vector by using pickup formant frequencies which are formant frequencies of a speech signal acquired by picking up an utterance made by the subject while feeding back a speech signal that has been converted with change of the feedback formant frequencies to the subject, and pickup formant frequencies which are formant frequencies of a speech signal acquired by picking up an utterance made by the subject while feeding back a speech signal that has been converted without change of the feedback formant frequencies to the subject, and determines an evaluation based on a compensatory response vector for each cutoff frequency. 1. An evaluation device comprising:a signal analyzer configured to analyze a picked-up speech signal, and determine a first formant frequency and a second formant frequency;a convertor configured to apply a lowpass filter with a cutoff frequency being a first predetermined value or a second predetermined value greater than the first predetermined value with or without change of feedback formant frequencies which are formant frequencies of the picked-up speech signal, and convert the picked-up speech signal;a feedback feeder configured to feed back the converted speech signal to a subject; andan evaluator configured to determine a compensatory response vector by using pickup formant frequencies which are formant frequencies of a speech signal acquired by picking up an utterance made by the subject while feeding back a speech signal that has been converted with change of the feedback formant frequencies ...

Подробнее
27-05-2021 дата публикации

Audio Signal

Номер: US20210158833A1
Автор: Cooke Michael
Принадлежит:

A computer device () for processing audio signals is described. The computer device () includes at least a processor and a memory. The computer device () is configured to receive a bitstream comprising a combined audio signal, the combined audio signal comprising a first audio signal including speech and a second audio signal. The computer device () is configured to compress the combined audio signal to provide a compressed audio signal. The computer device () is configured to control a dynamic range of the compressed audio signal to provide an output audio signal. In this way, a quality of the speech included in the output audio signal is improved. 1. A computer device for processing audio signals , the computer device including at least a processor and a memory , wherein the computer device is configured to:receive a bitstream comprising a combined audio signal, the combined audio signal comprising a first audio signal including speech and a second audio signal;compress the combined audio signal to provide a compressed audio signal; andcontrol a dynamic range of the compressed audio signal to provide an output audio signal;whereby a quality of the speech included in the output audio signal is improved.2. The computer device according to claim 1 , wherein the computer device is configured to compress the combined audio signal by selectively reducing an amplitude of the second audio signal.3. The computer device according to any previous claim claim 1 , wherein the computer device is configured to compress the combined audio signal by selectively increasing an amplitude of the speech included in the first audio signal.4. The computer device according to any previous claim claim 1 , wherein the computer device is configured to compress the combined audio signal by matching amplitudes of the first audio signal and the second audio signal.5. The computer device according to any previous claim claim 1 , wherein the computer device is configured to:selectively ...

Подробнее
11-05-2017 дата публикации

Method and Device for Processing Sound Signal for Communications Device

Номер: US20170133032A1
Принадлежит:

A method and a device for processing a sound signal for a communications device, where a relationship between values of a volume of a first sound signal collected by a main microphone and a volume of a second sound signal collected by an auxiliary microphone is acquired by comparison, to determine a sound signal processing policy, and according to the sound signal processing policy, a sound signal to be sent to a peer communications terminal is determined, where the sound signal processing policy is used to ensure that a volume of the sound signal to be sent to the peer communications terminal exceeds a preset volume threshold. 1. A method for processing a sound signal for a communications device , comprising:acquiring a first sound signal collected by a main microphone and a second sound signal collected by an auxiliary microphone;determining a sound signal processing policy according to a relationship between values of a volume of the first sound signal and a volume of the second sound signal; anddetermining, according to the sound signal processing policy, a sound signal to be sent to a peer communications terminal, wherein the sound signal processing policy ensures a volume of the sound signal to be sent to the peer communications terminal exceeds a preset volume threshold.2. The method according to claim 1 , wherein the sound signal processing policy comprises a first sound signal processing policy claim 1 , wherein determining the sound signal processing policy comprises determining that the sound signal processing policy is the first sound signal processing policy when the volume of the first sound signal is less than the volume of the second sound signal claim 1 , wherein the first sound signal processing policy comprises determining whether the volume of the second sound signal is greater than or equal to the preset volume threshold claim 1 , and wherein determining the sound signal to be sent to the peer communications terminal comprises:sending the second ...

Подробнее
11-05-2017 дата публикации

ENHANCEMENT OF AUDIO CAPTURED BY MULTIPLE MICROPHONES AT UNSPECIFIED POSITIONS

Номер: US20170133036A1
Принадлежит:

Embodiments disclosed herein provide systems, methods, and computer readable media for steering a camera and enhancing audio captured by microphones at unspecified positions. In a particular embodiment, a method provides receiving audio captured by the plurality of microphones at a location and receiving video captured of a scene that includes the plurality of microphones captured by a first camera at a first camera position. The method further provides identifying the plurality of microphones in the scene and determining physical positions of the plurality of microphones at the location relative to the first camera position. The method then provides adjusting the audio based on the physical positions of the plurality of microphones. 1. A method of determining positions of a plurality of microphones , the method comprising:receiving audio captured by the plurality of microphones at a location;receiving video captured of a scene that includes the plurality of microphones captured by a first camera at a first camera position;identifying the plurality of microphones in the scene;determining physical positions of the plurality of microphones at the location relative to the first camera position; andadjusting the audio based on the physical positions of the plurality of microphones.2. The method of claim 1 , further comprising:identifying a speaker in the audio;determining a first physical position of the speaker based on the physical positions of the plurality of microphones; andadjusting a video camera to feature the first physical position.3. The method of claim 2 , wherein determining a first physical position of the speaker comprises:determining a time difference between when each of the plurality of microphones captured a portion of the audio from the speaker.4. The method of claim 1 , wherein identifying the plurality of microphones comprises:performing image recognition on the video to identify each microphone of the plurality of microphones.5. The method of ...

Подробнее
01-09-2022 дата публикации

Playback enhancement in audio systems

Номер: US20220277759A1
Автор: Joseph Gaalaas
Принадлежит: Bose Corp

Audio systems and methods are provided that enhance a portion of audio content relative to other portions of the audio content. The systems and methods select the portion to be enhanced and calculate an intelligibility metric of the selected portion, such as a dialogue portion. The systems and methods determine a gain based at least in part upon the intelligibility metric and apply the gain to the selected portion to provide an enhanced portion. The systems and methods provide an audio signal, based at least in part upon the enhanced portion, to an output for conversion to an acoustic signal, such as by an acoustic transducer.

Подробнее
21-05-2015 дата публикации

NOISE ADAPTIVE POST FILTERING

Номер: US20150142425A1
Принадлежит:

An apparatus comprising at least one processor and at least one memory including computer code for one or more programs, the at least one memory and the computer code configured to with the at least one processor to cause the apparatus to at least perform: estimating a signal to noise ratio value for an audio signal; generating a post-filter comprising at least one of: a first formant frequency filter and a second formant frequency filter, wherein the post-filter is dependent on the signal to noise ratio value for the audio signal, 130-. (canceled)31. A method comprising:estimating a signal to noise ratio value for an audio signal;generating a post-filter comprising at least one of: a first formant frequency filter and a second formant frequency filter, wherein the post-filter is dependent on the signal to noise ratio value for the audio signal.32. The method as claimed in claim 31 , wherein the post-filter is configured to move energy of the audio signal to higher frequencies.33. The method as claimed in claim 31 , wherein when generating the post-filter comprising the first formant frequency filter claim 31 , further comprises generating a first formant frequency parameter configured to attenuate first formant frequency components of the audio signal dependent on the signal to noise ratio value for the audio signal.34. The method as claimed in claim 33 , wherein generating the first formant frequency parameter dependent on the signal to noise ratio value for the audio signal comprises:comparing the signal to noise ratio value for the audio signal against a first signal to noise ratio threshold value;generating a maximum post-filter first formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being greater than the signal to noise ratio threshold value; andgenerating a second post-filter formant frequency parameter value dependent on the signal to noise ratio value for the audio signal being less than the signal to noise ...

Подробнее
23-04-2020 дата публикации

PERSONALIZED, REAL-TIME AUDIO PROCESSING

Номер: US20200126580A1
Принадлежит:

An apparatus and method for real-time audio processing employs a gaze detection sensor to detect a direction of a user's gaze and output a gaze signal corresponding to the detected direction of the user's gaze. A digital signal processing unit responds to a plurality of signals corresponding to a plurality of sounds received at the apparatus, and the determined direction of gaze to identify a signal of interest from the plurality of signals using the gaze signal. The signal of interest is processed for output to the user. In embodiments, a microphone array provides the plurality of signals. An imaging sensor may work with either the microphone array or the gaze detection sensor to identify the signal of interest. 1. (canceled)2. A computer-implemented method performed by a user wearable device , the method comprising:detecting a direction of a gaze of a user based at least in part on monitoring user head motion;identifying, in an electronic data storage, one or more available actions that correspond to the direction of the gaze of the user;selecting a first action from one or more available actions based at least in part on the direction of the gaze of the user; andexecuting a set of computer-readable instructions that correspond to the first action.3. The method of claim 2 , wherein detecting the direction of the gaze of the user comprises:detecting the direction of the gaze of the user based at least in part on monitoring a position of an eye of the user.4. The method of claim 2 , wherein detecting the direction of the gaze of the user comprises:detecting the direction of the gaze of the user based at least in part on an image.5. The method of claim 2 , wherein selecting the first action from the one or more available actions comprises:selecting to adjust playback of audio, video, or both, based at least in part on the direction of the gaze of the user.6. The method of claim 2 , wherein selecting the first action from the one or more available actions comprises: ...

Подробнее
08-09-2022 дата публикации

METHOD FOR DENOISING VOICE DATA, DEVICE, AND STORAGE MEDIUM

Номер: US20220284914A1
Автор: Liu Rong
Принадлежит:

The present disclosure provides a method for denoising voice data, an electronic device, and a computer readable storage medium. The present disclosure relates to the technical field of artificial intelligence, such as Internet of Vehicles, smart cockpit, smart voice, and voice recognition. A specific embodiment of the method includes: receiving an input to-be-played first piece of voice data; and invoking, in response to not detecting a synthetic voice interruption signal in a process of playing the first piece of voice data, a preset first denoising algorithm to filter out noise data except for the first piece of voice data. 1. A method for denoising voice data , comprising:receiving an input to-be-played first piece of voice data; andinvoking, in response to not detecting a synthetic voice interruption signal in a process of playing the first piece of voice data, a preset first denoising algorithm to filter out noise data except for the first piece of voice data.2. The method according to claim 1 , wherein the method further comprises:receiving, in response to detecting the synthetic voice interruption signal in the process of playing the first piece of voice data, an input second piece of voice data based on the synthetic voice interruption signal, and invoking a preset second denoising algorithm to filter out voice data except for human voice data from the second piece of voice data.3. The method according to claim 1 , wherein the invoking the preset first denoising algorithm to filter out the noise data except for the first piece of voice data comprises:identifying in-vehicle regular noises based on a preset in-vehicle regular noise feature set; andremoving an in-vehicle regular noise mixedly played with the first piece of voice data.4. The method according to claim 2 , wherein the invoking the preset second denoising algorithm to filter out the voice data except for the human voice data in the second piece of voice data comprises:identifying in-vehicle ...

Подробнее
09-05-2019 дата публикации

Speech Synthesis Device and Method

Номер: US20190139535A1
Принадлежит:

This invention is an improvement of technology for automatically generating response voice to voice uttered by a speaker (user), and is characterized by controlling a pitch of the response voice in accordance with a pitch of the speaker's utterance. A voice signal of the speaker's utterance (e.g., question) is received, and a pitch (e.g., highest pitch) of a representative portion of the utterance is detected. Voice data of a responsive to the utterance is acquired, and a pitch (e.g., average pitch) based on the acquired response voice data is acquired. A pitch shift amount for shifting the acquired pitch to a target pitch having a particular relationship to the pitch of the representative portion is determined. When response voice is to be synthesized on the basis of the response voice data, the pitch of the response voice to be synthesized is shifted in accordance with the pitch shift amount. 1. A speech synthesis method comprising:receiving a voice signal of an utterance;detecting a voiced section of the voice signal;detecting a pitch of a trailing end portion of the voiced section; [0030]acquiring voice data of a response to the utterance;acquiring a representative pitch based on the voice data of the response;determining one shift amount for shifting the representative pitch to a target pitch having a particular relationship to the detected pitch of the trailing end portion; andsynthesizing voice of the response based on the voice data of the response, while shifting pitch of the voice data in accordance with the one shift amount.2. The speech synthesis method as claimed in claim 1 , wherein the voiced section of the voice signal is a portion where a pitch of the voice signal is detectable. [0029]3. The speech synthesis method as claimed in claim 1 , wherein the trailing end portion is a part of the voiced section. [0030]4. The speech synthesis method as claimed in claim 1 , wherein the trailing end portion has a predetermined time width. [0030]5. The speech ...

Подробнее
30-04-2020 дата публикации

Method and Device for Recognizing State of Meridian

Номер: US20200135228A1
Автор: Zhonghua Ci
Принадлежит: Individual

The present application relates to a method and device for recognizing the state of a human body meridian by utilizing a voice recognition technology, the method comprising: receiving an input voice of a user; preprocessing the input voice; extracting a stable feature of the preprocessed input voice; primarily classifying the stable feature on the basis of a feature recognition model, and determining a basic classification pitch, wherein the basic classification pitch comprises Gong, Shang, Jue, Zhi and Yu (respectively equivalent to do, re, mi, sol and la); secondarily classifying the stable feature on the basis of the feature recognition model, and determining a secondary classification tone in the basic classification pitch; and recognizing the state of a meridian according to the secondary classification tone. The method for recognizing the state of a human body meridian of the present invention can accurately recognize the state of a human body meridian by classifying individual voices, thus solving the problem that conventional voice recognition and classification are completely dependent on human experience.

Подробнее
10-06-2021 дата публикации

PROCESSING SPOKEN COMMANDS TO CONTROL DISTRIBUTED AUDIO OUTPUTS

Номер: US20210174802A1
Принадлежит:

A system that is capable of controlling multiple entertainment systems and/or speakers using voice commands. The system receives voice commands and may determine audio sources and speakers indicated by the voice commands. The system may generate audio data from the audio sources and may send the audio data to the speakers using multiple interfaces. For example, the system may send the audio data directly to the speakers using a network address, may send the audio data to the speakers via a voice-enabled device or may send the audio data to the speakers via a speaker controller. The system may generate output zones including multiple speakers and may associate input devices with speakers within the output zones. For example, the system may receive a voice command from an input device in an output zone and may reduce output audio generated by speakers in the output zone. 120.-. (canceled)21. A computer-implemented method comprising:detecting, by an input device in a first environment, input audio corresponding to an utterance;determining an output device causing first audio to be output in the first environment; andbased at least in part on detecting the input audio, sending, to a networking component associated with the output device, an override command to reduce a volume of the first audio.22. The computer-implemented method of claim 21 , further comprising:outputting, by the input device, second audio in the first environment; andbased at least in part on detecting the input audio, reducing a volume of the second audio.23. The computer-implemented method of claim 21 , wherein the input device is paired with the output device using a wireless connection.24. The computer-implemented method of claim 21 , further comprising:determining an identifier corresponding to the output device,wherein sending the override command is further based at least in part on the identifier.25. The computer-implemented method of claim 21 , further comprising:determining that the output ...

Подробнее
25-05-2017 дата публикации

METHOD AND APPARATUS FOR DISCRIMINATING BETWEEN VOICE SIGNALS

Номер: US20170149461A1
Принадлежит: MOTOROLA SOLUTIONS, INC.

A method and apparatus for distinguishing voice signals that are played together over the same speaker employs spectral reshaping of one or more of the audio signals. The spectral reshaping shifts modifies the timber of the voice signal while not modifying the pitch of the voice signal. Additional techniques can be used to further distinguish voice signals, such as dynamic gain offset and frequency shifting. After processing one or more signals to spectrally reshape them, they can be played over the same speaker. A user hearing the resulting acoustic signal will be more able to distinguish between the multiple voice signals being played. 1. A method for differentiating audio signals when played together over a speaker , comprising:receiving, at the same time, a primary audio signal on a primary channel and a secondary audio signal on a secondary channel;spectrally reshaping at least one of the primary audio signal and the secondary audio signal based on spectral content of the other audio signal to produce resulting signals including at least one reshaped signal;mixing the resulting signals; andplaying the resulting signals over the speaker.2. The method of claim 1 , wherein spectrally reshaping is performed in response to detecting voice content in the primary audio signal.3. The method of claim 1 , further comprising adjusting a gain of at least one of the resulting signals to maintain a preselected gain offset between the resulting signals.4. The method of claim 3 , wherein the preselected gain is based on the instantaneous energy of the primary audio signal.5. The method of claim 1 , wherein spectrally reshaping is performed continuously based on spectral comparison of the spectral content of the primary and secondary audio signals.6. The method of claim 1 , wherein spectrally reshaping is performed by calculating an energy level in each of a plurality of sub-bands of the primary and secondary audio signals and applying a dynamic equalization adjustment to at ...

Подробнее
17-06-2021 дата публикации

ADAPTIVE SPEECH INTELLIGIBILITY CONTROL FOR SPEECH PRIVACY

Номер: US20210183402A1

In some examples, adaptive speech intelligibility control for speech privacy may include determining, based on background noise at a near-end of a speaker, a noise estimate associated with speech emitted from the speaker, and comparing, by using a specified factor, the noise estimate to a speech level estimate for the speech emitted from the speaker. Adaptive speech intelligibility control for speech privacy may further include determining, based on the comparison, a gain value to be applied to the speaker to produce the speech at a specified level to maintain on-axis intelligibility with respect to the speaker, and applying the gain value to the speaker. 1. An adaptive speech intelligibility control for speech privacy apparatus comprising:a processor; and determine, based on background noise at a near-end of a speaker, a noise estimate associated with speech emitted from the speaker;', 'compare, by using a specified factor, the noise estimate to a speech level estimate for the speech emitted from the speaker;', 'determine, based on the comparison, a gain value to be applied to the speaker to produce the speech at a specified level to maintain on-axis intelligibility with respect to the speaker; and', 'apply the gain value to the speaker., 'a memory storing machine readable instructions that when executed by the processor cause the processor to2. The apparatus according to claim 1 , wherein the speaker includes an ultrasonic modulator to modulate the speech claim 1 , and a piezo-transducer to receive the modulated speech and to generate a directional audio wavefront for a target listener at a specified location.3. The apparatus according to claim 1 , wherein the machine readable instructions to determine claim 1 , based on the background noise at the near-end of the speaker claim 1 , the noise estimate associated with the speech emitted from the speaker further comprise machine readable instructions to cause the processor to:determine, based on the background noise ...

Подробнее
17-06-2021 дата публикации

FREQUENCY EXTRACTION METHOD USING DJ TRANSFORM

Номер: US20210183403A1
Автор: Kim Dong Jin
Принадлежит:

A method, of which each step is performed by a computer, for extracting a frequency of an input sound according to an embodiment of the present disclosure comprises the steps of: modeling a plurality of springs which have natural frequencies different from each other and oscillate according to an input sound; calculating transient-state-pure-tone amplitudes of the plurality of modeled springs; calculating expected steady-state amplitudes of the plurality of modeled springs; calculating predicted pure-tone amplitudes based on the expected steady-state amplitudes; calculating filtered pure-tone amplitudes by multiplying the transient-state-pure-tone amplitudes with the predicted pure-tone amplitudes ; and extracting the natural frequency of the spring which corresponds to a local maximum value among the filtered pure-tone amplitudes. 1. A method , of which each step is performed by a computer , for extracting a frequency of an input sound comprising the steps of:modeling a plurality of springs which have natural frequencies different from each other and oscillate according to an input sound;calculating transient-state-pure-tone amplitudes of the plurality of modeled springs;calculating expected steady-state amplitudes of the plurality of modeled springs;calculating predicted pure-tone amplitudes based on the expected steady-state amplitudes;calculating filtered pure-tone amplitudes by multiplying the transient-state-pure-tone amplitudes with the predicted pure-tone amplitudes; andextracting the natural frequency of the spring which corresponds to a local maximum value among the filtered pure-tone amplitudes.2. The method according to claim 1 , wherein said expected steady-state amplitude is calculated based on the amplitudes at least two time points within a duration of the input sound.4. The method according to claim 2 , wherein a difference between the two different time points is a period of the natural frequency of the corresponding spring.5. The method according ...

Подробнее
01-06-2017 дата публикации

VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD

Номер: US20170155369A1
Принадлежит:

Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal. 1. A loudness normalization method based upon target loudness , the method comprising:determining one or more dynamic gain parameters based upon a content type or context; andmodifying a loudness of an audio signal by employing the selected gain parameters, wherein a resulting loudness level of an audio recording on playback is consistent over a timeline based on a target loudness value.2. The loudness normalization method of claim 1 , wherein the dynamic gain parameters are identified and applied in real time.3. The loudness normalization method of claim 1 , wherein the content type comprises speech claim 1 , short-term music claim 1 , noise and/or background sound.4. The loudness normalization method of claim 1 , wherein dialog enhancement is applied having an effect of making dialog more prominent within a particular context.5. The loudness normalization method of claim 1 , wherein a loudness equalization is applied to have an effect on one or more playback levels on a tonal balance.6. The loudness normalization method of claim 1 , wherein a parameter smoothing is applied to the dynamic gain parameters.7. An apparatus configured to normalize loudness based upon target loudness claim 1 , comprising:at least one processor; and in which the at least one memory with the computer program is configured with the at least one processor to cause the audio processing apparatus to at least ...

Подробнее
08-06-2017 дата публикации

POST-PROCESSING GAINS FOR SIGNAL ENHANCEMENT

Номер: US20170162212A1
Принадлежит:

A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta. 1. An audio processing apparatus , comprising:at least one processor; andat least one memory storing a computer program; receive at least one input audio signal,', 'determine raw gains to carry out dynamic range control (DRC), and', 'modify the DRC gains and apply them to the input audio signal to remove audible artifacts from the audio output signal., 'in which the at least one memory with the computer program is configured with the at least one processor to cause the audio processing apparatus to at least2. The apparatus of claim 1 , wherein the gains are applied to carry-out dynamic equalization3. The apparatus of claim 1 , wherein post-processing is applied to the DRC gains to carry out gain smoothing of the modified DRC gains.4. The apparatus of claim 3 , wherein a first order linear smoothing filter is employed as post processor.5. The apparatus of claim 3 , wherein a delta smoothing post processor is employed.6. The apparatus of claim 5 , wherein the smoothing is controlled according to the output of a voice activity detector (VAD).7. The apparatus of claim 3 , wherein the post-processing is applied when there is a ...

Подробнее
23-05-2019 дата публикации

NOISE SUPPRESSOR AND METHOD OF IMPROVING AUDIO INTELLIGIBILITY

Номер: US20190156850A1
Принадлежит: SAMSUNG ELECTRONICS CO., LTD.

There is provided a noise suppressor comprising a receiver operable to receive an input audio signal and to produce from the input audio signal a first signal and a second signal the input audio signal comprising desired audio and transmission end noise. The noise suppressor further comprises a first processor operable to perform a first process on the first signal the first process comprising noise suppression to remove at least a portion of the transmission end noise from the first signal before outputting the first signal to a first audio channel The noise suppressor further comprises a second processor operable to perform a second process on the second signal the second process comprising outputting the second signal 18 to a second audio channel The first process comprises more aggressive noise suppression than the second process. 1. A noise suppressor comprising:a receiver operable to receive an input audio signal and to produce from the input audio signal a first signal and a second signal, the input audio signal comprising desired audio and transmission end noise;a first processor operable to perform a first process on the first signal, the first process comprising noise suppression to remove at least a portion of the transmission end noise from the first signal before outputting the first signal to a first audio channel; anda second processor operable to perform a second process on the second signal, the second process comprising outputting the second signal to a second audio channel.2. The noise suppressor of claim 1 , whereinthe second process comprises noise suppression, andthe noise suppression of the first process is more aggressive than the noise suppression of the second process.3. The noise suppressor of claim 1 , wherein the second process does not comprise noise suppression.4. The noise suppressor of claim 1 , wherein the first process further comprises introducing a time delay to the first signal before outputting the first signal to the first ...

Подробнее
23-05-2019 дата публикации

Enhanced De-Esser For In-Car Communication Systems

Номер: US20190156855A1
Принадлежит:

Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications. 1. A method of deessing a speech signal , the method comprising:a) for each time frame of a speech signal presented to a speech processing system, analyzing a full spectral envelope to identify frequency content for deessing; andb) spectrally weighting the speech signal as a function of results of the analyzing.2. The method of claim 1 , wherein the analyzing includes calculating a psychoacoustic measure from the full spectral envelope.3. The method of claim 2 , wherein the analyzing further includes detecting sibilant sounds of the speech signal using the psychoacoustic measure.4. The method of claim 2 , wherein the psychoacoustic measure includes at least one of a measure of sharpness and a measure of roughness.5. The method of claim 2 , wherein the psychoacoustic measure includes a measure of sharpness claim 2 , and wherein the analyzing further includes calculating deesser weights based on the measure of sharpness.6. The method of claim 1 , wherein the spectrally weighting the speech signal includes applying ...

Подробнее
22-09-2022 дата публикации

METHOD AND SYSTEM FOR NORMALIZING PLATFORM-ADAPTIVE AUDIO

Номер: US20220302892A1
Принадлежит:

A method for normalizing platform-adaptive audio includes encoding input video content and generating video stream data as original data to store the video stream data in storage; generating loudness metadata for audio data of the video content and storing the loudness metadata in the storage; receiving a request for the video content from a client; searching the storage for video stream data of the video content corresponding to the request, the loudness metadata, and a device profile corresponding to device information included in the request; and transmitting, to the client, a response including the video stream data, the loudness metadata, and the device profile that are found in the storage. 1. A platform adaptive audio normalization method performed by a computer device having at least one processor , the platform adaptive audio normalization method comprising:encoding input video content, generating video stream data as original data, and storing the video stream data in a storage;generating loudness metadata for audio data of the video content and storing the loudness metadata in the storage;receiving a request for the video content from a client;retrieving the video stream data of the video content corresponding to the request, the loudness metadata, and a device profile corresponding to device information included in the request from the storage; andtransmitting a response that includes the video stream data, the loudness metadata, and the device profile retrieved from the storage to the client.2. The platform adaptive audio normalization method of claim 1 , wherein the device profile includes an adjustment value that adjusts a normalization factor by analyzing at least one of a number claim 1 , positions claim 1 , and distances of audio output devices of a playback device based on audio that is output through the playback device for playing back the video content and is input at a preset playback position.3. The platform adaptive audio normalization ...

Подробнее
18-06-2015 дата публикации

Effective Pre-Echo Attenuation in a Digital Audio Signal

Номер: US20150170668A1
Принадлежит:

A method is provided for processing pre-echo attenuation in a digital audio signal generated from a transform coding, wherein, at the decoding point, the method includes: detection of a position of attack in the decoded signal; determination of a pre-echo region preceding the position of attack detected in the decoded signal; calculation of attenuation factors per sub-block of the pre-echo region, according to at least the frame wherein the attack has been detected and the preceding frame; and pre-echo attenuation in the sub-blocks of the pre-echo region by the corresponding damping factors. The method also includes application of a filter for the spectral shaping of the pre-echo region on the current frame up to the detected position of the attack. A device and a decoder including the device are also proved for implementing the method. 1. A method of processing attenuation of pre-echo in a digital audio signal engendered on the basis of a transform-based coding , in which , on decoding , the method comprises the following performed by a processing device:detection of an attack position in the decoded signal;determination of a pre-echo zone preceding the attack position detected in the decoded signal;calculation of attenuation factors per sub-block of the pre-echo zone, as a function at least of the frame in which the attack has been detected and of the previous frame;attenuation of pre-echo in the sub-blocks of the pre-echo zone by the corresponding attenuation factors; andapplication of an adaptive filtering of spectral shaping of the pre-echo zone on the current frame until as far as the detected position of the attack.2. The method as claimed in claim 1 , wherein the method furthermore comprises calculation of at least one decision parameter regarding the filtering to be applied to the pre-echo zone and the adaptation of the coefficients of the filtering as a function of said at least one decision parameter.3. The method as claimed in claim 2 , wherein at least ...

Подробнее
24-06-2021 дата публикации

METHODS AND SYSTEM FOR CONTROLLING TACTILE CONTENT

Номер: US20210191689A1
Принадлежит:

An audio system presented herein includes a transducer array, sensor array, and a controller. The controller control tactile content imparted to a user via actuation of at least one transducer in the transducer array while presenting audio content to the user. The transducer array presents the audio content with the tactile content to the user. The audio system can be part of a headset. 1. An audio system comprising:at least one transducer configured to present acoustic content and tactile content to a user; anda controller configured to control the tactile content imparted to the user via actuation of the at least one transducer by altering spectral content of the acoustic content to improve perception of the tactile content, whereinthe at least one transducer is further configured to present the acoustic content having the altered spectral content and the tactile content to the user.2. The audio system of claim 1 , wherein the controller is further configured to provide navigation instructions to the user using the tactile content.3. The audio system of claim 2 , wherein:the at least one transducer comprises at least one cartilage conduction transducer attached to a corresponding ear of the user; andthe controller is further configured to selectively apply the tactile content to the corresponding ear via the at least one cartilage conduction transducer to provide the navigation instructions to the user.4. The audio system of claim 1 , wherein the controller is further configured to increase speech intelligibility for audio content presented to the user by controlling the tactile content claim 1 , the audio content comprising the acoustic content and the tactile content.5. The audio system of claim 1 , wherein the controller is further configured to generate audio content having a defined level of a near field effect by controlling the tactile content claim 1 , the audio content comprising the acoustic content and the tactile content.6. The audio system of claim 1 ...

Подробнее
08-06-2017 дата публикации

MODIFICATION OF AUDIO SIGNAL BASED ON USER AND LOCATION

Номер: US20170163813A1
Принадлежит:

In one aspect, a device includes a processor and storage accessible to the processor. The storage bears instructions executable by the processor to receive at least one audio signal, identify one or more of a user associated with at least one received audio signal and a location of the user, and modify at least one received audio signal based at least in part on identification of one or more of the user and the location. 1. A device , comprising:a processor; andstorage accessible to the processor and bearing instructions executable by the processor to:receive at least one audio signal;identify one or more of a user associated with at least one received audio signal and a location of the user; andbased at least in part on identification of one or more of the user and the location, modify at least one received audio signal by adjusting an accent of words spoken by the user from a first accent associated with a first geographic region to a second accent associated with a second geographic region different from the first geographic region.2. The device of claim 1 , wherein the device is a first device claim 1 , and wherein the instructions are executable by the processor to:transmit the modified audio signal to a second device different from the first device.3. The device of claim 1 , comprising at least one speaker claim 1 , wherein the instructions are executable by the processor to:present, using the speaker, audio output based on the audio signal.4. The device of claim 1 , wherein at least one received audio signal is modified using digital signal processing.5. The device of claim 1 , comprising a microphone claim 1 , wherein the at least one audio signal is received from the microphone.6. The device of claim 1 , wherein the device is a first device claim 1 , and wherein the at least one audio signal is received from a second device different from the first device.7. (canceled)8. The device of claim 1 , wherein the instructions are executable by the processor to: ...

Подробнее
22-09-2022 дата публикации

Sound Field Related Rendering

Номер: US20220303710A1
Принадлежит:

An apparatus for spatial audio reproduction including circuitry configured to: obtain at least one focus parameter configured to define a focus shape; process a spatial audio signal that represents an audio scene to generate a processed spatial audio signal that represents a modified audio scene, so as to control relative emphasis in, at least in part, a portion of the spatial audio signal in the focus shape relative to at least in part; other portions of the spatial audio signals outside the focus shape and output the processed spatial audio signal, wherein the modified audio scene enables the relative emphasis in, at least in part, the portion of the spatial audio signal in the focus shape relative to at least in part other portions of the spatial audio signals outside the focus shape. 1. An apparatus comprising at least one processor and at least one non-transitory memory including a computer program code , the at least one memory and the computer program code configured to , with the at least one processor , cause the apparatus at least to:obtain at least one focus parameter configured to define a focus shape;process a spatial audio signal that represents an audio scene to generate a processed spatial audio signal that represents a modified audio scene, so as to control relative emphasis in, at least in part, a portion of the spatial audio signal in the focus shape relative to at least in part other portions of the spatial audio signals outside the focus shape; andoutput the processed spatial audio signal, wherein the modified audio scene enables the relative emphasis in, at least in part, the portion of the spatial audio signal in the focus shape relative to at least in part other portions of the spatial audio signals outside the focus shape.2. The apparatus according to claim 1 , wherein at least one focus parameter is further configured to define a focus amount claim 1 , and the at least one memory and the computer program code are configured to claim 1 , ...

Подробнее
24-06-2021 дата публикации

Training a voice morphing apparatus

Номер: US20210193159A1
Автор: Steve Pearson
Принадлежит: SoundHound Inc

Systems and methods for training a voice morphing apparatus are described. The voice morphing apparatus is trained to morph input audio data to mask an identity of a speaker. Training is performed by evaluating an objective function that is a function of the input audio data and an output of the voice morphing apparatus. The objective function may have a first term that is based on speaker identification and a second term that is based on audio fidelity. By optimizing the objective function, parameters of the voice morphing apparatus may be adjusted so as to reduce a confidence of speaker identification and maintain an audio fidelity of the morphed audio data. The voice morphing apparatus, once trained, may be used as part of an automatic speech recognition system.

Подробнее
24-06-2021 дата публикации

ENHANCING AUDIO USING MULTIPLE RECORDING DEVICES

Номер: US20210193180A1
Принадлежит:

In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for identifying that a first audio stream includes first, second, and third sources of audio. A computing system identifies that a second audio stream includes the first, second, and third sources of audio. The computing system determines that the first and second sources of audio are part of a first conversation. The computing system generates a third audio stream that combines the first source of audio from the first audio stream, the first source of audio from the second audio stream, the second source of audio from the first audio stream, and the second source of audio from the second audio stream, and diminishes the third source of audio from the first audio stream, and the third source of audio from the second audio stream. 1. A computer-implemented method for enhancing audio , the method comprising:receiving, using a hardware processor, an audio stream for playback on a media device;extracting, using the hardware processor, a first audio source, a second audio source, and a third audio source from the audio stream;determining, using the hardware processor, that a conversation between the first audio source and the second audio source occurs within a first portion of the audio stream and a second conversation between the first audio source and the third audio source occurs within a second portion of the audio stream at a second time point; andgenerating, using the hardware processor, an updated audio stream that enhances the first audio source and the second audio source extracted from the first portion of the audio stream and diminishes the third audio source extracted from the first portion of the audio stream and that enhances the first audio source and the third audio source extracted from the second portion of the audio stream and diminishes the second audio source extracted from the second portion of the audio stream.2. The computer- ...

Подробнее
14-06-2018 дата публикации

VOICE SIGNAL PROCESSING APPARATUS AND VOICE SIGNAL PROCESSING METHOD

Номер: US20180166090A1
Принадлежит: ACER INCORPORATED

A voice signal processing apparatus and a voice signal processing method are provided. A loudness of an input voice signal is detected to obtain a reference loudness. Reference loudness gains corresponding to frequency bands are calculated according to the reference loudness and wide dynamic range compression curves corresponding to the frequency bands. Loudnesses of filter signals of the frequency bands are adjusted according to the reference loudness gains of the frequency bands. 1. A voice signal processing apparatus , comprising:an input voice signal filter receiving an input voice signal and filtering the input voice signal to generate a plurality of filter signals of different frequency bands; anda processor detecting a loudness of the input voice signal to obtain a reference loudness, calculating reference loudness gains corresponding to the frequency bands according to the reference loudness and wide dynamic range compression curves corresponding to the frequency bands, multiplying the filter signals by the reference loudness gains corresponding to the filter signals to obtain a plurality of loudness adjusted filter signals corresponding to the frequency bands, and adding up the loudness adjusted filter signals to generate an output voice signal.2. The voice signal processing apparatus according to claim 1 , wherein the wide dynamic range compression curves are obtained by performing wide dynamic range compression processes corresponding to the frequency bands on a unit gain curve claim 1 , and the processor further calculates the reference loudness gains according to first output loudnesses corresponding to the reference loudness on the wide dynamic range compression curves corresponding to the frequency bands and a second output loudness corresponding to the reference loudness on the unit gain curve.3. The voice signal processing apparatus according to claim 1 , wherein the processor further detects loudnesses of the filter signals to obtain a plurality of ...

Подробнее
16-06-2016 дата публикации

Audio Signal Processing Method and Apparatus and Differential Beamforming Method and Apparatus

Номер: US20160173978A1
Автор: Deming Zhang, Haiting Li
Принадлежит: Huawei Technologies Co Ltd

An audio signal processing method and apparatus and a differential beamforming method and apparatus to resolve a problem that an existing audio signal processing system cannot process audio signals in multiple application scenarios at the same time. The method includes determining a super-directional differential beamforming weighting coefficient, acquiring an audio input signal and determining a current application scenario and an audio output signal, acquiring, a weighting coefficient corresponding to the current application scenario, performing super-directional differential beamforming processing on the audio input signal using the acquired weighting coefficient in order to obtain a super-directional differential beamforming signal in the current application scenario, and performing processing on the formed signal to obtain a final audio signal required by the current application scenario. By using this method, a requirement that different application scenarios require different audio signal processing manners can be met.

Подробнее
28-05-2020 дата публикации

SPEECH SIGNAL PROCESSING METHOD AND APPARATUS

Номер: US20200168237A1
Автор: YUAN Haolei
Принадлежит:

A speech signal processing method is performed at a terminal device, including: obtaining a recorded signal and a to-be-output speech signal, the recorded signal including a noise signal and an echo signal; calculating a loop transfer function according to the recorded signal and the speech signal; calculating a power spectrum of the echo signal and a power spectrum of the noise signal according to the recorded signal, the speech signal, and the loop transfer function; calculating a frequency weighted coefficient according to the two power spectra of the echo signal and the noise signal; adjusting a frequency amplitude of the speech signal based on the frequency weighted coefficient; and outputting the adjusted speech signal to a speaker electrically coupled to the terminal device. As such, the frequency amplitude of the speech signal is automatically adjusted according to the relative frequency distribution of a noise signal and the speech signal. 1. A speech signal processing method performed at a terminal device having one or more processors , a microphone , a speaker , and memory storing one or more programs to be executed by the one or more processors , the method comprising:receiving, via an instant messaging application, a speech signal from a second terminal device, wherein the second terminal device is connected to the terminal device via a computer network;recording, via the microphone, an audio signal, the audio signal including a noise signal from an environment surrounding the terminal device and an echo signal from the speaker;calculating a loop transfer function using the recorded audio signal and the speech signal;calculating a power spectrum of the echo signal and a power spectrum of the noise signal using the recorded audio signal, the speech signal and the loop transfer function;calculating a frequency weighted coefficient according to the power spectrum of the echo signal and the power spectrum of the noise signal, wherein the frequency weighted ...

Подробнее
29-06-2017 дата публикации

Enhancing An Audio Recording

Номер: US20170186463A1

A system and method are provided for enhancing an audio recording which comprises a recording of a sound signal obtained from the play-out of an audio signal via a speaker. The audio signal, and thereby the sound signal, may represent certain audio content, e.g., a radio station or TV audio. To perform the enhancing, the recording of the sound signal is suppressed using the audio signal, thereby obtaining an intermediate audio recording. An original version of the audio content is then added to the intermediate audio recording to obtain an enhanced audio recording. This original version is generally of higher quality as it generally does not represent a background audio component but rather was purposefully recorded or generated.

Подробнее
29-06-2017 дата публикации

Automated equalization

Номер: US20170188148A1
Принадлежит: Intel Corporation

Techniques for improving speech recognition are described. An example of an electronic device includes an extracting unit to extract a reference spectral profile from a reference signal and a device spectral profile from a device signal. A comparing unit compares the reference spectral profile and the device spectral profile. A delta calculating unit calculates a delta between the reference spectral profile and the device spectral profile. A design unit designs a correction filter based on the computed delta. 1. An electronic device for improving speech recognition of a device under test (DUT) , comprising:an extracting unit to extract a reference spectral profile from a reference signal and a DUT spectral profile from a DUT signal;a comparing unit to compare the reference spectral profile and the DUT spectral profile;a delta calculating unit to compute a delta between the reference spectral profile and the DUT spectral profile to obtain a computed delta; anda design unit to design a correction filter based on the computed delta.2. The electronic device of claim 1 , comprising a first calculating unit to calculate the reference signal from a set of recordings.3. The electronic device of claim 1 , wherein the design unit designs the correction filter using a plurality of recordings claim 1 , and wherein the plurality of recordings are obtained from one or more devices.4. The electronic device of claim 1 , comprising an application unit to apply the correction filter to a microphone of the DUT.5. The electronic device of claim 4 , comprising an orientation sensor to determine an orientation of the DUT and employ an appropriate correction filter.6. The electronic device of claim 4 , comprising a proximity sensor to determine a distance from a user to the DUT and employ the appropriate correction filter.7. The electronic device of claim 4 , comprising an angle sensor to determine an angle between the user and the DUT and employ the appropriate correction filter.8. The ...

Подробнее
18-09-2014 дата публикации

Apparatus and Method for Power Efficient Signal Conditioning for a Voice Recognition System

Номер: US20140278393A1
Принадлежит: MOTOROLA MOBILITY LLC

A disclosed method includes monitoring an audio signal energy level while having a plurality of signal processing components deactivated and activating at least one signal processing component in response to a detected change in the audio signal energy level. The method may include activating and running a voice activity detector on the audio signal in response to the detected change where the voice activity detector is the at least one signal processing component. The method may further include activating and running the noise suppressor only if a noise estimator determines that noise suppression is required. The method may activate and runs a noise type classifier to determine the noise type based on information received from the noise estimator and may select a noise suppressor algorithm, from a group of available noise suppressor algorithms, where the selected noise suppressor algorithm is the most power consumption efficient.

Подробнее
16-07-2015 дата публикации

Distributed beamforming based on message passing

Номер: US20150200454A1
Принадлежит: Google LLC

Methods and systems are provided for implementing a distributed algorithm for beam-forming (e.g., MVDR beam-forming) using a message-passing algorithm. The message-passing algorithm provides for computations to be performed in a distributed manner across a network, rather than in a centralized processing center or “fusion center”. The message-passing algorithm may also function for any network topology, and may continue operations when various changes are made in the network (e.g., nodes appearing, nodes disappearing, etc.). Additionally, the message-passing algorithm may minimize the transmission power per iteration and, depending on the particular network, also may minimize the transmission power required for communication between network nodes.

Подробнее
05-07-2018 дата публикации

Personalized, real-time audio processing

Номер: US20180190309A1
Принадлежит: eBay Inc

An apparatus and method for real-time audio processing employs a gaze detection sensor to detect a direction of a user's gaze and output a gaze signal corresponding to the detected direction of the user's gaze. A digital signal processing unit responds to a plurality of signals corresponding to a plurality of sounds received at the apparatus, and the determined direction of gaze to identify a signal of interest from the plurality of signals using the gaze signal. The signal of interest is processed for output to the user. In embodiments, a microphone array provides the plurality of signals. An imaging sensor may work with either the microphone array or the gaze detection sensor to identify the signal of interest.

Подробнее
06-07-2017 дата публикации

ADAPTIVE AUDITORY ALERTS

Номер: US20170195499A1
Автор: Eischeid Todd Michael
Принадлежит:

A method includes recording, at an electronic device utilizing a microphone of the electronic device, ambient noise of an environment the electronic device is disposed in; electronically analyzing, utilizing one or more processors, the recorded ambient noise of the environment to determine one or more frequency bands to avoid; dynamically adapting, based on the electronic analysis, an auditory alert to be played at the electronic device, such adaptation including frequency equalization adjustments based on the determination of one or more frequency bands to avoid; and playing, at the electronic device utilizing one or more speakers of the electronic device, the adapted auditory alert. 1. A method comprising:(a) recording, at an electronic device utilizing a microphone of the electronic device, ambient noise of an environment the electronic device is disposed in;(b) electronically analyzing, utilizing one or more processors, the recorded ambient noise of the environment to determine one or more frequency bands to avoid;(c) dynamically adapting, based on the electronic analysis, an auditory alert to be played at the electronic device, such adaptation including frequency equalization adjustments based on the determination of one or more frequency bands to avoid; and(d) playing, at the electronic device utilizing one or more speakers of the electronic device, the adapted auditory alert.2. The method of claim 1 , wherein the auditory alert comprises a ringtone.3. The method of claim 1 , wherein the auditory alert comprises a song.4. The method of claim 1 , wherein the auditory alert comprises an alarm.5. The method of claim 1 , wherein the electronic device comprises a cell phone.6. The method of claim 1 , wherein the electronic device comprises a mobile phone.7. The method of claim 1 , wherein the electronic device comprises a tablet.8. The method of claim 1 , wherein the electronic device comprises a beeper.9. The method of claim 1 , wherein electronically analyzing ...

Подробнее
20-06-2019 дата публикации

MULTI-CHANNEL SPEECH ENHANCEMENT

Номер: US20190189144A1
Автор: Dusan Sorin V.
Принадлежит:

Speech enhancers suppress impairments in an acoustic signal. An audio appliance has a first microphone and a second microphone. The first microphone provides a first signal, and the second microphone provides a second signal. A voice-activity detector can determine a presence of user speech responsive to a combination of voice-activity cues, including a first level difference between the first signal and the second signal within a first frequency band, and a second level difference between the first signal and the second signal within a second frequency band. A noise suppressor suppresses impairments originating from a direction of, e.g., up to about 75-degrees from an axis extending from the second microphone to the first microphone. An output device can output a noise-suppressed output-signal corresonding to a determined presence or absence of speech by the voice-activity detector. The impairments can be suppressed by, e.g., between about 3 dB and about 20 dB. 1. An audio appliance , comprising:a first microphone transducer to provide a first acoustic signal;a second microphone transducer to provide a second acoustic signal, wherein the first microphone transducer and the second microphone transducer are spaced apart from each other and define a longitudinal axis;a voice-activity detector configured to determine a presence or an absence of user speech responsive to a combination of voice-activity cues comprising, a first level difference between the first acoustic signal and the second acoustic signal within a first frequency band, and a second level difference between the first acoustic signal and the second acoustic signal within a second frequency band; anda noise suppressor configured, responsive to a determined presence of speech by the voice-activity detector, to suppress in a noise-supressed output-signal impairments originating from a direction of up to about 75-degrees from the longitudinal axis by between about 3 dB and about 20 dB; andan output device ...

Подробнее
22-07-2021 дата публикации

SPEECH COMMUNICATION SYSTEM AND METHOD FOR IMPROVING SPEECH INTELLIGIBILITY

Номер: US20210225388A1
Принадлежит:

A speech communication system for improving speech intelligibility may comprise one or more processors; and a memory storing instructions that, when executed by the one or more processors, cause the system to perform: determining a cutoff frequency based on an estimation of a spectrum of noise, wherein the cutoff frequency defines a noise dominant region of frequency; lifting a spectrum of a speech above the noise dominant region of frequency, wherein a frequency range of the spectrum of the speech increases by the cutoff frequency; and applying an adaptive filter to the speech to achieve echo cancelation, wherein the adaptive filter is controlled by a volume of the noise. 1. A speech communication system for improving speech intelligibility , comprising:one or more processors; anda memory storing instructions that, when executed by the one or more processors, cause the system to perform:determining a cutoff frequency based on an estimation of a spectrum of noise,wherein the cutoff frequency defines a noise dominant region of frequency;lifting a spectrum of a speech above the noise dominant region of frequency, wherein a frequency range of the spectrum of the speech increases by the cutoff frequency; andapplying an adaptive filter to the speech to achieve echo cancelation, wherein the adaptive filter is controlled by a volume of the noise.2. The system according to claim 1 , wherein determining the cutoff frequency based on the estimation of the spectrum of the noise comprises:receiving a sound signal through a microphone of the system;estimating the spectrum of the noise in the sound signal;estimating a Signal-Noise-Ratio (SNR) of the sound signal; anddetermining the cutoff frequency based on the spectrum of the noise and the SNR.3. The system of claim 2 , wherein the SNR is an instantaneous SNR claim 2 , and wherein the instantaneous SNR is smoothed over frames of the sound signal and adjacent sub-bands of frequency.4. The system of claim 3 , wherein determining ...

Подробнее
22-07-2021 дата публикации

METHODS FOR MEASURING SPEECH INTELLIGIBILITY, AND RELATED SYSTEMS AND APPARATUS

Номер: US20210225389A1
Принадлежит:

In a method for efficiently and accurately measuring the intelligibility of speech, a user may utter a sample text, and an automatic speech assessment (ASA) system may receive an acoustic signal encoding the utterance. An automatic speech recognition (ASR) module may generate an N-best output corresponding to the utterance and generate an intelligibility score representing the intelligibility of the utterance based on the N-best output and the sample text. Generating the intelligibility score may involve (1) calculating conditional intelligibility value(s) for the N recognition result(s), and (2) determining the intelligibility score based on the conditional intelligibility value of the most intelligible recognition result. Optionally, the process of generating the intelligibility score may involve adjusting the intelligibility score to account for environmental information (e.g., a pronunciation score for the user's speech and/or a confidence score assigned to the 1-best recognition result). N may be greater than or equal to 2. 1. A speech intelligibility scoring method , comprising:receiving an acoustic signal encoding an utterance of a user, wherein the utterance comprises a verbalization of a sample text by the user;generating, by an automatic speech recognition (ASR) module, an N-best output corresponding to the utterance, wherein the N-best output comprises N recognition results generated by the ASR module for the utterance, wherein N is a positive integer; andgenerating an intelligibility score representing an intelligibility of the utterance based, at least in part, on the N-best output and the sample text, wherein:(1) N is at least 2,(2) the generating of the intelligibility score is further based on a confidence score, wherein the confidence score indicates a probability that a particular one of the recognition results is a correct transcription of the utterance, and/or(3) the generating of the intelligibility score is further based on a pronunciation ...

Подробнее
25-09-2014 дата публикации

Systems and methods for enhancing place-of-articulation features in frequency-lowered speech

Номер: US20140288938A1
Автор: Ying-Yee Kong
Принадлежит: Northeastern University Boston

To improve the intelligibility of speech for users with high-frequency hearing loss, the present systems and methods provide an improved frequency lowering system with enhancement of spectral features responsive to place-of-articulation of the input speech. High frequency components of speech, such as fricatives, may be classified based on one or more features that distinguish place of articulation, including spectral slope, peak location, relative amplitudes in various frequency bands, or a combination of these or other such features. Responsive to the classification of the input speech, a signal or signals may be added to the input speech in a frequency band audible to the hearing-impaired listener, said signal or signals having predetermined distinct spectral features corresponding to the classification, and allowing a listener to easily distinguish various consonants in the input.

Подробнее
12-07-2018 дата публикации

ACOUSTIC PARAMETER ADJUSTMENT

Номер: US20180197560A1
Принадлежит:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for adjusting acoustic parameters. In one aspect, a method includes receiving an identifier associated with an enclosure for a computing device, transmitting data identifying the identifier associated with the enclosure for the computing device, and receiving one or more physical parameters of the enclosure for the computing device. The method also includes based on the one or more physical parameters of the enclosure for the computing device, determining, one or more acoustic parameter adjustments of the computing device in the enclosure, the one or more acoustic parameter adjustments being configured to preserve one or more acoustic characteristics of the computing device out of the enclosure while the computing device is in the enclosure, and based on the one or more acoustic parameter adjustments, adjusting the one or more acoustic parameters of the computing device. 1. A computer-implemented method , comprising:receiving, by a computing device, an identifier associated with an enclosure for the computing device;transmitting, by the computing device, data identifying the identifier associated with the enclosure for the computing device;receiving, by the computing device, one or more physical parameters of the enclosure for the computing device;based on the one or more physical parameters of the enclosure for the computing device, determining, by the computing device, one or more acoustic parameter adjustments of the computing device in the enclosure, the one or more acoustic parameter adjustments being configured to preserve one or more acoustic characteristics of the computing device out of the enclosure while the computing device is in the enclosure; andbased on the one or more acoustic parameter adjustments, adjusting, by the computing device, the one or more acoustic parameters of the computing device.2. The method of claim 1 , comprising:determining that a use ...

Подробнее
21-07-2016 дата публикации

DEVICE FOR LANGUAGE PROCESSING ENHANCEMENT IN AUTISM

Номер: US20160210872A1
Принадлежит:

Methods and devices can enhance language processing in an autism spectrum disorder (ASD) individual through auditory manipulation of an auditory stream. The auditory stream is received and includes an acoustic stimulus perceptually representing an object. An acoustic manipulation parameter for a predetermined acoustic detail characteristic is selected. The predetermined acoustic detail characteristic is associated with the ASD individual and is based on a measured language processing capability of the ASD individual. The auditory stream is modified based on the selected parameter, to reduce the predetermined acoustic detail characteristic while preserving a lexicality of the stimulus, such that the reduced acoustic detail characteristic enhances perception of the object by the ASD individual even when the stimulus includes two or more acoustically distinct stimuli each perceptually representing the object. The modified auditory stream is output to the ASD individual via at least one loudspeaker. 1. A method of auditory manipulation of an auditory stream for enhancement of language processing in an autism spectrum disorder (ASD) individual , the method comprising:receiving, by a processor, the auditory stream, the auditory stream including an acoustic stimulus perceptually representing an object;selecting an acoustic manipulation parameter for a predetermined acoustic detail characteristic, the predetermined acoustic detail characteristic associated with the ASD individual and based on a measured language processing capability of the ASD individual;modifying, by the processor, the auditory stream based on the selected acoustic manipulation parameter, to reduce the predetermined acoustic detail characteristic while preserving a lexicality of the stimulus, such that the reduced acoustic detail characteristic enhances perception of the object by the ASD individual even when the stimulus includes two or more acoustically distinct stimuli each perceptually representing ...

Подробнее
29-07-2021 дата публикации

PITCH EMPHASIS APPARATUS, METHOD, PROGRAM, AND RECORDING MEDIUM FOR THE SAME

Номер: US20210233549A1

As pitch enhancement processing, a pitch enhancement apparatus obtains, for a time segment judged to be a time segment including a signal that is a consonant, for each time of the time segment, as an output signal, a signal including a signal obtained by adding a signal, which was obtained by multiplying a signal at a time that is an earlier time than the time by the number of samples Tcorresponding to a pitch period of the time segment, the pitch gain σof the time segment, a predetermined constant B, and a value that is greater than 0 and less than 1, and a signal at the time. 1. A pitch enhancement apparatus that obtains an output signal by performing , for each time segment , pitch enhancement processing on a signal derived from an input audio signal , the pitch enhancement apparatus comprising:processing circuitry configured to: [ [{'sub': 0', '0', '0, 'a signal, which was obtained by multiplying the signal at a time that is an earlier time than the time by the number of samples Tcorresponding to a pitch period of the time segment, pitch gain σof the time segment, a predetermined constant B, and a value that is greater than 0 and less than 1, and'}, 'the signal at the time, and, 'for a time segment judged to be a time segment including the signal that is a consonant, for each time of the time segment, processing to obtain, as an output signal, a signal including a signal obtained by adding'}, [{'sub': 0', '0', '0, 'a signal, which was obtained by multiplying the signal at a time that is an earlier time than the time by the number of samples Tcorresponding to a pitch period of the time segment, pitch gain σof the time segment, and a predetermined constant B, and'}, 'the signal at the time., 'for a time segment judged to be a time segment including the signal that is not a consonant, for each time of the time segment, processing to obtain, as an output signal, a signal including a signal obtained by adding'}], 'perform, as the pitch enhancement processing,'}2. A ...

Подробнее