Настройки

Укажите год
-

Небесная энциклопедия

Космические корабли и станции, автоматические КА и методы их проектирования, бортовые комплексы управления, системы и средства жизнеобеспечения, особенности технологии производства ракетно-космических систем

Подробнее
-

Мониторинг СМИ

Мониторинг СМИ и социальных сетей. Сканирование интернета, новостных сайтов, специализированных контентных площадок на базе мессенджеров. Гибкие настройки фильтров и первоначальных источников.

Подробнее

Форма поиска

Поддерживает ввод нескольких поисковых фраз (по одной на строку). При поиске обеспечивает поддержку морфологии русского и английского языка
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Укажите год
Укажите год

Применить Всего найдено 129114. Отображено 100.
10-03-2007 дата публикации

СТАТИСТИЧЕСКАЯ МОДЕЛЬ РЕЧИ

Номер: RU0000061924U1

1. Статистическая модель речи, включающая интерфейсный блок, соединенный соответствующими входами и выходами с блоком выбора, формирующим выборку дикторов, с блоком выбора звуков, осуществляющим выбор звуков и определение их параметров, с блоком формирования речевого потока, осуществляющим действия над элементами речевых сигналов и с базой данных, содержащей описания типовых дикторов, которая также соединена со входами указанных блоков, отличающаяся тем, что в блок выбора дикторов дополнительно включен модуль статистики параметров населения, между блоком выбора диктора и блоком выбора звуков включен блок выборки типовых дикторов, а между блоком выбора звуков и блоком формирования речевого потока дополнительно включен блок хранения просодики, причем в блок выбора звуков дополнительно введены модули правил именования аллофонов и следования звуков. 2. Статистическая модель речи по п.1, отличающаяся тем, что блок выбора диктора состоит из модулей: генератора выборки дикторов и статистики параметров населения, причем выход модуля статистики параметров населения соединен с входом генератора выборки дикторов, выход которого соединен с входом блока выборки типовых дикторов. 3. Статистическая модель речи п.1, отличающаяся тем, что блок выбора звуков состоит из последовательно соединенных модулей: формирования цепочек, приписывания интонационного контура, именования аллофонов, определения длительности, наложения интонационных контуров, а также модулей правил следования звуков и правил именования аллофонов, причем выходы двух последних модулей соединены с дополнительными входами модуля формирования цепочек и модуля именования аллофонов, а дополнительный выход модуля именования аллофонов соединен с выходом модуля наложения интонационных контуров, выходы которого соединены с блоком хранения просодики и интерфейсным блоком. 4. Статистическая модель речи по п.1, отличающаяся тем, что блок формирования речевого потока содержит последовательно-соединенные модули: формирования ...

Подробнее
10-04-2014 дата публикации

ВОССТАНОВИТЕЛЬ ОГИБАЮЩЕЙ У КЛИППИРОВАННОГО РЕЧЕВОГО СИГНАЛА ЖЕЛЕЗНОДОРОЖНОЙ РАДИОСВЯЗИ

Номер: RU0000139093U1

Восстановитель огибающей у клиппированного речевого сигнала железнодорожной радиосвязи, состоящего из последовательно подключённых к выходу приёмника ЧM радиосигналов блоков: дискретизатора по времени знакопеременного клиппированного речевого сигнала (PC), двустороннего усилителя-ограничителя амплитуды сигнала, интегратора по времени, ФНЧ, а также из усилителя звуковых сигналов (УЗЧ) с подключённым к его выходу громкоговорителя, генератора импульсов дискретизации, подключённого к высокочастотному входу дискретизатора, отличающийся тем, что в него дополнительно введён дифференциатор по времени, подключённый своим входом к выходу ФНЧ, выходом - к УЗЧ. РОССИЙСКАЯ ФЕДЕРАЦИЯ (19) RU (11) (13) 139 093 U1 (51) МПК G10L 21/00 (2013.01) B61L 29/00 (2006.01) ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ОПИСАНИЕ (21)(22) Заявка: ПОЛЕЗНОЙ МОДЕЛИ К ПАТЕНТУ 2013141838/08, 13.09.2013 (24) Дата начала отсчета срока действия патента: 13.09.2013 (72) Автор(ы): Волков Анатолий Алексеевич (RU) Приоритет(ы): (22) Дата подачи заявки: 13.09.2013 (45) Опубликовано: 10.04.2014 Бюл. № 10 R U (73) Патентообладатель(и): Федеральное государственное бюджетное образовательное учреждение высшего профессионального образования "Московский государственный университет путей сообщения" МГУПС (МИИТ) (RU) Адрес для переписки: 127994, Москва, ул. Образцова, 9, стр. 9, МИИТ U 1 1 3 9 0 9 3 R U Стр.: 1 U 1 Формула полезной модели Восстановитель огибающей у клиппированного речевого сигнала железнодорожной радиосвязи, состоящего из последовательно подключённых к выходу приёмника ЧM радиосигналов блоков: дискретизатора по времени знакопеременного клиппированного речевого сигнала (PC), двустороннего усилителя-ограничителя амплитуды сигнала, интегратора по времени, ФНЧ, а также из усилителя звуковых сигналов (УЗЧ) с подключённым к его выходу громкоговорителя, генератора импульсов дискретизации, подключённого к высокочастотному входу дискретизатора, отличающийся тем, что в него дополнительно введён ...

Подробнее
10-02-2015 дата публикации

СИСТЕМА ОПРЕДЕЛЕНИЯ ПОДЛИННОСТИ ФОНОГРАММ

Номер: RU0000150244U1

1. Система определения подлинности фонограмм, включающая модуль ввода звукового сигнала, который соединен с модулем для аналого-цифрового преобразования, который соединен с модулем определения подлинности фонограмм, осуществляющим многоуровневое вейвлет-преобразование сигнала и визуализацию результатов анализа. 2. Система по п. 1, в которой модулем ввода звукового сигнала является микрофон для записи речевых сигналов идентифицируемого диктора. 3. Система по п. 1, в которой модулем ввода звукового сигнала является телефонный канал сети общего пользования. 4. Система по п. 1, в которой модулем ввода звукового сигнала является сотовый канал связи. 5. Система по п. 1, в которой модулем ввода звукового сигнала является устройство воспроизведения аналоговой магнитной записи. РОССИЙСКАЯ ФЕДЕРАЦИЯ (19) RU (11) (51) МПК G10L 21/00 (13) 150 244 U1 (2013.01) ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ОПИСАНИЕ (21)(22) Заявка: ПОЛЕЗНОЙ МОДЕЛИ К ПАТЕНТУ 2013158829/08, 30.12.2013 (24) Дата начала отсчета срока действия патента: 30.12.2013 (45) Опубликовано: 10.02.2015 Бюл. № 4 1 5 0 2 4 4 R U Формула полезной модели 1. Система определения подлинности фонограмм, включающая модуль ввода звукового сигнала, который соединен с модулем для аналого-цифрового преобразования, который соединен с модулем определения подлинности фонограмм, осуществляющим многоуровневое вейвлет-преобразование сигнала и визуализацию результатов анализа. 2. Система по п. 1, в которой модулем ввода звукового сигнала является микрофон для записи речевых сигналов идентифицируемого диктора. 3. Система по п. 1, в которой модулем ввода звукового сигнала является телефонный канал сети общего пользования. 4. Система по п. 1, в которой модулем ввода звукового сигнала является сотовый канал связи. 5. Система по п. 1, в которой модулем ввода звукового сигнала является устройство воспроизведения аналоговой магнитной записи. Стр.: 1 U 1 U 1 (54) СИСТЕМА ОПРЕДЕЛЕНИЯ ПОДЛИННОСТИ ФОНОГРАММ 1 5 0 2 4 4 Адрес для ...

Подробнее
06-04-2017 дата публикации

УСТРОЙСТВО СЖАТИЯ АУДИОСИГНАЛА ДЛЯ ПЕРЕДАЧИ ПО КАНАЛАМ РАСПРОСТРАНЕНИЯ ДАННЫХ

Номер: RU0000169931U1

Полезная модель относится к устройству для достижения качественного представления звука в средах с высоким уровнем шума, в частности к устройству для сжатия аудио-данных, содержащих по меньшей мере один аудио-сигнал. Техническим результатом является повышение эффективности сжатия аудио-контента. Устройство для сжатия аудио-данных содержит последовательно соединенные: блок обработки и хранения аудио-данных, дельта-сигма модулятор с балансировкой заряда, цифровой интегратор, цифровой фильтр низких частот, компрессор, осуществляющий сжатие упомянутого аудио-сигнала посредством ослабления высокоамплитудных характеристик аудио-сигнала и усиления низкоамплитудных характеристик аудио-сигнала, а также модуль обратной связи, вход которого подключен к выходу цифрового интегратора, фазовый детектор, подключенный к выходу модуля обратной связи, и дифференциатор, вход которого подключен к выходу фазового детектора, а выход – к дельта-сигма модулятору, приемопередатчик, подключенный к компрессору и двухсторонней связью к блоку обработки и хранения аудио-данных. 169931 И 1 ко РОССИЙСКАЯ ФЕДЕРАЦИЯ ЭВо“” 169 931“ Ц4 ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ИЗВЕЩЕНИЯ К ПАТЕНТУ НА ПОЛЕЗНУЮ МОДЕЛЬ ММ9К Досрочное прекращение действия патента из-за неуплаты в установленный срок пошлины за поддержание патента в силе Дата прекращения действия патента: 03.11.2020 Дата внесения записи в Государственный реестр: 24.09.2021 Дата публикации и номер бюллетеня: 24.09.2021 Бюл. №27 Стр.: 1 р е6б9ру па ЕП

Подробнее
30-05-2017 дата публикации

УНИВЕРСАЛЬНЫЙ УСИЛИТЕЛЬ МОЩНОСТИ СИГНАЛОВ ЗВУКОВОЙ ЧАСТОТЫ

Номер: RU0000171417U1

Полезная модель относится к звукоусилительной технике, в частности к усилителям мощности сигналов звуковой частоты, выполненных по монолитной, дискретной или смешанной технологиям. Усилитель мощности нагружен на восемь громкоговорителей, четыре из которых попарно началами своих звуковых катушек подключены к неинвертирующим выходам обоих каналов, другие выводы катушек в каждой паре громкоговорителей подключены один к общему проводу, другой - к свободному выводу громкоговорителя из другой пары, а четыре других громкоговорителя попарно концами своих звуковых катушек подключены к инвертирующим выходам каналов, другие выводы катушек в каждой паре громкоговорителей подключены один к общему проводу, другой - к свободному выводу громкоговорителя из другой пары. Технический результат заключается в улучшении качественных показателей звуковоспроизведения в помещениях большого объема. 1 ил. РОССИЙСКАЯ ФЕДЕРАЦИЯ (19) RU (11) (13) 171 417 U1 (51) МПК H03F 3/181 (2006.01) G10L 21/02 (2013.01) H04R 3/12 (2006.01) ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ОПИСАНИЕ ПОЛЕЗНОЙ МОДЕЛИ К ПАТЕНТУ (21)(22) Заявка: 2016152802, 30.12.2016 (24) Дата начала отсчета срока действия патента: 30.12.2016 30.05.2017 Приоритет(ы): (22) Дата подачи заявки: 30.12.2016 зал на колесах, Москва, МДК Пресс, 2000, с. 209, рис. 7.5. RU 164500 U1, 10.09.2016. RU 2575883 C2, 20.02.2016. RU 2568314 C2, 20.11.2015. RU 45218 U1, 27.04.2005. US 2016/ 0157018 A1, 02.06.2016. R U (54) УНИВЕРСАЛЬНЫЙ УСИЛИТЕЛЬ МОЩНОСТИ СИГНАЛОВ ЗВУКОВОЙ ЧАСТОТЫ (57) Реферат: Полезная модель относится к проводу, другой - к свободному выводу звукоусилительной технике, в частности к громкоговорителя из другой пары, а четыре усилителям мощности сигналов звуковой других громкоговорителя попарно концами своих частоты, выполненных по монолитной, звуковых катушек подключены к инвертирующим дискретной или смешанной технологиям. выходам каналов, другие выводы катушек в Усилитель мощности нагружен на восемь каждой паре ...

Подробнее
08-06-2017 дата публикации

Домофон с сенсорным экраном и беспроводной связью

Номер: RU0000171655U1

Домофон с сенсорным экраном и беспроводной связью (1) состоит из устройства обработки, хранения и передачи информации (2), модуля ввода-вывода информации (3) с сенсорным экраном, динамиком и микрофоном, модуля идентификации пользователей (4), модуля управления исполнительными устройствами (5) и модуля абонентской связи (6). Модуль абонентской связи представляет собой устройство беспроводной связи, которое позволяет использовать в качестве абонентских устройств обычные смартфоны, планшетные компьютеры или ноутбуки с помощью соответствующего программного обеспечения. Устройство позволяет обеспечить ввод и чтение информации непосредственно на экране домофона, что вместе с использованием в качестве абонентских устройств различных электронных гаджетов существенно расширяет его функциональные возможности, упрощает монтаж и повышает уровень безопасности. РОССИЙСКАЯ ФЕДЕРАЦИЯ (19) RU (11) (13) 171 655 U1 (51) МПК G08B 3/10 (2006.01) G10L 21/0356 (2013.01) G06K 9/00 (2006.01) G06F 3/041 (2006.01) H04H 60/91 (2008.01) ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ОПИСАНИЕ ПОЛЕЗНОЙ МОДЕЛИ К ПАТЕНТУ (21)(22) Заявка: 2015150095, 23.11.2015 (24) Дата начала отсчета срока действия патента: 23.11.2015 (72) Автор(ы): Щетинин Сергей Геннадьевич (RU) (73) Патентообладатель(и): Щетинин Сергей Геннадьевич (RU) Дата регистрации: Приоритет(ы): (22) Дата подачи заявки: 23.11.2015 (66) Номер(а) и дата(ы) подачи ранее поданной(ых) заявки(ок): 2015146319 28.10.2015 48087 U1, 10.09.2005. RU 2281614 C1, 10.08.2006. RU 2251155 C1, 27.04.2005. RU 2554549 C2, 27.06.2015. RU 2012146013 A, 10.05.2014. EP 767571 B1, 08.09.2004. US 6504479 B1, 07.012.2003. 1 7 1 6 5 5 R U (54) Домофон с сенсорным экраном и беспроводной связью (57) Реферат: Домофон с сенсорным экраном и обычные смартфоны, планшетные компьютеры беспроводной связью (1) состоит из устройства или ноутбуки с помощью соответствующего обработки, хранения и передачи информации (2), программного обеспечения. модуля ввода-вывода ...

Подробнее
13-07-2017 дата публикации

УСТРОЙСТВО СИНХРОННОГО СБОРА ДАННЫХ С МАССИВА MEMS МИКРОФОНОВ C PDM ИНТЕРФЕЙСОМ

Номер: RU0000172596U1

Полезная модель относится к устройствам сбора данных и к компонентам электронно-вычислительных машин, обеспечивающих измерение и обработку акустической информации. Техническим результатом заявленного решения является повышение величины соотношения сигнал/шум (SNR) в данных, синхронно собираемых и обрабатываемых с по меньшей мере двух MEMS микрофонов с интерфейсом PDM. Для обеспечения указанного технического результата было разработано устройство сбора данных с по меньшей мере двух MEMS микрофонов с интерфейсом PDM, содержащее по меньшей мере два MEMS микрофона с интерфейсом PDM; блок DFSDM, информационные входы (DATA) которого соединены с MEMS микрофонами; и источник тактирования, соединенный с MEMS микрофонами и блоком DFSDM; причем блок DFSDM выполнен с возможностью синхронного сбора и синхронной обработки информации с упомянутых MEMS микрофонов через интерфейс PDM; блок хранения данных, выполненный с возможностью хранения результатов обработки информации с упомянутых MEMS микрофонов. И 1 172596 ко РОССИЙСКАЯ ФЕДЕРАЦИЯ 7 ВУ’? 172 596? 1 ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ИЗВЕЩЕНИЯ К ПАТЕНТУ НА ПОЛЕЗНУЮ МОДЕЛЬ ММ9К Досрочное прекращение действия патента из-за неуплаты в установленный срок пошлины за поддержание патента в силе Дата прекращения действия патента: 02.06.2019 Дата внесения записи в Государственный реестр: 20.03.2020 Дата публикации и номер бюллетеня: 20.03.2020 Бюл. №8 Стр.: 1 па бас др ЕП

Подробнее
27-09-2017 дата публикации

АУДИОВИЗУАЛЬНЫЙ МНОГОКАНАЛЬНЫЙ ДЕТЕКТОР НАЛИЧИЯ ГОЛОСА

Номер: RU0000174044U1

Полезная модель относится к измерительной технике, в частности к области определения наличия голоса в записываемом звуковом сигнале. Решение может быть использовано в комплексе с системой распознавания речи для выделения участков звукового сигнала, которые необходимо передать системе распознавания речи для анализа. Техническим результатом заявленного решения является повышение точности определения источников человеческой речи. Для обеспечения указанного технического результата было разработано устройство обработки по меньшей мере одного аудиосигнала, содержащее: видеокамеру; массив микрофонов, причем геометрический центр массива микрофонов совмещен с центром матрицы видеокамеры; блок обработки аудиосигнала, выполненный с возможностью: синхронного получения данных от микрофонов массива микрофонов для определения по меньшей мере одного направления на активные источники звука; получения изображения от видеокамеры для определения по меньшей мере одного направления на губы в системе координат камеры; определения наличия по меньшей мере одного источника голоса в полученном по меньшей мере одном аудиосигнале на основе по меньшей мере одного направления на активные источники звука и по меньшей мере одного направления на губы в системе координат камеры. И 1 174044 ко РОССИЙСКАЯ ФЕДЕРАЦИЯ 7 ВУ” 174 044” Чл ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ИЗВЕЩЕНИЯ К ПАТЕНТУ НА ПОЛЕЗНУЮ МОДЕЛЬ ММ9К Досрочное прекращение действия патента из-за неуплаты в установленный срок пошлины за поддержание патента в силе Дата прекращения действия патента: 30.05.2019 Дата внесения записи в Государственный реестр: 20.03.2020 Дата публикации и номер бюллетеня: 20.03.2020 Бюл. №8 Стр.: 1 па УУОТДТ ЕП

Подробнее
02-07-2018 дата публикации

ВЫКЛЮЧАТЕЛЬ СВЕТА С ГОЛОСОВЫМ УПРАВЛЕНИЕМ

Номер: RU0000180965U1

Полезная модель относится к электротехнике и может быть использована в бытовой автоматике в качестве голосового выключателя электрической цепи при произнесении слова "свет". Техническим результатом является обеспечение возможности включения света по голосовой команде. Выключатель света с голосовым управлением содержит микрофон, блок восстановления постоянной составляющей звукового сигнала, сигнальный вход которого соединен с выходом микрофона, первый его выход подключен к входу блока высокочастотного звукового фильтра, а второй выход соединен с первым блоком распознавания высокочастотного звукового сигнала, первый выход которого соединен через первый блок управления высокочастотного звукового сигнала с входом блока световой индикации высокочастотного звукового сигнала, а его второй выход соединен с блоком управления неголосовой команды, к второму входу которого подключен блок световой индикации звуковых сигналов неголосовой команды, а выход которого подключен к четвертому входу блока управления голосовой команды, первый вход которого соединен с выходом блока управления звуковыми сигналами, которые превышают по длине слова голосовую команду или не совпадают с ней, к второму входу которого подключен первый выход блока световой индикации звуковых сигналов неголосовой команды, а первый его вход соединен через блок распознавания звуковых сигналов, которые не совпадают с голосовой командой с выходом микрофона. 1 ил. И 1 18096 5 ко РОССИЙСКАЯ ФЕДЕРАЦИЯ ВУ” 180 965” 4 ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ИЗВЕЩЕНИЯ К ПАТЕНТУ НА ПОЛЕЗНУЮ МОДЕЛЬ ММ9К Досрочное прекращение действия патента из-за неуплаты в установленный срок пошлины за поддержание патента в силе Дата прекращения действия патента: 08.03.2020 Дата внесения записи в Государственный реестр: 01.12.2020 Дата публикации и номер бюллетеня: 01.12.2020 Бюл. №34 Стр.: 1 па <960381 ЕП

Подробнее
28-08-2018 дата публикации

УНИВЕРСАЛЬНЫЙ УСИЛИТЕЛЬ МОЩНОСТИ СИГНАЛОВ ЗВУКОВОЙ ЧАСТОТЫ

Номер: RU0000182702U1

Полезная модель относится к звукоусиливающей технике, в частности к усилителям мощности сигналов звуковой частоты. Технический результат заключается в обеспечении возможности осуществления одноканального режима усиления. Усилитель мощности сигналов звуковой частоты содержит два идентичных канала усиления. На входе каждого из каналов расположен фазоинвертор, вход которого образует вход канала усиления. К неинвертирующему выходу фазоинвертора подключен вход первого усилителя канала, а к инвертирующему - вход второго усилителя канала. Причем выход первого усилителя каждого из каналов образует неинвертирующий выход канала усиления, а выход второго усилителя образует инвертирующий выход канала усиления. При этом усилитель нагружен на четыре громкоговорителя, два из которых образуют пару и началами своих звуковых катушек подключены к объединенным неинвертитрующим выходам каналов. Два других громкоговорителя образуют другую пару и концами своих звуковых катушек подключены к объединенным инвертирующим выходам каналов. Другие выводы катушек в каждой паре громкоговорителей подключены один к общему проводу, другой - к свободному выводу громкоговорителя из другой пары. Между входами каналов усиления введен первоначально замкнутый выключатель. Оба канала усиления нагружены на один громкоговоритель, подключенный началом своей звуковой катушки к объединенным неинвертирующим выходам каналов, а концом - к объединенным инвертирующим выходам каналов. 1 ил. Ц 182702 ко РОССИЙСКАЯ ФЕДЕРАЦИЯ ВУ” 182 702” 44 ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ИЗВЕЩЕНИЯ К ПАТЕНТУ НА ПОЛЕЗНУЮ МОДЕЛЬ ММ9К Досрочное прекращение действия патента из-за неуплаты в установленный срок пошлины за поддержание патента в силе Дата прекращения действия патента: 26.12.2018 Дата внесения записи в Государственный реестр: 18.11.2019 Дата публикации и номер бюллетеня: 18.11.2019 Бюл. №32 Стр.: 1 па СОС ЕП

Подробнее
05-09-2019 дата публикации

УСТРОЙСТВО ДЛЯ АУДИОВИЗУАЛЬНОЙ НАВИГАЦИИ СЛЕПОГЛУХИХ ЛЮДЕЙ

Номер: RU0000192148U1

Заявленное техническое решение относится к приспособлениям для инвалидов по слуху и зрению, в частности к устройству для аудиовизуальной навигации слепоглухих людей, которое может быть использовано для самостоятельной навигации слепоглухих людей в помещении и на улице без использования дополнительных приспособлений или вместе с ними, например, с тактильной тростью в зависимости от степени нарушения сенсорных функций у пользователя. Техническим результатом является обеспечение безопасной навигации слепоглухого человека в заранее неизвестной обстановке за счет повышения точности локализации источников звука. Для достижения указанного технического результата разработано устройство для аудиовизуальной навигации слепоглухих людей, содержащее вычислительный модуль и соединенный с ним массив из по меньшей мере 3-х микрофонов, причем вычислительный модуль выполнен с возможностью: получения данных с массива микрофонов в виде звуковых кадров, считывающихся таким образом, чтобы обеспечивалась аппаратная синхронизация считывания звука со всех каналов; обработки данных звуковых кадров для локализации источников звука, включающей: определение коэффициентов правдоподобия наличия активного источника звука с частотой ω в заданных направлениях Θ; определение направления Θ, имеющего максимальное значение коэффициента правдоподобия, как направление на активный источник звука; формирование диаграммы направленности на основе данных азимутов и углов направления Θ на активный источник звука для получения одноканального выделенного звука целевого направления на активный источник звука с подавленными звуками по остальным направлениям; классификацию выделенного звука целевого направления с помощью глубокой свёрточной нейронной сети для определения на предмет наличия звуков потенциально опасных объектов; вывод информации об активном источнике звука пользователю в соответствии с классификатором. РОССИЙСКАЯ ФЕДЕРАЦИЯ (19) RU (11) (13) 192 148 U1 (51) МПК G10L 21/10 (2013.01) ФЕДЕРАЛЬНАЯ СЛУЖБА ...

Подробнее
21-07-2020 дата публикации

ПОРТАТИВНОЕ УСТРОЙСТВО РАСПОЗНАВАНИЯ РЕЧИ И ЗВУКОВЫХ СИГНАЛОВ

Номер: RU0000198673U1

Полезная модель относится к ассистивным устройствам, предназначенным для использования людьми с ограниченными возможностями по слуху, слуху и зрению. Расширение функционала устройства, позволяющего распознавать не только речь собеседника, но и другие звуки, демонстрировать пользователю направление источника звука, а также трансформировать полученную и исходящую информацию в виде рельефно-точечного шрифта Брайля и выводить ее на дисплей достигнуто благодаря тому, что устройство дополнительно к стандартным программно-аппаратным средствам содержит плату светодиодов, выполненных с возможностью их видимости через прорези в крышке корпуса, закрытых стеклом, одноплатный компьютер, на платформу которого установлены блок автозапуска процессов, находящийся во взаимосвязи с командно-телеметрическим модулем, модулем захвата звуков, модулем управления микрофонами, модулем вывода текстовой информации на дисплей Брайля, блок контроля над работоспособностью указанных модулей и блока запуска, а также блок распознания звуков и блок управления устройством, связанные через модули с блоком автозапуска процессов. Устройство с расширенными функциями позволяет значительно расширить круг пользователей и преимущественно адаптировать людей с ограниченными возможностями по слуху, а также слуху и зрению к социальной жизни в обществе. 6 з.п. ф-лы; 6 ил. РОССИЙСКАЯ ФЕДЕРАЦИЯ (19) RU (11) (13) 198 673 U1 (51) МПК G10L 21/00 (2013.01) ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ОПИСАНИЕ ПОЛЕЗНОЙ МОДЕЛИ К ПАТЕНТУ (52) СПК G10L 21/00 (2020.02) (21)(22) Заявка: 2020112603, 27.03.2020 (24) Дата начала отсчета срока действия патента: Дата регистрации: 21.07.2020 (45) Опубликовано: 21.07.2020 Бюл. № 21 1 9 8 6 7 3 R U (56) Список документов, цитированных в отчете о поиске: US 20020158816 A1, 31.10.2002. RU 2345422 C2, 27.01.2009. RU 2312646 C2, 20.12.2007. US 4378466 A1, 29.03.1983. US 9956407 B2, 01.05.2018. FR 2899097 B1, 13.02.2009. (54) ПОРТАТИВНОЕ УСТРОЙСТВО РАСПОЗНАВАНИЯ РЕЧИ И ...

Подробнее
05-01-2012 дата публикации

Speech audio processing

Номер: US20120004909A1
Принадлежит: Intel Corp

A speech processing engine is provided that in some embodiments, employs Kalman filtering with a particular speaker's glottal information to clean up an audio speech signal for more efficient automatic speech recognition.

Подробнее
05-01-2012 дата публикации

Audio human verification

Номер: US20120004914A1
Принадлежит: Microsoft Corp

A system generates an audio challenge that includes a first voice and one or more second voices, the first voice being audibly distinguishable, by a human, from the one or more second voices. The first voice conveys first information and the second voice conveys second information. The system provides the audio challenge to a user and verifies that the user is human based on whether the user can identify the first information in the audio challenge.

Подробнее
05-01-2012 дата публикации

Full-Band Scalable Audio Codec

Номер: US20120004918A1
Автор: Jinwei Feng, Peter Chu
Принадлежит: Plycom Inc

A scalable audio codec for a processing device determines first and second bit allocations for each frame of input audio. First bits are allocated for a first frequency band, and second bits are allocated for a second frequency band. The allocations are made on a frame-by-frame basis based on the energy ratio between the two bands. For each frame, the codec transform codes both frequency bands into two sets of transform coefficients, which are then packetized based on the bit allocations. The packets are then transmitted with the processing device. Additionally, the frequency regions of the transform coefficients can be arranged in order of importance determined by power levels and perceptual modeling. Should bit stripping occur, the decoder at a receiving device can produce audio of suitable quality given that bits have been allocated between the bands and the regions of transform coefficients have been ordered by importance.

Подробнее
26-03-2021 дата публикации

«РЕЧЕВОЙ КОРРЕКТОР» - УСТРОЙСТВО ДЛЯ УЛУЧШЕНИЯ РАЗБОРЧИВОСТИ РЕЧИ

Номер: RU0000203218U1

Полезная модель относится к реабилитационной технике и медицине и может применяться в быту, медицинских и образовательных организациях. Технический результат заключается в обеспечении возможности использования устройства для улучшения разборчивости речи без предварительной настройки за счет полосовых фильтров, настроенных на частоты спектральных зон фонетических признаков речи. Технический результат достигается за счет устройства для улучшения разборчивости речи, которое состоит из корпуса, внутри которого закреплены микрофон, полосовые фильтры, настроенные на частоты спектральных зон фонетических признаков речи, усилитель мощности с регулятором громкости, гнездо для подключения наушников, соединенные последовательно, а также блютуз-гарнитура (Bluetooth), микрофоном которой является микрофон устройства, а выход на наушники которой соединен с входом полосовых фильтров устройства, аккумулятор, обеспечивающий питание устройства, схема зарядки аккумулятора с гнездом для подключения внешнего источника тока, кнопка управления блютуз-гарнитурой и выключатель питания устройства. 5 з.п. ф-лы, 7 ил. РОССИЙСКАЯ ФЕДЕРАЦИЯ (19) RU (11) (13) 203 218 U1 (51) МПК G10L 21/0364 (2013.01) ФЕДЕРАЛЬНАЯ СЛУЖБА ПО ИНТЕЛЛЕКТУАЛЬНОЙ СОБСТВЕННОСТИ (12) ОПИСАНИЕ ПОЛЕЗНОЙ МОДЕЛИ К ПАТЕНТУ (52) СПК G10L 21/0364 (2021.02) (21)(22) Заявка: 2020141385, 15.12.2020 (24) Дата начала отсчета срока действия патента: Дата регистрации: (73) Патентообладатель(и): Общество с ограниченной ответственностью "Речевая аппаратура "Унитон" (RU) 26.03.2021 (45) Опубликовано: 26.03.2021 Бюл. № 9 2 0 3 2 1 8 R U (54) «РЕЧЕВОЙ КОРРЕКТОР» - УСТРОЙСТВО ДЛЯ УЛУЧШЕНИЯ РАЗБОРЧИВОСТИ РЕЧИ (57) Реферат: Полезная модель относится к частоты спектральных зон фонетических реабилитационной технике и медицине и может признаков речи, усилитель мощности с применяться в быту, медицинских и регулятором громкости, гнездо для подключения образовательных организациях. Технический наушников, соединенные последовательно, а результат ...

Подробнее
12-01-2012 дата публикации

Audio processing with time advanced inserted payload signal

Номер: US20120008803A1
Принадлежит: Sony Europe Ltd

An audio processing apparatus for modifying a primary audio signal includes a modulator that increases or decreases a level of a noise signal generated by a noise generator, in response to an increase or a decrease of a detected signal level of the primary audio signal, to generate a modulated noise signal. The apparatus further includes a combiner that combines the primary audio signal and the modulated noise signal. The modulator operates, with respect to a signal delayer, to time-advance a decrease in the level of said noise signal based on a corresponding decrease in the signal level of the primary audio signal.

Подробнее
19-01-2012 дата публикации

Method and device for audio signal classification

Номер: US20120016677A1
Принадлежит: Huawei Technologies Co Ltd

The present invention discloses a method and a device for audio signal classification, and relates to the field of communications technologies, which solve a problem of high complexity of type classification of audio signals in the prior art. In the present invention, after an audio signal to be classified is received, a tonal characteristic parameter of the audio signal to be classified, where the tonal characteristic parameter of the audio signal to be classified is in at least one sub-band, is obtained, and a type of the audio signal to be classified is determined according to the obtained characteristic parameter. The present invention is mainly applied to an audio signal classification scenario, and implements audio signal classification through a relatively simple method.

Подробнее
19-01-2012 дата публикации

Intelligent Automated Assistant

Номер: US20120016678A1
Принадлежит: Apple Inc

An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

Подробнее
26-01-2012 дата публикации

Noise canceller and noise cancellation program

Номер: US20120020489A1
Автор: Tomohiro Narita
Принадлежит: Mitsubishi Electric Corp

A directivity control unit 10 calculates a main beam signal with its directivity turned toward an object sound direction and a sub-beam signal with its blind spot turned toward the object sound direction from output signals of a plurality of microphones 2 and 3 through signal processing, and a frequency analyzing unit 20 converts them to spectra. A sound source decision unit 30 decides on whether a sound source is voice, stationary noise or unstationary noise from the spectra of the main beam signal and sub-beam signal and outputs as a sound source decision result, and calculates the average spectrum which is a statistic of noise for the main beam signal. An interfering sound removing unit 50 subtracts the average spectrum from the spectrum of the main beam signal to remove interfering sounds.

Подробнее
26-01-2012 дата публикации

Audio signal processing apparatus, audio coding apparatus, and audio decoding apparatus

Номер: US20120022676A1
Принадлежит: Panasonic Corp

To provide an audio signal processing apparatus which can perform, with low operation amount, audio signal processing that is either time stretch and/or compression processing or frequency modulation processing. The audio signal processing apparatus is intended to transform an input audio signal sequence using a predetermined adjustment factor. The audio signal processing apparatus includes a filter bank ( 2601 ) which transforms the input audio signal sequence into Quadrature Mirror Filter (QMF) coefficients using a filter for Quadrature Mirror Filter analysis (a QMF analysis filter) and an adjusting unit ( 2602 ) which adjusts the QMF coefficients based on a predetermined adjustment factor.

Подробнее
26-01-2012 дата публикации

Speech and Noise Models for Speech Recognition

Номер: US20120022860A1
Принадлежит: Google LLC

An audio signal generated by a device based on audio input from a user may be received. The audio signal may include at least a user audio portion that corresponds to one or more user utterances recorded by the device. A user speech model associated with the user may be accessed and a determination may be made background audio in the audio signal is below a defined threshold. In response to determining that the background audio in the audio signal is below the defined threshold, the accessed user speech model may be adapted based on the audio signal to generate an adapted user speech model that models speech characteristics of the user. Noise compensation may be performed on the received audio signal using the adapted user speech model to generate a filtered audio signal with reduced background audio compared to the received audio signal.

Подробнее
26-01-2012 дата публикации

Speech to Text Conversion

Номер: US20120022867A1
Принадлежит: Google LLC

Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.

Подробнее
26-01-2012 дата публикации

Geotagged environmental audio for enhanced speech recognition accuracy

Номер: US20120022870A1
Принадлежит: Google LLC

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving geotagged audio signals that correspond to environmental audio recorded by multiple mobile devices in multiple geographic locations, receiving an audio signal that corresponds to an utterance recorded by a particular mobile device, determining a particular geographic location associated with the particular mobile device, generating a noise model for the particular geographic location using a subset of the geotagged audio signals, where noise compensation is performed on the audio signal that corresponds to the utterance using the noise model that has been generated for the particular geographic location.

Подробнее
26-01-2012 дата публикации

Dynamic Range Improvement Technique

Номер: US20120022877A1
Автор: Larry Joseph Kirn
Принадлежит: Individual

Apparatus and methods are disclosed for detecting and progressively attenuating specific frequencies prevalent in an audio signal. In contrast to conventional wide-band enhancement techniques over long time frames, narrow bandwidths and short attenuation times employed are commensurate with resonances and timing typical of speech. Apparent dynamic range is therefore increased through attenuation of longer-duration elements with declining informational contribution.

Подробнее
02-02-2012 дата публикации

Systems, methods, apparatus, and computer-readable media for dynamic bit allocation

Номер: US20120029925A1
Принадлежит: Qualcomm Inc

A dynamic bit allocation operation determines a bit allocation for each of a plurality of vectors, based on a corresponding plurality of gain factors, and compares each allocation to a threshold value that is based on a dimensionality of the vector.

Подробнее
09-02-2012 дата публикации

Information Processing Apparatus, Information Processing Method, and Program

Номер: US20120035927A1
Принадлежит: Sony Corp

An information processing apparatus includes a plurality of information input units that inputs observation information of a real space, an event detection unit that generates event information including estimated position information and estimated identification (ID) information of a user present in the real space based on analysis of the information input from the information input unit, and an information integration processing unit that inputs the event information, and generates target information including a position and user ID information of each user based on the input event information and signal information representing a probability value for an event generating source. Here, the information integration processing unit includes an utterance source probability calculation unit having an identifier, and calculates an utterance source probability based on input information using the identifier in the utterance source probability calculation unit.

Подробнее
09-02-2012 дата публикации

System and method for synthetic voice generation and modification

Номер: US20120035933A1
Принадлежит: AT&T INTELLECTUAL PROPERTY I LP

Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a synthetic voice. A system configured to practice the method combines a first database of a first text-to-speech voice and a second database of a second text-to-speech voice to generate a combined database, selects from the combined database, based on a policy, voice units of a phonetic category for the synthetic voice to yield selected voice units, and synthesizes speech based on the selected voice units. The system can synthesize speech without parameterizing the first text-to-speech voice and the second text-to-speech voice. A policy can define, for a particular phonetic category, from which text-to-speech voice to select voice units. The combined database can include multiple text-to-speech voices from different speakers. The combined database can include voices of a single speaker speaking in different styles. The combined database can include voices of different languages.

Подробнее
09-02-2012 дата публикации

Speech search device and speech search method

Номер: US20120036159A1
Принадлежит: Toyohashi University of Technology NUC

Provided are a speech search device, the search speed of which is very fast, the search performance of which is also excellent, and which performs fuzzy search, and a speech search method. Not only the fuzzy search is performed, but also the distance between phoneme discrimination features included in speech data is calculated to determine the similarity with respect to the speech using both a suffix array and dynamic programming, and an object to be searched for is narrowed by means of search keyword division based on a phoneme and search thresholds relative to a plurality of the divided search keywords, the object to be searched for is repeatedly searched for while increasing the search thresholds in order, and whether or not there is the keyword division is determined according to the length of the search keywords, thereby implementing speech search, the search speed of which is very fast and the search performance of which is also excellent.

Подробнее
16-02-2012 дата публикации

Sound recognition device and sound recognition method

Номер: US20120039478A1
Принадлежит: Panasonic Corp

A sound recognition device includes: a frequency analysis unit which analyzes a frequency signal of a sound signal; a phase curve calculation unit which calculates a phase curve approximating temporal fluctuations of a phase of the frequency signal; an error calculation unit which calculates an error between the phase curve and the phase of the frequency signal; and a sound signal recognition unit which recognizes whether or not the sound signal is a signal of a periodic sound, based on the calculated error. The phase curve is expressed by a quadratic polynomial in which a value of the phase is a variable.

Подробнее
16-02-2012 дата публикации

Methods and apparatus for embedding watermarks

Номер: US20120039504A1
Автор: Venugopal Srinivasan
Принадлежит: Individual

Methods and apparatus for embedding a watermark are disclosed. An example method disclosed herein to embed a watermark in a compressed data stream comprises obtaining a set of transform coefficients included in the compressed data stream, the set of transform coefficients having a respective first set of mantissa codes and a respective set of exponents, the first set of mantissa codes associated with a respective set of mantissa step sizes, identifying a first transform coefficient from the set of transform coefficients having a smallest magnitude among the set of transform coefficients, determining a second set of mantissa codes based on the first transform coefficient and the set of step sizes, and replacing the first set of mantissa codes included in the compressed data stream with the second set of mantissa codes to embed the watermark without uncompressing the compressed data stream.

Подробнее
16-02-2012 дата публикации

Teaching aid

Номер: US20120040315A1
Автор: Peter Lawrence King
Принадлежит: UNICUS INVESTMENTS Pty Ltd

The present invention relates to the field of voice and speech recognition, in one form, the invention relates to a teaching aid adapted to teach reading and spelling via a voice and/or speech recognition system adapted to assist persons having dyslexia. The invention also provides a mechanism to train a speech recognition system without the need for the used to read verbose passages of text.

Подробнее
23-02-2012 дата публикации

Sound source separation apparatus and sound source separation method

Номер: US20120045066A1
Принадлежит: Honda Motor Co Ltd

A sound source separation apparatus includes a transfer function storage unit that stores a transfer function from a sound source, a sound change detection unit that generates change state information indicating a change of the sound source on the basis of an input signal input from a sound input unit, a parameter selection unit that calculates an initial separation matrix on the basis of the change state information generated by the sound change detection unit, and a sound source separation unit that separates the sound source from the input signal input from the sound input unit using the initial separation matrix calculated by the parameter selection unit.

Подробнее
23-02-2012 дата публикации

System, method and apparatus with environmental noise cancellation

Номер: US20120045074A1
Принадлежит: C Media Electronics Inc

Disclosed herein are system, method and apparatus with environmental noise cancellation. The instant disclosure is particularly adapted to a receiver module having at least two inputs. The two inputs respectively receive a main audio portion and the audio with majority of environmental noise. The system firstly calibrates the audio signals to reduce the error caused by the difference between the two inputs. An adaptive beamforming technology and a speech extractor are respectively used to extract the environmental noise portion with less main audio and the main audio portion with less noise. After a process of time-to-frequency domain transformation, a non-linear noise suppression technology is introduced into estimating the environmental noise and acquiring a gain. After noise suppression processed with the gain, a sequence of audio signals is output after a frequency-to-time domain transformation.

Подробнее
23-02-2012 дата публикации

Apparatus and method for improving communication quality in mobile terminal

Номер: US20120046943A1
Принадлежит: SAMSUNG ELECTRONICS CO LTD

An apparatus and a method for voice communication of a mobile terminal are provided. More particularly, an apparatus and a method for clearly receiving a counterpart user's voice signal in a mobile terminal positioned at a place where a noise occurs are provided. The apparatus includes an input unit, an extension signal generator, and an adder. The input unit receives a voice signal. The extension signal generator generates, based on a voice signal received via the input unit, a harmonics signal corresponding to a frequency band that represents a reaction sensitive to a sense of hearing. The adder merges the generated harmonics signal with the received voice signal.

Подробнее
23-02-2012 дата публикации

Retrieval and presentation of network service results for mobile device using a multimodal browser

Номер: US20120046950A1
Принадлежит: Nuance Communications Inc

A method of obtaining information using a mobile device can include receiving a request including speech data from the mobile device, and querying a network service using query information extracted from the speech data, whereby search results are received from the network service. The search results can be formatted for presentation on a display of the mobile device. The search results further can be sent, along with a voice grammar generated from the search results, to the mobile device. The mobile device then can render the search results.

Подробнее
23-02-2012 дата публикации

Method and Apparatus for Telephonically Accessing and Navigating the Internet

Номер: US20120047216A1
Принадлежит: Ben Franklin Patent Holding LLC

A method for accessing and browsing the interne through the use of a telephone and the associated DTMF signals is disclosed. The preferred embodiment provides a system that converts the information content of a web page from text to speech (voice signals), signals the hyperlink selections of a web page in an audio manner, and allows selection of the hyperlinks through the use of DTMF signals generated from a telephone keypad. Upon receiving a DTMF signal corresponding to a hyperlink, the corresponding web page is fetched and again delivered to the user via one of the available delivery methods such as voice, fax-on-demand, electronic mail, or regular mail.

Подробнее
15-03-2012 дата публикации

System for extraction of reverberant content of an audio signal

Номер: US20120063608A1
Принадлежит: Harman International Industries Inc

A reverberant characteristic of an acoustic space is superimposed on an audio signal that is received by an apparatus. The apparatus decomposes the audio signal into an estimated original dry signal component and an estimated reverberant characteristic of the acoustic space. Estimation of the original dry signal component and the reverberant characteristic of the acoustic space is based on determination of an estimated impulse response of the acoustic space from the received audio signal. Once the audio signal is decomposed, the estimated original dry signal component and the estimated reverberant characteristic of the acoustic space may be independently modified by the apparatus. The modified or unmodified estimated original dry signal component and estimated reverberant characteristic of the acoustic space may be combined by the apparatus to produce one or more adjusted frequency spectra.

Подробнее
15-03-2012 дата публикации

System and method for pronunciation modeling

Номер: US20120065975A1
Принадлежит: AT&T INTELLECTUAL PROPERTY I LP

Systems, computer-implemented methods, and tangible computer-readable media for generating a pronunciation model. The method includes identifying a generic model of speech composed of phonemes, identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech, labeling the family of interchangeable phonemic alternatives as referring to the same phoneme, and generating a pronunciation model which substitutes each family for each respective phoneme. In one aspect, the generic model of speech is a vocal tract length normalized acoustic model. Interchangeable phonemic alternatives can represent a same phoneme for different dialectal classes. An interchangeable phonemic alternative can include a string of phonemes.

Подробнее
15-03-2012 дата публикации

Efficient Combined Harmonic Transposition

Номер: US20120065983A1
Принадлежит: DOLBY INTERNATIONAL AB

The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular; a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described, The system may comprise an analysis filter bank ( 501 ) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank ( 501 ) has a frequency resolution of Δf, The system further comprises a nonlinear processing unit ( 502 ) configured to determine a set of synthesis subband signals from the set of analysis subband signals using a transposition order P; wherein the set of synthesis subband signals comprises a portion of the set of analysis subband signals phase shifted by an amount derived from the transposition order P; and a synthesis filter bank ( 504 ) configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein the synthesis filter bank ( 504 ) has a frequency resolution of FΔf; with F being a resolution factor, with F≧1; wherein the transposition order P is different from the resolution factor F.

Подробнее
22-03-2012 дата публикации

Using codec parameters for endpoint detection in speech recognition

Номер: US20120072211A1
Принадлежит: Nuance Communications Inc

Systems, methods and apparatus for determining an estimated endpoint of human speech in a sound wave received by a mobile device having a speech encoder for encoding the sound wave to produce an encoded representation of the sound wave. The estimated endpoint may be determined by analyzing information available from the speech encoder, without analyzing the sound wave directly and without producing a decoded representation of the sound wave. The encoded representation of the sound wave may be transmitted to a remote server for speech recognition processing, along with an indication of the estimated endpoint.

Подробнее
29-03-2012 дата публикации

Fine/Coarse Gain Adjustment

Номер: US20120076320A1
Автор: Vasu Iyengar
Принадлежит: Bose Corp

Methods and apparatuses for deriving a signal-to-noise ratio based at least in part on a measured level of a signal carrying far-end speech, and a measured level of a signal carrying ambient acoustic noise, determining a target gain adjustment based at least in part on the derived signal-to-noise ratio, including determining a coarse gain adjustment and a fine gain adjustment, the target gain adjustment corresponding to a combination of the coarse gain adjustment and the fine gain adjustment, applying the coarse gain adjustment to the signal carrying far-end speech using first gain adjustment circuitry to produce a first gain-adjusted signal, applying the fine gain adjustment to the signal carrying far-end speech using second gain adjustment circuitry to produce a second gain-adjusted signal, and providing a result of combining the first gain-adjusted signal and the second gain-adjusted signal to audio output from a wireless communications device.

Подробнее
29-03-2012 дата публикации

Method and device for frequency compression with selective frequency shifting

Номер: US20120076333A1
Принадлежит: Siemens Medical Instruments Pte Ltd

A method and device for frequency compression of audio signals to reduce the occurrence of artifacts. A component of the audio signal having a plurality of frequency channels is shifted from a first frequency channel into a second frequency channel. A dominant instantaneous frequency is determined in the first frequency channel. During shifting or mapping, first the entire first frequency channel, including the dominant instantaneous frequency, is shifted or mapped into the second frequency channel, wherein the dominant instantaneous frequency obtains an intermediate frequency position. A final frequency position for the dominant instantaneous frequency is determined using a predefined compression characteristic in the second frequency channel, starting from the frequency position of the dominant instantaneous frequency in the first frequency channel. Finally, the dominant instantaneous frequency is shifted or mapped from the intermediate frequency position to the final frequency position.

Подробнее
19-04-2012 дата публикации

Automatically providing a user with substitutes for potentially ambiguous user-defined speech commands

Номер: US20120095765A1
Принадлежит: Nuance Communications Inc

A method for alleviating ambiguity issues of new user-defined speech commands. An original command for a user-defined speech command can be received. It can then be determined if the original command is likely to be confused with a set of existing speech commands. When confusion is unlikely, the original command can be automatically stored. When confusion is likely, a substitute command that is unlikely to be confused with existing commands can be automatically determined. The substitute can be presented as an alternative to the original command and can be selectively stored as the user-defined speech command.

Подробнее
03-05-2012 дата публикации

Adaptive audio transcoding

Номер: US20120109643A1
Принадлежит: Google LLC

A system and method provide an audio/video coding system for adaptively transcoding audio streams based on content characteristics of the audio streams. An audio stream metadata extraction module of the system is configured to extract metadata of a source audio stream. An audio stream classification module of the system is configured to classify the source audio stream into one of the several audio content categories based on the metadata of the source audio stream. An adaptive audio encoder of the system is configured to determine one or more transcoding parameters including target bitrate and sampling rate based on the metadata and classification of the source audio stream. An adaptive audio transcoder of the system is configured to transcode the source audio stream into an output audio stream using the transcoding parameters.

Подробнее
10-05-2012 дата публикации

Method and apparatus for encoding and decoding high frequency signal

Номер: US20120116757A1
Принадлежит: SAMSUNG ELECTRONICS CO LTD

Provided are a method and apparatus for encoding and decoding a high frequency signal by using a low frequency signal. The high frequency signal can be encoded by extracting a coefficient by linear predicting a high frequency signal, and encoding the coefficient, generating a signal by using the extracted coefficient and a low frequency signal, and encoding the high frequency signal by calculating a ratio between the high frequency signal and an energy value of the generated signal. Also, the high frequency signal can be decoded by decoding a coefficient, which is extracted by linear predicting a high frequency signal, and a low frequency signal, and generating a signal by using the decoded coefficient and the decoded low frequency signal, and adjusting the generated signal by decoding a ratio between the generated signal and an energy value of the high frequency signal.

Подробнее
10-05-2012 дата публикации

System for voice control of a medical implant

Номер: US20120116774A1
Автор: Peter Forsell
Принадлежит: MILUX HOLDING SA

An implantable system ( 11 ) for control of and communication with an implant ( 17 ) in a body, comprising a command input device ( 12 ) and a processing device ( 13 ) coupled thereto, the processing device ( 13 ) being adapted to generate input to a command generator ( 16 ) which is comprised in the system ( 11 ) coupled to the processing device ( 13 ) and which is adapted to generate and communicate commands to the medical implant ( 17 ) in response to input received from the processing device ( 13 ), the system ( 11 ) further comprising a memory unit ( 15 ) connected to at least one of said devices in the system ( 11 ) for storing a memory bank of commands. The command input device ( 12 ) is adapted to receive commands from a user as voice commands, and the processing device ( 13 ) comprises a filter adapted to filter voice commands against high frequency losses and frequency distortion caused by the mammal body ( 10 ).

Подробнее
17-05-2012 дата публикации

System and method for providing enhanced audio in a video environment

Номер: US20120120270A1
Принадлежит: Cisco Technology Inc

A method is provided in one example and includes receiving audio data at a microphone array that includes a plurality of microphones. The microphone array is provisioned at a first endpoint, which includes a camera element configured to capture video data associated with a video session involving the first endpoint and a second endpoint. The method also includes formatting the audio data into a time division multiplex (TDM) stream, and communicating the stream to a port for a subsequent communication over a network and to the second endpoint.

Подробнее
17-05-2012 дата публикации

Post-noise suppression processing to improve voice quality

Номер: US20120123775A1
Принадлежит: Individual

Provided are methods and systems for improving quality of speech communications. The method may be for improving quality of speech communications in a system having a speech encoder configured to encode a first audio signal using a first set of encoding parameters associated with a first noise suppressor. A method may involve receiving a second audio signal at a second noise suppressor which provides much higher quality noise suppression than the first noise suppressor. The second audio signal may be generated by a single microphone or a combination of multiple microphones. The second noise suppressor may suppress the noise in the second audio signal to generate a processed signal which may be sent to a speech encoder. A second set of encoding parameters may be provided by the second noise suppressor for use by the speech encoder when encoding the processed signal into corresponding data.

Подробнее
24-05-2012 дата публикации

Spatial noise suppression for a microphone array

Номер: US20120128176A1
Принадлежит: Microsoft Corp

A noise reduction system and a method of noise reduction includes utilizing an array of microphones to receive sound signals from stationary sound sources and a user that is speaking. Positions of the stationary sound sources relative to the array of microphones are estimated using sound signals emitted from the sound sources at an earlier time. Noise is suppressed in an audio signal based at least in part on the estimated positions of the stationary sound sources. A position of the user relative to the array of microphones can also be estimated

Подробнее
24-05-2012 дата публикации

Voice Volume Modulator

Номер: US20120130154A1

A small, compact voice volume monitor has been developed that provides feedback to the speaker/user regarding the volume of the user's speech. The monitor is based on a sensing sound vibrations in the ear bone during speech and converting these vibrations into an electrical signal reflecting the speech volume. Electronic circuitry is then used to compare the intensity of this signal with pre-set reference levels. When the intensity of the signal is outside the reference levels for a set amount of time, feedback is provided to the user, for example, from a small vibratory motor.

Подробнее
24-05-2012 дата публикации

Speech determination apparatus and speech determination method

Номер: US20120130711A1
Автор: Takaaki Yamabe
Принадлежит: JVCKenwood Corp

A signal portion per frame is extracted from an input signal, thus generating a per-frame signal. The per-frame signal in the time domain is converted into a per-frame signal in the frequency domain, thereby generating a spectral pattern of spectra. It is determined whether an energy ratio is higher than a threshold level. The energy ratio is a ratio of each spectral energy to subband energy in a subband that involves the spectrum. The subband is involved in subbands into which a frequency band is separated with a specific bandwidth. It is determined whether the per-frame signal is a speech segment, based on a result of the determination. Average energy is derived in the frequency direction for the spectra in the spectral pattern in each subband. Subband energy is derived per subband by averaging the average energy in the time domain.

Подробнее
31-05-2012 дата публикации

Noise suppression apparatus, method, and a storage medium storing a noise suppression program

Номер: US20120134509A1
Автор: Chikako Matsumoto
Принадлежит: Fujitsu Ltd

A noise suppression apparatus includes: a conversion unit to convert a recorded sound signal in a time domain into a spectrum in a frequency domain; a setting unit to set a suppression gain indicating a degree of suppression on each spectrum for each frequency spectrum on the basis of a nonstationarity-value variation in time of the respective spectrum; a suppression unit to suppress each of the spectrum on the basis of the suppression gain set by the setting unit for each frequency spectrum; and an inverse conversion unit to perform an inverse conversion to the conversion by the conversion unit on the spectrum having been subjected to the suppression processing by the suppression unit.

Подробнее
31-05-2012 дата публикации

System and Method for Selective Enhancement Of Speech Signals

Номер: US20120134522A1
Принадлежит: Individual

A system and method for selectively enhancing an audio signal to make sounds, particularly speech sounds, more distinguishable. The system and method are designed to divide an input auditory signal into a plurality of spectral channels having associated unenhanced signals and perform enhancement processing on a first subset of the spectral channels and not perform enhancement processing on a second subset of the spectral channels. The enhancement processing is performed by determining an output gain for at least the first subset of spectral channels based on a time-varying history of energy of the unenhanced signals associated with each channel in the first subset of the spectral channels and applying the output gain for each of the first subset of the spectral channels to the unenhanced signals to form enhanced signals associated with each of the first subset of the spectral channels. The system and method are then designed to combine the plurality of enhanced signals associated with each of the first subset of the spectral channels and the unenhanced signals associated with each of the second subset of the spectral channels to form a selectively enhanced output auditory signal.

Подробнее
31-05-2012 дата публикации

Smartphone-Based Methods and Systems

Номер: US20120134548A1
Принадлежит: Digimarc Corp

Methods and arrangements involving portable devices are disclosed. One arrangement enables a content creator to select software with which that content should be rendered—assuring continuity between artistic intention and delivery. Another arrangement utilizes the camera of a smartphone to identify nearby subjects, and take actions based thereon. Others rely on near field chip (RFID) identification of objects, or on identification of audio streams (e.g., music, voice). Some of the detailed technologies concern improvements to the user interfaces associated with such devices. Others involve use of these devices in connection with shopping, text entry, sign language interpretation, and vision-based discovery. Still other improvements are architectural in nature, e.g., relating to evidence-based state machines, and blackboard systems. Yet other technologies concern use of linked data in portable devices—some of which exploit GPU capabilities. Still other technologies concern computational photography. A great variety of other features and arrangements are also detailed.

Подробнее
14-06-2012 дата публикации

Telephone or other device with speaker-based or location-based sound field processing

Номер: US20120150542A1
Автор: Wei Ma
Принадлежит: National Semiconductor Corp

A method includes obtaining audio data representing audio content from at least one speaker. The method also includes spatially processing the audio data to create at least one sound field, where each sound field has a spatial characteristic that is unique to a specific speaker. The method further includes generating the at least one sound field using the processed audio data. The audio data could represent audio content from multiple speakers, and generating the at least one sound field could include generating multiple sound fields around a listener. The spatially processing could include performing beam forming to create multiple directional beams, and generating the multiple sound fields around the listener could include generating the directional beams with different apparent origins around the listener. The method could further include separating the audio data based on speaker, where each sound field is associated with the audio data from one of the speakers.

Подробнее
14-06-2012 дата публикации

Method and system for reconstructing speech from an input signal comprising whispers

Номер: US20120150544A1
Принадлежит: NANYANG TECHNOLOGICAL UNIVERSITY

A system for reconstructing speech from an input signal comprising whispers is disclosed. The system comprises an analysis unit configured to analyse the input signal to form a representation of the input signal; an enhancement unit configured to modify the representation of the input signal to adjust a spectrum of the input signal, wherein the adjusting of the spectrum of the input signal comprises modifying a bandwidth of at least one formant in the spectrum to achieve a predetermined spectral energy distribution and amplitude for the at least one formant; and a synthesis unit configured to reconstruct speech from the modified representation of the input signal.

Подробнее
21-06-2012 дата публикации

Sound processing apparatus and recording medium storing a sound processing program

Номер: US20120155674A1
Автор: Naoshi Matsuo
Принадлежит: Fujitsu Ltd

A sound processing apparatus includes a first calculator that calculates first power based on a first signal received by a first microphone that is among the first microphone and a second microphone; a second calculator that calculates second power based on a second signal received by the second microphone; a gain calculator that calculates a gain on the basis of the ratio of the first power to the second power; and a multiplier that processes the second signal using the gain calculated by the gain calculator.

Подробнее
28-06-2012 дата публикации

Adaptable audio instruction system and method

Номер: US20120164617A1
Автор: Dongju Chung
Принадлежит: Individual

An adaptable audio instruction system and method allows for tailoring and modification to audio sequences used for audio instruction of users. The tailoring and modification abilities of the system regard content and presentation details of the audio sequences to comply with user preferences and user progress in learning content contained in the audio sequences.

Подробнее
05-07-2012 дата публикации

Apparatus and method for voice command recognition based on a combination of dialog models

Номер: US20120173244A1
Принадлежит: SAMSUNG ELECTRONICS CO LTD

Provided are a voice command recognition apparatus and method capable of figuring out the intention of a voice command input through a voice dialog interface, by combining a rule based dialog model and a statistical dialog model rule. The voice command recognition apparatus includes a command intention determining unit configured to correct an error in recognizing a voice command of a user, and an application processing unit configured to check whether the final command intention determined in the command intention determining unit comprises the input factors for execution of an application.

Подробнее
19-07-2012 дата публикации

Device and method for controlling damping of residual echo

Номер: US20120183133A1
Принадлежит: Limes Audio AB

The present invention relates to a device, such as a communication device, comprising an adaptive foreground filter configured to calculate a first echo estimation signal based on a first input signal, and an adaptive background filter being more rapidly adapting than the foreground filter and configured to calculate a second echo estimation signal based on said first input signal. Embodiments of the device further comprise damping control means for controlling damping of an echo-cancelled output signal. The device in various embodiments includes that the damping control means is configured to calculate a maximum echo estimation signal using both the first and the second echo estimation signals, and control the damping of the echo-cancelled output signal based on said maximum echo estimation signal and/or a signal derived from said maximum echo estimation signal.

Подробнее
19-07-2012 дата публикации

Sound signal processing apparatus, sound signal processing method, and program

Номер: US20120183149A1
Автор: Atsuo Hiroe
Принадлежит: Sony Corp

An apparatus including a direction estimation unit detecting one or more direction points indicating a sound source direction of a sound signal for each of blocks divided in a predetermined time unit, and a direction tracking unit connecting the direction points to each other between the blocks and detecting a section in which a sound is active.

Подробнее
19-07-2012 дата публикации

Method and system for creating a voice recognition database for a mobile device using image processing and optical character recognition

Номер: US20120183221A1
Принадлежит: Denso Corp, Denso International America Inc

A method and system for controlling a mobile device from a head unit using voice control is disclosed. The head unit receives a graphical representation of a current user interface screen of the mobile device. The head unit than scans the graphical representation of the current user interface screen to determine the locations of potential input mechanisms. The potential input mechanisms are scanned using optical character recognition and voice commands are determined for the input mechanisms. The determined voice commands and their respective locations on the user interface screens are stored in a voice recognition database, which is queried with uttered voice commands during voice recognition.

Подробнее
02-08-2012 дата публикации

Oversampling in a combined transposer filter bank

Номер: US20120195442A1
Принадлежит: DOLBY INTERNATIONAL AB

The present invention relates to coding of audio signals, and in particular to high frequency reconstruction methods including a frequency domain harmonic transposer. A system and method for generating a high frequency component of a signal from a low frequency component of the signal is described. The system comprises an analysis filter bank ( 501 ) comprising an analysis transformation unit ( 601 ) having a frequency resolution of Δf; and an analysis window ( 611 ) having a duration of D A ; the analysis filter bank ( 501 ) being configured to provide a set of analysis subband signals from the low frequency component of the signal; a nonlinear processing unit ( 502, 650 ) configured to determine a set of synthesis subband signals based on a portion of the set of analysis subband signals, wherein the portion of the set of analysis subband signals is phase shifted by a transposition order T; and a synthesis filter bank ( 504 ) comprising a synthesis transformation unit ( 602 ) having a frequency resolution of QΔf; and a synthesis window ( 612 ) having a duration of D s ; the synthesis filter bank ( 504 ) being configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein Q is a frequency resolution factor with Q≧1 and smaller than the transposition order T; and wherein the value of the product of the frequency resolution Δf and the duration D A of the analysis filter bank is selected based on the frequency resolution factor Q.

Подробнее
02-08-2012 дата публикации

Voice correction device, voice correction method, and recording medium storing voice correction program

Номер: US20120197634A1
Принадлежит: Fujitsu Ltd

A voice correction device includes a detector that detects a response from a user, a calculator that calculates an acoustic characteristic amount of an input voice signal, an analyzer that outputs an acoustic characteristic amount of a predetermined amount when having acquired a response signal due to the response from the detector, a storage unit that stores the acoustic characteristic amount output by the analyzer, a controller that calculates an correction amount of the voice signal on the basis of a result of a comparison between the acoustic characteristic amount calculated by the calculator and the acoustic characteristic amount stored in the storage unit, and a correction unit that corrects the voice signal on the basis of the correction amount calculated by the controller.

Подробнее
09-08-2012 дата публикации

Method and device for forming a mixed signal, method and device for separating signals, and corresponding signal

Номер: US20120203362A1

The invention relates to a method of formation of one or more mixed signals (S out ) on the basis of at least two digital source signals (S 1 , S 2 ), in particular audio signals, in which the mixed signal or signals (S out ) are formed by mixing the source signals (S 1 , S 2 ). In particular, a quantity characteristic of a source signal or of the mixing is determined and the value (W 1 , W 2 ) of the said characteristic quantity is watermarked on at least one of the signals (S 1 , S 2 , S out ). The invention also relates to a method of separation intended to separate, at least partially, at least one digital source signal contained in one or more mixed signals comprising a watermarked value of a quantity characteristic of a source signal or of the mixing. According to the method, the watermarked value of the quantity characteristic of the source signal or of the mixing is determined, and then the mixed signal or signals is or are processed as a function of the said value so as to obtain, at least partially, the said source signal. The invention also relates to the corresponding mixed signal (S out ), as well as the corresponding devices.

Подробнее
16-08-2012 дата публикации

Method And Background Estimator For Voice Activity Detection

Номер: US20120209604A1
Автор: Martin Sehlstedt
Принадлежит: Individual

The present invention relates to a method and a background estimator in voice activity detector for updating a background noise estimate for an input signal. The input signal for a current frame is received and it is determined whether the current frame of the input signal comprises non-noise. Further, an additional determination is performed whether the current frame of the non-noise input comprises noise by analyzing characteristics at least related to correlation and energy level of the input signal, and background noise estimate is updated if it is determined that the current frame comprises noise.

Подробнее
16-08-2012 дата публикации

Method and apparatus for information extraction from interactions

Номер: US20120209606A1
Принадлежит: Nice Systems Ltd

Obtaining information from audio interactions associated with an organization. The information may comprise entities, relations or events. The method comprises: receiving a corpus comprising audio interactions; performing audio analysis on audio interactions of the corpus to obtain text documents; performing linguistic analysis of the text documents; matching the text documents with one or more rules to obtain one or more matches; and unifying or filtering the matches.

Подробнее
16-08-2012 дата публикации

Speech signal restoration device and speech signal restoration method

Номер: US20120209611A1
Принадлежит: Mitsubishi Electric Corp

A synthesis filter 106 synthesizes a plurality of wide-band speech signals by combining wide-band phoneme signals and sound source signals from a speech signal code book 105 , and a distortion evaluation unit 107 selects one of the wide-band speech signals with a minimum waveform distortion with respect to an up-sampled narrow-band speech signal output from a sampling conversion unit 101 . A first bandpass filter 103 extracts a frequency component outside a narrow-band of the wide-band speech signal and a band synthesis unit 104 combines it with the up-sampled narrow-band speech signal.

Подробнее
23-08-2012 дата публикации

Methods and apparatus for formatting text for clinical fact extraction

Номер: US20120212337A1
Принадлежит: Nuance Communications Inc

An original text that is a representation of a narration of a patient encounter provided by a clinician may be received and re-formatted to produce a formatted text. One or more clinical facts may be extracted from the formatted text. A first fact of the clinical facts may be extracted from a first portion of the formatted text, and the first portion of the formatted text may be a formatted version of a first portion of the original text. A linkage may be maintained between the first fact and the first portion of the original text.

Подробнее
23-08-2012 дата публикации

Hearing assistance system for providing consistent human speech

Номер: US20120215532A1
Принадлежит: Apple Inc

Broadly speaking, the embodiments disclosed herein describe an apparatus, system, and method that allows a user of a hearing assistance system to perceive consistent human speech. The consistent human speech can be based upon user specific preferences.

Подробнее
23-08-2012 дата публикации

Sound Recognition Operation Apparatus and Sound Recognition Operation Method

Номер: US20120215537A1
Автор: Yoshihiro Igarashi
Принадлежит: Individual

According to one embodiment, a sound recognition operation apparatus includes a sound detection module, a keyword detection module, an audio mute module, and a transmission module. The sound detection module is configured to detect sound. The keyword detection module is configured to detect a particular keyword using voice recognition when the sound detection module detects sound. The audio mute module is configured to transmit an operation signal for muting audio sound when the keyword detection module detects the keyword. The transmission module is configured to recognize the voice command after the keyword is detected by the keyword detection module, and transmit an operation signal corresponding to the voice command.

Подробнее
30-08-2012 дата публикации

Network apparatus and methods for user information delivery

Номер: US20120221412A1
Автор: Robert F. Gazdzinski
Принадлежит: Individual

A network apparatus useful for providing directions and other information to a user of a client device in wireless communication therewith. In one embodiment, the apparatus includes one or more wireless interfaces and a network interface for communication with a server. User speech inputs in the form of digitized representations are received by the apparatus and used by the server as the basis for retrieving information including graphical representations of location or entities that the user wishes to find.

Подробнее
06-09-2012 дата публикации

Audio-signal correction apparatus, audio-signal correction method and audio-signal correction program

Номер: US20120226372A1
Автор: Masami Nakamura
Принадлежит: JVCKenwood Corp

Sequential digital audio signals are received to calculate a difference between each currently sampled digital audio signal and another digital audio signal sampled at one sampling period before each currently sampled digital audio signal. Differences for the sequential digital audio signals are stored. The number of digital audio signals consecutively clipped is counted in the received sequential digital audio signals. A specific difference is retrieved, from the stored differences, for a digital audio signal sampled at a specific number of sampling periods before each clipped digital audio signal. The specific number of sampling periods is determined based on the counted number of digital audio signals consecutively clipped. Each clipped digital audio signal is corrected based on the specific difference.

Подробнее
06-09-2012 дата публикации

Device and method for filtering out noise from speech of caller

Номер: US20120226495A1
Автор: Wei Wu, XIN YANG

A device and a method for filtering out noise from speech of caller are disclosed. The method is applied to the device, includes: inputting a speech sound of a caller; converting the speech sound to digital signals by an analyzing-to-digital converting unit; analyzing the digital signals to identify a pure speech of the caller and filtering out an extraneous noise thus obtaining pure speech signals of the caller; encoding the pure speech signals by a coder and decoder unit, and submitting the encoded speech signals to the receiver.

Подробнее
13-09-2012 дата публикации

Wireless synchronization of data and software components over a wireless network compatible to ieee802.11 standard(s) for mobile devices

Номер: US20120230315A1
Принадлежит: Flexiworld Technologies Inc

Wireless synchronization of data and software components over IEEE802.11 standard(s) are herein disclosed and enabled. An information apparatus, which includes a wireless communication unit compatible with IEEE802.11, may access a wireless local area network (WLAN). To setup the wireless synchronization, the user connects the information apparatus to a wireless output device over a wired connection (e.g., USB) and selects the wireless output device. Information associated with the wireless output device is saved in the mobile information apparatus for enabling wireless synchronization. Next, the user connects the mobile information apparatus to the WLAN, and, depending on the availability of the wireless output device in the network, the information apparatus may lock a wireless connection to the wireless output device for wireless synchronization. A client application in the mobile information apparatus and output controller software in the wireless output device may be required to facilitate the wireless synchronization over the WLAN.

Подробнее
13-09-2012 дата публикации

Bandwidth extension of a low band audio signal

Номер: US20120230515A1
Принадлежит: Telefonaktiebolaget LM Ericsson AB

Estimation of a high band extension of a low band audio signal includes the following steps: extracting (S 1 ) a set of features of the low band audio signal; mapping (S 2 ) extracted features to at least one high band parameter with generalized additive modeling; frequency shifting (S 3 ) a copy of the low band audio signal into the high band; controlling (S 4 ) the envelope of the frequency shifted copy of the low band audio signal by said at least one high band parameter.

Подробнее
20-09-2012 дата публикации

Sound processing based on a confidence measure

Номер: US20120239385A1
Принадлежит: Cochlear Ltd

A method for processing sound that includes, generating one or more noise component estimates relating to an electrical representation of the sound and generating an associated confidence measure for the one or more noise component estimates. The method further comprises processing, based on the confidence measure, the sound.

Подробнее
20-09-2012 дата публикации

Apparatus and method for supporting reading of document, and computer readable medium

Номер: US20120239390A1
Принадлежит: Toshiba Corp

According to one embodiment, an apparatus for supporting reading of a document includes a model storage unit, a document acquisition unit, a feature information extraction, and an utterance style estimation unit. The model storage unit is configured to store a model which has trained a correspondence relationship between first feature information and an utterance style. The first feature information is extracted from a plurality of sentences in a training document. The document acquisition unit is configured to acquire a document to be read. The feature information extraction unit is configured to extract second feature information from each sentence in the document to be read. The utterance style estimation unit is configured to compare the second feature information of a plurality of sentences in the document to be read with the model, and to estimate an utterance style of the each sentence of the document to be read.

Подробнее
20-09-2012 дата публикации

Erroneous detection determination device, erroneous detection determination method, and storage medium storing erroneous detection determination program

Номер: US20120239394A1
Автор: Chikako Matsumoto
Принадлежит: Fujitsu Ltd

An erroneous detection determination device includes: a signal acquisition unit configured to acquire, from each of microphones, a plurality of audio signals relating to ambient sound including sound from a sound source in a certain direction; a result acquisition unit configured to acquire a recognition result including voice activity information indicating the inclusion of a voice activity relating to at least one of the audio signals; a calculation unit configured to calculate, for each of audio signals on the basis of the signals in respective unit times and the certain direction, a speech arrival rate representing the proportion of the sound from the certain direction to the ambient sound in each of the unit times; and an error detection unit configured to determine, on the basis of the recognition result and the speech arrival rate, whether or not the voice activity information is the result of erroneous detection.

Подробнее
27-09-2012 дата публикации

Methods and apparatus for formatting text for clinical fact extraction

Номер: US20120245926A1
Принадлежит: Nuance Communications Inc

An original text that is a representation of a narration of a patient encounter provided by a clinician may be received and re-formatted to produce a formatted text. One or more clinical facts may be extracted from the formatted text. A first fact of the clinical facts may be extracted from a first portion of the formatted text, and the first portion of the formatted text may be a formatted version of a first portion of the original text. A linkage may be maintained between the first fact and the first portion of the original text.

Подробнее
04-10-2012 дата публикации

Noise removal device and noise removal program

Номер: US20120250883A1
Автор: Tomohiro Narita
Принадлежит: Mitsubishi Electric Corp

A noise removal unit 102 executes noise removal and flooring processing of an input signal, and a density calculating unit 104 calculates, as to a point of interest on a time-frequency plane of the input signal passing through the noise removal, a density of non-flooring processing points from the presence or absence of the flooring processing of individual points around the point of interest. A partial suppression unit 105 replaces, when the density is less than a threshold, the power of the point of interest with its flooring value by considering it as a musical noise component, thereby suppressing the musical noise component.

Подробнее
04-10-2012 дата публикации

Frame mapping approach for cross-lingual voice transformation

Номер: US20120253781A1
Принадлежит: Microsoft Corp

Frame mapping-based cross-lingual voice transformation may transform a target speech corpus in a particular language into a transformed target speech corpus that remains recognizable, and has the voice characteristics of a target speaker that provided the target speech corpus. A formant-based frequency warping is performed on the fundamental frequencies and the linear predictive coding (LPC) spectrums of source speech waveforms in a first language to produce transformed fundamental frequencies and transformed LPC spectrums. The transformed fundamental frequencies and the transformed LPC spectrums are then used to generate warped parameter trajectories. The warped parameter trajectories are further used to transform the target speech waveforms in the second language to produce transformed target speech waveform with voice characteristics of the first language that nevertheless retain at least some voice characteristics of the target speaker.

Подробнее
04-10-2012 дата публикации

Multi-mode audio codec and celp coding adapted therefore

Номер: US20120253797A1

In an embodiment, bitstream elements of sub-frames are encoded differentially to a global gain value so that a change of the global gain value results in an adjustment of an output level of the decoded representation of the audio content. Concurrently, the differential coding saves bits. Even further, the differential coding enables the lowering of the burden of globally adjusting the gain of an encoded bitstream. In another embodiment, a global gain control across CELP coded frames and transform coded frames is achieved by co-controlling the gain of the codebook excitation of the CELP codec, along with a level of the transform or inverse transform of the transform coded frames. In another embodiment, the gain value determination in CELP coding is performed in the weighted domain of the excitation signal.

Подробнее
04-10-2012 дата публикации

System and method for rapid customization of speech recognition models

Номер: US20120253799A1
Принадлежит: AT&T INTELLECTUAL PROPERTY I LP

Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating domain-specific speech recognition models for a domain of interest by combining and tuning existing speech recognition models when a speech recognizer does not have access to a speech recognition model for that domain of interest and when available domain-specific data is below a minimum desired threshold to create a new domain-specific speech recognition model. A system configured to practice the method identifies a speech recognition domain and combines a set of speech recognition models, each speech recognition model of the set of speech recognition models being from a respective speech recognition domain. The system receives an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model, and tunes the combined speech recognition model for the speech recognition domain based on the data.

Подробнее
04-10-2012 дата публикации

Location-Based Conversational Understanding

Номер: US20120253802A1
Принадлежит: Microsoft Corp

Location-based conversational understanding may be provided. Upon receiving a query from a user, an environmental context associated with the query may be generated. The query may be interpreted according to the environmental context. The interpreted query may be executed and at least one result associated with the query may be provided to the user.

Подробнее
04-10-2012 дата публикации

Systems, methods, and media for generating hierarchical fused risk scores

Номер: US20120254243A1
Принадлежит: Victrio Inc

Systems, methods, and media for generating fused risk scores for determining fraud in call data are provided herein. Some exemplary methods include generating a fused risk score used to determine fraud from call data by generating a fused risk score for a leg of call data, via a fuser module of an analysis system, the fused risk score being generated by fusing together two or more uniquely calculated fraud risk scores, each of the uniquely calculated fraud risk scores being generated by a sub-module of the analysis system; and storing the fused risk score in a storage device that is communicatively couplable with the fuser module.

Подробнее
11-10-2012 дата публикации

Integrated psychoacoustic bass enhancement (pbe) for improved audio

Номер: US20120259626A1
Автор: Pei Xiang, Ren Li
Принадлежит: Qualcomm Inc

Psychoacoustic Bass Enhancement (PBE) is integrated with one or more other audio processing techniques, such as active noise cancellation (ANC), and/or receive voice enhancement (RVE), leveraging each technique to achieve improved audio output. This approach can be advantageous for improving the performance of headset speakers, which often lack adequate low-frequency response to effectively support ANC.

Подробнее
11-10-2012 дата публикации

Accelerometer vector controlled noise cancelling method

Номер: US20120259628A1
Автор: Georg Siotis
Принадлежит: SONY ERICSSON MOBILE COMMUNICATIONS AB

A telecommunication device is disclosed, comprising: a microphone array comprising a plurality of microphones, wherein each microphone receives an analogue acoustic signal; a position sensing device for determining how the telecommunication device is positioned in three-dimensions with respect to a user's mouth; at least one analogue/digital converter for converting each analogue acoustic signal into a digital signal; a digital signal processor for performing signal processing on the received digital signals comprising a controller, a plurality of delay circuits for delaying each received signal based on an input from the controller and a plurality of preamplifiers for adjusting the gain of each received signal based on a gain input from the controller, wherein the controller selects the appropriate delay and gain values applied to each received signal to remove noise from the received signals based on the determined position of the telecommunication device. A method for creating and controlling a location of a virtual microphone near a telecommunication device so as to reduce background noise in a speech signal is also disclosed.

Подробнее
11-10-2012 дата публикации

Voice control device and voice control method

Номер: US20120259640A1
Принадлежит: Fujitsu Ltd

A voice control unit controlling and outputting a first voice signal includes an analysis unit configured to calculate an average value of a gradient of spectrum at a high frequency of an inputted second voice signal as a voice characteristic, a determination unit configured to determine an amplification band and an amplification amount of a spectrum of the first voice signal based on the gradient, and an amplification unit configured to amplify the spectrum of the first voice signal to realize the determined amplification band and the determined amplification amount.

Подробнее
11-10-2012 дата публикации

Method and apparatus for encoding audio data

Номер: US20120259645A1
Принадлежит: Individual

A method for processing audio data includes determining a first common scalefactor value for representing quantized audio data in a frame. A second common scalefactor value is determined for representing the quantized audio data in the frame. A line equation common scalefactor value is determined from the first and second common scalefactor values.

Подробнее
18-10-2012 дата публикации

Apparatus and method for processing voice command

Номер: US20120265536A1
Принадлежит: Hyundai Motor Co

Disclosed is a technique for processing voice commands. In particular, the disclose technique increases a voice recognition rate without performing a process of inputting separate voice commands by updating a voice command table based on interaction with a user by storing similar commands input by the user once those commands have been confirmed by the user as similar command.

Подробнее
25-10-2012 дата публикации

Signal demultiplexing device, signal demultiplexing method and non-transitory computer readable medium storing a signal demultiplexing program

Номер: US20120269203A1
Принадлежит: NEC Corp

Provided is a signal demultiplexing system that can minimize losses in demultiplexing performance even if signals unsuited to demultiplexing are inputted. The provided signal demultiplexing device contains: an input signal analysis means for determining whether or not a plurality of input signals are suited to demultiplexing; a data memory means for storing data from frequency-domain input signals which result from transformation of the aforementioned input signals into frequency-domain signals; a selection control means for storing the frequency-domain input signals in the data memory means if the input signal analysis means has determined that the input signals are suited to the generation of a demultiplexing matrix for demultiplexing; and a demultiplexing matrix generation means for generating a demultiplexing matrix using frequency-domain input signals including the most recent and older frequency-domain input signals stored in the data memory means.

Подробнее
25-10-2012 дата публикации

Establishing a multimodal advertising personality for a sponsor of a multimodal application

Номер: US20120271642A1
Принадлежит: Nuance Communications Inc

Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor.

Подробнее
01-11-2012 дата публикации

System and Method for Community Feedback and Automatic Ratings for Speech Metrics

Номер: US20120278075A1
Принадлежит: Individual

A system and method for collecting from an ASR, a first rating of an intelligibility of human speech, and collecting another intelligibility rating of such speech from networked listeners to such speech. The first rating and the second rating are weighed based on an importance to a user of the ratings, and a third rating is created from such weighted two ratings.

Подробнее
08-11-2012 дата публикации

Photo-realistic synthesis of image sequences with lip movements synchronized with speech

Номер: US20120284029A1
Автор: Frank Soong, Lijuan Wang
Принадлежит: Microsoft Corp

Audiovisual data of an individual reading a known script is obtained and stored in an audio library and an image library. The audiovisual data is processed to extract feature vectors used to train a statistical model. An input audio feature vector corresponding to desired speech with which a synthesized image sequence will be synchronized is provided. The statistical model is used to generate a trajectory of visual feature vectors that corresponds to the input audio feature vector. These visual feature vectors are used to identify a matching image sequence from the image library. The resulting sequence of images, concatenated from the image library, provides a photorealistic image sequence with lip movements synchronized with the desired speech.

Подробнее
15-11-2012 дата публикации

Method and apparatus for processing multi-channel de-correlation for cancelling multi-channel acoustic echo

Номер: US20120288100A1
Автор: Nam-gook CHO
Принадлежит: SAMSUNG ELECTRONICS CO LTD

Provided are a method and apparatus for multi-channel de-correlation processing for cancelling a multi-channel acoustic echo. The method includes: dividing an input multi-channel audio signal into units of frames to form multi-channel audio signals in units of frames; analyzing eigen values and eigen vectors related to the multi-channel audio signals by using the multi-channel audio signals in units of frames every time contents are modified; and separating the multi-channel audio signals in units of frames into a plurality of signal component spaces by using the analyzed eigen values and eigen vectors.

Подробнее
15-11-2012 дата публикации

Noise filling and audio decoding

Номер: US20120288117A1
Автор: Eun-mi Oh, Mi-young Kim
Принадлежит: SAMSUNG ELECTRONICS CO LTD

A noise filling method is provided that includes detecting a frequency band including a part encoded to 0 from a spectrum obtained by decoding a bitstream; generating a noise component for the detected frequency band; and adjusting energy of the frequency band in which the noise component is generated and filled by using energy of the noise component and energy of the frequency band including the part encoded to 0.

Подробнее
15-11-2012 дата публикации

Transform-Domain Codebook In A Celp Coder And Decoder

Номер: US20120290295A1
Автор: Vaclav Eksler
Принадлежит: VoiceAge Corp

Codebook Arrangement for use in coding an input sound signal includes First and Second Codebook Stages. First Codebook Stage includes one of a time-domain CELP codebook and a transform-domain codebook. Second Codebook Stage follows the first codebook stage and includes the other of the time-domain CELP codebook and the transform-domain codebook. Codebook Stage includes an adaptive codebook may be provided before First Codebook Stage. A selector may be provided to select an order of the time-domain CELP codebook and the transform-domain codebook in First and Second Codebook Stages, respectively, as a function of characteristics of the input sound signal. The selector may also be responsive to both the characteristics of the input sound signal and a bit rate of the codec using Codebook Arrangement to bypass Second Codebook Stage. Codebook Arrangement can be used in a coder of an input sound signal.

Подробнее
22-11-2012 дата публикации

Method and apparatus for reducing noise pumping due to noise suppression and echo control interaction

Номер: US20120294453A1

An input signal is processed through noise suppression (NS) and echo control (EC) via a multipath model that reduces noise pumping effects while maintaining EC performance. A copy of a “noisy” input signal is sent to an EC component before the noisy signal is sent to a NS component, which processes the signal first, when there is a consistent noise level for estimation. The copy of the pre-processing noisy signal is sent to the EC component along with a “clean” or “noise-suppressed” signal output from the NS component. The EC component analyzes the noisy signal as if the EC was the first component in the signal chain to determine what actions to take. The EC component then applies these actions to the clean signal received from the NS component.

Подробнее