Настройки

Укажите год
-

Небесная энциклопедия

Космические корабли и станции, автоматические КА и методы их проектирования, бортовые комплексы управления, системы и средства жизнеобеспечения, особенности технологии производства ракетно-космических систем

Подробнее
-

Мониторинг СМИ

Мониторинг СМИ и социальных сетей. Сканирование интернета, новостных сайтов, специализированных контентных площадок на базе мессенджеров. Гибкие настройки фильтров и первоначальных источников.

Подробнее

Форма поиска

Поддерживает ввод нескольких поисковых фраз (по одной на строку). При поиске обеспечивает поддержку морфологии русского и английского языка
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Укажите год
Укажите год

Применить Всего найдено 29772. Отображено 200.
03-08-2017 дата публикации

ЭЛЕКТРОННОЕ УСТРОЙСТВО, СЕРВЕР И СПОСОБ УПРАВЛЕНИЯ ТАКИМИ УСТРОЙСТВАМИ

Номер: RU2627117C2

Изобретение относится к средствам управления устройством отображения. Технический результат заключается в минимизации временного интервала для распознавания команды пользователя и выполнении операции. Определяют, соответствует ли принятая речевая команда команде распознавания речи, включенной в сохраненный первый список команд распознавания речи. В ответ на определение того, что принятая речевая команда соответствует команде распознавания речи, включенной в сохраненный первый список команд распознавания речи, осуществляют функционирование в соответствии с информацией о команде управления, соответствующей команде распознавания речи. В ответ на определение того, что принятая речевая команда не соответствует команде распознавания речи, включенной в сохраненный первый список команд распознавания речи, передают принятую речевую команду на первый сервер, принимают второй список команд распознавания речи, обновленный посредством добавления переданной речевой команды и соответствующей команды управления ...

Подробнее
17-04-2017 дата публикации

РАСПОЗНАВАНИЕ АУДИОПОСЛЕДОВАТЕЛЬНОСТИ ДЛЯ АКТИВАЦИИ УСТРОЙСТВА

Номер: RU2616553C2

Группа изобретений относится к вычислительной технике и может быть использована для активации электрического устройства из режима ожидания в режим работы на полную мощность. Техническим результатом является упрощение активации устройств, находящихся в режиме ожидания. Способ содержит этапы, на которых принимают аудиопоток в электрическом устройстве, находясь в режиме питания при ожидании; оцифровывают аудиопоток в аудиопоследовательность, находясь в режиме питания при ожидании; сравнивают, находясь в режиме питания при ожидании, аудиопоследовательность, оцифрованную на предыдущем этапе, с оцифрованной фразой активации, сохраненной в энергонезависимой памяти; активируют электрическое устройство, если аудиопоследовательность соответствует фразе активации в пределах заданного допустимого отклонения; и подтверждают, что аудиопоследовательность соответствует фразе активации, после активации электрического устройства, используя ресурсы доступные электрическому устройству, когда оно активировано ...

Подробнее
12-10-2021 дата публикации

СПОСОБ И СИСТЕМА ДЛЯ ОБРАБОТКИ ПОЛЬЗОВАТЕЛЬСКОГО РАЗГОВОРНОГО РЕЧЕВОГО ФРАГМЕНТА

Номер: RU2757264C2

Изобретение относится к области вычислительной техники для обработки естественного языка. Технический результат заключается в повышении точности определения действия, которое необходимо выполнить электронному устройству в соответствии с пользовательским разговорным речевым фрагментом. Технический результат достигается за счет приема от пользователя индикатора пользовательского разговорного речевого фрагмента; формирования гипотезы по текстовому представлению на основе пользовательского разговорного речевого фрагмента; обработки, с использованием первой обученной модели на основе сценариев и второй обученной модели на основе сценариев, гипотезы по текстовому представлению для того, чтобы формировать первую гипотезу по сценарию и вторую гипотезу по сценарию, соответственно; причем первая обученная модель на основе сценариев и вторая обученная модель на основе сценариев обучаются с использованием по меньшей мере частично различающегося корпуса текстов; анализа, с использованием алгоритма машинного ...

Подробнее
17-05-2018 дата публикации

Номер: RU2016144802A3
Автор:
Принадлежит:

Подробнее
08-06-2018 дата публикации

Номер: RU2016147907A3
Автор:
Принадлежит:

Подробнее
18-05-2020 дата публикации

СИСТЕМА ПЕЧАТИ, СПОСОБ УПРАВЛЕНИЯ И СЕРВЕР

Номер: RU2721223C1

Изобретение относится к средствам управления печатью. Технический результат заключается в улучшении функциональности команд печати. Если целевые данные контента для печати не могут быть указаны на основании данных команды, блок передачи по меньшей мере одного сервера передает данные сообщения для запроса об элементе настройки контента. Если целевые данные контента для печати могут быть указаны на основании данных команды, блок передачи передает данные для запроса об элементе настройки печати. Устройство голосового управления выводит сообщение голосом на основании переданных данных сообщения. Блок указания по меньшей мере одного сервера указывает данные контента на основании голосовой команды, принятой устройством голосового управления, после того как сообщение выводится голосом, и печатающее устройство выполняет печать на основании данных для печати, сформированных на основании указанных данных контента. 3 н. и 16 з.п. ф-лы, 12 ил.

Подробнее
20-04-2015 дата публикации

СПОСОБ И УСТРОЙСТВО ДЛЯ ВЫПОЛНЕНИЯ РЕЖИМА ПРЕДВАРИТЕЛЬНО УСТАНОВЛЕННОЙ ОПЕРАЦИИ С ИСПОЛЬЗОВАНИЕМ РАСПОЗНОВАНИЯ РЕЧИ

Номер: RU2013144921A
Принадлежит:

... 1. Способ выполнения предварительно установленной операции посредством использования распознавания речи, причем способ состоит в том, что:выполняют предварительно установленную операцию режима предварительно установленной операции согласно клавишному вводу или вводу-касанию в режиме предварительно установленной операции; ираспознают введенную речь во время выполнения предварительно установленной операции режима предварительно установленной операции и содействуют выполнению предварительно установленной операции согласно распознанной речи.2. Способ по п.1, в котором режим предварительно установленной операции соответствует режиму набора текста, ивыполнение предварительно установленной операции содержит отображение текста ввода согласно клавишному вводу или вводу-касанию в режиме набора текста в окне отображения текста, исодействие предварительно установленной операции содержит распознавание введенной речи, в то время как текст ввода согласно клавишному вводу или вводу-касанию отображается ...

Подробнее
10-10-2015 дата публикации

СПОСОБ И СИСТЕМА ГОЛОСОВОГО ИНТЕРФЕЙСА

Номер: RU2014111971A
Принадлежит:

... 1. Способ обработки голосовых пользовательских команд,включающий следующие шаги:- получают список программ, список системных команд и их обработчиков;- получают запрос пользователя и текущий контекст;- обрабатывают запрос пользователя, при этом:- если запрос включает системную команду, то немедленно выполняют обработчик данной команды;- иначе, если запрос включает команду работы с данными и в контексте хранится информация о работе с данными, то выполняют обработчик команды применительно к данным;- иначе производят поиск и выполнение программы с учетом контекста, наиболее подходящей под запрос пользователя;- обновляют текущий контекст с учетом обработанного на предыдущем шаге запроса;- выдают ответ пользователю на основании результатов обработки запроса.2. Способ по п. 1, в котором список программ дополнительно содержит, по крайней мере, следующие атрибуты:a. название;b. синонимы;c. тип.3. Способ по п. 1, в котором запрос пользователя представляет собой текст, полученный путем распознавания ...

Подробнее
27-11-2015 дата публикации

РАСПОЗНАВАНИЕ АУДИОПОСЛЕДОВАТЕЛЬНОСТИ ДЛЯ АКТИВАЦИИ УСТРОЙСТВА

Номер: RU2014119876A
Принадлежит:

... 1. Способ активации электрического устройства из режима ожидания, содержащий этапы, на которых:(a) принимают в режиме ожидания аудиопоток в электрическом устройстве;(b) оцифровывают в режиме ожидания аудиопоток в аудиопоследовательность;(c) сравнивают в режиме ожидания аудиопоследовательность, оцифрованную на упомянутом этапе (b), с оцифрованной фразой активации, сохраненной в энергонезависимой памяти; и(d) активируют электрическое устройство, если аудиопоследовательность соответствует фразе активации в пределах заданного допустимого отклонения.2. Способ по п. 1, в котором микрофон непрерывно отслеживает окружающую среду на предмет аудиопотока упомянутого этапа (a).3. Способ по п. 1, дополнительно содержащий этап фильтрации помех из принятого аудиопотока перед сравнением оцифрованной аудиопоследовательности с оцифрованной фразой активации.4. Способ по п. 1, в котором оцифрованную последовательность активации сохраняют в энергонезависимой памяти электрического устройства перед первоначальным ...

Подробнее
13-08-1992 дата публикации

Equipment for speech control of appts. - in which spoken command produces electrical measurement signal, with signal analysed and compared with sample signal for command

Номер: DE0004103913A1
Принадлежит:

A microphone (2) converts the spoken command into electrical signals. A processing circuit produces measurement signals out of the electrical signals. A microprocessor (8) analyses the measurement signals and compares them with simple signals for the command. The microprocessor emits a control signal at an outlet. In a comparison circuit the amplitude of the measurement signal is compared with a reference voltage. At its outlet a logic 1 situation is present if the measurement signal exceeds the reference voltage. A logic 0 situation applies if the amplitude of the measurement signal is below the reference voltage. The microprocessor is used to compare the time pattern of the measurement signal at the outlet of the comparison circuit with a time pattern of a stored sample signal. When there is coincidence between the time patterns of both signals, the microprocessor issues a control signal at an outlet (10). ADVANTAGE - The control of appts. by spoken command.

Подробнее
25-07-2019 дата публикации

Anwendungsprozessor mit Sprachtriggersystem mit niedriger Leistung und direktem Weg zum Unterbrechen, elektronisches Gerät mit demselben und Verfahren zum Betreiben desselben

Номер: DE102018128225A1
Автор: KIM SUN-KYU, Kim, Sun-Kyu
Принадлежит:

Ein Anwendungsprozessor kann einen Host-Prozessor, ein Sprachtriggersystem und ein Audio-Subsystem beinhalten, das elektrisch mit einem Systembus verbunden ist. Das Sprachtriggersystem kann konfiguriert werden, um einen Sprachtriggervorgang durchzuführen und ein Triggerereignis auszugeben. Das Audio-Subsystem kann konfiguriert werden, um einen Audio-Ausgangsstrom über eine Audio-Schnittstelle wiederzugeben. Ein direkter Bus kann konfiguriert werden, um einen Kommunikationsweg zwischen dem Sprachtriggersystem und dem Audio-Subsystem während einer Unterbrechungs-Bedingung bereitzustellen, in der der Sprachtriggervorgang und die Wiedergabe des Audio-Ausgangsstroms gemeinsam durchgeführt werden. Der Anwendungsprozessor kann konfiguriert werden, um kompensierte Triggerdaten zu erzeugen, indem er eine Echounterdrückung in Bezug auf von einem Mikrofon empfangene Mikrofondaten durchführt, und das Sprachtriggersystem kann konfiguriert werden, um den Sprachtriggervorgang während der Unterbrechungs-Bedingung ...

Подробнее
08-01-2015 дата публикации

Touchscreen-Anwenderschnittstelle mit Spracheingabe

Номер: DE112012006165T5
Принадлежит: INTEL CORP, INTEL CORPORATION

Ein elektronisches Gerät empfängt eine Berührungsauswahl eines Elementes auf einem Touchscreen. Als Reaktion schaltet das elektronische Gerät in einen Hörmodus für Sprachbefehle, die von einem Anwender des Gerätes gesprochen wird. Der Sprachbefehl spezifiziert eine Funktion, die der Anwender mit dem gewählten Element ausführen möchte. Optional ist der Hörmodus auf einen definierten Zeitraum, der auf der Berührungsauswahl basiert, begrenzt. Solche Sprachbefehle in Verbindung mit der Berührungsauswahl erleichtern die Anwenderinteraktionen mit dem elektronischen Gerät.

Подробнее
24-05-2018 дата публикации

Fahr-Assistenzvorrichtung, Fahr-Assistenzserver und Fahr-Assistenzsystem

Номер: DE112015006585T5

Eine Fahr- bzw. Reise-Assistenzvorrichtung beinhaltet eine Sprachinformations-Detektionseinheit, die geäußertes Sprechen des Anwenders erfasst und erkennt und ein Erkennungsergebnis der Erkennung ausgibt; eine Informations-Verarbeitungseinheit, welche die Evaluierungsinformation aus dem Erkennungsergebnis erzeugt; eine Positionsinformations-Detektionseinheit, die eine Position detektiert, wo die Evaluierungsinformation erzeugt wird; eine Zuverlässigkeits-Bestimmungseinheit, die die Zuverlässigkeit der Evaluierungsinformation unter Verwendung von im Internet geposteter Posting-Information innerhalb eines vorbestimmten Distanzbereichs basierend auf der Position, an welcher die Evaluierungsinformation innerhalb einer vorbestimmten Periode erzeugt wird, basierend auf aktuellem Datum und Zeit, bestimmt, und bestimmt, ob die Evaluierungsinformation zu senden ist; eine Kommunikationseinheit, die an den Reise-Assistenzserver eine Übertragung der Evaluierungsinformation, von der bestimmt ist, dass ...

Подробнее
05-05-2017 дата публикации

Nichtdeterministische Aufgabeninitiierung durch ein persönliches Assistenzmodul

Номер: DE202016008238U1
Автор:
Принадлежит: GOOGLE INC, GOOGLE INC.

System, umfassend einen oder mehrere Prozessoren und einen Speicher, der Anweisungen speichert, die den einen oder die mehreren Prozessoren dazu bringen: eine Benutzerangabe zu identifizieren, die an einem Computergerät empfangen wird; auf Grundlage der Benutzerangabe eine Vielzahl von in Frage kommenden Antwortaktionen zu identifizieren, die durch das Computergerät initiiert werden können, um die Benutzerangabe potenziell zu erfüllen; auf Grundlage der Benutzerangabe eine einzelne in Frage kommende Antwortaktion der Vielzahl von in Frage kommenden Antwortaktionen zur exklusiven Initiierung auf dem Computergerät als Reaktion auf die Benutzerangabe nichtdeterministisch auszuwählen; und die einzelne in Frage kommende Antwortaktion auf dem Computergerät exklusiv zu initiieren.

Подробнее
18-12-2017 дата публикации

Intelligenter automatisierter Assistent

Номер: DE202017004558U1
Автор:
Принадлежит: APPLE INC, Apple Inc.

Elektronische Vorrichtung zum Betreiben eines automatisierten Assistenten, wobei die elektronische Vorrichtung einen Lautsprecher und ein Mikrofon umfasst, wobei die elektronische Vorrichtung Mittel umfasst zum: Bereitstellen, über den Lautsprecher der elektronischen Vorrichtung, einer Audioausgabe; während dem Bereitstellen der Audioausgabe über den Lautsprecher der elektronischen Vorrichtung, Empfangen, über das Mikrofon der elektronischen Vorrichtung, einer Eingabe in natürlicher Sprache; Ableiten einer Darstellung eine Benutzerabsicht, basierend auf der Eingabe in natürlicher Sprache und der Audioausgabe; Identifizieren einer Aufgabe, basierend auf der abgeleiteten Benutzerabsicht; und Durchführen der identifizierten Aufgabe.

Подробнее
05-07-2018 дата публикации

Gesprächsbewusste proaktive Benachrichtigungen für eine Sprachschnittstellenvorrichtung

Номер: DE102017129939A1
Принадлежит:

Ein Verfahren für proaktive Benachrichtigungen in einer Sprachschnittstellenvorrichtung enthält: Empfangen einer ersten Anwendersprachanforderung für eine Handlung mit einem künftigen Ausführungszeitpunkt; Zuweisen der ersten Anwendersprachanforderung einem Sprachassistentendienst für die Ausführung; anschließend an das Empfangen Empfangen einer zweiten Anwendersprachanforderung und in Reaktion auf die zweite Anwendersprachanforderung Beginnen eines Gesprächs mit dem Anwender; und während des Gesprächs: Empfangen einer Benachrichtigung von dem Sprachassistentendienst von der Ausführung der Handlung; Auslösen einer ersten hörbaren Ankündigung für den Anwender, um einen Übergang aus dem Gespräch anzugeben, und Unterbrechen des Gesprächs; Auslösen einer zweiten hörbaren Ankündigung für den Anwender, um die Ausführung der Handlung anzugeben; und Auslösen einer dritten hörbaren Ankündigung für den Anwender, um einen Übergang zurück zu dem Gespräch anzugeben, und erneutes Eintreten in das Gespräch ...

Подробнее
09-05-2019 дата публикации

Sprachsteuerung für ein Fahrzeug

Номер: DE102017219616A1
Автор: DUSIK JAN, Dusik, Jan
Принадлежит:

Die Erfindung betrifft eine Mensch-Maschine-Schnittstelle (Human Machine Interface, HMI) für ein Fahrzeug mit einem Mikrofon zum Erfassen eines von einem Fahrer oder einem weiteren Insassen des Fahrzeugs gesprochenen Befehlsworts und einem Steuergerät, welches ein Spracherkennungsmodul zum Erkennen mehrerer bestimmter Befehlsworte aufweist.

Подробнее
23-04-2015 дата публикации

Verfahren und Vorrichtung zur Verarbeitung mehrerer Audioströme in einem Bordrechensystem eines Fahrzeugs

Номер: DE102014114604A1
Принадлежит:

Bereitgestellt wird ein Verfahren zur Verarbeitung einer Mehrzahl Audioströme in einem Bordrechensystem eines Fahrzeugs. Das Verfahren umfasst Empfangen der Mehrzahl Audioströme aus einer Mehrzahl Positionen in einem Fahrzeug; Priorisieren jedes der Mehrzahl Audioströme zur Erzeugung eines Priorisierungsergebnisses und Ausführen einer mit jedem der Mehrzahl Audioströme in Verbindung stehenden Anwendung je nach dem Priorisierungsergebnis.

Подробнее
23-10-2003 дата публикации

Akustischer Internet-Zugriff

Номер: DE0060100784D1

Подробнее
23-11-2011 дата публикации

A system and method for interrupting an instructional prompt to signal upcoming input over a wireless communication link

Номер: GB0002480576A
Принадлежит:

A voice interactive session includes detection of an input signaling an interrupt to the session. When the interrupt is detected, instructional and or informational output is Interrupted and detection of voice input begins. The voice Input is not detected until the output is interrupted. Upon detection of a voice input (or other sound-based input), a determination may be made if the input was valid. If the input was valid, the input is processed, otherwise, instructional and/or informational output may be relayed again and/or the voice input may be redetected.

Подробнее
15-03-2017 дата публикации

Speech recognition

Номер: GB0002542268A
Принадлежит:

A speech recognition system (in eg. in a mobile phone) stores an input signal from a microphone (100 or 102) in a first buffer (110) and generates a noise-reduced signal (134) in a second block (146) before selecting (140) which of the signals to send to a speech recognition engine (132). Aspects of the invention include the detection (120) and validation (130) of a trigger phrase or pass phrase (eg. hello phone) which, when spoken by an authorised speaker (identified by speaker verification), wakes up the speech recognition engine, with a time delay being applied to either signal stored in buffers (110) and (146) in order to synchronise them. This multi-phase process allows computationally intensive speech recognition to be automatically powered down between uses.

Подробнее
12-05-2004 дата публикации

Route guidance system having voice guidance capability

Номер: GB0000408079D0
Автор:
Принадлежит:

Подробнее
13-12-2017 дата публикации

Nondeterministic task initiation by a personal assistant module

Номер: GB0002551216A
Принадлежит:

Upon identifying a user declaration received at a computing device, a plurality of candidate responsive actions that can be initiated by the computing device in response to the user declaration may be identified. A single candidate responsive action may then be non-deterministically (e.g., randomly, stochastically) selected 564 to be exclusively initiated on the computing device in response to the user declaration. The selection of the candidate responsive action is preferably based on a probability of the action being appropriate based on a history of interaction 566 between the user and a computer device. The user declaration may be a user statement or enquiry input by voice 552 to a digital personal assistant and the action may be for example initiating a phone call.

Подробнее
05-07-2017 дата публикации

Systems and methods for autonomously soothing babies

Номер: GB0201708214D0
Автор:
Принадлежит:

Подробнее
22-03-2017 дата публикации

A spoken dialogue system, a spoken dialogue method and a method of adapting a spoken dialogue system

Номер: GB0201701918D0
Автор:
Принадлежит:

Подробнее
23-10-2013 дата публикации

Vehicle interface system

Номер: GB0201316074D0
Автор:
Принадлежит:

Подробнее
10-04-1996 дата публикации

Pattern matching method and apparatus

Номер: GB0009602699D0
Автор:
Принадлежит:

Подробнее
26-08-2020 дата публикации

Smart light bulb with integral virtual assistant device

Номер: GB0002581500A
Принадлежит:

A mains-powered 36 smart light bulb 10 emitting light from elements 17 & 23 contains integrated virtual assistant functionality including microphones 25, loudspeaker 31, keyword speech recognition processor 44 (eg. “Assistant”) and antenna 22 which transmits subsequent audio commands (eg. “lights on/ play Jingle Bells”) to WiFi networks (27, fig. 12) and onwards to a cloud-based (ie. internet) software agent (64) which interprets the audio voice commands and replies with digital control/audio data. The Printed Circuit Board (PCB 21) and light emitting elements are connected to a heat conductor 18 and heat sink 19, and a conical acoustic reflector 33 may direct the speaker output away from the mics. The processor 44 may also process video from a camera.

Подробнее
30-08-2017 дата публикации

Facilitation of offline semantic processing in a resource-constrained device

Номер: GB0002547744A
Принадлежит:

A voice-enabled, resource-constrained device 52 (eg. mobile phone) uses a stored offline grammar model 76 to semantically process long tail (ie. relatively unique) voice input queries or commands (eg. Im really in the mood for some [artist name]) which are also converted offline 74 in order to identify candidate response actions 72 which the device can perform, and updates an offline action model 78 to map queries to response actions. Online actions 80 can then be processed and performed when connectivity is restored. A phrase may be parsed using location information (eg. the store being near home or on the usual route from work), and user actions may inform statistics on future actions (eg. a performed action becomes a qualifying action if executed promptly in direct response to a query).

Подробнее
19-09-2018 дата публикации

Query endpointing based on lip detection

Номер: GB0002560598A
Принадлежит:

Systems and methods are described for improving endpoint detection of a voice query submitted by a user. A synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output. This enables non-speech noise (e.g. background) to be filtered out after the user has stopped speaking.

Подробнее
21-02-2018 дата публикации

Sensor input recognition

Номер: GB0002553040A
Принадлежит:

In eg. a speech recognition system (in eg. in a mobile phone), a sensor 208 detects an input (eg. sound) signal and determines whether it contains a selectable pattern (eg. trigger phrase such as hello phone) via analysers 206 coupled to an analysis signal memory and a controllable pattern detector 220 acting on eg. detected keywords (from 120, 5a) . Aspects of the invention include the subsequent validation 120 of the trigger word or pass phrase which, when spoken by an authorised speaker (identified by speaker verification), wakes up 212 the speech recognition engine (SRE 132). Noise reduction may be tuned for speech recognition 134 or communication 136 depending on whether it is applied before or after the validation.

Подробнее
23-09-2020 дата публикации

Providing suggested voice-based action queries

Номер: GB0002553936B
Принадлежит: GOOGLE LLC, Google LLC

Подробнее
05-06-2019 дата публикации

Natural Language processing for session establishment with service providers

Номер: GB0002568983A
Принадлежит:

Routing packetized actions in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate an action data structure. The system can transmit the action data structure to a third party provider device. The system can receive an indication from the third party provider device that a communication session was established with the device.

Подробнее
27-05-2020 дата публикации

Virtual assistant identification of nearby computing devices

Номер: GB0002556993B
Принадлежит: GOOGLE LLC, Google LLC

Подробнее
18-03-2020 дата публикации

Method for providing VUI particular response and application thereof to intelligent sound box

Номер: GB0002577157A
Автор: XUDONG LIU, Xudong Liu
Принадлежит:

A voice user interface receives 10 a voice instruction Cv and uses 30 physiological information to determine whether the voice instruction is abnormal. If so, then a search instruction Cs is generated and used (40,50,500), along with the voice instruction, to obtain feedback information (F1,F2) for output 60. F1 and F2 correspond to the voice and search instructions, respectively. A voice characteristic profile for a user may be created using stored 20 voice instructions, each of which may be labelled as abnormal if appropriate. Abnormality of a voice instruction may be determined by comparing its reference waveform with that of an archive. The feedback may be generated by a cloud server and may be output in voice (61) or displayed (63) form. In an embodiment a digital assistant or smart speaker responds to spoken queries and also responds to detected physiological abnormalities such as swollen vocal cords or other illnesses.

Подробнее
20-01-2021 дата публикации

Disabling a digital assistant during a conference call based on security level

Номер: GB0002585771A
Принадлежит:

A digital assistant and method are provided that allow for the digital assistant to be disabled during a conference call. An initial security level for a conference call is determined. A second conference device that is associated with the digital assistant is alerted of the initial security level. The second conference device sends the initial security level to the digital assistant. The functionality of the digital assistant is adjusted based at least in part upon the initial security level.

Подробнее
26-08-2020 дата публикации

Creating modular conversations using implicit routing

Номер: GB0002581660A
Принадлежит:

A computer implemented method of routing a verbal input to one of a plurality of handlers, comprising using one or more processors adapted to execute a code, the code is adapted for receiving a verbal input from a user, applying a plurality of verbal content identifiers to the verbal input, each of the verbal content identifiers is adapted to evaluate an association of the verbal input with a respective one of a plurality of handlers by computing a match confidence value for one or more features,such as an intent expressed by the user and/or an entity indicated by the user, extracted from the verbal input and routing the verbal input to a selected one of the handlers based on the matching confidence value computed by the plurality of verbal content identifiers. The selected handler is adapted to initiate one or more actions in response to the verbal input.

Подробнее
19-01-2022 дата публикации

End of speech detection using one or more neural networks

Номер: GB0002597126A
Принадлежит:

An Automatic Speech Recognition/voice transcription system indicates an End of Speech segment based on one or more characters predicted to be within the segment, especially a particular percentage of blank (non-speech) characters within a sliding window 352, fig. 3B (eg. 95% within 500 ms). Audio data is input to a Connectionist Temporal Classification (CTC) neural network model 304 to generate character probabilities based on extracted features (eg. mel-spectogram 204, fig. 2). The Start (11) and End (12) Of Speech segments are then detected 310 via a greedy (eg. ArgMax) decoder 308.

Подробнее
29-05-2019 дата публикации

Processing and visualising audio signals

Номер: GB0201905445D0
Автор:
Принадлежит:

Подробнее
25-04-2018 дата публикации

NO DETAILS

Номер: GB0201803881D0
Автор:
Принадлежит:

Подробнее
30-06-2021 дата публикации

End of speech detection using one or more neural networks

Номер: GB202107009D0
Автор:
Принадлежит:

Подробнее
28-11-2018 дата публикации

Content playback system

Номер: GB0201816363D0
Автор:
Принадлежит:

Подробнее
02-10-2019 дата публикации

Selective sensor polling

Номер: GB0002572316A
Принадлежит:

A selective sensor polling system for a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a template for an action data structure with a plurality of fields. The system can determine to poll a first sensor for data for the first field. The system can determine to obtain data in memory previously collected by the second sensor. The system can generate and transmit the action data structure with the data from the sensor and memory, and transmit the action data structure to a third party device.

Подробнее
29-06-2022 дата публикации

Emotion detection using speaker baseline

Номер: GB0002602398A
Принадлежит:

Described herein is a system for emotion detection in audio data using a speaker's baseline. The baseline may represent a user's speaking style in a neutral emotional state. The system is configured to compare the user's baseline with input audio representing speech from the user to determine an emotion of the user. The system may store multiple baselines for the user, each associated with a different context (e.g., environment, activity, etc.), and select one of the baselines to compare with the input audio based on the contextual situation.

Подробнее
21-12-2022 дата публикации

Conversational AI platforms with closed domain and open domain dialog integration

Номер: GB0002607985A
Принадлежит:

Open and closed domain dialogue systems are combined into an intelligent dialogue management system to provide more relevant responses to user queries. A user query 114 sent to a system such as a chatbot or virtual assistant, is analysed, e.g. by a natural language understanding model 120, to associate the query with a domain tag, and optionally intent classification, and/or input slots. The domain tag determines whether the query is transmitted to either a generalist open domain 150 dialogue system or a specialist closed domain 160 dialogue system. If the domain tag relates to a closed, domain-specific dialogue system, one or more request policies for the system are used to generate a request for the closed domain system. Responsive data from the dialogue system that received the query is used to generate an output 106, 108 for the user. If responsive data is received from the closed domain system, this may also be sent to the open domain system so it can track a dialogue state of the ...

Подробнее
22-06-2022 дата публикации

Account association with device

Номер: GB0002602211A
Принадлежит:

Systems and methods for account data association with voice interface devices are disclosed. For example, when a host user/primary user and guest user have consented for account data to be associated with the primary user's devices, a request to associate the account data may be received. Voice and device-based authentication may be performed to confirm the identity of the guest user and the guest user's account data may be associated with the primary user's devices. During a guest session, voice recognition may be utilized to determine if a given user utterance is from the guest user or the primary user, and actions may be performed by the voice interface device accordingly.

Подробнее
13-09-2023 дата публикации

Cloud service platform system for speakers

Номер: GB0002616512A
Принадлежит:

A cloud service platform system for smart speakers comprising a speech input module, network connection and player receives positioning data from working speakers, marks two speakers within a threshold distance of each other as “suspected same group” and sends a “suspected same group acknowledgment message” to the corresponding speakers. If the feedback result is “yes”, data for the same group of speakers is unified and transmitted to any speaker within the group, and any speaker transmits data within the group. If a “NO” result is fed back, play data is transmitted successively according to a weight priority and volume is controlled. This allows a single application to manage conflicts between nearby speakers.

Подробнее
15-05-1979 дата публикации

HALTELEISTE

Номер: ATA899676A
Автор:
Принадлежит:

Подробнее
15-08-1977 дата публикации

PROCEDURE FOR the PRODUCTION OF NEW P-METHYLTHIO-ACYLOPHENOn-o (2-AMINOATHYL) - OXIMATHERN AND THEIR SOUR ADDITION SALTS

Номер: AT0000027377A
Автор:
Принадлежит:

Подробнее
15-07-1990 дата публикации

MOUNTING DEVICE FOR ADJUSTABLE FRONT PLATES

Номер: AT0000039985A
Автор:
Принадлежит:

Подробнее
15-07-2010 дата публикации

VORRICHTUNG ZUR ABSCHEIDUNG UND KONZENTRIERUNG VON SCHWEFELSÄURE UND SCHWEFELDIOXID

Номер: AT0000507668B1
Принадлежит:

Es wird ein Verfahren zur Abscheidung von Schwefelsäure und Schwefeldioxid aus Gasströmen beschrieben, wobei der Gasstrom in einem ersten Wärmetauscher mit einem Kühlmedium gekühlt wird, der vorgekühlte Gasstrom in einem zweiten Wärmetauscher, welcher durch den Gasstrom von unten nach oben im Gegenstrom zu einem Kühlgasstrom durchströmt wird, abgekühlt wird und der gekühlte Gasstrom daraufhin einen Nasselektrofilter von unten nach oben durchströmt, um die restlichen Schwefelsäuretröpfchen zu entfernen, wobei auf den Nasselektrofilter ein Wäscher zur Abscheidung von Schwefeldioxid-Restmengen nachgeschaltet ist ...

Подробнее
15-04-2020 дата публикации

Welding apparatus with assistance system

Номер: AT0000514655B1
Принадлежит:

Schweißvorrichtung (1) mit einer Schweißeinheit (2) zum Schweißen eines Werkstückes (W), die durch einen Schweißer (U) bedient wird, und mit einer Schweiß-Steuereinheit (3), die Schweiß-Steuerparameter (SSP) der Schweißeinheit (2) in Abhängigkeit von Steuerbefehlen einstellt, die von einem Schweißassistenzsystem (4) durch Auswertung eines von dem Schweißassistenzsystem (4) mit dem Schweißer (U) geführten Sprachdialogs generiert werden; wobei das Schweißassistenzsystem (4) über mindestens einen Lautsprecher (SC) entsprechend dem gesteuert ablaufenden Sprachdialog generierte Sprachfragmente an den Schweißer (U) ausgibt; und wobei die generierten Sprachfragmente des gesteuert ablaufenden Sprachdialogs in Abhängigkeit von Schlüsselwörtern generiert werden, die bei der Auswertung des mit dem Schweißer (U) geführten Sprachdialogs seitens des Schweißassistenzsystems (4) erkannt werden.

Подробнее
10-07-1940 дата публикации

Claimant by electric motors manual work equipment.

Номер: AT0000159120B
Автор:
Принадлежит:

Подробнее
04-07-2019 дата публикации

Intelligent automated assistant

Номер: AU2017330209C1
Принадлежит: FPA Patent Attorneys Pty Ltd

Systems and processes for operating an automated assistant are disclosed. In one example process, an electronic device provides an audio output via a speaker of the electronic device. While providing the audio output, the electronic device receives, via a microphone of the electronic device, a natural language speech input. The electronic device derives a representation of user intent based on the natural language speech input and the audio output, identifies a task based on the derived user intent; and performs the identified task.

Подробнее
30-05-2019 дата публикации

Sequence dependent operation processing of packet based data message transmissions

Номер: AU2017386093A1

Optimization of sequence dependent operations in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A prediction component can determine a thread based on the trigger keyword and the request that includes a first action, a second action subsequent to the first action, and a third action subsequent to the second action. A content selector component can select, based on the third action and the trigger keyword, a content item. An audio signal generator component can generate an output signal comprising the content item. An interface can transmit the output signal to cause a client computing device to drive a speaker to generate an acoustic wave corresponding to the output signal prior to occurrence of at least one of the first action and the second action.

Подробнее
29-07-2021 дата публикации

PROTOTYPING VOICE USER INTERFACES FOR LANGUAGES WITH NON-PHONETIC ALPHABETS

Номер: AU2019268092B2
Принадлежит:

Voice command matching during testing of voice-assisted application prototypes for languages with non-phonetic alphabets is described. A visual page of an application prototype is displayed during a testing phase of the application prototype. A speech-to-text service converts a non-phonetic voice command spoken in a language with a non-phonetic alphabet, captured by at least one microphone during the testing phase of the application prototype, into a non-phonetic text string in the non-phonetic alphabet of the voice command. A phonetic language translator translates the non-phonetic text string of the voice command into a phonetic text string in a phonetic alphabet of the voice command. A comparison module compares the phonetic text string of the voice command to phonetic text strings in the phonetic alphabet of stored voice commands associated with the application prototype to identify a matching voice command. A performance module performs an action associated with the matching voice ...

Подробнее
15-08-2019 дата публикации

Telephone system, telephone, call forwarding method, and program

Номер: AU2018297798A1

The present invention enables a user of a telephone as a forwarding destination to recognize a person from whom a call has been received when the call is forwarded from one telephone to another telephone. A first telephone (11) makes a call with another party. A second telephone (12) is a telephone to which the call is forwarded from the first telephone (11). A voice data acquisition means (13) acquires voice data of the other party in the call made by the first telephone (11). A voice recognition means (14) performs voice recognition of the voice data acquired by the voice data acquisition means (13), and generates text data. A forwarding destination text display means (15) displays the text data generated by the voice recognition means (14) to a user of the second telephone (12).

Подробнее
15-10-2020 дата публикации

Dynamic thresholds for always listening speech trigger

Номер: AU2018241137B2
Принадлежит: FPA Patent Attorneys Pty Ltd

Systems and processes are disclosed for dynamically adjusting a speech trigger threshold, which can be used in triggering a virtual assistant. Audio input can be received via a microphone. The received audio input can be sampled, and a confidence level can be determined of whether the sampled audio input includes a portion of a spoken trigger. In response to the confidence level exceeding a threshold, a virtual assistant can be triggered to receive a user command from the audio input. The threshold can be dynamically adjusted in response to perceived events (e.g., events indicating a user may be more or less likely to initiate speech interactions, events indicating a trigger may be difficult to detect, events indicating a trigger was missed, etc.), thereby minimizing both missed triggers and false positive triggering events.

Подробнее
16-01-2020 дата публикации

Methods and systems of handling a dialog with a robot

Номер: AU2018202162B2
Принадлежит: Spruson & Ferguson

Abstract METHODS AND SYSTEMS OF HANDLING A DIALOG WITH A ROBOT There is disclosed a computer-implemented method of handling an audio dialog between a robot and a human user, the method comprising: during said audio dialog, receiving audio data and converting said audio data into text data; in response to the verification of one or more dialog mode execution rules of said text data, selecting a modified dialog mode; wherein a dialog mode comprises one or more dialog contents and one or more dialog voice skins; wherein a dialog content comprises a collection of predefined sentences, said collection comprising question sentences and answer sentences; and wherein a dialog voice skin comprises voice rendering parameters comprising frequency, tone, velocity and pitch. Described developments comprise modifying dialog contents and/or dialog voice skin, using dialog execution rules (for example depending on the environment perceived by the robot) and moderating dialog contents.

Подробнее
24-03-2003 дата публикации

Device for providing voice driven control of a media presentation

Номер: AU2002324970A1
Принадлежит:

Подробнее
03-11-2016 дата публикации

Methods and systems of handling a dialog with a robot

Номер: AU2015248796A1
Принадлежит: Spruson & Ferguson

There is disclosed a computer-implemented method of handling an audio dialog between a robot and a human user, the method comprising: during said audio dialog, receiving audio data and converting said audio data into text data; in response to the verification of one or more dialog mode execution rules of said text data, selecting a modified dialog mode; wherein a dialog mode comprises one or more dialog contents and one or more dialog voice skins; wherein a dialog content comprises a collection of predefined sentences, said collection comprising question sentences and answer sentences; and wherein a dialog voice skin comprises voice rendering parameters comprising frequency, tone, velocity and pitch. Described developments comprise modifying dialog contents and/or dialog voice skin, using dialog execution rules (for example depending on the environment perceived by the robot) and moderating dialog contents.

Подробнее
06-07-2017 дата публикации

Techniques for graph based natural language processing

Номер: AU2014415625A1
Принадлежит: Cotters Patent & Trade Mark Attorneys

Techniques for graph based natural language processing are described. In one embodiment an apparatus may comprise a client service component operative on the processor circuit to receive a natural language user request from a device and to execute the natural language user request based on matched one or more objects and a social object relation component operative on the processor circuit to match the natural language user request to the one or more objects in an object graph, the object graph comprising token mappings for objects within the object graph, the token mappings based on data extracted from a plurality of interactions by a plurality of users of the network system, wherein the one or more objects are matched with the natural language user request based on the token mappings. Other embodiments are described and claimed.

Подробнее
05-02-2015 дата публикации

Image processing apparatus and control method thereof and image processing system

Номер: AU2013200307B2
Принадлежит:

An image processing apparatus including: image processor which processes broadcasting signal, to display image based on processed broadcasting signal; communication unit which is connected to a server; a voice input unit which receives a user's speech; a voice processor which processes a performance of a preset corresponding operation according to a voice command corresponding to the speech; and a controller which processes the voice command corresponding to the speech through one of the voice processor and the server if the speech is input through the voice input unit. If the voice command includes a keyword relating to a call sign of a broadcasting channel, the controller controls one of the voice processor and the server to select a recommended call sign corresponding to the keyword according to a predetermined selection condition, and performs a corresponding operation under the voice command with respect to the broadcasting channel of the recommended call sign. 4014037_1 (GHMatters ...

Подробнее
18-12-2014 дата публикации

Voice instructions during navigation

Номер: AU2013271981A1
Принадлежит:

A method of providing navigation on an electronic device when the display screen is locked. The method receives a verbal request to start navigation while the display is locked. The method identifies a route from a current location to a destination based on the received verbal request. While the display screen is locked, the method provides navigational directions on the electronic device from the current location of the electronic device to the destination. Some embodiments provide a method for processing a verbal search request. The method receives a navigation-related verbal search request and prepares a sequential list of the search results based on the received request. The method then provides audible information to present a search result from the sequential list. The method presents the search results in a batch form until the user selects a search result, the user terminates the search, or the search items are exhausted.

Подробнее
17-11-2016 дата публикации

Plant control system using voice as a control mechanism

Номер: AU2015275170A1
Принадлежит: Davies Collison Cave Pty Ltd

A system for controlling equipment involving a tangible material at a plant (100). A control computer (140) is coupled to a terminal computer (130). The control computer includes a storage device that stores voice data (340) for each of several authorized operators at the plant and a voice recognition and authenticated voice-activated control (VR/VAC) program (310). The control computer is programmed to implement the VR/VAC program. The control computer, responsive to receiving a voice-derived input (325), analyzes (408) the voice-derived input to determine if the voice-derived input matches the voice data for any of the authorized operators. Provided the voice input matches the voice data, the control computer determines (410) at least one command (350) from the voice-derived input. The control computer executes (424) the command to control the equipment.

Подробнее
08-12-2016 дата публикации

Electrically operated domestic appliance having a voice recognition device

Номер: AU2015263408A1

The invention relates to an electrically operated domestic appliance (1), in particular a kitchen appliance, comprising a voice recognition device, which voice recognition device is designed to compare voice signals of a user (13) with known control commands for operating the domestic appliance (1), and comprising an activation device for activating the voice recognition device. In order to achieve convenient activation and use of the voice recognition device and in particular to overcome the disadvantages of the prior art, the activation device according to the invention has an optical detection device, wherein the activation device is designed to activate the voice recognition device in dependence on information captured by means of the optical detection device. The invention further relates to a method for operating a domestic appliance according to the invention.

Подробнее
16-08-2018 дата публикации

Intelligent automated assistant in a home environment

Номер: AU2016409887A1
Принадлежит: FPA Patent Attorneys Pty Ltd

Systems and processes for operating an intelligent automated assistant are provided. In one example process, discourse input representing a user request can be received. The process can determine one or more possible device characteristics corresponding to the discourse input. Data structure representing a set of devices of an established location can be retrieved. The process can determine, based on the data structure, one or more candidate devices from the set of devices. The one or more candidate devices can correspond to the discourse input. The process can determine, based on the one or more possible device characteristics and one or more actual device characteristics of the one or more candidate devices, a user intent corresponding to the discourse input. Instructions that cause a device of the one or more candidate devices to perform an action corresponding to the user intent can be provided.

Подробнее
08-03-2018 дата публикации

Zero latency digital assistant

Номер: AU2016320585A1
Принадлежит: FPA Patent Attorneys Pty Ltd

An electronic device can implement a zero-latency digital assistant by capturing audio input from a microphone and using a first processor to write audio data representing the captured audio input to a memory buffer. In response to detecting a user input while capturing the audio input, the device can determine whether the user input meets a predetermined criteria. If the user input meets the criteria, the device can use a second processor to identify and execute a task based on at least a portion of the contents of the memory buffer.

Подробнее
02-01-2020 дата публикации

METHOD FOR REAL-TIME AUTHORIZATION WITHIN A PUSH TO TALK FOR THE INTERNET OF THINGS SYSTEM

Номер: AU2019203987A1
Принадлежит: Phillips Ormonde Fitzpatrick

A method and apparatus for PTT over loT is described herein. During operation each loT device will be assigned to a talkgroup. Some talkgroups may have a single IoT device assigned, and other talkgroups may have multiple IoT devices assigned. During operation, an action command is received over a first talkgroup and a first command type is issued to a first loT device assigned to that talkgroup. A second action command is received over a second talkgroup and a second command type is issued to a second loT device assigned to the second talkgroup. RECEIVE A FIRST TALKGROUP IDENTIFICATION FROM A FIRST OVER-THE-AIR TRANSMISSION OF 701 A FIRST RADIO RECEIVE A SECOND TALKGROUP IDENTIFICATION FROM A SECOND OVER-THE-AIR TRANSMISSION 703 OF A SECOND RADIO MAP THE FIRST TALKGROUP IDENTIFICATION TO A FIRST INTERNET-OF-THINGS (loT) DEVICE MAP THE SECOND TALKGROUP IDENTIFICATION TO A SECOND INTERNET-OF-THINGS (loT) DEVICE DETERMINE A FIRST CONTROL COMMAND 709 BASED ON THE FIRST loT DEVICE DETERMINE ...

Подробнее
26-08-2021 дата публикации

Audio response playback

Номер: AU2021212112A1
Принадлежит:

Audio Response Playback Abstract A computing device, comprising: at least one processor; and a non-transitory computer-readable medium comprising program instructions that, when executed by the at least one processor, cause the first playback device to perform functions comprising: receiving via a network microphone device of a media playback system, a voice command detected by at least one microphone of the network microphone device, wherein the media playback system comprises a plurality of zones, and wherein the network microphone device is a member of a default playback zone; dynamically selecting an audio response zone from the plurality of zones to play an audio response to the voice input and foregoing selection of the default playback zone, wherein the selected zone comprises a playback device, and wherein the dynamically selecting comprises determining that the network microphone device is paired with the playback device; and causing the playback device of the selected zone to ...

Подробнее
06-05-2021 дата публикации

SYSTEMS AND METHODS FOR AN INTERACTIVE VIRTUAL PERSONAL ASSISTANT

Номер: AU2021101147A4
Принадлежит:

SYSTEMS AND METHODS FOR AN INTERACTIVE VIRTUAL PERSONAL The present disclosure relates to autonomous digital assistants, and in particular to an 5 interactive smart virtual personal assistant that assists in conducting tasks autonomously. Aspects of the present invention disclose a system for a smart interactive virtual personal assistant comprising, an echobot consisting a microphone and a sound synthesizer; a microprocessor coupled to said echobot to identify the type and nature of commands and execute said commands; a user interaction and output display window displayed on the 10 desktop; an active internet connection to execute web based commands and tasks; a speaker module to provide output after execution of obtained command. Echo Bot Input Me ssag e Text /Voice Queue UI Goal Engine Service Library Ou pu VoiceQur Speaer Figure 2 Data flow diagram of a smart interactive personal assistant. - 10 - Application No: Applicant Name: Total No of Sheets: 02 Page 2 of 2 Echo Bot Input Message ...

Подробнее
13-05-2021 дата публикации

Device control using gaze information

Номер: AU2021202352A1
Принадлежит:

The present disclosure generally relates to controlling electronic devices. In some examples, the electronic device uses gaze information to activate a digital assistant. In some examples, the electronic device uses gaze information to identify an external device on which to act. In some examples, the electronic device provides an indication that distinguishes between different speakers.

Подробнее
13-02-2001 дата публикации

System and method to facilitate speech enabled user interfaces

Номер: AU0005222199A
Принадлежит:

Подробнее
16-01-1973 дата публикации

COLLIER DE FIXATION SOUPLE

Номер: CA919150A
Автор:
Принадлежит:

Подробнее
17-04-2021 дата публикации

MAINTAINING DATA CONFIDENTIALITY IN COMMUNICATIONS INVOLVING VOICE-ENABLED DEVICES IN A DISTRIBUTED COMPUTING ENVIRONMENT

Номер: CA3059029A1
Принадлежит:

The disclosed exemplary embodiments include computer-implemented systems, devices, apparatuses, and processes that maintain data confidentiality in communications involving voice-enabled devices operating within a distributed computing environment. By way of example, an apparatus may receive, from a communications system across a public communications network, a request for an element of data generated by the computing system based on first audio content obtained at a device. The apparatus may obtain the requested data element and further, may generate acoustic data representative of at least a portion of the requested data element. The apparatus may also generate an encrypted response to the received request that includes the acoustic data, and transmit the encrypted response to the device across the public communications network. The device may execute an application program that causes the device to decrypt the encrypted response and to perform operations that present the acoustic data ...

Подробнее
14-03-2019 дата публикации

VOICE-ACTIVATED ENERGY MANAGEMENT SYSTEM

Номер: CA0003073757A1
Принадлежит: FINLAYSON & SINGLEHURST

A method for responding to a voice activated request includes receiving a speech input request from a smart speaker requesting energy management data associated with energy consumption at a premises of the smart speaker. The method also includes generating a voice service request including a first query for a first data source. The first query includes a request for the energy management data. Additionally, the method includes communicating the first query to the first data source and receiving a first response to the first query from the first data source. Further, the method includes generating an audible speech output in response to the speech input request based on the first response to the first query and transmitting the audible speech output to the smart speaker. The smart speaker audibly transmits the audible speech output.

Подробнее
31-05-2018 дата публикации

DETECTION OF AUTHORIZED USER PRESENCE AND HANDLING OF UNAUTHENTICATED MONITORING SYSTEM COMMANDS

Номер: CA0003044602A1
Принадлежит: SMART & BIGGAR

Techniques are described for detecting and handling unauthenticated commands in a property monitoring system. In some implementations, a monitoring system may include sensors located throughout a property, a monitoring control unit, and an input device. The monitoring control unit may be configured to receive data collected by the sensors, as well as an input command detected by the input device. For an input command that does not include authentication information, the monitoring control unit may generate property state information based on the sensor data, then analyze the property state data and the input command against one or more rules that relate to authorization of unauthenticated commands. Based on the analysis, the monitoring control unit may determine whether to perform the action corresponding to the input command or whether to perform another action, for example, generating and providing a notification or authorization request to a user.

Подробнее
26-07-2018 дата публикации

TAKING ACTION BASED ON PHYSICAL GRAPH

Номер: CA0003046332A1
Автор: MITAL VIJAY, MITAL, VIJAY
Принадлежит: SMART & BIGGAR

Taking action based on a physical graph. The taking of actions occurs with the use of an agent that interprets command(s) (such as natural language commands) from a user. The agent responds to the command(s) by formulating at least one query against a physical graph that represents state of one or more physical entities within a physical space and observed by a plurality of sensors. The agent then uses the query or queries against the physical graph. In response to the responses thereto, the agent identifies actions to take. Such actions could include actions such as presenting information to the user, and sending communications out to others. However, the actions could even include physical actions. For instance, the agent might include a physical action engine that performs physical actions (such as via a robot or drone).

Подробнее
23-11-2017 дата публикации

ROBOT

Номер: CA0002995671A1
Принадлежит:

In the case that the result of image recognition by an external server of an image of a recognition target is needed, this robot rotates a set of drive wheels in opposite directions to rotate a sphere-shape housing and, if a result of image recognition of the image of the recognition target is received from the external server, then the robot turns a display unit towards the user and stops rotation of the sphere-shape housing.

Подробнее
17-10-2020 дата публикации

PROCESSING AND VISUALISING AUDIO SIGNALS

Номер: CA0003092604A1
Принадлежит: BERESKIN & PARR LLP/S.E.N.C.R.L.,S.R.L.

A method of processing a voice audio signal is provided. The method comprises dividing a voice audio signal into a plurality of segments based on identified spoken phrases within the signal. If it is determined that a selected segment of the plurality of segments has a duration longer than a threshold duration, the method comprises identifying a most likely location of a breath in the audio associated with the selected segment. The selected segment is then divided into sub-segments based on the identified most likely location of a breath.

Подробнее
31-05-2019 дата публикации

FAUCET INCLUDING A WIRELESS CONTROL MODULE

Номер: CA0003093319A1
Принадлежит: PIASETZKI NENNIGER KVAS LLP

An electronic faucet including a wireless module facilitating remote control of an electrically operable valve. Illustratively, the wireless module includes a body defining a fluid passageway in fluid communication with the electrically operable valve, and a receiver configured to receive wireless signals from a remote transmitter. The remote transmitter may comprise a voice recognition and conversion device to facilitate voice control of the electrically operable valve.

Подробнее
27-08-2020 дата публикации

VOICE COMMAND DETECTION AND PREDICTION

Номер: CA0003073700A1
Принадлежит: GOWLING WLG (CANADA) LLP

Methods, systems, and apparatuses for predicting an end of a command in a voice recognition input are described herein. The system may receive data comprising a voice input. The system may receive a signal comprising a voice input. The system may detect, in the voice input, data that is associated with a first portion of a command. The system may predict, based on the first portion and while the voice input is being received, a second portion of the command. The prediction may be generated by a machine learning algorithm that is trained based at least in part on historical data comprising user input data. The system may cause execution of the command, based on the first portion and the predicted second portion, prior to an end of the voice input.

Подробнее
13-12-2018 дата публикации

SYSTEM AND METHOD FOR ASYNCHRONOUS MULTI-MODE MESSAGING

Номер: CA0003066344A1
Принадлежит: RIDOUT & MAYBEE LLP

Systems and methods for providing and facilitating multi-mode communication are disclosed. Users may initiate, receive and/or respond to messages and message notifications on a computing device using multi-mode interactions executed through either a device display or a wearable device such as a headset with enhanced functionality. Contextual prompts guide the user interaction with the computing device using on-board or remote voice recognition text-to-speech and speech-to-text processing and playback. Voice and text data are packaged and transmitted to the network.

Подробнее
02-12-2019 дата публикации

DISPLAY DEVICE

Номер: CA0003063019A1
Принадлежит: AGENCE DE BREVETS FOURNIER

... [Problem to be solved] To provide a presentation system for effectively displaying a keyword for effecting the selection of the next slide during a presentation. [Solution] Provided is a display device comprising: a voice recognition means 53; a conversation-derived term extraction means 55; a search keyword storage means 57; a search keyword extraction means 59; a material storage unit 61; a relevant page information extraction means 63; a selection term extraction means 65; and a selection term display means 71.

Подробнее
29-11-2018 дата публикации

VOICE ACTIVATED LIFTGATE

Номер: CA0003064998A1
Принадлежит: GOWLING WLG (CANADA) LLP

An independent add-on automated vehicle lift gate system utilizing existing key fob authentication circuits in combination with an independent voice control system. The system uses microphones in connection with audio acquisition hardware and voice recognition hardware that actively listen for one or more voiced commands from a user outside of a vehicle. Before activating the mechanical system, which is for example the actuator of the lift gate and lock mechanism for the lift gate, the system will wait for confirmation from a separate vehicle system that monitors and notifies the vehicle lift gate system when an identification code is received from a key fob transponder located in a predetermined proximity of the vehicle, thereby authenticating the one or more voiced commands detected by the microphones.

Подробнее
05-07-2018 дата публикации

SYSTEM AND METHOD FOR VARYING VERBOSITY OF RESPONSE BASED ON CHANNEL PROPERTIES IN A GROUP COMMUNICATION USING ARTIFICIAL INTELLIGENCE

Номер: CA0003048402A1
Принадлежит: PERRY + CURRIER

Efficient use of channel bandwidth response, response timing, along with the ability to acquire the most accurate and up to date response are provided for management of virtual assistant search queries within a communication system (100). Improved management is obtained using an artificial intelligence (AI) server (104) controlling response activity to a query communication device (102) by incorporating one or more of: adjusting verbosity of responses (158), redirecting queries from the AI server to alternate resources (412), and/ or prioritizing of a response (506) based on wait time.

Подробнее
03-07-2018 дата публикации

DEVICES, SYSTEMS, AND METHODS FOR RELAYING VOICE MESSAGES TO OPERATOR CONTROL UNITS OF REMOTE CONTROL LOCOMOTIVES

Номер: CA0002990542A1
Принадлежит:

According to various aspects, exemplary embodiments are disclosed of devices, systems, and methods related relaying voice messages to operator control units of a locomotive. In an exemplary embodiment, an operator control unit generally includes a user interface configured to receive one or more commands from an operator for controlling a locomotive, a wireless communication interface configured to transmit data to and receive data from a locomotive control unit of the locomotive, and memory configured to store multiple voice messages corresponding to the locomotive. The operator control unit also includes a processor configured to receive a voice message number from the locomotive control unit of the locomotive via the wireless communication interface, retrieve one of the multiple stored voice messages from memory corresponding to the received voice message number, and transmit the retrieved voice message to an earpiece of the operator via a wired transmission and/or near-field wireless ...

Подробнее
14-07-2016 дата публикации

HEADLESS TASK COMPLETION WITHIN DIGITAL PERSONAL ASSISTANTS

Номер: CA0002970725A1
Принадлежит:

Techniques are described for headlessly completing a task of an application in the background of a digital personal assistant. For example, a method can include receiving a voice input via a microphone. Natural language processing can be performed using the voice input to determine a user voice command. The user voice command can include a request to perform a task of the application. The application can be caused to execute the task as a background process without a user interface of the application appearing. A user interface of the digital personal assistant can provide a response to the user, based on a received state associated with the task, so that the response comes from within a context of the user interface of the digital personal assistant without surfacing the user interface of the application.

Подробнее
04-08-2016 дата публикации

UPDATING LANGUAGE UNDERSTANDING CLASSIFIER MODELS FOR A DIGITAL PERSONAL ASSISTANT BASED ON CROWD-SOURCING

Номер: CA0002970728A1
Принадлежит:

A method for updating language understanding classifier models includes receiving via one or more microphones of a computing device, a digital voice input from a user of the computing device. Natural language processing using the digital voice input is used to determine a user voice request. Upon determining the user voice request does not match at least one of a plurality of pre-defined voice commands in a schema definition of a digital personal assistant, a GUI of an end-user labeling tool is used to receive a user selection of at least one of the following: at least one intent of a plurality of available intents and/or at least one slot for the at least one intent. A labeled data set is generated by pairing the user voice request and the user selection, and is used to update a language understanding classifier.

Подробнее
15-03-2017 дата публикации

AUTOMATIC VOICE RECOGNITION WITH DETECTION OF AT LEAST ONE CONTEXTUAL ELEMENT, APPLICATION TO STEERING AND MAINTENANCE OF AN AIRCRAFT

Номер: CA0002942116A1
Принадлежит:

Ce dispositif de reconnaissance vocale automatique (30) comprend une unité (32) d'acquisition d'un signal audio, un dispositif de détection (36) pour détecter l'état d'au moins un élément contextuel, et un décodeur linguistique (38) pour la détermination d'une instruction orale correspondant au signal audio. Le décodeur linguistique (38) comprend au moins un modèle acoustique (42) définissant une loi de probabilité acoustique et au moins deux modèles syntaxiques (44) définissant chacun une loi de probabilité syntaxique. Le décodeur linguistique (38) comprend également un algorithme de construction d'instruction orale (46) mettant en oeuvre le modèle acoustique (42) et une pluralité de modèles syntaxiques actifs pris parmi les modèles syntaxiques (44), un processeur de contextualisation (48) pour sélectionner, en fonction de l'état du ou de chaque élément contextuel détecté par le dispositif de détection (36), au moins un modèle syntaxique sélectionné parmi la pluralité de modèles syntaxiques ...

Подробнее
07-10-1997 дата публикации

METHOD AND APPARATUS FOR SPEECH RECOGNITION

Номер: CA0002056347C
Автор: SAKO KAZUYA, SAKO, KAZUYA
Принадлежит: FUJITSU TEN LTD, FUJITSU TEN LIMITED

A speech recognizing apparatus compares a speech command from a user with one of registration patterns stored in a storage unit in turn. Then if the speech command coincides with one of the registration patterns, that controls a predetermined electronic apparatus associated with an operation related to the registration pattern. If the speech command does not coincide with any one of the registration patterns, that stores into the first storage unit the speech command as a new registration pattern in which the speech command is related to a manipulation of the electronic apparatus by the user immediately after speech command is produced.

Подробнее
02-05-1996 дата публикации

VOICE-OPERATED SERVICES

Номер: CA0002202663A1
Принадлежит:

A method and apparatus for accessing a database where entries are linked to at least two sets of patterns. Recognition means recognise within a received signal one or more patterns of a first set of patterns. The recognised patterns are used to identify entries and compile a list of patterns in a second set of patterns to which those entries are also linked. The list is then used to recognise a second received signal. The received signals may, for example, be voice signals or signals indicating the origin or destination of the received signals.

Подробнее
19-01-2012 дата публикации

Intelligent Automated Assistant

Номер: US20120016678A1
Принадлежит: Apple Inc

An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

Подробнее
26-01-2012 дата публикации

Speech to Text Conversion

Номер: US20120022867A1
Принадлежит: Google LLC

Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.

Подробнее
23-02-2012 дата публикации

Method and Apparatus for Telephonically Accessing and Navigating the Internet

Номер: US20120047216A1
Принадлежит: Ben Franklin Patent Holding LLC

A method for accessing and browsing the interne through the use of a telephone and the associated DTMF signals is disclosed. The preferred embodiment provides a system that converts the information content of a web page from text to speech (voice signals), signals the hyperlink selections of a web page in an audio manner, and allows selection of the hyperlinks through the use of DTMF signals generated from a telephone keypad. Upon receiving a DTMF signal corresponding to a hyperlink, the corresponding web page is fetched and again delivered to the user via one of the available delivery methods such as voice, fax-on-demand, electronic mail, or regular mail.

Подробнее
10-05-2012 дата публикации

System for voice control of a medical implant

Номер: US20120116774A1
Автор: Peter Forsell
Принадлежит: MILUX HOLDING SA

An implantable system ( 11 ) for control of and communication with an implant ( 17 ) in a body, comprising a command input device ( 12 ) and a processing device ( 13 ) coupled thereto, the processing device ( 13 ) being adapted to generate input to a command generator ( 16 ) which is comprised in the system ( 11 ) coupled to the processing device ( 13 ) and which is adapted to generate and communicate commands to the medical implant ( 17 ) in response to input received from the processing device ( 13 ), the system ( 11 ) further comprising a memory unit ( 15 ) connected to at least one of said devices in the system ( 11 ) for storing a memory bank of commands. The command input device ( 12 ) is adapted to receive commands from a user as voice commands, and the processing device ( 13 ) comprises a filter adapted to filter voice commands against high frequency losses and frequency distortion caused by the mammal body ( 10 ).

Подробнее
30-08-2012 дата публикации

Network apparatus and methods for user information delivery

Номер: US20120221412A1
Автор: Robert F. Gazdzinski
Принадлежит: Individual

A network apparatus useful for providing directions and other information to a user of a client device in wireless communication therewith. In one embodiment, the apparatus includes one or more wireless interfaces and a network interface for communication with a server. User speech inputs in the form of digitized representations are received by the apparatus and used by the server as the basis for retrieving information including graphical representations of location or entities that the user wishes to find.

Подробнее
13-09-2012 дата публикации

Wireless synchronization of data and software components over a wireless network compatible to ieee802.11 standard(s) for mobile devices

Номер: US20120230315A1
Принадлежит: Flexiworld Technologies Inc

Wireless synchronization of data and software components over IEEE802.11 standard(s) are herein disclosed and enabled. An information apparatus, which includes a wireless communication unit compatible with IEEE802.11, may access a wireless local area network (WLAN). To setup the wireless synchronization, the user connects the information apparatus to a wireless output device over a wired connection (e.g., USB) and selects the wireless output device. Information associated with the wireless output device is saved in the mobile information apparatus for enabling wireless synchronization. Next, the user connects the mobile information apparatus to the WLAN, and, depending on the availability of the wireless output device in the network, the information apparatus may lock a wireless connection to the wireless output device for wireless synchronization. A client application in the mobile information apparatus and output controller software in the wireless output device may be required to facilitate the wireless synchronization over the WLAN.

Подробнее
13-12-2012 дата публикации

Voice recognition grammar selection based on context

Номер: US20120316878A1
Принадлежит: Google LLC

The subject matter of this specification can be embodied in, among other things, a method that includes receiving geographical information derived from a non-verbal user action associated with a first computing device. The non-verbal user action implies an interest of a user in a geographic location. The method also includes identifying a grammar associated with the geographic location using the derived geographical information and outputting a grammar indicator for use in selecting the identified grammar for voice recognition processing of vocal input from the user.

Подробнее
18-04-2013 дата публикации

Voice-Activated Pulser

Номер: US20130093445A1
Автор: David Edward Newman
Принадлежит: ZANAVOX

A voice-activated pulser can trigger an oscilloscope or a meter, upon a simple voice command, thereby enabling hands-free signal measurements. The pulser can also be used to control the circuit under test, activating it or changing parameters, all under voice control. The pulser includes numerous switch-selectable output modes that allow users to generate complex, tightly-controlled diagnostic sequences, all activated upon a voice command and hands-free. The invention includes a fast, robust command-interpretation protocol that completely eliminates the expense and complexity of word recognition. Visual indicators display the device status and various operating modes, and also confirm each output pulse. The device receives voice commands directly through an internal microphone, or through a detachable headset, and confirms each command with an acoustical signal in the headset.

Подробнее
09-05-2013 дата публикации

Personalized Vocabulary for Digital Assistant

Номер: US20130117022A1
Принадлежит: Apple Inc

Methods, systems, and computer readable storage medium related to operating an intelligent digital assistant are disclosed. A text string is obtained from a speech input received from a user. The received text string is interpreted to derive a representation of user intent based at least in part on a plurality of words associated with a user and stored in memory associated with the user, the plurality of words including words from a plurality of user interactions with an automated assistant. At least one domain, a task, and at least one parameter for the task, are identified based at least in part on the representation of user intent. The identified task is performed. An output is provided to the user, where the output is related to the performance of the task.

Подробнее
06-06-2013 дата публикации

System and method for continuous multimodal speech and gesture interaction

Номер: US20130144629A1
Принадлежит: AT&T INTELLECTUAL PROPERTY I LP

Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing multimodal input. A system configured to practice the method continuously monitors an audio stream associated with a gesture input stream, and detects a speech event in the audio stream. Then the system identifies a temporal window associated with a time of the speech event, and analyzes data from the gesture input stream within the temporal window to identify a gesture event. The system processes the speech event and the gesture event to produce a multimodal command. The gesture in the gesture input stream can be directed to a display, but is remote from the display. The system can analyze the data from the gesture input stream by calculating an average of gesture coordinates within the temporal window.

Подробнее
13-06-2013 дата публикации

Generic virtual personal assistant platform

Номер: US20130152092A1
Автор: Osher Yadgar
Принадлежит: SRI International Inc

A method for assisting a user with one or more desired tasks is disclosed. For example, an executable, generic language understanding module and an executable, generic task reasoning module are provided for execution in the computer processing system. A set of run-time specifications is provided to the generic language understanding module and the generic task reasoning module, comprising one or more models specific to a domain. A language input is then received from a user, an intention of the user is determined with respect to one or more desired tasks, and the user is assisted with the one or more desired tasks, in accordance with the intention of the user.

Подробнее
04-07-2013 дата публикации

Electronic apparatus and method for controlling the same

Номер: US20130169525A1
Принадлежит: SAMSUNG ELECTRONICS CO LTD

An electronic apparatus and a control method thereof are provided, which displays first voice guide information indicating voice commands available to control the electronic apparatus, and if a command to control an external device connected to the electronic apparatus is received, changes the first voice guide information and displays second voice guide information to indicating voice commands available to control the external device.

Подробнее
25-07-2013 дата публикации

Computerized information and display apparatus

Номер: US20130191750A1
Автор: Robert F. Gazdzinski
Принадлежит: West View Research LLC

Apparatus useful for obtaining and displaying information. In one embodiment, the apparatus includes a network interface, display device, and speech recognition apparatus configured to receive user speech input and enable performance of various tasks via a remote entity, such as obtaining desired information relating to directions, sports, finance, weather, or any number of other topics. The downloaded may also, in one variant, be transmitted to a personal user device, such as via a data interface.

Подробнее
19-09-2013 дата публикации

Method of enabling voice input for a visually based interface

Номер: US20130246920A1
Принадлежит: Research in Motion Ltd

A method of enabling voice input for a graphical user interface (GUI) based application on an electronic device. The method includes: obtaining required properties of one or more user interface objects of the GUI-based application, wherein the one or more user interface objects include one or more input objects; receiving a voice input; extracting from the voice input one or more elements; associating the one or more elements with the one or more input objects; identifying, based on said associating, an input object having a required property which is not satisfied; and outputting, based on the required property, audio output for a prompt for a further voice input.

Подробнее
02-01-2014 дата публикации

Computer implemented methods and apparatus for selectively interacting with a server to build a local dictation database for speech recognition at a device

Номер: US20140006028A1
Автор: Minzhi Hu
Принадлежит: Salesforce com Inc

Disclosed are methods, apparatus, systems, and computer-readable storage media for selectively interacting with a server to build a local dictation database for speech recognition at a device. In some implementations, a computing device receives an audio sample. The computing device may determine that the received audio sample does not match any of one or more existing audio samples stored in the local dictation database of the computing device. The received audio sample may be transmitted to a remote server for detection of one or more words indicated by the received audio sample. The computing device may receive data identifying the one or more words, and update the local dictation database to store the received audio sample in association with the one or more words.

Подробнее
10-04-2014 дата публикации

Method and apparatus for performing preset operation mode using voice recognition

Номер: US20140100850A1
Автор: Sung-Joon Won
Принадлежит: SAMSUNG ELECTRONICS CO LTD

A method and apparatus of performing a preset operation by using voice recognition are provided. The method includes performing the preset operation of a preset operation mode according to a key input or a touch input in the preset operation mode; and recognizing an input voice during performance of the preset operation of the preset operation mode and assisting the performance of the preset operation according to the recognized voice.

Подробнее
06-01-2022 дата публикации

CONTROLLING VISUAL INDICATORS IN AN AUDIO RESPONSIVE ELECTRONIC DEVICE, AND CAPTURING AND PROVIDING AUDIO USING AN API, BY NATIVE AND NON-NATIVE COMPUTING DEVICES AND SERVICES

Номер: US20220004349A1
Принадлежит: ROKU, INC.

Disclosed herein are embodiments for controlling visual indicators of an audio responsive electronic device. In some embodiments, an audio responsive electronic device operates by receiving audio input, and then analyzing the audio input to identify an intended target of the audio input. The intended target may be one of a plurality of electronic devices or services which are native or non-native to the audio responsive electronic device. The audio responsive electronic device transmits the audio input to the identified intended target. A reply message is received from the intended target. Then, the audio responsive electronic device controls its visual indicators using information in the reply message, to thereby provide visual feedback to a user. Also disclosed herein are embodiments for capturing and providing audio to an application according to an application programming interface of a media device. 120-. (canceled)21. A method for enhancement of an audio stream from a command source and de-enhancement of a background audio stream from a background source , comprising:determining, by an audio responsive remote control, a first position of the background source relative to the audio responsive remote control;performing, based on the first position of the background source, the de-enhancement of the background audio stream from the background source;receiving, by the audio responsive remote control, a trigger command;determining, by the audio responsive remote control and based on receiving the trigger command, a second position of the command source; andperforming, based on the second position of the command source, the enhancement of the audio stream from the command source to form an enhanced audio stream.22. The method of claim 21 , wherein the first position of the background source is determined based on a first configuration setting and the second position of the command source is determined based on a second configuration setting claim 21 , wherein the ...

Подробнее
06-01-2022 дата публикации

OPTIMIZATION APPARATUS, OPTIMIZATION METHOD, AND PROGRAM

Номер: US20220005471A1

To perform optimization processing of parameters with various structures without having to manually redesign processing contents of encoding and decoding. An evaluation step of obtaining an evaluated value representing an evaluation result of signal processing using a first signal processing parameter value that is a signal processing parameter; a coding step of converting, based on at least a definition file that defines an attribute of the signal processing parameter, the first signal processing parameter value into a first external parameter value that is an external parameter; a generation step of generating a second external parameter value that is the external parameter of which a value differs from the first external parameter value based on the evaluated value and the first external parameter value; and a decoding step of converting, based on the definition file, the second external parameter value into a second signal processing parameter value that is the signal processing parameter are executed. 1. An optimization apparatus , comprising processing circuitry configured to implement:an evaluating unit which obtains an evaluated value representing an evaluation result of signal processing using a first signal processing parameter value that is a signal processing parameter;a coding unit which converts, based on at least a definition file that defines an attribute of the signal processing parameter, the first signal processing parameter value into a first external parameter value that is an external parameter;a generating unit which generates a second external parameter value that is the external parameter of which a value differs from the first external parameter value based on the evaluated value and the first external parameter value; anda decoding unit which converts, based on the definition file, the second external parameter value into a second signal processing parameter value that is the signal processing parameter.2. The optimization apparatus ...

Подробнее
06-01-2022 дата публикации

ISOLATING A DEVICE, FROM MULTIPLE DEVICES IN AN ENVIRONMENT, FOR BEING RESPONSIVE TO SPOKEN ASSISTANT INVOCATION(S)

Номер: US20220005475A1
Принадлежит:

Methods, apparatus, systems, and computer-readable media are provided for isolating at least one device, from multiple devices in an environment, for being responsive to assistant invocations (e.g., spoken assistant invocations). A process for isolating a device can be initialized in response to a single instance of a spoken utterance, of a user, that is detected by multiple devices. One or more of the multiple devices can be caused to query the user regarding identifying a device to be isolated for receiving subsequent commands. The user can identify the device to be isolated by, for example, describing a unique identifier for the device. Unique identifiers can be generated by each device of the multiple devices and/or by a remote server device. The unique identifiers can be presented graphically and/or audibly to the user, and user interface input. Any device that is not identified can become temporarily unresponsive to certain commands, such as spoken invocation commands. 1. A method implemented by one or more processors , the method comprising: 'wherein each of the client device and the one or more additional client devices includes an assistant application that is responsive to the spoken utterance;', 'receiving an instance of a spoken utterance at a client device that is operating in an environment with one or more additional client devices that also received the instance of the spoken utterance,'} wherein each of the one or more additional client devices provides a respective prompt in response to receiving the instance of the spoken utterance, and', 'wherein the prompt provided at the client device is unique relative to each respective prompt provided at each of the one or more additional client devices;, 'providing, by the client device and based on receiving the instance of the spoken utterance at the client device, user interface output that provides a prompt, to a user, related to whether the client device is to be responsive to invocations of the ...

Подробнее
06-01-2022 дата публикации

SYSTEMS, METHODS, AND APPARATUSES FOR MANAGING INCOMPLETE AUTOMATED ASSISTANT ACTIONS

Номер: US20220005476A1
Принадлежит:

Methods, apparatus, systems, and computer-readable media are provided for resuming a partially completed action that is to be performed by an automated assistant. The action can require the automated assistant to prompt the user to provide information that the automated assistant can use to complete the action. During a dialog session in which the user is providing the information, an event can occur that interferes with the completion of the action. In response, the automated assistant can cause any information obtained during the dialog session to be stored locally, in order that the automated assistant can resume completing the action at a later time. For instance, the user can be prompted by the automated assistant to complete the action, or the user can independently invoke the automated assistant to complete the action at a time that is convenient for the user. 1. A method implemented by one or more processors , the method comprising:receiving, at a microphone of a client device during a dialog session between a user and an automated assistant, a spoken request for an action to be performed by the automated assistant, wherein the automated assistant is configured to complete the action based on one or more slot values obtained from the user during the dialog session;detecting, at the client device, an interruption that occurs during the dialog session;causing the one or more slot values obtained from the user during the dialog session to be stored in memory of the client device, wherein the stored one or more slot values are subsequently retrievable from the memory by the automated assistant in furtherance of completing the action;subsequent to the interruption, receiving, at the microphone of the client device or at another microphone of another client device, another spoken request for another action to be performed by the automated assistant; andsubsequent to receiving the another spoken request, causing the automated assistant to provide a natural language ...

Подробнее
02-01-2020 дата публикации

REFRIGERATOR WITH SOUND REPRODUCING CAPABILITY

Номер: US20200003486A1
Принадлежит: LG ELECTRONICS INC.

A refrigerator with a sound reproducing capability is provided. The refrigerator may include a cabinet forming an exterior of the refrigerator and having an upper surface portion, a lower surface portion, a side surface portion, and a rear surface portion, a door coupled to the cabinet, a vibration module attached to an inner surface of at least one among the door, the upper surface portion, the side surface portion, the rear surface portion, and the lower surface portion, and a controller configured to control vibration of the vibration module. The vibration strength of the vibration module and power supplied to the vibration module may be determined by a trained model trained through machine learning, an area of artificial intelligence (AI). In addition, the refrigerator may include a communicator, and may operate in conjunction with an external device via 5G communication. 1. A refrigerator with a sound reproducing capability , the refrigerator comprising:a cabinet forming an exterior of the refrigerator, the cabinet having an upper surface portion, a lower surface portion, a side surface portion, and a rear surface portion;a door coupled to the cabinet;a first vibrator attached to an inner surface of one of the door, the upper surface portion, the lower surface portion, the side surface portion, or the rear surface portion; anda controller configured to control vibration of the first vibrator,wherein the first vibrator vibrates the inner surface of the one of the door, the upper surface portion, the lower surface portion, the side surface portion, or rear surface portion so as to output sound.2. The refrigerator according to claim 1 , further comprising a frame configured to mechanically support the first vibrator such that the first vibrator is fixed while being in contact with the inner surface of the one of the door claim 1 , the upper surface portion claim 1 , the side surface portion claim 1 , or the rear surface portion claim 1 , andwherein a front surface ...

Подробнее
05-01-2017 дата публикации

INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM

Номер: US20170003933A1
Автор: KOBAYASHI Kenichiro
Принадлежит: SONY CORPORATION

There is provided an information processing device capable of deciding process content of image information according to content of language information input by users, the information processing device including: an image region specifying unit configured to specify a region in an image based on input language information, and a process content specifying unit configured to specify content of a process using the image in regard to the region specified in the image by the image region specifying unit based on the input language information. 1. An information processing device comprising:an image region specifying unit configured to specify a region in an image based on input language information; anda process content specifying unit configured to specify content of a process using the image in regard to the region specified in the image by the image region specifying unit based on the input language information.2. The information processing device according to claim 1 ,wherein the process content specifying unit specifies that a recognition process for an object in the region specified in the image by the image region specifying unit is performed based on the input language information.3. The information processing device according to claim 2 ,wherein the image region specifying unit specifies a region in the image based on further input language information using the object recognized in the specified region in the image as a standard.4. The information processing device according to claim 1 ,wherein the process content specifying unit specifies that a process of acquiring information regarding an object included in the region specified in the image by the image region specifying unit is performed based on the input language information.5. The information processing device according to claim 4 ,wherein the process content specifying unit specifies that a process of acquiring a name of the object as the information regarding the object is performed.6. The ...

Подробнее
07-01-2016 дата публикации

AUDIO COMMAND INTENT DETERMINATION SYSTEM AND METHOD

Номер: US20160004501A1
Принадлежит: HONEYWELL INTERNATIONAL INC.

Methods and apparatus are provided for generating aircraft cabin control commands from verbal speech onboard an aircraft. An audio command supplied to an audio input device is processed. Each word of the processed audio command is compared to words stored in a vocabulary map to determine a word type of each word. Each determined word type is processed to determine if an intent of the audio command is discernable. If the intent is discernable, an aircraft cabin control command is generated based on the discerned intent. If a partial intent is discernable, feedback is generated. 1. A method of generating aircraft cabin control commands from verbal speech onboard an aircraft , comprising the steps of:processing an audio command supplied to an audio input device, the audio command including at least one word;comparing each word of the processed audio command to words stored in a vocabulary map to determine a word type of each word, the vocabulary map comprising a predetermined set of word types; andprocessing each determined word type to determine if an intent of the audio command is discernable;if the intent is discernable, generating an aircraft cabin control command based on the discerned intent; andgenerating feedback if no or only a partial intent of the audio command is discernable.2. The method of claim 1 , wherein the step of processing each determined word type to determine if the intent of the audio command is discernable comprises:determining if the audio command includes at least a context word type and an action word type;identifying an anchor node in a normalized intent rules tree structure that corresponds to the context word type;determining if the action word type is associated with the anchor node and, if so, determining the intent therefrom.3. The method of claim 2 , wherein the normalized intent rules tree structure comprises:a root node, the root node associated with the aircraft;a plurality of context nodes, each context node corresponding to a ...

Подробнее
07-01-2016 дата публикации

SYSTEM AND METHOD FOR CORRECTING SPEECH INPUT

Номер: US20160004502A1
Принадлежит:

A system and method for correcting speech input are disclosed. A particular embodiment includes: receiving a base input string; detecting a correction operation; receiving a replacement string in response to the correction operation; generating a base object set from the base input string and a replacement object set from the replacement string; identifying a matching base object of the base object set that is most phonetically similar to a replacement object of the replacement object set; and replacing the matching base object with the replacement object in the base input string. 1. A system comprising:a data processor; and receive a base input string;', 'detect a correction operation;', 'receive a replacement string in response to the correction operation;', 'generate a base object set from the base input string and a replacement object set from the replacement string;', 'identify a matching base object of the base object set that is most phonetically similar to a replacement object of the replacement object set; and', 'replace the matching base object with the replacement object in the base input string., 'a speech input processing module, executable by the data processor, the speech input processing module being configured to2. The system of wherein the base input string is received as a spoken utterance.3. The system of wherein the correction operation is explicitly initiated by use of an input mechanism from the group consisting of: clicking an icon claim 1 , activating a softkey claim 1 , pressing a physical button claim 1 , providing a keyboard input claim 1 , manipulating a user interface claim 1 , and uttering a separate audible command.4. The system of wherein the correction operation is implicitly initiated by detection of a speaker audibly spelling out a word or phrase.5. The system of wherein the replacement is received as a spoken utterance.6. The system of being further configured to generate a phonetic representation of each of a plurality of ...

Подробнее
02-01-2020 дата публикации

APPARATUS AND METHOD FOR VIRTUAL HOME SERVICE

Номер: US20200004237A1
Принадлежит:

An embodiment of the present disclosure is a virtual home service apparatus including, a communicator, a home information collector for obtaining a design drawing of the home, and obtaining a 3D drawing by converting the design drawing, a home appliance identifier for obtaining an internal image and SLAM information of the home, and identifying the location and state of the home appliance based on the internal image and the SLAM information, and a virtual home implementator for generating virtual home information by reflecting the location and state of the home appliance to the 3D drawing. 1. A virtual home service apparatus for providing data for supporting a virtual home interface to a vehicle apparatus for providing the virtual home interface for controlling an operation of a home appliance installed in a home , comprising:a communicator;a home information collector for obtaining a design drawing of the home through the communicator based on user identification information of the home appliance, and obtaining a 3D drawing by converting the design drawing;a home appliance identifier for obtaining an internal image and SLAM information of the home through the communicator, and identifying the location and state of the home appliance based on the internal image and the SLAM information; anda virtual home implementator for generating virtual home information by reflecting the location and state of the home appliance to the 3D drawing,wherein the virtual home information is provided to the vehicle apparatus as the data for supporting the virtual home interface.2. The virtual home service apparatus of claim 1 ,wherein the home information collector generates a design drawing request signal that requests the design drawing by using an address registered at the time of sale of the home appliance as the user identification information, and transmits the generated design drawing request signal to a real estate brokerage server through the communicator, andwherein the ...

Подробнее
04-01-2018 дата публикации

MULTI-DIMENSIONAL REFERENCE ELEMENT FOR MIXED REALITY ENVIRONMENTS

Номер: US20180004481A1
Принадлежит:

Approaches provide for controlling, managing, and/or otherwise interacting with mixed (e.g., virtual and/or augmented) reality content in response to input from a user, including voice input, device input, among other such inputs, in a mixed reality environment. For example, a mixed reality device, such as a headset or other such device can perform various operations in response to a voice command or other such input. In one such example, the device can receive a voice command and an application executing on the device or otherwise in communication with the device can analyze audio input data of the voice command to control the view of content in the environment, as may include controlling a user's “position” in the environment. The position can include, for example, a specific location in time, space, etc., as well as directionality and field of view of the user in the environment. A reference element can be displayed as an overlay to the mixed reality content, and can provide a visual reference to the user's position in the environment. 1. A mixed reality display system , comprising:a display;a device processor; display virtual reality content on the display;', 'display a multi-dimensional reference element as an overlay to the virtual reality content, the multi-dimensional reference element operable to provide a first visual reference to a first position within a virtual environment and a first field of view (orientation) in the virtual environment;', 'receive audio input data, the audio input data corresponding to an utterance received by a microphone of the virtual reality display device;', 'use automatic speech recognition (ASR) techniques on the audio input data to generate text data that represents words;', 'use natural language understanding (NLU) techniques on the text data to identify a command to display virtual reality content for a second position within the virtual environment and a second field of view within the virtual environment;', 'update the ...

Подробнее
04-01-2018 дата публикации

SYSTEM AND METHOD FOR CONTINUOUS MULTIMODAL SPEECH AND GESTURE INTERACTION

Номер: US20180004482A1
Принадлежит:

Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing multimodal input. A system configured to practice the method continuously monitors an audio stream associated with a gesture input stream, and detects a speech event in the audio stream. Then the system identifies a temporal window associated with a time of the speech event, and analyzes data from the gesture input stream within the temporal window to identify a gesture event. The system processes the speech event and the gesture event to produce a multimodal command. The gesture in the gesture input stream can be directed to a display, but is remote from the display. The system can analyze the data from the gesture input stream by calculating an average of gesture coordinates within the temporal window. 1monitoring an audio stream associated with a non-tactile gesture input stream;identifying a speech event in an audio stream;determining a temporal window associated with a time of the speech event, wherein the temporal window extends forward and backward from the time of the speech event;analyzing, via a processor, data from the non-tactile gesture input stream within the temporal window to identify, based on the speech event, a non-tactile gesture event; andprocessing the speech event and the non-tactile gesture event to produce a multimodal command.. A method comprising: The present application is a continuation of U.S. patent application Ser. No. 14/875,105, filed Oct. 5, 2015, which is a continuation of U.S. patent application Ser. No. 13/308,846, filed Dec. 1, 2011, now U.S. Pat. No. 9,152,376, issued Oct. 6, 2015, which was also filed as PCT Application No. PCT/US12/67309, filed Nov. 30, 2012, the contents of which is incorporated herein by reference in its entirety.The present disclosure relates to human-computer interaction and more specifically to incorporating a continuous speech input stream and a continuous gesture input stream.Currently deployed ...

Подробнее
02-01-2020 дата публикации

Electronic apparatus, document displaying method thereof and non-transitory computer readable recording medium

Номер: US20200004493A1
Принадлежит: SAMSUNG ELECTRONICS CO LTD

The disclosure relates to an artificial intelligence (AI) system using a machine learning algorithm such as deep learning, and an application thereof. In particular, an electronic apparatus, a document displaying method thereof, and a non-transitory computer readable recording medium are provided. An electronic apparatus according to an embodiment of the disclosure includes a display unit displaying a document, a microphone receiving a user voice, and a processor configured to acquire at least one topic from contents included in a plurality of pages constituting the document, recognize a voice input through the microphone, match the recognized voice with one of the acquired at least one topic, and control the display unit to display a page including the matched topic.

Подробнее
05-01-2017 дата публикации

TESTING WORDS IN A PRONUNCIATION LEXICON

Номер: US20170004823A1
Принадлежит:

A method, for testing words defined in a pronunciation lexicon used in an automatic speech recognition (ASR) system, is provided. The method includes: obtaining test sentences which can be accepted by a language model used in the ASR system. The test sentences cover words defined in the pronunciation lexicon. The method further includes obtaining variations of speech data corresponding to each test sentence, and obtaining a plurality of texts by recognizing the variations of speech data, or a plurality of texts generated by recognizing the variation of speech data. The method also includes constructing a word graph, using the plurality of texts, for each test sentence, where each word in the word graph corresponds to each word defined in the pronunciation lexicon; and determining whether or not all or parts of words in a test sentence are present in a path of the word graph derived from the test sentence. 1. A method performed in one or more of computers , for testing words defined in a pronunciation lexicon used in an automatic speech recognition system , wherein the method comprises the following steps:obtaining a plurality of test sentences which can be accepted by a language model used in the automatic speech recognition system, wherein the test sentences cover the words defined in the pronunciation lexicon;obtaining variations of speech data corresponding to each of the test sentences;obtaining a plurality of texts by recognizing the variations of speech data, or a plurality of texts generated by recognizing the variation of speech data;constructing a word graph, using the plurality of texts, for each of the test sentences, wherein each word in the word graph corresponds to each of the words defined in the pronunciation lexicon; anddetermining whether or not all or parts of words in a test sentence of the test sentences are present in a path of the word graph derived from the test sentence.2. The method according to claim 1 , wherein the generated test ...

Подробнее
05-01-2017 дата публикации

ADAPTIVE BEAM FORMING DEVICES, METHODS, AND SYSTEMS

Номер: US20170004826A1
Принадлежит:

Devices, methods, systems, and computer-readable media for adaptive beam forming are described herein. One or more embodiments include a method for adaptive beam forming, comprising: receiving a voice command at a number of microphones, determining an instruction based on the received voice command, calculating a confidence level of the determined instruction, determining feedback based on the confidence level of the determined instruction, and altering a beam of the number of microphones based on the feedback. 1. A method for adaptive beam forming , comprising:determining an instruction based on a received voice command;calculating a confidence level of the determined instruction;determining feedback based on the confidence level of the determined instruction; andaltering a beam of the number of microphones based on the feedback.2. The method of claim 1 , wherein the feedback includes information relating to optimizing a defined beam width and a defined beam direction.3. The method of claim 1 , wherein the confidence level includes a percentage that corresponds to a likelihood the determined instruction correctly corresponds to the received voice command.4. The method of claim 1 , wherein determining feedback includes determining a location where the voice command originated5. The method of claim 1 , wherein the feedback includes information relating to noise not relating to the received voice command.6. The method of claim 1 , wherein determining the instruction includes utilizing a predetermined vocabulary to determine the instruction of the received voice command.7. The method of claim 1 , comprising determining to utilize the feedback for a different received voice command when the different received voice command has similar properties to the received voice command.8. A non-transitory computer readable medium claim 1 , comprising instructions to:send a first voice command to a cloud computing network to determine a first instruction of the first voice command; ...

Подробнее
05-01-2017 дата публикации

Data Collection and Reporting System and Method

Номер: US20170004827A1
Принадлежит:

A wearable device is provided that is configured to carry out an inspection of a gas turbine or other type of system. The device may include a processor, a data store, at least one output device, and at least one input device. The processor in the wearable device is responsive to inputs through the at least one input device and set of tasks stored in the data store corresponding to an inspection of a system to provide outputs through the at least one output device that prompt a user to gather data associated with the system being inspected using the wearable device while the wearable device is mounted to the user without the user holding the wearable device with the hand of the user. Also, the processor in the wearable device is configured to generate and output an inspection report responsive to the set of tasks and the data gathered using the wearable device. 1. A system for data collection and reporting for inspections of equipment comprising: wherein the processor in the wearable device is responsive to inputs through the at least one input device and a set of tasks stored in the data store corresponding to an inspection of a system to provide outputs through the at least one output device that prompt a user to gather data associated with the system being inspected using the wearable device while the wearable device is mounted to the user without the user holding the wearable device with the hand of the user,', 'wherein the processor in the wearable device is configured to generate and output an inspection report responsive to the set of tasks and the data gathered using the wearable device, and', 'wherein the processor in the wearable device is configured to detect that the wearable device has moved to a noisy environment and responsive to the detection that the wearable device has moved to a noisy environment, cause the wearable device to provide outputs through the at least one output device corresponding to at least one task request that requests an input ...

Подробнее
05-01-2017 дата публикации

SMART HOME APPLIANCES, OPERATING METHOD OF THEREOF, AND VOICE RECOGNITION SYSTEM USING THE SMART HOME APPLIANCES

Номер: US20170004828A1
Принадлежит:

Provided is a smart home appliance. The smart home appliance includes: a voice input unit collecting a voice; a voice recognition unit recognizing a text corresponding to the voice collected through the voice input unit; a capturing unit collecting an image for detecting a user's visage or face; a memory unit mapping the text recognized by the voice recognition unit and a setting function and storing the mapped information; and a control unit determining whether to perform a voice recognition service on the basis of at least one information of image information collected by the capturing unit and voice information collected by the voice input unit. 1. A smart home appliance comprising:a voice input unit collecting a voice;a voice recognition unit recognizing a text corresponding to the voice collected through the voice input unit;a capturing unit collecting an image for detecting a user's visage;a memory unit mapping the text recognized by the voice recognition unit and a setting function and storing the mapped information; anda control unit determining whether to perform a voice recognition service on the basis of at least one information of image information collected by the capturing unit and voice information collected by the voice input unit.2. The smart home appliance according to claim 1 , wherein the control unit comprises a face detection unit recognizing that a user is in a staring state for voice input when image information on a user's visage is collected for more than a setting time through the capturing unit.3. The smart home appliance according to claim 2 , wherein the control unit determines that a voice recognition service standby state is entered when it is recognized that there is keyword information in a voice through the voice input unit and a user is in the staring state through the face detection unit.4. The smart home appliance according to claim 1 , further comprising:a filter unit removing a noise sound from the voice inputted through the ...

Подробнее
05-01-2017 дата публикации

TERMINAL APPARATUS, PROGRAM, AND SERVER APPARATUS FOR PROVIDING INFORMATION ACCORDING TO USER DATA INPUT

Номер: US20170004829A1
Принадлежит: NTT DOCOMO, INC.

Provided is a method of alleviating difficulty experience by a user when issuing an instruction by speech. When the user performs a predetermined operation on a terminal apparatus, the terminal apparatus displays a dialogue screen to wait for a speech instruction. If a predetermined period has elapsed without issuance of a speech instruction by the user since the start of display of the dialogue screen for the wait state, the terminal apparatus displays a sentence prompting a speech instruction corresponding to the attributes of the user or the attributes of the environment surrounding the user. Even if the user is at a loss about the content of a speech instruction, the user can issue a speech instruction in accordance with the displayed prompt. Therefore, a speech instruction can be issued smoothly. 110-. (canceled)11. A terminal apparatus , comprising:an attribute acquisition unit that acquires attribute data indicating an attribute of a user or an environment surrounding the user;a sentence acquisition unit that acquires prompt sentence data indicating a sentence that prompts the user to issue a speech instruction, the prompt sentence data corresponding to the attribute indicated by the attribute data;a display control unit that causes a display apparatus to display the sentence indicated by the prompt sentence data;a speech data acquisition unit that acquires speech data indicating a speech made by the user in response to the display apparatus displaying the sentence indicated by the prompt sentence data;a processing ID acquisition unit that acquires processing identification data identifying processing corresponding to an instruction indicated by the speech data; anda processing execution unit that executes the processing identified by the processing identification data.12. The terminal apparatus according to claim 11 , further comprising:a transmission unit that transmits the attribute data and the speech data to a server apparatus, whereinthe sentence ...

Подробнее
05-01-2017 дата публикации

METHOD FOR CONTROLLING OPERATION OF AN AGRICULTURAL MACHINE AND SYSTEM THEREOF

Номер: US20170004830A1
Принадлежит:

A method for controlling operation of an agricultural machine and system thereof are disclosed. The method may comprise providing a portable device that has an input device, a processing unit, a storage unit, an output device, and a transceiver device configured for wireless data transmission; receiving a voice control command over a microphone device of the input device of the portable device; determining command text data from the voice control command by processing the voice control command by a speech recognition application running on the processing unit of the portable device; providing machine control signals assigned to a machine control function in a control device of an agricultural machine located remotely from the portable device; and controlling the operation of the agricultural machine according to the machine control signals. 1. A method for controlling operation of an agricultural machine , comprising:providing a portable device, the portable device comprising an input device, a processing unit, a storage unit, an output device, and a transceiver device configured for wireless data transmission;receiving a voice control command over a microphone device of the input device of the portable device;determining command text data from the voice control command by processing the voice control command by a speech recognition application running on the processing unit of the portable device; and determining control function data indicating the machine control function from the command text data, and', 'processing the control function data for generating the machine control signals, and', 'controlling the operation of the agricultural machine according to the machine control signals., 'providing machine control signals assigned to a machine control function in a control device of an agricultural machine located remotely from the portable device, the control device of the agricultural machine comprising a processing unit and a transceiver device configured for ...

Подробнее
04-01-2018 дата публикации

STATE MACHINE BASED CONTEXT-SENSITIVE SYSTEM FOR MANAGING MULTI-ROUND DIALOG

Номер: US20180004729A1
Автор: Qiu Nan, Wang Haofen
Принадлежит:

The present invention discloses a state machine based context-sensitive multi-round dialog management system, comprising: an input module, for receiving multi-modal input information from a user; an intention identification engine module, for identifying intention information in the multi-modal input information; an intention module, for bringing multiple intention information identified by the intention identification engine module into one-to-one correspondence with multiple intention sub-modules at back ends; a state machine module, comprising a plurality of state machines for managing a relevant context in the dialog management system and providing support for an output result; an instruction parsing engine module, comprising a plurality of instruction parsing engine sub-modules for parsing corresponding intention information and acquiring the parsed multiple intention information; and an output module, for acquiring policy information according to the results from the parsing engine module and the intention identification module, and transmitting the policy information to the state machine module. 1. A state machine based context-sensitive multi-round dialog management system , comprising:an input module, for receiving multi-modal input information from a user;an intention identification engine module, for identifying intention information in the multi-modal input information;an intention module, for bringing multiple intention information identified by the intention identification engine module into one-to-one correspondence with multiple intention sub-modules at back ends;a state machine module, comprising a plurality of state machines for managing a relevant context in the dialog management system and providing support for an output result;an instruction parsing engine module, comprising a plurality of instruction parsing engine sub-modules for parsing corresponding intention information and acquiring the parsed multiple intention information; andan output ...

Подробнее
13-01-2022 дата публикации

SYSTEMS, APPARATUS, AND METHODS OF USING A SELF-AUTOMATED MAP TO AUTOMATICALLY GENERATE A QUERY RESPONSE

Номер: US20220012289A1
Автор: IBRAHEEM Remi Muinatu
Принадлежит:

A method includes receiving, at a processor and via a graphical user interface (GUI), input data including a representation of at least one behavioral pattern. The at least one behavioral pattern is correlated to pattern data associated with a subset of detectors from a set of detectors. A first matrix is generated for a first point in time based on the correlation. Interactive objects are generated for presentation via the GUI, and each is associated with the set of detectors from the plurality of detectors. In response to detecting a user interaction with at least one of the interactive objects a relationship between each detector from the set of detectors in the first matrix and the input data is defined and stored. The first matrix is transformed based on the relationship, and the transformed matrix is synthesized to generate a motif of the behavioral pattern of the input data. 1. A method , comprising:receiving, at a processor and via a graphical user interface (GUI), input data including a representation of at least one behavioral pattern;correlating, via the processor, the at least one behavioral pattern to pattern data associated with a set of detectors from a plurality of detectors;generating a first matrix for a first point in time based on the correlation between the at least one behavioral pattern and the pattern data associated with each detector from the set of detectors, the first matrix including at least the set of detectors;generating a plurality of interactive objects for presentation via the GUI, each interactive object from the plurality of interactive objects associated with the set of detectors from the plurality of detectors;in response to detecting a user interaction with at least one interactive object from the plurality of interactive objects, defining and storing a representation of a relationship between each detector from the set of detectors in the first matrix and the input data;transforming the first matrix based on the relationship, ...

Подробнее
13-01-2022 дата публикации

TRANSFERRING AN AUTOMATED ASSISTANT ROUTINE BETWEEN CLIENT DEVICES DURING EXECUTION OF THE ROUTINE

Номер: US20220013121A1
Автор: Ni Yuzhao
Принадлежит:

Transferring (e.g., automatically) an automated assistant routine between client devices during execution of the automated assistant routine. The automated assistant routine can correspond to a set of actions to be performed by one or more agents and/or one or more devices. While content, corresponding to an action of the routine, is being rendered at a particular device, the user may walk away from the particular device and toward a separate device. The automated assistant routine can be automatically transferred in response, and the separate device can continue to rendering the content for the user. 1. A method implemented by one or more processors , the method comprising:receiving, at a remote server device, data transmitted from a first client device;determining, at the remote server device, that the data corresponds to a request for initialization of an automated assistant routine that corresponds to a set of automated assistant actions;in response to determining that the data corresponds to the request, generating, at the remote server device, content for an action of the set of automated assistant actions; 'transmitting the content for the action to the first client device to cause the first client device to render the content for the action;', 'in response to the data that corresponds to the request for initialization of the automated assistant routine being received from the first client devicedetermining, at the remote server device during rendering of the content for the action by the first client device, that a user has directly or indirectly indicated an interest in the automated assistant routine being continued at a second client device; and 'rendering, at the second client device, additional data that is in furtherance of the automated assistant routine.', 'in response to determining that the user has indicated the interest in the automated assistant routine being continued at the second client device2. The method of claim 1 , wherein in response to ...

Подробнее
07-01-2021 дата публикации

System and method for automated agent assistance within a cloud-based contact center

Номер: US20210004817A1
Принадлежит: Talkdesk Inc

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing.

Подробнее
07-01-2021 дата публикации

System and method for automated scheduling using agent assist within a cloud-based contact center

Номер: US20210004824A1
Принадлежит: Talkdesk Inc

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing.

Подробнее
02-01-2020 дата публикации

Customer voice order triggered mutual affinity merchant donation

Номер: US20200005276A1
Принадлежит: Edatanetworks Inc

A customer uses a mobile device to verbally request an offer that includes an incentive to transact at a merchant's brick and mortar store in the customer's local community in exchange for the merchant's agreement to make an auditable donation to a charity serving the local community. Business rules limit the merchant's charitable donations over calendar periods, which donations can be made directly by the merchant to the community charity, or indirectly to the charity by way of a blind donation made by the merchant to a donation disbursement agency acting on the merchant's behalf to satisfy the merchant's commitment to donate.

Подробнее
07-01-2021 дата публикации

MULTISTREAM ACOUSTIC MODELS WITH DILATIONS

Номер: US20210005182A1
Принадлежит:

Audio signals of speech may be processed using an acoustic model. An acoustic model may be implemented with multiple streams of processing where different streams perform processing using different dilation rates. For example, a first stream may process features of the audio signal with one or more convolutional neural network layers having a first dilation rate, and a second stream may process features of the audio signal with one or more convolutional neural network layers having a second dilation rate. Each stream may compute a stream vector, and the stream vectors may be combined to a vector of speech unit scores, where the vector of speech unit scores provides information about the acoustic content of the audio signal. The vector of speech unit scores may be used for any appropriate application of speech, such as automatic speech recognition. 1. A computer-implemented method for processing speech , comprising:receiving a sequence of feature vectors computed from an audio signal;computing a first stream vector by processing the sequence of feature vectors in a first stream, wherein the first stream comprises a first convolutional neural network layer having a first dilation rate;computing a second stream vector by processing the sequence of feature vectors in a second stream, wherein the second stream comprises a second convolutional neural network layer having a second dilation rate, wherein the second dilation rate is different from the first dilation rate; andcomputing a vector of speech unit scores by processing the first stream vector and the second stream vector.2. The computer-implemented method of claim 1 , comprising processing the vector of speech unit scores to determine one or more words spoken in the audio signal.3. The computer-implemented method of claim 1 , wherein the sequence of feature vectors comprise a sequence of vectors of Mel-frequency cepstral coefficients.4. The computer-implemented method of claim 1 , wherein the sequence of feature ...

Подробнее
07-01-2021 дата публикации

SERVICE DATA PROCESSING METHOD AND APPARATUS AND RELATED DEVICE

Номер: US20210005185A1
Автор: Fang Xuewei, MA JINGLIN

In a service data processing method performed by a server, user speech information collected by a first terminal is received. A target service operation code according to the user speech information is obtained. The target service operation code is used for identifying target service operation information. The target service operation code is transmitted from the server to the first terminal, so that the first terminal plays the target service operation code by using a speech. The target service operation code obtained by a second terminal is received. A target execution page corresponding to the target service operation code is searched for. The target execution page is transmitted to the second terminal, so that the second terminal executes a service operation corresponding to the target 1. A service data processing method , comprising:receiving, by circuitry of a server, user speech information collected by a first terminal;obtaining, by the circuitry of the server, a target service operation code according to the user speech information, the target service operation code being used for identifying target service operation information;transmitting, by the circuitry of the server, the target service operation code to the first terminal, so that the first terminal plays the target service operation code by using a speech;receiving, by the circuitry of the server, the target service operation code obtained by a second terminal;searching, by the circuitry of the server, for a target execution page corresponding to the target service operation code; andtransmitting, by the circuitry of the server, the target execution page to the second terminal, so that the second terminal executes a service operation corresponding to the target service operation information in the target execution page.2. The method according to claim 1 , wherein the obtaining a target service operation code according to the user speech information comprises:performing, by the circuitry of the ...

Подробнее
07-01-2021 дата публикации

SPEECH RECOGNITION SYSTEM PROVIDING SECLUSION FOR PRIVATE SPEECH TRANSCRIPTION AND PRIVATE DATA RETRIEVAL

Номер: US20210005190A1
Принадлежит:

A method includes receiving a voice input via a microphone of an electronic device, and determining whether the voice input contains speech from an authorized user of the electronic device or speech from an unauthorized user. The method includes in response to determining that the voice input contains speech from the authorized user: determining whether the speech contains private speech or public speech; in response to determining that the speech contains private speech, processing the voice input through a local automatic speech recognition (ASR) engine within the electronic device, the local ASR engine converting the voice input from audio format to text format and outputting a text transcription of the private speech; and in response to determining that the speech does not contain private speech, forwarding the voice input through a communication interface associated with a network-connected external device for processing the voice input at the network-connected external device. 1. A method comprising:receiving a voice input via a microphone of an electronic device;determining whether the voice input contains speech from an authorized user of the electronic device or speech from an unauthorized user; determining whether the speech contains private speech or public speech;', 'in response to determining that the speech contains private speech, processing the voice input through a local automatic speech recognition (ASR) engine within the electronic device, the local ASR engine converting the voice input from audio format to text format and outputting a text transcription of the private speech; and', 'in response to determining that the speech does not contain private speech, forwarding the voice input through a communication interface associated with a network-connected external device for processing the voice input at the network-connected external device., 'in response to determining that the voice input contains speech from the authorized user2. The method of ...

Подробнее
07-01-2021 дата публикации

SYSTEM, SERVER, AND METHOD FOR SPEECH RECOGNITION OF HOME APPLIANCE

Номер: US20210005191A1
Принадлежит:

Provided is a system, server, and method for speech recognition capable of collectively setting a plurality of setting items for device control through an utterance of a single sentence provided in the form of natural language. The system includes: a home appliance configured to receive a speech command that is generated through an utterance of a single sentence for control of the home appliance; and a server configured to receive the speech command in the single sentence from the home appliance and interpret the speech command of the single sentence through multiple intent determination. 1. A speech recognition system for a home appliance , comprising:a home appliance configured to receive a speech command that is generated through an utterance of a single sentence for control of the home appliance; anda server configured to receive the speech command in the single sentence from the home appliance and interpret the speech command in the single sentence through multiple intent determination.2. The speech recognition system of claim 1 , wherein the speech command generated through the utterance of the single sentence includes a plurality of intents claim 1 , and the server interprets the speech command on the basis of the plurality of intents.3. The speech recognition system of claim 2 , wherein the server is configured to:generate a plurality of instruction sentence formulas by combining the plurality of intents;generate a plurality of derivative sentences on the basis of the plurality of instruction sentence formulas; andcompare the plurality of derivative sentences with a plurality of pieces of speech command data registered in the server, to find matching speech command data in the comparison.4. The speech recognition system of claim 3 , wherein the server is configured to:generate a plurality of scenarios operable by the home appliance on the basis of a function and a specification of the home appliance; andgenerate the plurality of instruction sentence formulas ...

Подробнее
07-01-2021 дата публикации

Method for Exiting a Voice Skill, Apparatus, Device and Storage Medium

Номер: US20210005193A1

A method for exiting a voice skill, an apparatus, a device, and a storage medium are provided by embodiments of the present disclosure, wherein a user voice instruction is received; a target exit intention corresponding to the user voice instruction is identified according to the user voice instruction and a grammar rule of a preset exit intention; and a corresponding operation is executed on a current voice skill of a device according to the target exit intention. The embodiments of the present disclosure refine and expand the user's exit intention. After the target exit intention to which the user voice instruction belongs is identified, the corresponding operation is executed according to the target exit intention so as to meet the users' different exit requirements for the voice skills, enhance the fluency and convenience of user interaction with the device and improve the user's exit experience when using the voice skills.

Подробнее
07-01-2021 дата публикации

Electric device, control method of electric device, and storage medium

Номер: US20210005194A1
Автор: Yuichi Nishii
Принадлежит: Sharp Corp

Provided is an electric device including a function executer that executes a predetermined function, a communicator that communicates with a voice recognition server and receives an execution command corresponding to a voice command from the voice recognition server, an execution state detector that detects an execution state of the predetermined function, a message outputter that outputs a message relating to the execution state of the function, and a controller that controls the function executer, the communicator, the execution state detector, and the message outputter. If the communicator receives an execution command from the voice recognition server, the controller causes the execution state detector to detect the execution state of the predetermined function, and if the predetermined function is executable, the controller causes the function executer to execute the predetermined function, whereas if the predetermined function is not executable, the controller causes the message outputter to output a related message.

Подробнее
07-01-2021 дата публикации

COMMUNICATION SYSTEM, CONTROL METHOD, AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM

Номер: US20210005196A1
Автор: Nomoto Masakazu
Принадлежит:

A communication system comprises a communication device and a server system. The communication system obtains permission to perform a function related to the communication device from a user, performs a predetermined process of obtaining permission to perform a predetermined function from the user, if the predetermined function that the user does not permit the server system to perform is added as the function, performs the function that the user permits the server system to perform in advance, if an instruction for performing the function that the user permits the server system to perform in advance is inputted into a voice control device with a voice, after the predetermined process is performed and in a state where the permission to perform the predetermined function is not obtained from the user, and performs a process corresponding to the function. 1. A communication system including a communication device and a server system , the communication system comprising:an obtaining unit that obtains permission to perform a function related to the communication device using the server system from a user;a first performing unit that performs a predetermined process of obtaining permission to perform a predetermined function using the server system from the user, in a case where the predetermined function that the user does not permit the server system to perform is added as the function related to the communication device;a second performing unit that performs the function related to the communication device that the user permits the server system to perform in advance, in a case where an instruction for performing the function related to the communication device that the user permits the server system to perform in advance is inputted into a voice control device with a voice, after the predetermined process is performed and in a state where the permission to perform the predetermined function by using the server system is not obtained from the user; anda third ...

Подробнее
07-01-2021 дата публикации

Focus Session at a Voice Interface Device

Номер: US20210005202A1
Принадлежит:

A first electronic device of a local group of connected electronic devices receives a first voice command including a request for a first operation, assigns a first target device from among a local group of connected electronic devices as an in-focus device for performing the first operation, causes the first operation to be performed by the first target device via operation of a server-implemented common network service, receives a second voice command including a request for a second operation, and based on a determination that the second voice command does not include an explicit designation of a second target device and a determination that the second operation can be performed by the first target device, assigning the first target device as the in-focus device for performing the second operation. 1. A method , comprising: receiving a first voice command including a request for a first operation;', 'assigning a first target device from among the local group of connected electronic devices as an in-focus device for performing the first operation;', 'in accordance with the assigning of the first target device as the in-focus device for performing the first operation, causing the first operation to be performed by the first target device via operation of the server-implemented common network service;', 'receiving a second voice command including a request for a second operation;', 'determining that the second voice command does not include an explicit designation of a second target device;', 'determining that the second operation can be performed by the first target device; and', assigning the first target device as the in-focus device for performing the second operation; and', 'in accordance with the assigning of the first target device as the in-focus device for performing the second operation, causing the second operation to be performed by the first target device via operation of the server-implemented common network service., 'in accordance with (i) the ...

Подробнее
07-01-2021 дата публикации

System and method for speech-enabled automated agent assistance within a cloud-based contact center

Номер: US20210005206A1
Принадлежит: Talkdesk Inc

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing.

Подробнее
04-01-2018 дата публикации

DEVICE INCLUDING SPEECH RECOGNITION FUNCTION AND METHOD OF RECOGNIZING SPEECH

Номер: US20180005627A1
Принадлежит:

A device including a speech recognition function which recognizes speech from a user, includes: a loudspeaker which outputs speech to a space; a microphone which collects speech in the space; a first speech recognition unit which recognizes the speech collected by the microphone; a command control unit which issues a command for controlling the device, based on the speech recognized by the first speech recognition unit; and a control unit which prohibits the command issuance unit from issuing the command, based on the speech to be output from the loudspeaker. 17-. (canceled)8. A device including a speech recognition function which recognizes user speech which is speech from a user , the device comprising:a loudspeaker which outputs speech to a space;a microphone which collects speech in the space;a speech output unit configured to output a speech signal;a first speech recognition unit configured to recognize the speech collected by the microphone;a first control unit configured to generate a command for controlling the speech output unit, based on the speech recognized by the first speech recognition unit; anda second control unit configured to permit or prohibit issuance of the command to the speech output unit, based on the speech signal,wherein the second control unit includes a second speech recognition unit configured to analyze the speech signal to recognize the speech to be output from the loudspeaker, andthe second control unit is configured to determine whether or not the speech recognized by the second speech recognition unit matches a predetermined keyword, and when the speech recognized by the second speech recognition unit matches the predetermined keyword, prohibit the issuance of the command and when the speech recognized by the second speech recognition unit does not match the predetermined keyword, permit issuance of the command.9. A method of recognizing user speech using a device including a speech recognition function , the user speech being ...

Подробнее
04-01-2018 дата публикации

POLICY AUTHORING FOR TASK STATE TRACKING DURING DIALOGUE

Номер: US20180005629A1
Принадлежит: Microsoft Technology Licensing, LLC

Conversational understanding systems allow users to conversationally interface with a computing device. In examples, a query may be received that includes a request for execution of a task. A data exchange task definition may be accessed. The data exchange task definition assists a conversational understanding system in managing task state tracking for information needed for task execution. Using the data exchange task definition, a per-turn policy for interacting with the user computing device is generated based on the state of a dialogue with a computing device and an evaluation of a process flow chart provided by a task owner resource. The task owner resource may be independent from the conversational understanding system. A response to the query may be generated and output based on the per-turn policy. In examples, the per-turn policy is used to generate one or more responses during a dialogue with a user via a computing device. 1. A method comprising:receiving, from a task owner resource, a process flowchart providing a process flow for collection of information that is needed for the task owner resource to execute the task, wherein the task owner resource is independent of a conversational understanding system that executes a task state tracking of the parameter data; 'generating a per-turn policy for interacting with the user computing device based on the state of the dialogue and an evaluation of the process flow chart; and', 'creating a data exchange task definition, wherein the data exchange definition adapts the process flow chart for the task state tracking by the conversational understanding system based on a state of a dialogue with a user computing device, and wherein the creating comprisesstoring the data exchange task definition for recall to assist the conversational understanding system in the task state tracking.2. The method according to claim 1 , further comprising: identifying one or more data representations that are sharable from the data ...

Подробнее
04-01-2018 дата публикации

PERFORMING TASKS AND RETURING AUDIO AND VISUAL ANSWERS BASED ON VOICE COMMAND

Номер: US20180005631A1
Принадлежит: KT CORPORATION

An artificial intelligence voice interactive system may provide various services to a user in response to a voice command by providing an interface between the system and a legacy system to enable providing various types of existing services in response to user speech without modifying systems for the existing services. Such system includes a central server, and the central server may perform operations of registering a plurality of service servers at the central server and storing registration information of each service server, analyzing voice command data from the user device and determining at least one task and corresponding service servers based on the analysis results, generating an instruction message based on the voice command data, the determined at least one task, and the registration information of the selected service servers, and transmitting the generated instruction message to the selected service servers, and receiving task results including audio and video data from the selected service servers and outputting the task results through at least one device associated with the user device. 1. A method for providing voice interactive services to a user through an artificial intelligence voice interactive system including a central server and a plurality of service servers each connected to the central server through a communication network , the method of the central server comprising:performing a registration operation upon generation of a predetermined event for registering the service servers and storing registration information of each service server;receiving voice command data, determining at least one task to perform based on the voice command data, and selecting at least one of the service servers based on the determined task; andgenerating an instruction message based on the voice command data, the determined at least one task, and the registration information of the selected service server, and transmitting the generated instruction message to ...

Подробнее
04-01-2018 дата публикации

SYSTEM AND METHOD FOR RECORDING A VIDEO SCENE WITHIN A PREDETERMINED VIDEO FRAMEWORK

Номер: US20180005665A1
Автор: Knutt James Karl
Принадлежит:

A computer-implemented video method includes receiving a first digital video file and a second digital video file; recognizing the first digital video file as a beginning scene and the second digital video file as an ending scene; receiving a user input to record a middle scene, wherein the beginning scene, the middle scene, and the ending scene being configured to form a full video; and responsive to a user input to record, providing a real-time queue for the recording by sequentially, in real-time: 1) first, playing the beginning scene within a first preview window on the video display; 2) second, recording the middle scene and simultaneously displaying the middle scene within a video capture window on the video display; and 3) third, playing the ending scene within a second preview window on the video display. 1. A method comprising steps of:receiving, by a processor communicatively coupled to a video display and a front-facing camera, a first digital video file and a second digital video file;recognizing, by the processor, the first digital video file as a beginning scene and the second digital video file as an ending scene;after receiving the first digital video file and the second digital video file, receiving, by the processor, a user input to record a middle scene, wherein the beginning scene, the middle scene, and the ending scene are configured to form a full video; and 1) first, playing the beginning scene within at least one preview window on the video display;', '2) second, automatically after an end of the beginning scene, recording the middle scene via the front-facing camera while simultaneously displaying the middle scene within a video capture window on the video display; and', '3) third, automatically after recording an end of the middle scene, playing the ending scene within the at least one preview window on the video display., 'responsive to the user input to record, the processor2. The method in accordance with claim 1 , wherein:the front- ...

Подробнее
02-01-2020 дата публикации

PHONIC FIRES TRAINER

Номер: US20200005661A1
Принадлежит: Cubic Corporation

A voice-controlled training unit for conducting fire training and/or operation of an artillery unit may include a communication interface, a memory, and a processing unit communicatively coupled with the communication interface and the memory. The processing unit may be configured to cause the voice-controlled training unit to detect spoken speech, and to determine that the spoken speech includes a command that is related to operation of the artillery unit. The processing unit may be further configured to cause the voice-controlled training unit to generate a message indicative of the command, in accordance with a protocol of a distributed computer simulation standard, and send, via the communication interface, the message indicative of the command to a remote simulation system. 1. A voice-controlled training unit for conducting fire training or operations of an artillery unit , the voice-controlled training unit comprising:a communication interface;a memory; and detect spoken speech;', 'determine that the spoken speech includes a command that is related to operation of the artillery unit;', 'generate a message indicative of the command, in accordance with a protocol of a distributed computer simulation standard; and', 'send, via the communication interface, the message indicative of the command to a remote simulation system., 'a processing unit communicatively coupled with the communication interface and the memory, and configured to cause the voice-controlled training unit to2. The voice-controlled training unit of claim 1 , wherein the processing unit is further configured to cause the voice-controlled training unit to:determine that the command is related to firing of the artillery unit;calculate a firing solution for the artillery unit; generate a message indicative of the firing solution, in accordance with the protocol of the distributed computer simulation standard, and send, via the communication interface, the message indicative of the firing solution to ...

Подробнее
13-01-2022 дата публикации

DISTRIBUTED TYPE TRAFFIC LINE TRACING APPARATUS, AND METHOD USING THE SAME

Номер: US20220014876A1
Автор: YOO Jae-Chern

A distributed type traffic line tracing apparatus of the present disclosure is personally installed in a residence of a tracing target person, to automatically verify whether a traffic line of the tracing target person coincides with a traffic line of a confirmed person and automatically report a result to a disease management authority only when the traffic lines coincide with each other in an emergency situation such as an infectious disease pandemic, so that a quarantining target person is discovered and found early while protecting a privacy of an individual much more when compared to a centralized type, thereby rapidly establishing an infectious disease management system for patients with suspected diseases and efficiently managing the patients from the spread of infectious diseases. 1. A distributed type traffic line tracing apparatus comprising:a digital communication module installed in a region in a residence of a tracing target person;a traffic line tracing management application installed in a mobile phone of the tracing target person, the traffic line tracing management application configured to store mobile phone location information of the tracing target person provided from a plurality of location information providers in a resident memory of the mobile phone, and wirelessly transmit the mobile phone location information to the digital communication module or a remote server when necessary;communication connection verifier, resident in the traffic line tracing management application, configured to verify whether the digital communication module and the mobile phone are connected through short-range communication;a tracing target person traffic line information memory configured to store traffic line history information of the mobile phone;a memory scheduler configured to move and store the mobile phone location information stored in the resident memory of the mobile phone in the tracing target person traffic line information memory through wireless ...

Подробнее
02-01-2020 дата публикации

INTERACTIVE METHOD AND DEVICE OF ROBOT, AND DEVICE

Номер: US20200005772A1
Автор: DAI Jun, Liu Ying
Принадлежит:

Embodiments of the present disclosure provide an interactive method of a robot, an interactive device of a robot and a device. The method includes: obtaining voice information input by an interactive object, and performing semantic recognition on the voice information to obtain a conversation intention; obtaining feedback information corresponding to the conversation intention based on a conversation scenario knowledge base pre-configured by a simulated user; and converting the feedback information into a voice of the simulated user, and playing the voice to the interactive object. 1. An interactive method for a robot , comprising:obtaining voice information input by an interactive object, and performing semantic recognition on the voice information to obtain a conversation intention;obtaining feedback information corresponding to the conversation intention based on a conversation scenario knowledge base pre-configured by a simulated user; andconverting the feedback information into a voice of the simulated user, and playing the voice to the interactive object.2. The method according to claim 1 , wherein obtaining the feedback information corresponding to the conversation intention based on the conversation scenario knowledge base pre-configured by the simulated user comprises:querying the conversation scenario knowledge base based on the conversation intention, to obtain a query path;when the query path shows a preset path, querying rich media knowledge pre-configured by the simulated user and/or structured knowledge related to user characteristics and pre-configured by the simulated user, to obtain the feedback information corresponding to the conversation intention.3. The method according to claim 2 , wherein after querying the conversation scenario knowledge base based on the conversation intention to obtain the query path claim 2 , the method further comprises:when the query path shows an external path, querying a search engine pre-configured by the simulated ...

Подробнее
02-01-2020 дата публикации

AUTOMATIC SKILL ROUTING IN CONVERSATIONAL COMPUTING FRAMEWORKS

Номер: US20200005776A1

An utterance is analyzed to identify an absence of a known invocation phrase. A skill set is constructed in response to the absence, the skill set including a first skill corresponding to the utterance and a first skill score corresponding to a likelihood that the first skill corresponds to the utterance. The first skill score is adjusted, based on the presence of the first skill in a skill history, where the skill history stores a set of history skills in an order of recency of use of each history skill in the set of history skills. The first skill score is adjusted, based on an association of the first skill with a default installed skill. An installed skill is selected, based on the adjusted first skill score, the installed skill performing an action in response to the utterance. 1. A method comprising:analyzing an utterance to identify an absence of a known invocation phrase;constructing, in response to the absence, using a processor and a memory, a skill set, the skill set including a first skill corresponding to the utterance and a first skill score corresponding to a likelihood that the first skill corresponds to the utterance;adjusting, based on the presence of the first skill in a skill history, the first skill score, wherein the skill history stores a set of history skills in an order of recency of use of each history skill in the set of history skills;adjusting, based on an association of the first skill with a default installed skill, the first skill score; andselecting, based on the adjusted first skill score, an installed skill, wherein the installed skill performs an action in response to the utterance.2. The method of claim 1 , wherein the adjusting claim 1 , based on the presence of the first skill in a skill history claim 1 , further comprises:determining that the first skill matches a history skill associated with an installed skill other than the default installed skill; andraising, responsive to the determining, the first skill score.3. The ...

Подробнее
02-01-2020 дата публикации

OBJECT SEARCHING METHOD, OBJECT SEARCHING DEVICE AND OBJECT SEARCHING SYSTEM

Номер: US20200005779A1
Принадлежит:

An object searching method includes the following operations: receiving, by an object searching device, an user message; analyzing, by the object searching device, an object name from the user message; obtaining, by the object searching device, a locator corresponding to the object name according to a locator mapping table; detecting, by the object searching device, a locator distance and a locator direction of the locator; generating, by the object searching device, a description string according to the locator distance, the locator direction, and a feature direction map; generating, by the object searching device, a voice message according to the description string and the object name; and broadcasting, by the object searching device, the voice message. 1. An object searching method , comprising:receiving, by an object searching device, an user message;analyzing, by the object searching device, an object name from the user message;obtaining, by the object searching device, a locator corresponding to the object name according to a locator mapping table;detecting, by the object searching device, a locator distance and a locator direction of the locator;generating, by the object searching device, a description string according to the locator distance, the locator direction, and a feature direction map;generating, by the object searching device, a voice message according to the description string and the object name; andbroadcasting, by the object searching device, the voice message.2. The object searching method of claim 1 , further comprising:obtaining, by the object searching device, an environmental object distance and an environmental object direction of at least one environmental object in a space; andestablishing the feature direction map, by the object searching device, according to the environmental object distance and the environmental object direction.3. The object searching method of claim 2 , wherein the at least one environmental object comprises a ...

Подробнее
02-01-2020 дата публикации

Voice interaction method and device

Номер: US20200005780A1
Автор: Yongshuai LU

Embodiments of the present disclosure provide voice interaction method and device. The method includes: determining whether a first query statement currently received is a query statement first received within a preset time period; if not, obtaining a second query statement, where the second query statement is a query statement lastly received before receiving the first query statement; obtaining a third sentence vector according to a first sentence vector of the first query statement and a second sentence vector of the second query statement; and obtaining, from a bottom corpus, a first question and answer result corresponding to a fourth sentence vector a similarity between which and the third sentence vector satisfies a preset condition, and returning the first question and answer result. The method provided in the embodiment can return a bottom reply irrelevant to the query statement to the user, thereby improving the user experience.

Подробнее
02-01-2020 дата публикации

HUMAN-MACHINE INTERACTION PROCESSING METHOD AND APPARATUS THEREOF

Номер: US20200005781A1
Принадлежит:

Embodiments of the present disclosure provide a human-machine interaction processing method, an apparatus thereof, a user terminal, a processing server and a system. On the user terminal side, the method includes: receiving an interaction request voice inputted from a user, and collecting video data of the user when inputting the interaction request voice; obtaining an interaction response voice corresponding to the interaction request voice, where the interaction response voice is obtained according to expression information of the user when inputting the interaction request voice and included in the video data; and outputting the interaction response voice to the user. The method imbues the interaction response voice with an emotional tone that matches the current emotion of the user, so that the human-machine interaction process is no longer monotonous, greatly enhancing the user experience. 1. A human-machine interaction processing method , comprising:receiving an interaction request voice inputted from a user, and collecting video data of the user when inputting the interaction request voice;obtaining an interaction response voice corresponding to the interaction request voice, wherein the interaction response voice is obtained according to expression information of the user when inputting the interaction request voice and included in the video data; andoutputting the interaction response voice to the user.2. The method according to claim 1 , wherein the collecting the video data of the user when inputting the interaction request voice comprises:collecting, via a binocular camera, the video data of the user when inputting the interaction request voice.3. The method according to claim 1 , wherein the obtaining the interaction response voice corresponding to the interaction request voice claim 1 , wherein the interaction response voice is obtained according to expression information of the user when inputting the interaction request voice and included in the ...

Подробнее
02-01-2020 дата публикации

VOICE RECOGNITION FOR PATIENT CARE ENVIRONMENT

Номер: US20200005783A1
Принадлежит:

A location monitoring system tracks a location of a user within a healthcare facility. When the user is detected in a patient room an electronic controller activates a voice command database having a plurality of voice commands specific to the user. A microphone located in the patient room receives one of the plurality of voice commands. The electronic controller transmits the one of the plurality of voice commands to a remote device positioned outside of the patient room. 1. A method of providing communication in a healthcare facility , the method comprising:determining with a location device whether a person is in a patient room,activating a voice command database having a plurality of voice commands specific to the person in the patient room,receiving one of the plurality of voice commands at an electronic device located in the patient room, andtransmitting the one of the plurality of voice commands to a remote device positioned outside of the patient room.2. The method of claim 1 , wherein the person in the patient room is a caregiver claim 1 , and activating a voice command database includes activating a voice command database having a plurality of voice commands specific to the caregiver.3. The method of claim 1 , wherein the person in the patient room is a patient and activating a voice command database includes activating a voice command database having a plurality of voice commands specific to the patient.4. The method of claim 1 , further comprising monitoring vital signs of a patient in the patient room.5. The method of claim 4 , further comprising transmitting a notification to a caregiver when it is determined that a caregiver is in the patient room.6. The method of claim 1 , further comprising:tracking the voice command transmitted to the remote device, andgenerating an action item in response to the voice command.7. The method of claim 6 , further comprising removing the action item when a caregiver responds to the voice command.8. The method of claim ...

Подробнее
02-01-2020 дата публикации

ELECTRONIC DEVICE AND OPERATING METHOD THEREOF FOR OUTPUTTING RESPONSE TO USER INPUT, BY USING APPLICATION

Номер: US20200005784A1
Принадлежит:

A method of outputting a response to a user input in an electronic device is provided. The method includes receiving a user input from a user and, in response to receiving the user input, generating a first response comprising first content based on the user input, obtaining contextual information of the user, generating a second response comprising second content based on the contextual information, the second content being different from the first content, generating a combined response based on the first response and the second response, and outputting the combined response. 1. A method of an electronic device , comprising:receiving a user input from a user; and generating a first response comprising first content based on the user input,', 'obtaining contextual information of the user,', 'generating a second response comprising second content based on the contextual information, the second content being different from the first content,', 'generating a combined response based on the first response and the second response, and', 'outputting the combined response., 'in response to receiving the user input,'}2. The method of claim 1 ,wherein a type of the first content comprises at least one of text, a moving picture, an image, or audio content, andwherein a type of the second content comprises at least one of text, a moving picture, an image, audio content, a light-emitting diode (LED) output, a vibration output, a visual effect, an audible effect, or a user interface.3. The method of claim 1 , further comprising claim 1 , in response to the user input claim 1 , obtaining first data claim 1 ,wherein the first response is generated based on the first data, andwherein the second response is generated based on second data obtained by modifying the first data based on the contextual information.4. The method of claim 1 , further comprising inputting the contextual information into a generative model to generate the second response.5. The method of claim 4 , further ...

Подробнее
02-01-2020 дата публикации

Method for Term-Dependent Output of Information Based on a Voice Input to a Specific Group, and System

Номер: US20200005785A1
Принадлежит:

A method for term-dependent output of information based on a voice input to a specific group includes the steps of: capturing the voice input; analyzing the captured voice input for the presence of a group-specific key term, associated with the specific group; and on detection of the group-specific key term in the analyzed voice input, outputting the information based on the voice input to the specific group. 1. A method for term-dependent output of information based on a voice input to a specific group , the method comprising the steps of:a) capturing the voice input;b) analyzing the captured voice input for a presence of a group-specific key term, associated with the specific group; andc) on detection of the group-specific key term in the analyzed voice input, outputting the information based on the voice input to the specific group.2. The method according to claim 1 , wherein the specific group is one or more of:a garden and/or forestry worker group, and the group-specific key term is a garden and/or forestry term,a hauler group, and the group-specific key term is a hauler term,a forestry and/or work management group, and the group-specific key term is a forestry and/or work management term, anda rescue group, and the group-specific key term is an emergency term and/or a rescue term.3. The method according to claim 1 , further comprising the step of:ascertaining a direction from an output location to a capture location, andwherein the information has the ascertained direction.4. The method according to claim 1 , further comprising the steps of:ascertaining a distance between a capture location and an output location; andif the ascertained distance is below a distance limit value, outputting the information in group-aspecific fashion.5. The method according to claim 4 , whereinthe distance is ascertained by a time-of-flight method.6. The method according to claim 1 , further comprising the step of:wirelessly transmitting the captured voice input and/or the ...

Подробнее
02-01-2020 дата публикации

INFORMATION PROCESSING APPARATUS AND INFORMATION PROCESSING METHOD

Номер: US20200005786A1
Принадлежит:

Achieving voice utterance that can attract an interest of a target further effectively. There is provided an information processing apparatus including an utterance control unit that controls output of voice utterance, in which the utterance control unit determines a target on the basis of an analyzed context, and controls an output device to output an attracting utterance that attracts an interest of the target. Furthermore, there is provided an information processing method including executing, by a processor, output control of voice utterance, in which the execution of the output control further includes: determining a target on the basis of an analyzed context; and controlling an output device to output an attracting utterance that attracts an interest of the target. 1. An information processing apparatus comprising an utterance control unit that controls output of voice utterance ,wherein the utterance control unit determines a target on a basis of an analyzed context, and controls an output device to output an attracting utterance that attracts an interest of the target.2. The information processing apparatus according to claim 1 ,wherein the utterance control unit determines a first target and a second target on a basis of the context, and controls the output device to output the attracting utterance toward the first target.3. The information processing apparatus according to claim 2 ,wherein the utterance control unit controls the output device to output the attracting utterance that attracts an interest of the second target, toward the first target.4. The information processing apparatus according to claim 2 ,wherein the context includes a conversation status between users, andthe utterance control unit determines the first target and the second target on a basis of the conversation status.5. The information processing apparatus according to claim 4 ,wherein, on a basis of a fact that a target user being a target of an utterance makes no response to the ...

Подробнее
02-01-2020 дата публикации

GUIDE ROBOT AND METHOD FOR OPERATING THE SAME

Номер: US20200005787A1
Автор: MAENG Jichan, SHIN Wonho
Принадлежит: LG ELECTRONICS INC.

The present disclosure relates to a guide robot and a method of operating the same. A guide robot according to the present disclosure includes a voice receiving unit to receive a voice, a controller to determine whether the received voice includes a preset wake-up word, and a wireless communication unit to perform communication with an artificial intelligence (AI) server set to be activated by the preset wake-up word. At this time, the control unit transmits the received voice to the artificial intelligence server, receives result information from the artificial intelligence server, and outputs the received result information, when the received voice includes the preset wake-up word. And, the control unit outputs a response voice selected according to a predetermined reference when the received voice does not include the preset wake-up word. 1. A guide robot , comprising:a voice receiving unit configured to receive a voice;a control unit configured to determine whether the received voice includes a preset wake-up word; anda wireless communication unit configured to perform communication with an artificial intelligence (AI) server set to be activated by the preset wake-up word,wherein the control unit transmits the received voice to the artificial intelligence server, receives result information from the artificial intelligence server, and output the received result information, when the received voice includes the preset wake-up word, andoutputs a response voice selected according to a predetermined reference when the received voice does not include the preset wake-up word.2. The guide robot of claim 1 , wherein the control unit performs a greeting recognition operation when the received voice does not include the preset wake-up word claim 1 , and determines whether the received voice is recognized as a greeting based on a sensing signal received from at least one sensor in the greeting recognition operation.3. The guide robot of claim 2 , wherein the control unit ...

Подробнее
02-01-2020 дата публикации

IMAGE DISPLAY APPARATUS AND METHOD OF CONTROLLING THE SAME

Номер: US20200005790A1
Принадлежит: SAMSUNG ELECTRONICS CO., LTD.

Provided are an image display apparatus and a method of controlling the same. The image display apparatus enabling voice recognition includes: a first voice inputter which receives a user-side audio signal; an audio outputter which outputs an audio signal processed by the image display apparatus; a first voice recognizer which recognizes the user-side audio signal received through the first voice inputter; and a controller which decreases a volume of the audio signal output through the audio outputter to a predetermined level if a voice recognition start command is received. 1. A display apparatus comprising:a display;a microphone configured to receive a user-side audio; and obtain a candidate command word from a first user-side audio received through the microphone to register a command word for an execution of a function of the display apparatus,', 'based on the candidate command word being a registrable command word according to a predetermined criterion, register the candidate command word as the command word,', 'based on the candidate command word not being a registrable command word according to the predetermined criterion, receive a second user-side audio through the microphone to obtain another candidate command word, and', 'based on the registered command word being obtained from a third user-side audio received through the microphone after the candidate command word is registered as the command word, execute the function., 'a controller configured to2. The display apparatus according to claim 1 , wherein the controller is configured to:based on the candidate command word being the registrable command word according to the predetermined criterion, display, on the display, a message which indicates the candidate command word being registrable, andbased on the candidate command word not being the registrable command word according to the predetermined criterion, display, on the display, a message which indicates the candidate command word not being ...

Подробнее
02-01-2020 дата публикации

Method and apparatus for processing speech

Номер: US20200005793A1
Автор: Ya Wu

Embodiments of a method and apparatus for processing a speech are provided. The method can include: acquiring, in response to determining at least one speech interaction device in a target speech interaction device set receiving an input speech, a speech feature of the input speech received by a speech interaction device of the at least one speech interaction device; and selecting, based on the speech feature of the input speech received by the speech interaction device in the at least one speech interaction device, a first speech interaction device from the at least one speech interaction device to process the input speech. Some embodiments realize the selection of a targeted speech interaction device.

Подробнее
03-01-2019 дата публикации

INTENTION ESTIMATION DEVICE AND INTENTION ESTIMATION METHOD

Номер: US20190005950A1
Автор: ISHII Jun, JING Yi
Принадлежит: Mitsubishi Electric Corporation

When among simple sentences which are estimation targets for an intention estimator, there is a simple sentence whose intention estimation has failed, a supplementary information estimator estimates supplementary information from the simple sentence by using a supplementary information estimation model stored in a supplementary information estimation model storage. When among the simple sentences which are the estimation targets for the intention estimator, there is a simple sentence from which an imperfect intention estimation result is provided, an intention supplementation unit supplements the imperfect intention estimation result by using the supplementary information estimated by the supplementary information estimator. 111-. (canceled)12. An intention estimation device comprising: to carry out a morphological analysis on a complex sentence including plural intentions,', 'to carry out a syntactic analysis on the complex sentence on which the morphological analysis is carried out, to divide the complex sentence into plural simple sentences,, 'processing circuitry'} when among the simple sentences which are estimation targets, there is a simple sentence whose intention estimation has failed, to estimate supplementary information from the simple sentence whose intention estimation has failed, and', 'when among the simple sentences which are the estimation targets, there is a simple sentence from which an imperfect intention estimation result is provided, to supplement the imperfect intention estimation result by using the estimated supplementary information., 'to estimate an intention included in each of the plural simple sentences,'}13. The intention estimation device according to claim 12 , wherein the processing circuitry holds a supplementary information estimation model showing a relation between simple sentences and pieces of supplementary information claim 12 ,wherein the processing circuitry estimates the supplementary information by using the ...

Подробнее
03-01-2019 дата публикации

Hands free always on near field wakeword solution

Номер: US20190005953A1
Принадлежит: Amazon Technologies Inc

Apparatuses and systems for conserving power for a portable electronic device that monitors local audio for a wakeword are described herein. In a non-limiting embodiment, a portable electronic device may have two-phases. The first phase may be a first circuit that stores an audio input while determining whether human speech is present in the audio input. The second phase may be a second circuit that activates when the first circuit determines that human speech is present in the audio input. The second circuit may receive the audio input from the first circuit, store the audio input, and determine whether a wakeword is present within the audio input.

Подробнее
03-01-2019 дата публикации

WAKE-ON-VOICE METHOD, TERMINAL AND STORAGE MEDIUM

Номер: US20190005954A1
Принадлежит:

The present disclosure provides a wake-on-voice method, a terminal and a storage medium. The method includes: acquiring a wake-up voice configured to wake up a smart terminal; performing an analysis on an acoustic feature of the wake-up voice by using a preset acoustic model and a preset wake-up word recognition network of the smart terminal, so as to acquire a confidence coefficient of the acoustic feature of the wake-up voice with respect to an acoustic feature of a preset wake-up word; determining whether the confidence coefficient falls in a preset range of moderate confidence coefficients, if yes, uploading the wake-up voice to a remote server; and determining whether a linguistic feature obtained by analyzing the wake-up voice using a linguistic model matches to a linguistic feature of the preset wake-up word, if yes, receiving an instruction to wake up the smart terminal generated by the remote server. 1. A wake-on-voice method , comprising:acquiring a wake-up voice configured to wake up a smart terminal;performing an analysis on an acoustic feature of the wake-up voice by using a preset acoustic model and a preset wake-up word recognition network of the smart terminal, so as to acquire a confidence coefficient of the acoustic feature of the wake-up voice with respect to an acoustic feature of a preset wake-up word;determining whether the confidence coefficient falls in a preset range of moderate confidence coefficients, and if yes, uploading the wake-up voice to a remote server; anddetermining whether a linguistic feature obtained by analyzing the wake-up voice using a linguistic model in the remote server matches to a linguistic feature of the preset wake-up word, and if yes, receiving an instruction to wake up the smart terminal generated by the remote server.2. The wake-on-voice method according to claim 1 , wherein after it is determined that the linguistic feature obtained by analyzing the wake-up voice using the linguistic model in the remote server ...

Подробнее
03-01-2019 дата публикации

METHODS, SYSTEMS, AND MEDIA FOR VOICE-BASED CALL OPERATIONS

Номер: US20190005955A1
Принадлежит:

Methods, systems, and media for voice-based call operations are provided. In some embodiments, a method comprises: receiving, at a first user device, a communication; detecting a voice command, using the first user device, that includes a keyword; and in response to detecting the voice command, causing the communication to be transferred to a second user device that is associated with the keyword. 1. A method comprising:receiving, at a first user device, a communication;detecting a voice command, using the first user device, that includes a keyword; andin response to detecting the voice command, causing the communication to be transferred to a second user device that is associated with the keyword.2. The method of claim 1 , wherein the communication is an invitation to join a telecommunication channel.3. The method of claim 2 , wherein the telecommunication channel is a telephone channel.4. The method of claim 2 , wherein the telecommunication channel is an Internet videotelephony channel.5. The method of claim 2 , further comprising:in response to receiving the invitation to join the telecommunication channel, presenting a notification of the invitation during a period of time, wherein the period of time is terminated upon acceptance of the invitation, and wherein the voice command is detected during the period of time.6. The method of claim 2 , further comprising causing the second user device to join the telecommunication channel.7. The method of claim 1 , wherein causing the communication to be transferred to the second user device is further in response to detecting a second keyword in the voice command claim 1 , wherein the second keyword corresponds to a communication transfer operation.8. The method of claim 1 , wherein the communication is an SMS message or an MMS message.9. The method of claim 1 , further comprising:receiving an indication of the keyword; anddetermining that the keyword is to be associated with the second user device.10. A system ...

Подробнее
03-01-2019 дата публикации

METHODS, SYSTEMS, AND MEDIA FOR CONNECTING AN IoT DEVICE TO A CALL

Номер: US20190005956A1
Принадлежит:

Methods, systems, and media for connecting an IoT device to a call are provided. In some embodiments, a method is provided, the method comprising: establishing, at a first end-point device, a telecommunication channel with a second end-point device; subsequent to establishing the telecommunication channel, and prior to a termination of the telecommunication channel, detecting, using the first end-point device, a voice command that includes a keyword; and in response to detecting the voice command, causing information associated with an IoT device that corresponds to the keyword to be transmitted to the second end-point device. 1. A method comprising:establishing, at a first end-point device, a telecommunication channel with a second end-point device;subsequent to establishing the telecommunication channel, and prior to a termination of the telecommunication channel, detecting, using the first end-point device, a voice command that includes a keyword; andin response to detecting the voice command, causing information associated with an IoT device that corresponds to the keyword to be transmitted to the second end-point device.2. The method of claim 1 , further comprising:determining that the IoT device is available for wireless communication;receiving an indication of the keyword; anddetermining that the keyword is to be associated with the IoT device based on the indication of the keyword.3. The method of claim 2 , wherein the indication of the keyword is based on a user input.4. The method of claim 2 , wherein the indication of the keyword is received from a server device that is associated with a user account with which the IoT device is associated.5. The method of claim 1 , further comprising determining that the first end-point device and the IoT device are connected to a particular local area network.6. The method of claim 1 , further comprising:causing the IoT device to join the telecommunication channel, wherein the information associated with the IoT device ...

Подробнее
03-01-2019 дата публикации

DEVICE AND METHOD FOR CONTROLLING APPLICATION PROGRAM USING VOICE COMMAND UNDER PRESET CONDITION

Номер: US20190005957A1
Принадлежит:

Disclosed are an application control device and method that activates a speech recognition mode only under a specific condition and allows control of a smart terminal by a voice input or a touch input under the specific condition. The device includes a condition setting module for setting a speech recognition control condition including an event occurrence or a device status, a command setting module for designating a set of voice commands to be recognized under the set speech recognition control condition, a command recognition module for activating the speech recognition mode and recognizing the voice commands designated by the command setting module when an event included n the speech recognition control condition occurs, and an application control module for converting the recognized voice command into an application control signal to control an application when the designated voice command is recognized under the speech recognition control condition. 1. A device for controlling an application program , the device comprising:a condition setting module for setting a speech recognition control condition in which a speech recognition mode is activated and a smart terminal is controlled by a voice command or a touch input, the speech recognition control condition being at least one of an event condition occurring in the smart terminal and a device status condition;a command setting module for designating a set of voice commands to be recognized in the speech recognition control condition which is set by the condition setting module;a command recognition module for activating the speech recognition mode and recognizing a voice command included in the set of voice commands prepared by the command setting module when an event included in the speech recognition control condition set by the condition setting module occurs; andan application control module for converting a voice command into a smart terminal control signal to control an application program when at least ...

Подробнее
07-01-2016 дата публикации

Bluetooth headset and voice interaction control thereof

Номер: US20160006849A1
Принадлежит: Zgmicro Wuxi Corp

Techniques for a personalized Bluetooth headset and a voice interaction control method thereof are described. According to one aspect of the present invention, the Bluetooth headset is caused to maintain a voice contact list. Each item in the voice contact list corresponds to a phone number associated with a set of audio data (e.g., a voice or a predefined audio). When a paired mobile device receives a call, the voice contact list is searched per the caller number. A corresponding audio is played back when an item is located in the voice contact list. As such a user of the Bluetooth headset knows who is calling and determines whether the call shall be answered or not.

Подробнее
07-01-2021 дата публикации

Assistance during audio and video calls

Номер: US20210006523A1
Принадлежит: Google LLC

Implementations relate to providing information items for display during a communication session. In some implementations, a computer-implemented method includes receiving, during a communication session between a first computing device and a second computing device, first media content from the communication session. The method further includes determining a first information item for display in the communication session based at least in part on the first media content. The method further includes sending a first command to at least one of the first computing device and the second computing device to display the first information item.

Подробнее
04-01-2018 дата публикации

Personal Voice-Based Information Retrieval System

Номер: US20180007201A1
Автор: Kurganov Alexander
Принадлежит:

The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user. 1. A method , comprising:(a) receiving a speech command from a voice-enabled device, over a network, by a speech-recognition engine coupled to a media server by an interactive voice response application including a user-defined search, the speech-recognition engine adapted to convert the speech command into a data message, the media server adapted to identify and access at least one or more websites containing information of interest to a particular user, the speech-recognition engine adapted to select particular speech-recognition grammar describing the speech command received and assigned to fetching content relating to the data message converted from the speech command and assigned to the user-defined search including a web request, along with a uniform resource locator of an identified web site from the one or more websites containing information of interest to the particular user and responsive to the web request;(b) selecting, by the media server, at least one information-source-retrieval instruction stored for the particular speech-recognition grammar in a database coupled to the media server and adapted to retrieve information from the at least one or more websites;(c) accessing, by a web-browsing server, a portion of the information source to retrieve information relating to the speech command, by using a processor of the web-browsing server, ...

Подробнее
04-01-2018 дата публикации

VOICE-CONTROLLED AUDIO COMMUNICATION SYSTEM

Номер: US20180007210A1
Автор: Todasco Michael C.
Принадлежит:

Systems and methods for providing an audio communication system include receiving from a first user, by a microphone of a first voice-controlled device of a plurality of voice-controlled devices in an audio communication system, a first audio message and an audio command to provide the audio message to a second user. An identity of the second user associated with a user profile based on the audio command is determined. The first audio message is provided to a second voice-controlled device of the plurality of voice-controlled devices to output the audio message at a speaker of the voice-controlled device to the second user that is proximity to the second voice-controlled device. 1. An audio communication system , comprising: a microphone configured to capture audio from a surrounding location;', 'a speaker configured to output audio to the surrounding location;', 'a first communication interface configured to provide wireless communications with a user device;', 'a non-transitory memory; and', receiving, via the microphone from a first user, a first audio message and an audio command to provide the first audio message to a second user;', 'receiving, via the first communication interface, a user device identifier of the user device that is within a predetermined range of the first voice-controlled device;', 'determining an identity of the first user based on the user device identifier;', 'determining an identity of the second user based on the audio command to provide the first audio message to the second user and the identity of the first user; and', 'providing the first audio message to the second user., 'one or more hardware processors coupled to the non-transitory memory, the microphone, the first communication interface, and the speaker, wherein the one or more hardware processors are configured to execute instructions from the non-transitory memory to cause the system to perform operations comprising], 'a first voice-controlled device comprising2. The system of ...

Подробнее
04-01-2018 дата публикации

Speech and Computer Vision-Based Control

Номер: US20180007250A1
Принадлежит:

The present disclosure relates to a method for controlling a digital photography system. The method includes obtaining, by a device, image data and audio data. The method also includes identifying one or more objects in the image data and obtaining a transcription of the audio data. The method also includes controlling a future operation of the device based at least on the one or more objects identified in the image data, and the transcription of the audio data. 120.-. (canceled)21. A computer-implemented method comprising:obtaining, by a computing device operable to capture images, (i) image data that describes a first scene that includes one or more objects and (ii) audio data that describes a human speech utterance, wherein the human speech utterance refers to at least a first object of the one or more objects included in the first scene;identifying, by the computing device based at least in part on the audio data and based at least in part on the image data, at least the first object that is included in the first scene and that is referred to by the human speech utterance;defining, by the computing device, a new rule that specifies a behavior of the computing device in response to future instances of identification of the first object in future image data that is different than the current image data; andcontrolling, by the computing device, a future operation of the computing device to comply with the new rule.22. The method of claim 21 , wherein controlling claim 21 , by the computing device claim 21 , the future operation of the computing device comprises determining claim 21 , by the computing device claim 21 , whether to store the future image data.23. The method of claim 21 , wherein controlling claim 21 , by the computing device claim 21 , the future operation of the computing device comprises determining claim 21 , by the computing device claim 21 , whether to automatically upload the future image data to cloud storage.24. The method of claim 21 , ...

Подробнее
07-01-2021 дата публикации

INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, TRANSMISSION APPARATUS, AND TRANSMISSION METHOD

Номер: US20210006862A1
Автор: TSURU Takumi
Принадлежит: SONY CORPORATION

The present technology relates to an information processing apparatus, information processing method, transmission apparatus, and transmission method, capable of improving the convenience of a voice AI assistance service used in cooperation with content. 1. An information processing apparatus comprising:a control unit configured to control a timing of a voice response upon using a voice AI assistance service in cooperation with content on a basis of voice response time information indicating time suitable for the voice response to an utterance of a viewer watching the content.2. The information processing apparatus according to claim 1 ,wherein the voice response time information is information indicating the time suitable for the voice response on a playback time axis of the content.3. The information processing apparatus according to claim 2 ,wherein the voice response time information is acquired via communication.4. The information processing apparatus according to claim 3 ,wherein the content is played back by a first device,the voice response time information is delivered by a second device via communication,the second device extracts the voice response time information indicating the time suitable for the voice response to the content being played in the first device from metadata including the voice response time information intended for an entirety or a part of time on the playback time axis of the content, andthe control unit controls the timing of the voice response on a basis of the voice response time information delivered via communication.5. The information processing apparatus according to claim 2 ,wherein the voice response time information is acquired via broadcasting.6. The information processing apparatus according to claim 5 ,wherein the content is played back by a first device,the voice response time information is delivered by a second device via broadcasting,the second device delivers metadata including the voice response time information ...

Подробнее
20-01-2022 дата публикации

Component libraries for voice interaction services

Номер: US20220019406A1
Принадлежит: Google LLC

The disclosed embodiments include computerized methods, systems, and devices, including computer programs encoded on a computer storage medium, for integrating voice-based interaction and control into a native graphical user interface (GUI) of an executed application. For example, a communications device may obtaining component data identifying a plurality of components of a voice-user interface from a computing system maintained by a voice-service provider, and may execute an application linked to a corresponding one of the components of the voice-user interface. The communications device may generate the native GUI based on an output of the executed application, and may generate an interface element representative of the corresponding one of the components of the voice-user interface. The communications device may present the generated interface element within the native GUI, which may embed the corresponding component of the voice-user interface into the native GUI.

Подробнее
02-01-2020 дата публикации

RESOURCE PUSHING METHOD, DEVICE, AND STORAGE MEDIUM FOR SMART DEVICE

Номер: US20200007461A1
Принадлежит:

The present disclosure provides a resource pushing method, a device and a storage medium for a smart device, the method includes: acquiring a first pushing resource according to a preset rule, where the first pushing resource is used by the smart device to interact with a user, pushing the first pushing resource to the user. In the solution, relevant resources are actively acquired according to behavior information of the user, triggering and hotspots, and are pushed to the user, thereby saving the cost in resource querying by the user, and increasing the exposure of platform resources. 1. A resource pushing method for a smart device , comprising:acquiring a first pushing resource according to a preset rule, wherein the first pushing resource is used by the smart device to interact with a user; andpushing the first pushing resource to the user.2. The method according to claim 1 , wherein the preset rule comprises at least one of following:determining a pushing resource according to a news hotspot;determining a pushing resource according to behavior information of the user;determining a pushing resource according to a keyword of user feedback,determining a pushing resource according to an identity of the user; anddetermining a pushing resource according to subscription information of the user.3. The method according to claim 2 , wherein before the acquiring a first pushing resource according to a preset rule claim 2 , the method comprises:acquiring a first voice of the user;the acquiring a first pushing resource according to a preset rule comprises:acquiring the first pushing resource according to the first voice of the user and the preset rule.4. The method according to claim 2 , wherein the behavior information of the user comprises at least one of following:time information, location information, historical resource access information, and weather information.5. The method according to claim 1 , further comprising:acquiring feedback information for the first ...

Подробнее
20-01-2022 дата публикации

Virtual Assistant Host Platform Configured for Interactive Voice Response Simulation

Номер: US20220019985A1
Принадлежит: Bank of America Corp

Aspects of the disclosure relate to using machine learning to simulate an interactive voice response system. A computing platform may establish a virtual assistant session with a mobile banking application executing on a mobile device, which may include authenticating at least one authentication credential associated with an online banking account. The computing platform may receive an assistance message from the mobile device requesting assistance. Using a machine learning model, the computing platform may identify an intent of the assistance message. The computing platform may generate a response message based on the intent of the assistance message. The computing platform may send the response message and one or more commands directing the mobile device to output an audio response file based on the response message to the mobile device, which may cause the mobile device to convert the response message into the audio response file and output the audio response file.

Подробнее
03-01-2019 дата публикации

COMMUNICATION HEADSET COMPRISING WIRELESS COMMUNICATION WITH PERSONAL PROTECTION EQUIPMENT DEVICES

Номер: US20190007540A1
Принадлежит:

Embodiments of the disclosure include a communication headset, which may comprise active noise cancellation or reduction, configured to wirelessly communicate with one or more PPE devices using voice recognition to process voice inputs from a user. The headset may comprise a voice recognition module configured to receive voice inputs from a user (via a microphone on the headset) and convert the voice inputs to text. The voice inputs may comprise commands that may be sent to the PPE devices, wherein the commands may request information from the PPE devices, such as pressure level, temperature, gas content and level, battery life, etc., and wherein the PPE devices may send a response to the headset comprising that information. The headset may then convert the response to a voice output, which may then be sent to the user via speakers in the headset. 115-. (canceled)16. A communication headset comprising:one or more inward facing microphones;one or more inward facing speakers;a voice recognition module configured to receive voice input from the user via the one or more microphones, and convert the voice input from the user into text; receive the text output from the voice recognition module;', 'determine if the text output is associated with a command;', 'direct the command to an indicated destination;', 'receive a response to the command;', 'convert the response to a voice output; and', 'send the voice output to the one or more speakers to be communicated to the user;, 'a processor configured toa wireless module configured to, when the indicated destination is an external personal protection equipment (PPE) device, communicate the command wirelessly to the PPE device, and receive a response from the PPE device; anda local control unit configured to, when the indicated destination is the local headset, process the command and generate a response to the command.17. The headset of claim 16 , further comprising one or more active noise reduction (ANR) modules connected to ...

Подробнее
02-01-2020 дата публикации

METHOD AND APPARATUS FOR SENDING INFORMATION

Номер: US20200007656A1
Автор: Zhu Ziqiang
Принадлежит:

A method and apparatus for sending information are provided. An embodiment of the method comprises: determining, in response to acquiring a user speech audio through a target application, a user command instructed by the user speech audio, and determining whether the user command satisfies a preset trigger condition for plug-in downloading; and sending, in response to the user command satisfying the trigger condition for plug-in downloading, a request for downloading a target plug-in to a target server, the target plug-in being a plug-in in a preset plug-in set for the target application, and the plug-in being not locally installed. According to the embodiment, a terminal device may be triggered to download the plug-in based on the content instructed by the user speech audio, to implement the functional upgrading. Therefore, the self-learning capability and the self-upgrading capability of the terminal device are improved, which makes the response to the user command more accurate and more pertinent. 1. A method for sending information , comprising:determining, in response to acquiring a user speech audio through a target application, a user command instructed by the user speech audio, and determining whether the user command satisfies a preset trigger condition for plug-in downloading; andsending, in response to the user command satisfying the trigger condition for plug-in downloading, a request for downloading a target plug-in to a target server, the target plug-in being a plug-in in a preset plug-in set for the target application, and the plug-in being not locally installed.2. The method according to claim 1 , wherein the user command instructs to execute an operation claim 1 , the trigger condition for plug-in downloading includes that the operation the user command instructs to execute is inexecutable claim 1 , and the target plug-in supports the operation.3. The method according to claim 1 , wherein the trigger condition for plug-in downloading includes that ...

Подробнее
20-01-2022 дата публикации

Wireless audio testing

Номер: US20220020370A1
Автор: Jonathan D. Hurwitz
Принадлежит: Google LLC

A method includes outputting, by a computing device, to a remote computing device, test audio data; determining, by the computing device, whether audio data detected by an audio input device includes the test audio data; and responsive to determining that the test audio data was not detected by the audio input device, temporarily refraining, by the computing device, from outputting advisory audio data indicating the audio input device is ready to receive a spoken audio command.

Подробнее
20-01-2022 дата публикации

METHOD, DEVICE, AND PROGRAM FOR CUSTOMIZING AND ACTIVATING A PERSONAL VIRTUAL ASSISTANT SYSTEM FOR MOTOR VEHICLES

Номер: US20220020374A1
Автор: BEN ABDELAZIZ Omar
Принадлежит:

A method for customizing and activating a personal virtual assistant (PVA) in a motor vehicle. The method includes: activating a PVA management system, determining a customized mode of use of the personal virtual assistant, and activating the customized mode of use of the personal virtual assistant. A device for carrying out the method is also included. 1. A method for customizing and activating a personal virtual assistant for a motor vehicle , comprising:at least one seat suitable for accommodating a user,at least one imaging device having a field of view configured to include said user,at least one central processing unit communicating at least with said at least one imaging device, activating a PVA management system, from said central processing unit,', 'determining a customized mode of use of the personal virtual assistant, and', 'activating said customized mode of use of the personal virtual assistant., 'the method comprising at least2. Method according to claim 1 , wherein said PVA management system is activated by performing:activation of said imaging device, anddetection of at least one event indicating the presence of said user.3. Method according to claim 2 , wherein said determination of a customized mode of use of said personal virtual assistant comprises:capture of at least one image by said at least imaging device,comparison of said at least one captured image with preexisting user image data, anddetermination, by said central processing unit, of a customized mode of use of the personal virtual assistant, based on the result of said comparison.4. Method according to claim 3 , wherein said customized mode of use comprises:a default operation if the result of said comparison is an absence of recognition of the user or a recognition of a user for whom the central processing unit cannot access a language preference for said PVA system or for the language of wake words,absence of operation if the result of said comparison is recognition of a user who does ...

Подробнее
08-01-2015 дата публикации

Pictures using voice commands and automatic upload

Номер: US20150009344A1
Автор: Jeffrey C. Konicek
Принадлежит: CUTTING EDGE VISION LLC

A system and method is disclosed for enabling user friendly interaction with a camera system. Specifically, the inventive system and method has several aspects to improve the interaction with a camera system, including voice recognition, gaze tracking, touch sensitive inputs and others. The voice recognition unit is operable for, among other things, receiving multiple different voice commands, recognizing the vocal commands, associating the different voice commands to one camera command and controlling at least some aspect of the digital camera operation in response to these voice commands. The gaze tracking unit is operable for, among other things, determining the location on the viewfinder image that the user is gazing upon. One aspect of the touch sensitive inputs provides that the touch sensitive pad is mouse-like and is operable for, among other things, receiving user touch inputs to control at least some aspect of the camera operation. Another aspect of the disclosed invention provides for gesture recognition to be used to interface with and control the camera system.

Подробнее
02-01-2020 дата публикации

SYSTEMS AND METHODS FOR ASSOCIATING PLAYBACK DEVICES WITH VOICE ASSISTANT SERVICES

Номер: US20200007987A1
Автор: Tolomei John G., Woo Sein
Принадлежит:

Systems and methods for media playback via a media playback system include detecting a first wake word via a first network microphone device of a first playback device, detecting a second wake word via a second network microphone device of a second playback device, and forming a bonded zone that includes the first playback device and the second playback device. In response to detecting the first wake word, a first voice first voice utterance following the first wake word is transmitted a first voice assistant service. In response to detecting the second wake word, a second voice utterance following the second wake word is transmitted to a second voice assistant service. Requested media content received from the first and/or second voice assistant service is played back via the first playback device and the second playback device in synchrony with one another. 1. A method comprising:detecting a first wake word via a first network microphone device of a first playback device;detecting a second wake word via a second network microphone device of a second playback device;forming a bonded zone of a media playback system, the bonded zone comprising the first playback device and the second playback device; transmitting a first voice utterance requesting playback of first media content to one or more remote computing devices associated with a first voice assistant service; and', 'playing back the first media content via the first and second playback devices of the bonded zone in synchrony with one another; and, 'in response to detecting the first wake word via the first network microphone device transmitting a second voice utterance requesting playback of second media content to one or more remote computing devices associated with a second voice assistant service; and', 'playing back the second media content via the first and second playback devices of the bonded zone in synchrony with one another., 'in response to detecting the second wake word via the second network ...

Подробнее
12-01-2017 дата публикации

SHOPPING FACILITY ASSISTANCE SYSTEMS, DEVICES AND METHODS TO ADDRESS GROUND AND WEATHER CONDITIONS

Номер: US20170009417A1
Принадлежит:

Some embodiments provide methods, systems and apparatus to enhance safety. In some embodiments, a system comprises: a central computer system comprising: a transceiver; a control circuit; and a memory coupled to the control circuit and storing computer instructions that when executed by the control circuit cause the control circuit to perform the steps of: communicate positioning routing instructions to the plurality of motorized transport units directing the motorized transport units to one or more external areas of a shopping facility that are exposed to weather conditions; and communicate separate area routing instructions to each of the motorized transport units that when implemented cause the motorized transport units to cooperatively and in concert travel in accordance with the area routing instructions over at least predefined portions of one or more external areas to cause ground treatment systems to address ground level conditions. 1. A system providing enhanced safety , comprising: a transceiver configured to communicate with the motorized transport units located at a shopping facility;', 'a control circuit coupled with the transceiver; and', 'a memory coupled to the control circuit and storing computer instructions that when executed by the control circuit cause the control circuit to perform the steps of:, 'a central computer system that is separate and distinct from a plurality of self-propelled motorized transport units, wherein the central computer system comprisescommunicate positioning routing instructions to the plurality of motorized transport units directing the plurality of motorized transport units to one or more external areas of a shopping facility that are exposed to weather conditions; andcommunicate separate area routing instructions to each of the plurality of motorized transport units that when implemented cause the plurality of motorized transport units to cooperatively and in concert travel in accordance with the area routing ...

Подробнее
10-01-2019 дата публикации

Digital command prompting device for dementia patients using augmented reality

Номер: US20190009049A1
Автор: Candy Katrina Goff
Принадлежит:

The digital command prompting device for dementia patients using augmented reality is an aid is to help all people, especially those who have special needs particularly individuals who have diminished or diminishing function of their brain. The device is predominately mobile but can also be stationary and can be programmed by receiving and selecting pre-set commands to operate and assist a user with their daily living standards or needs but is particularly adapted for use when a user is having a disorientation episode as to place and time. The device also has other various features including an illuminated display panel, a GPS tracking capability, an alarm, an illumination element, solar panel and battery backup components, time and date clocks, is provided water resistance covers and/or material, among other things. The device may be used within the home environment, outdoor environment or a restricted environment, e.g. aged care facility, hospital, pre-school or school. 1: A digital command prompting device using augmented reality for assisting and orienting a user during a disorientation episode comprisingmeans for electronically communicating said device with an electronic appliance and wherein the device can be activated by the user and further whereinthe device is provided with means which can send and receive electronic signals and further whereinthe device is provided with means to produce a wireless signal which is transmittable to the mobile electronic appliance and further wherein said electronic appliance is provided with a computer processing application software which can process any signal received by the device and further wherein the computer processing application software can produce a visual display on the electronic appliance to inform or command the user to respond to a question for assisting in the orientation of the user during a disorientation episode.2: The digital command prompting device of further provided with means for transmitting to ...

Подробнее
09-01-2020 дата публикации

Methods and systems for sleep management

Номер: US20200009349A1
Принадлежит: Resmed Sensor Technologies Ltd

A processing system includes methods to promote sleep. The system may include a monitor such as a non-contact motion sensor from which sleep information may be determined. User sleep information, such as sleep stages, hypnograms, sleep scores, mind recharge scores and body scores, may be recorded, evaluated and/or displayed for a user. The system may further monitor ambient and/or environmental conditions corresponding to sleep sessions. Sleep advice may be generated based on the sleep information, user queries and/or environmental conditions from one or more sleep sessions. Communicated sleep advice may include content to promote good sleep habits and/or detect risky sleep conditions. In some versions of the system, any one or more of a bedside unit 3000 sensor module, a smart processing device, such as a smart phone or smart device 3002 , and network servers may be implemented to perform the methodologies of the system.

Подробнее
27-01-2022 дата публикации

Recognition assistant

Номер: US20220027625A1
Принадлежит: International Business Machines Corp

A method provides for assistance in recognition of an entity. A set of data and associated information corresponding to a plurality of entities known to an assisted user is received, such that an instance of the set of data and associated information includes identification of a respective entity of the plurality of entities known to an assisted user. Real-time data corresponding to a first entity is received from one or more devices capturing the real-time data. The real-time data is compared to the set of data and associated information corresponding to the plurality of entities known to the user to determine whether the first entity has a known relevance to the user, and in response to determining the first entity does have a known relevance to the user, the processor provides the identity and relevance of the first entity to the user.

Подробнее
12-01-2017 дата публикации

Shopping Facility Assistance System and Method to Retrieve In-Store Abandoned Mobile Item Containers

Номер: US20170010609A1
Принадлежит: Wal Mart Stores Inc

A central computer system identifies a mobile item container in a retail shopping facility as being abandoned. The central computer system then directs a motorized transport unit through the retail shopping facility to the abandoned mobile item container and causes that motorized transport unit to physically attach to the abandoned mobile item container. The central computer system then directs that motorized transport unit through the retail shopping facility with the attached abandoned mobile item container to a specified destination within the retail shopping facility. Abandonment can be determined as a function, at least in part, of determining that the mobile item container is both stationary and unattended for at least a predetermined amount of time. By one approach the central computer system can use different predetermined amounts of time when assessing abandonment depending upon where in the retail shopping facility the mobile item containers are located.

Подробнее
27-01-2022 дата публикации

SYSTEMS AND METHODS FOR PROCESSING SPEECH DIALOGUES

Номер: US20220028371A1
Автор: Han Kun, Xu Haiyang

The present disclosure is related to systems and methods for processing speech dialogue. The method includes obtaining target speech dialogue data. The method includes obtaining a text vector representation sequence, a phonetic symbol vector representation sequence, and a role vector representation sequence by performing a vector transformation on the target speech dialogue data based on a text embedding model, a phonetic symbol embedding model, and a role embedding model, respectively. The method includes determining a representation vector corresponding to the target speech dialogue data by inputting the text vector representation sequence, the phonetic symbol vector representation sequence, and the role vector representation sequence into a trained speech dialogue coding model. The method includes determining a summary of the target speech dialogue data by inputting the representation vector into a classification model. 1. A method for processing speech dialogue implemented on a computing device having at least one processor and at least one storage device , the method comprising:obtaining target speech dialogue data;obtaining a text vector representation sequence, a phonetic symbol vector representation sequence, and a role vector representation sequence by performing a vector transformation on the target speech dialogue data based on a text embedding model, a phonetic symbol embedding model, and a role embedding model, respectively;determining a representation vector corresponding to the target speech dialogue data by inputting the text vector representation sequence, the phonetic symbol vector representation sequence, and the role vector representation sequence into a trained speech dialogue coding model; anddetermining a summary of the target speech dialogue data by inputting the representation vector into a classification model.2. The method of claim 1 , further comprising:obtaining a sentence text of the summary of the target speech dialogue data; ...

Подробнее
27-01-2022 дата публикации

VOICE RESPONSE SYSTEMS BASED ON PERSONALIZED VOCABULARY AND USER PROFILING - PERSONALIZED LINGUISTICS AI ENGINES

Номер: US20220028374A1
Принадлежит:

A method, computer system, and a computer program product for personalized voice responses is provided. The present invention may include gathering a plurality of user data from an Internet of Things (IoT) connected sensor. The present invention may include identifying a personalized vocabulary based on the gathered plurality of user data. The present invention may include training a voice response system based on the gathered plurality of user data and the identified personalized vocabulary. The present invention may include receiving a verbal request. The present invention may include responding to the received verbal request using the trained voice response system. 1. A method for personalized voice responses , the method comprising:gathering a plurality of user data from an Internet of Things (IoT) connected sensor;identifying a personalized vocabulary based on the gathered plurality of user data;training a voice response system based on the gathered plurality of user data and the identified personalized vocabulary;receiving a verbal request; andresponding to the received verbal request using the trained voice response system.2. The method of claim 1 , wherein the gathered plurality of user data is selected from the group consisting of user mobility patterns claim 1 , user travel locations claim 1 , user preferences claim 1 , user requests claim 1 , user responses claim 1 , user identification information claim 1 , and user activities.3. The method of claim 1 , wherein identifying the personalized vocabulary based on the gathered plurality of user data further comprises:identifying topical information of the gathered plurality of user data using contextual analysis via latent Dirichlet allocation (LDA); anddetermining a contextual significance of the gathered plurality of user data based on a social network contribution.4. The method of claim 1 , wherein training the voice response system based on the gathered plurality of user data and the identified ...

Подробнее
27-01-2022 дата публикации

ELECTRONIC DEVICE AND OPERATION METHOD THEREOF

Номер: US20220028381A1
Принадлежит:

According to an embodiment an electronic device includes at least one sensor, a display, a memory, and a processor operatively connected to the at least one sensor, the display, and the memory. The memory is configured to store instructions that, when executed, the processor is configured to recognize a context of a user by using the at least one sensor based on at least one of a speed of the electronic device, a location of the electronic device, a level of external noise, an external illuminance, personal information of the user, or a connection state between the electronic device and an external electronic device. The processor is also configured to control an execution environment of a voice assistant application based on the recognized context. 1. An electronic device comprising:at least one sensor;a display;a memory; anda processor operatively connected to the at least one sensor, the display, and the memory,wherein the memory stores instructions that, when executed, the processor is configured to:recognize a context of a user by using the at least one sensor based on at least one of a speed of the electronic device, a location of the electronic device, a level of external noise, an external illuminance, personal information of the user, or a connection state between the electronic device and an external electronic device; andcontrol an execution environment of a voice assistant application based on the recognized context.2. The electronic device of claim 1 , wherein the execution environment includes at least one of:a reference value for recognizing a command of the user for executing the voice assistant application, a transparency of a layer displayed on the display, whether to release the layer displayed on the display, a size of a text displayed on the display, a brightness of the display, or a volume of a sound device included in the electronic device.3. The electronic device of claim 2 , wherein when the recognized context corresponds to a driving mode ...

Подробнее
27-01-2022 дата публикации

SYSTEMS AND METHODS FOR VOICE ASSISTANT FOR ELECTRONIC HEALTH RECORDS

Номер: US20220028382A1
Принадлежит: Bola Technologies, Inc.

An electronic record voice assistant system can include one or more processors that receive audio data, apply a machine learning model to the audio data to generate speech data including at least one value, determine a state of an electronic record, and update one or more fields of the electronic record using the state and the at least one value. 1. A method of using voice commands to update electronic dental records , comprising:receiving, by one or more processors, audio data;applying, by the one or more processors, a speech model to the audio data to generate speech data including at least one value;determining, by the one or more processors, a state of a periodontal chart data object, the periodontal chart data object comprising a plurality of fields, each field associated with a tooth of a subject and at least one feature of the tooth, the state corresponding to a particular field of the plurality of fields;determining, by the one or more processors, a command based on at least one of the speech data or the at least one value; andassigning, by the one or more processors, the at least one value to the at least one feature of the tooth based on the command and the state.2. The method of claim 1 , wherein assigning claim 1 , by the one or more processors claim 1 , the at least one value comprises identifying the field with which the at least one feature is associated using the state.3. The method of claim 1 , further comprising updating claim 1 , by the one or more processors claim 1 , the state responsive to assigning the at least one value.4. The method of claim 1 , further comprising identifying claim 1 , by the one or more processors claim 1 , the particular field using the command.5. The method of claim 1 , wherein the one or more processors comprise a first processor operating on a client device and a second processor operating on a server device.6. The method of claim 1 , wherein generating the speech data comprises using claim 1 , by the one or more ...

Подробнее
12-01-2017 дата публикации

SENTENCE SIMPLIFICATION FOR SPOKEN LANGUAGE UNDERSTANDING

Номер: US20170011025A1
Принадлежит: Microsoft Technology Licensing, LLC

Sentence simplification may be provided. A spoken phrase may be received and converted to a text phrase. An intent associated with the text phrase may be identified. The text phrase may then be reformatted according to the identified intent and a task may be performed according to the reformatted text phrase. 1. A method for providing sentence simplification , the method comprising:receiving a spoken utterance;converting the spoken utterance to a text phrase;identifying a top level predicate associated with the text phrase;reformatting the text phrase according to the identified predicate; andperforming a task according to the reformatted text phrase.2. The method of claim 1 , wherein identifying the top level predicate associated with the text phrase comprises performing a dependency parse on the text phrase.3. The method of claim 2 , wherein performing a dependency parse comprises:identifying a top level predicate; andexcluding at least one auxiliary word in the text phrase.4. The method of claim 3 , wherein the at least one auxiliary word comprises a dependent of the top level predicate.5. The method of claim 3 , wherein the at least one auxiliary word comprises at least one predefined auxiliary keyword.6. The method of claim 3 , wherein identifying the top level predicate comprises evaluating a weighting criterion associated with each word of the text phrase.7. The method of claim 1 , wherein reformatting the text phrase according to the identified predicate comprises defining a domain associated with the task.8. The method of claim 7 , further comprising filling at least one semantic slot associated with the defined domain.9. The method of claim 8 , wherein the slot is filled with at least one word of the text phrase.10. The method of claim 9 , wherein the at least one word of the text phrase is not associated with the reformatted text phrase.11. A computer-readable medium which stores a set of instructions which when executed performs a method for providing ...

Подробнее
14-01-2016 дата публикации

METHODS AND SYSTEMS FOR MANAGING SPEECH RECOGNITION IN A MULTI-SPEECH SYSTEM ENVIRONMENT

Номер: US20160011853A1
Принадлежит: HONEYWELL INTERNATIONAL INC.

Methods and system are provided for managing speech processing in an environment having at least two speech enabled systems. In one embodiment, a method includes: recording first user data that indicates an action of a user; determining, by a processor, a selection of a first speech enabled system based on the recorded user data; and generating, by the processor, a signal to at least one of activate and deactivate speech processing based on the first speech enabled system. 1. A method of managing speech processing in an environment having at least two speech enabled systems , comprising:recording first user data that indicates an action of a user;determining, by a processor, a selection of a first speech enabled system based on the recorded user data; andgenerating, by the processor, a signal to at least one of activate and deactivate speech processing based on the first speech enabled system.2. The method of claim 1 , wherein the action of the user includes a gesture of the user.3. The method of claim 1 , wherein the action of the user includes a gaze of the user.4. The method of claim 1 , wherein the action of the user includes a spoken command from the user.5. The method of claim 1 , wherein the signal activates speech processing by the first speech enabled system.6. The method of claim 1 , wherein the signal activates speech processing by a centralized speech processor using at least one of a vocabulary and a speech processing technique associated with the first speech enabled system.7. The method of claim 1 , further comprising recording second user data that indicates a second action of the user claim 1 , and wherein the determining the selection of the first speech enabled system is based on the first recorded user data and the second recorded user data.8. The method of claim 7 , wherein the action of the user indicates at least one of a gesture of the user claim 7 , a gaze of the user claim 7 , and a spoken command from the user claim 7 , and wherein the ...

Подробнее
14-01-2016 дата публикации

Voice recognition device and display method

Номер: US20160011854A1
Принадлежит: Mitsubishi Electric Corp

Because a voice recognition device in accordance with the present invention can change the position where or the display form in which a display item corresponding to a voice recognition result is displayed according to the degree of importance of a display area in which the display item is displayed when recognizing a voice uttered by the user, the voice recognition device can prevent the acquisition of other information important for the user from being blocked due to the display of the display item, and improve the user's convenience.

Подробнее