Настройки

Укажите год
-

Небесная энциклопедия

Космические корабли и станции, автоматические КА и методы их проектирования, бортовые комплексы управления, системы и средства жизнеобеспечения, особенности технологии производства ракетно-космических систем

Подробнее
-

Мониторинг СМИ

Мониторинг СМИ и социальных сетей. Сканирование интернета, новостных сайтов, специализированных контентных площадок на базе мессенджеров. Гибкие настройки фильтров и первоначальных источников.

Подробнее

Форма поиска

Поддерживает ввод нескольких поисковых фраз (по одной на строку). При поиске обеспечивает поддержку морфологии русского и английского языка
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Укажите год
Укажите год

Применить Всего найдено 8188. Отображено 200.
25-07-2019 дата публикации

Anwendungsprozessor mit Sprachtriggersystem mit niedriger Leistung und direktem Weg zum Unterbrechen, elektronisches Gerät mit demselben und Verfahren zum Betreiben desselben

Номер: DE102018128225A1
Автор: KIM SUN-KYU, Kim, Sun-Kyu
Принадлежит:

Ein Anwendungsprozessor kann einen Host-Prozessor, ein Sprachtriggersystem und ein Audio-Subsystem beinhalten, das elektrisch mit einem Systembus verbunden ist. Das Sprachtriggersystem kann konfiguriert werden, um einen Sprachtriggervorgang durchzuführen und ein Triggerereignis auszugeben. Das Audio-Subsystem kann konfiguriert werden, um einen Audio-Ausgangsstrom über eine Audio-Schnittstelle wiederzugeben. Ein direkter Bus kann konfiguriert werden, um einen Kommunikationsweg zwischen dem Sprachtriggersystem und dem Audio-Subsystem während einer Unterbrechungs-Bedingung bereitzustellen, in der der Sprachtriggervorgang und die Wiedergabe des Audio-Ausgangsstroms gemeinsam durchgeführt werden. Der Anwendungsprozessor kann konfiguriert werden, um kompensierte Triggerdaten zu erzeugen, indem er eine Echounterdrückung in Bezug auf von einem Mikrofon empfangene Mikrofondaten durchführt, und das Sprachtriggersystem kann konfiguriert werden, um den Sprachtriggervorgang während der Unterbrechungs-Bedingung ...

Подробнее
01-12-2016 дата публикации

Diskrete Notfallreaktion

Номер: DE102016109814A1
Принадлежит:

Ein Fahrzeugsystem enthält ein Mikrophon, das programmiert ist, Töne in einem Fahrgastraum eines Fahrzeugs zu erfassen und ein Tonsignal auszugeben. Ein Sprachanalysator analysiert das Tonsignal bezüglich beunruhigter Sprache und bezüglich Aktionswörtern. Der Sprachanalysator gibt ein Alarmsignal aus, falls beunruhigte Sprache und Aktionswörter detektiert werden. Eine Verarbeitungsvorrichtung empfängt das Alarmsignal und leitet in Reaktion auf das Alarmsignal einen Kontakt mit einem Notdienstanbieter ein.

Подробнее
24-05-2018 дата публикации

Fahr-Assistenzvorrichtung, Fahr-Assistenzserver und Fahr-Assistenzsystem

Номер: DE112015006585T5

Eine Fahr- bzw. Reise-Assistenzvorrichtung beinhaltet eine Sprachinformations-Detektionseinheit, die geäußertes Sprechen des Anwenders erfasst und erkennt und ein Erkennungsergebnis der Erkennung ausgibt; eine Informations-Verarbeitungseinheit, welche die Evaluierungsinformation aus dem Erkennungsergebnis erzeugt; eine Positionsinformations-Detektionseinheit, die eine Position detektiert, wo die Evaluierungsinformation erzeugt wird; eine Zuverlässigkeits-Bestimmungseinheit, die die Zuverlässigkeit der Evaluierungsinformation unter Verwendung von im Internet geposteter Posting-Information innerhalb eines vorbestimmten Distanzbereichs basierend auf der Position, an welcher die Evaluierungsinformation innerhalb einer vorbestimmten Periode erzeugt wird, basierend auf aktuellem Datum und Zeit, bestimmt, und bestimmt, ob die Evaluierungsinformation zu senden ist; eine Kommunikationseinheit, die an den Reise-Assistenzserver eine Übertragung der Evaluierungsinformation, von der bestimmt ist, dass ...

Подробнее
16-07-2020 дата публикации

AUTOMATISCHES BLOCKIEREN VON IN EINEM AUDIO-DATENSTROM ENTHALTENEN SENSIBLEN DATEN

Номер: DE112018005421T5

Vorliegend werden ein System, ein Verfahren und ein Computerprogrammprodukt zum automatischen Blockieren hörbarer, sensibler Daten offenbart. Eingehende Audiosignale werden in Text umgewandelt und auf sensible Informationen analysiert. Auf Grundlage der Erfassung sensibler Informationen aus einer Quelle wird ein Empfänger durch Blockieren am Empfangen der hörbaren sensiblen Informationen gehindert.

Подробнее
23-05-2019 дата публикации

Navigationsvorrichtung

Номер: DE112012006997B4

Navigationsvorrichtung, die dazu eingerichtet ist, eine Führung an einer Bewegungsroute bereitzustellen und Kartendaten an einem Anzeigebildschirm anzuzeigen, auf Grundlage von Kartendaten und einer Position eines Bewegungsobjekts, wobei die Navigationsvorrichtung umfasst:eine Tonerfassungseinheit, die dazu eingerichtet ist, ein Tonsignal zu erfassen;einen Äußerungsabschnittsdetektor, der dazu eingerichtet ist, einen Äußerungsabschnitt einer geäußerten Sprache eines Nutzers aus dem Tonsignal zu detektieren, das durch die Tonerfassungseinheit erfasst wird, um Sprachdaten zu extrahieren;eine Spracherkennung, die dazu eingerichtet ist, regulär die Sprachdaten zu erkennen, die durch den Äußerungsabschnittsdetektor extrahiert werden, um ein vorbestimmtes Schlagwort zu extrahieren, nachdem die Navigationsvorrichtung gestartet wurde;eine Evaluierungsinformations-Erfassungseinheit, die dazu eingerichtet ist, auf eine Evaluierungsinformations-Entsprechungstabelle zu verweisen, in der eine Vielzahl ...

Подробнее
18-12-2017 дата публикации

Intelligenter automatisierter Assistent

Номер: DE202017004558U1
Автор:
Принадлежит: APPLE INC, Apple Inc.

Elektronische Vorrichtung zum Betreiben eines automatisierten Assistenten, wobei die elektronische Vorrichtung einen Lautsprecher und ein Mikrofon umfasst, wobei die elektronische Vorrichtung Mittel umfasst zum: Bereitstellen, über den Lautsprecher der elektronischen Vorrichtung, einer Audioausgabe; während dem Bereitstellen der Audioausgabe über den Lautsprecher der elektronischen Vorrichtung, Empfangen, über das Mikrofon der elektronischen Vorrichtung, einer Eingabe in natürlicher Sprache; Ableiten einer Darstellung eine Benutzerabsicht, basierend auf der Eingabe in natürlicher Sprache und der Audioausgabe; Identifizieren einer Aufgabe, basierend auf der abgeleiteten Benutzerabsicht; und Durchführen der identifizierten Aufgabe.

Подробнее
09-05-2019 дата публикации

Sprachsteuerung für ein Fahrzeug

Номер: DE102017219616A1
Автор: DUSIK JAN, Dusik, Jan
Принадлежит:

Die Erfindung betrifft eine Mensch-Maschine-Schnittstelle (Human Machine Interface, HMI) für ein Fahrzeug mit einem Mikrofon zum Erfassen eines von einem Fahrer oder einem weiteren Insassen des Fahrzeugs gesprochenen Befehlsworts und einem Steuergerät, welches ein Spracherkennungsmodul zum Erkennen mehrerer bestimmter Befehlsworte aufweist.

Подробнее
22-02-2017 дата публикации

Method and system for providing alerts for radio communications

Номер: GB0002541562A
Принадлежит:

A method and system for providing alerts for radio communications are provided. One or more keywords are generated based on one or more contextual parameters associated with a radio device. An audio stream is received at the radio device from a radio transmitter. One or more of the one or more keywords are detected in the audio stream, and an alert for the audio stream is provided to a user of the radio device.

Подробнее
15-03-2017 дата публикации

Speech recognition

Номер: GB0002542268A
Принадлежит:

A speech recognition system (in eg. in a mobile phone) stores an input signal from a microphone (100 or 102) in a first buffer (110) and generates a noise-reduced signal (134) in a second block (146) before selecting (140) which of the signals to send to a speech recognition engine (132). Aspects of the invention include the detection (120) and validation (130) of a trigger phrase or pass phrase (eg. hello phone) which, when spoken by an authorised speaker (identified by speaker verification), wakes up the speech recognition engine, with a time delay being applied to either signal stored in buffers (110) and (146) in order to synchronise them. This multi-phase process allows computationally intensive speech recognition to be automatically powered down between uses.

Подробнее
05-04-2017 дата публикации

Customer service appraisal device, customer service appraisal system, and customer service appraisal method

Номер: GB0002542959A
Принадлежит:

A customer service appraisal device for appraising the service a person provides a customer comprises: a speech input unit (31) into which a person's speech is input as a speech signal; a keyword detector (32) for detecting one or more predetermined customer service keywords from the person's speech by acquiring the speech signal; a speech feature acquisition unit (34) for acquiring a speech feature amount for the customer service keywords detected by the keyword detector (32); and a customer service score calculation unit (36) for calculating an appraisal score as an index indicating whether the service a person provided a customer is good or bad on the basis of a detection amount for customer service keywords detected by the keyword detector (32) and the speech feature amount.

Подробнее
14-08-2013 дата публикации

Speech Recognition

Номер: GB0201311379D0
Автор:
Принадлежит:

Подробнее
31-12-2014 дата публикации

Speech Recognition

Номер: GB0002515527A
Принадлежит:

A speech recognition system (in eg. in a mobile phone) detects a sound signal in an input signal 116 and counts the ratio 128 of detected keywords 126 to sounds 124. Aspects of the invention include the subsequent validation 130 of a trigger word or pass phrase (eg. hello phone) which, when spoken by an authorised speaker (identified by speaker verification), wakes up the speech recognition engine (SRE 132). This multi-phase process (fig. 4) allows computationally intensive speech recognition to be automatically powered down between uses, and the count ratio may indicate an excessive number of false triggers which wake up the engine unnecessarily.

Подробнее
21-02-2018 дата публикации

Sensor input recognition

Номер: GB0002553040A
Принадлежит:

In eg. a speech recognition system (in eg. in a mobile phone), a sensor 208 detects an input (eg. sound) signal and determines whether it contains a selectable pattern (eg. trigger phrase such as hello phone) via analysers 206 coupled to an analysis signal memory and a controllable pattern detector 220 acting on eg. detected keywords (from 120, 5a) . Aspects of the invention include the subsequent validation 120 of the trigger word or pass phrase which, when spoken by an authorised speaker (identified by speaker verification), wakes up 212 the speech recognition engine (SRE 132). Noise reduction may be tuned for speech recognition 134 or communication 136 depending on whether it is applied before or after the validation.

Подробнее
05-06-2019 дата публикации

Natural Language processing for session establishment with service providers

Номер: GB0002568983A
Принадлежит:

Routing packetized actions in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate an action data structure. The system can transmit the action data structure to a third party provider device. The system can receive an indication from the third party provider device that a communication session was established with the device.

Подробнее
30-09-2020 дата публикации

Multi-user personalization at a voice interface device

Номер: GB0002556656B
Принадлежит: GOOGLE LLC, Google LLC

Подробнее
25-04-2018 дата публикации

NO DETAILS

Номер: GB0201803881D0
Автор:
Принадлежит:

Подробнее
28-10-2020 дата публикации

Processing received speech data

Номер: GB0202014524D0
Автор:
Принадлежит:

Подробнее
10-03-2021 дата публикации

SENSOR DEVICE AND SYSTEM

Номер: GB202101042D0
Автор:
Принадлежит:

Подробнее
02-10-2019 дата публикации

Selective sensor polling

Номер: GB0002572316A
Принадлежит:

A selective sensor polling system for a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a template for an action data structure with a plurality of fields. The system can determine to poll a first sensor for data for the first field. The system can determine to obtain data in memory previously collected by the second sensor. The system can generate and transmit the action data structure with the data from the sensor and memory, and transmit the action data structure to a third party device.

Подробнее
12-06-2019 дата публикации

Audio generation system and method

Номер: GB0201905964D0
Автор:
Принадлежит:

Подробнее
28-09-2022 дата публикации

An electronics module for a wearable articel, a systemm, and a method of activation of an electronics module for a wearable article

Номер: GB0002605121A
Принадлежит:

A system 10 comprises a primary controller, a secondary controller coupled to the primary controller and a microphone coupled to the secondary controller. The secondary controller is configured to operate at a lower power than the primary controller. The secondary controller is further configured to: receive incoming voice data input via the microphone, detect the presence of a keyword from the incoming voice data, and in response to the detection of a keyword, transmit a command to the primary controller. The command may be to configure the primary controller to receive and process subsequent incoming voice data from the secondary controller. The secondary controller may be configured to receive the subsequent incoming voice data from the microphone, and to pass on the data to the primary controller after processing them. The system may comprise an electronics module 100 for a wearable article 200 and a user electronic device 300 communicatively coupled to the electronics module, wherein ...

Подробнее
15-08-2006 дата публикации

TELEPHONE NUMBER REGAINING SYSTEM AND PROCEDURE

Номер: AT0000333750T
Принадлежит:

Подробнее
04-07-2019 дата публикации

Intelligent automated assistant

Номер: AU2017330209C1
Принадлежит: FPA Patent Attorneys Pty Ltd

Systems and processes for operating an automated assistant are disclosed. In one example process, an electronic device provides an audio output via a speaker of the electronic device. While providing the audio output, the electronic device receives, via a microphone of the electronic device, a natural language speech input. The electronic device derives a representation of user intent based on the natural language speech input and the audio output, identifies a task based on the derived user intent; and performs the identified task.

Подробнее
30-05-2019 дата публикации

Sequence dependent operation processing of packet based data message transmissions

Номер: AU2017386093A1

Optimization of sequence dependent operations in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A prediction component can determine a thread based on the trigger keyword and the request that includes a first action, a second action subsequent to the first action, and a third action subsequent to the second action. A content selector component can select, based on the third action and the trigger keyword, a content item. An audio signal generator component can generate an output signal comprising the content item. An interface can transmit the output signal to cause a client computing device to drive a speaker to generate an acoustic wave corresponding to the output signal prior to occurrence of at least one of the first action and the second action.

Подробнее
16-09-2021 дата публикации

Intent-driven contact center

Номер: AU2020229706A1
Принадлежит:

The present disclosure relates generally to providing an intent-driven contact center. The contact center according to some embodiments analyzes intents to determine to which device or agent to route a communication. The analyzed intent information can also be used to formulate reports and analyze the accuracy of the identified intents with respect to the received communication.

Подробнее
15-10-2020 дата публикации

Dynamic thresholds for always listening speech trigger

Номер: AU2018241137B2
Принадлежит: FPA Patent Attorneys Pty Ltd

Systems and processes are disclosed for dynamically adjusting a speech trigger threshold, which can be used in triggering a virtual assistant. Audio input can be received via a microphone. The received audio input can be sampled, and a confidence level can be determined of whether the sampled audio input includes a portion of a spoken trigger. In response to the confidence level exceeding a threshold, a virtual assistant can be triggered to receive a user command from the audio input. The threshold can be dynamically adjusted in response to perceived events (e.g., events indicating a user may be more or less likely to initiate speech interactions, events indicating a trigger may be difficult to detect, events indicating a trigger was missed, etc.), thereby minimizing both missed triggers and false positive triggering events.

Подробнее
05-02-2015 дата публикации

Image processing apparatus and control method thereof and image processing system

Номер: AU2013200307B2
Принадлежит:

An image processing apparatus including: image processor which processes broadcasting signal, to display image based on processed broadcasting signal; communication unit which is connected to a server; a voice input unit which receives a user's speech; a voice processor which processes a performance of a preset corresponding operation according to a voice command corresponding to the speech; and a controller which processes the voice command corresponding to the speech through one of the voice processor and the server if the speech is input through the voice input unit. If the voice command includes a keyword relating to a call sign of a broadcasting channel, the controller controls one of the voice processor and the server to select a recommended call sign corresponding to the keyword according to a predetermined selection condition, and performs a corresponding operation under the voice command with respect to the broadcasting channel of the recommended call sign. 4014037_1 (GHMatters ...

Подробнее
14-08-2014 дата публикации

Method for Voice Activation of a Software Agent from Standby Mode

Номер: AU2014200407A1
Принадлежит:

Method for Voice Activation of a Software Agent from Standby Mode A method for voice activation of a software agent from a standby mode. An audio recording (2) is buffered in an audio buffer (6) and at the same time, the audio recording is input to a secondary voice recognition process (7) which is economical in terms of energy and has an increased false positive rate. When a keyword is recognized, a primary voice recognition process (8) is activated from an inactive state, which converts the audio buffer to text and inputs it to a dialog system (9) which analyzes as to whether there is a relevant question made by the user. If this is the case, the user gets an acoustic reply (3), and if this is not the case, the dialog system and the primary voice recognition process immediately return to the inactive state and transfer the control to the secondary voice recognition process. Use figure 2. MC) coo C)4 4o C' C)Y '4-) c/ T 0 :4- Lo ...

Подробнее
08-03-2018 дата публикации

Zero latency digital assistant

Номер: AU2016320585A1
Принадлежит: FPA Patent Attorneys Pty Ltd

An electronic device can implement a zero-latency digital assistant by capturing audio input from a microphone and using a first processor to write audio data representing the captured audio input to a memory buffer. In response to detecting a user input while capturing the audio input, the device can determine whether the user input meets a predetermined criteria. If the user input meets the criteria, the device can use a second processor to identify and execute a task based on at least a portion of the contents of the memory buffer.

Подробнее
06-06-2019 дата публикации

METHODS AND SYSTEMS FOR RECOMMENDING CONTENT IN CONTEXT OF A CONVERSATION

Номер: CA0003081859A1
Принадлежит: SMART & BIGGAR LLP

A media guidance application may monitor a conversation among users, and identify keywords in the conversation, without the use of wakewords. The keywords are used to search for media content that is relevant to the on-going conversation. Accordingly, the media guidance application presents relevant content to the users, during the conversation, to more actively engage the users. A conversation monitoring window may be used to present conversation information as well as relevant content. A listening mode may be used to manage when the media guidance application processes speech from a conversation. The media guidance application may access user profiles for keywords, select content types, select content sources, and determine relevancy of media content, to provide content in context of a conversation.

Подробнее
13-03-2019 дата публикации

SYSTEMS AND METHODS FOR IDENTIFYING DRUNK REQUESTERS IN AN ONLINE TO OFFLINE SERVICE PLATFORM

Номер: CA0003028639A1
Принадлежит: BORDEN LADNER GERVAIS LLP

A method for detecting drunk requesters in an O2O service platform is provided. The method may include obtaining information related to a request of an O2O service initiated by a requester. The method may also include determining a probability that the requester has consumed alcohol using an alcohol consumption prediction model based on the information related to the request, and determining whether the probability is greater than a threshold. In response to a determination that the probability is greater than the threshold, the method may further include obtaining information related to the requester, and determining whether the requester has consumed alcohol based on the information related to the requester. In response to a determination that the requester has consumed alcohol, the method may further include transmitting a notification that the requester has consumed alcohol to a provider terminal corresponding to the request of the O2O service.

Подробнее
02-12-2019 дата публикации

DISPLAY DEVICE

Номер: CA0003063019A1
Принадлежит: AGENCE DE BREVETS FOURNIER

... [Problem to be solved] To provide a presentation system for effectively displaying a keyword for effecting the selection of the next slide during a presentation. [Solution] Provided is a display device comprising: a voice recognition means 53; a conversation-derived term extraction means 55; a search keyword storage means 57; a search keyword extraction means 59; a material storage unit 61; a relevant page information extraction means 63; a selection term extraction means 65; and a selection term display means 71.

Подробнее
06-03-2018 дата публикации

TREATMENT PROCESSING OF A PLURALITY OF STREAMING VOICE SIGNALS FOR DETERMINATION OF RESPONSIVE ACTION THERETO

Номер: CA0002665055C

Streaming voice signals, such as might be received at a contact center or similar operation, are analyzed to detect the occurrence of one or more unprompted, predetermined utterances. The predetermined utterances preferably constitute a vocabulary of words and/or phrases having particular meaning within the context in which they are uttered. Detection of one or more of the predetermined utterances during a call causes a determination of response-determinative significance of the detected utterance(s). Based on the response-determinative significance of the detected utterance(s), a responsive action may be further determined. Additionally, long term storage of the call corresponding to the detected utterance may also be initiated. Conversely, calls in which no predetermined utterances are detected may be deleted from short term storage. In this manner, the present invention simplifies the storage requirements for contact centers and provides the opportunity to improve caller experiences ...

Подробнее
05-08-2017 дата публикации

SYSTEMS AND METHODS FOR CONTACTING EMERGENCY PERSONNEL VIA VOICE RECOGNITION

Номер: CA0002954991A1
Принадлежит:

Systems and methods for contacting emergency personnel via voice recognition are provided. Some methods include detecting and identifying, via voice recognition, a first predetermined phrase, determining whether a second predetermined phrase is detected and identified, via voice recognition, within a predetermined period of time after detection and identification of the first predetermined phrase, and when the second predetermined phrase is detected and identified within the predetermined period of time, contacting emergency personnel.

Подробнее
06-09-2013 дата публикации

AUTOMATED VOICE-TO-REPORTING/ MANAGEMENT SYSTEM AND METHOD FOR VOICE CALL-INS OF EVENTS/CRIMES

Номер: CA0002865226A1
Принадлежит:

A system for institutions (particularly public safety) to accept spoken information such that users can report incident details as required to convey an official incident (e.g., crime) report and to aid in the investigation of the incident. Similarly, users can employ a combination of voice or text input, media submission, [GUI menu screens] and information download to aid in the immediate analysis of incident information to facilitate a rapid response for the incident. This method of interaction with institutions enables reporting parties to more quickly and efficiently convey incident information when direct interaction by an official is not possible or is subject to delay. This helps institutions, for example, police to provide better services while using less staff and resources.

Подробнее
25-05-1995 дата публикации

Soft Decision Speech Recognition

Номер: CA0002117932A1
Принадлежит:

Подробнее
18-01-2019 дата публикации

Method and device for speech recognition

Номер: CN0109243433A
Автор: SUN KAI, LIU KUN, DANG QIHE
Принадлежит:

Подробнее
12-02-2019 дата публикации

DETECTING A TRIGGER OF A DIGITAL ASSISTANT

Номер: CN0109328381A
Принадлежит:

Подробнее
22-06-2016 дата публикации

VOICE WAKEUP DETECTING DEVICE WITH DIGITAL MICROPHONE AND ASSOCIATED METHOD

Номер: CN0105704300A
Принадлежит:

Подробнее
30-11-2016 дата публикации

Hand-held lighting device

Номер: CN0106170224A
Автор: SCHWARZBICH JORG
Принадлежит:

Подробнее
21-08-2002 дата публикации

Voice recognition method and device

Номер: CN0001365487A
Автор: KIPP A, A. KIPP
Принадлежит:

Подробнее
10-11-2017 дата публикации

Handsfree device with continuous keyword recognition

Номер: CN0107340988A
Принадлежит:

Подробнее
05-04-2017 дата публикации

Mobile health intelligent medical guide system and method thereof

Номер: CN0106557653A
Принадлежит:

Подробнее
04-05-2016 дата публикации

A mobile terminal of the speech recognition method and its device

Номер: CN0103280217B
Автор:
Принадлежит:

Подробнее
19-01-2001 дата публикации

Chosen word replacement speech editing technique having audio signal statistical word model memory with trial input compared and where match found innocuous word replaced.

Номер: FR0002796486A1
Принадлежит:

Des procédés et dispositifs, permettant de traiter dynamiquement un signal vocal sont décrits. Ils comprennent la détection automatique, via des techniques de reconnaissance vocale à retouche des mots, du fait qu'une partie du signal vocal puisse comprendre un terme lexical associé à un ou plusieurs vocabulaires de termes lexicaux. La partie du signal vocal contenant le terme lexical est ensuite éditée automatiquement lorsqu'un tel terme lexical est détecté, d'une manière telle que le terme lexical détecté ne soit pas perceptible de façon audible. Selon un premier mode de réalisation, l'étape d'édition automatique comprend la substitution automatique de la partie du signal vocal contenant le terme lexical par une partie de signal vocal modifiée, ayant au moins un paramètre vocal (par exemple associé à une excitation de zone vocale, à un filtre de zone vocale, etc. ) qui constitue une modification d'au moins un paramètre vocal dans la partie du signal vocal contenant le terme lexical. Selon ...

Подробнее
23-08-2013 дата публикации

FACILITY FOR PROCESSING VERBAL FEEDBACK AND UPDATING DIGITAL VIDEO RECORDER(DVR) RECORDING PATTERNS

Номер: KR0101298823B1
Автор:
Принадлежит:

Подробнее
08-04-2020 дата публикации

MULTIMODAL TRANSMISSION OF PACKETIZED DATA

Номер: KR0102098847B1
Автор:
Принадлежит:

Подробнее
08-04-2020 дата публикации

Device selection for providing responses

Номер: KR0102098136B1
Автор:
Принадлежит:

Подробнее
02-05-2019 дата публикации

Номер: KR1020190045390A
Автор:
Принадлежит:

Подробнее
03-05-2005 дата публикации

METHOD AND AN APPARATUS FOR NOTIFYING PHONE NUMBERS, PARTICULARLY FOR SEPARATING ONLY A PART CORRESPONDING TO NUMBERS FROM A MECHANICAL SOUND, RECOGNIZING A CORRESPONDING PHONE NUMBER, STORING IT IN A MEMORY, AND DISPLAYING IT ON AN LCD

Номер: KR1020050040628A
Автор: KIM, CHAN WOO
Принадлежит:

PURPOSE: A method and apparatus for notifying phone numbers are provided to have the same effect of notifying a phone number to a user even when the user uses a VoIP type voice phone or a phone installed in an arbitrary home appliance. CONSTITUTION: When a phone number notification service center is connected, a responded notification voice signal is detected. A pattern of the responded voice signal is extracted according to a feature extracting method. Only a phone number notification part is stored according to a keyword spotting method with respect to the extracted pattern. Corresponding number data is recognized by using a number recognition function with respect to the stored phone number notification data. The detected number data is stored in a memory and displayed on a screen. © KIPO 2006 ...

Подробнее
26-12-2018 дата публикации

디지털화된 오디오 스트림을 분석하는 저전력 집적 회로

Номер: KR1020180137041A
Принадлежит:

... 예시들은 오디오 스트림을 수신하고 디지털화하기 위해 저전력 집적 회로를 개시한다. 또한, 예시들은 디지털화된 오디오 스트림을 키워드와 비교하고 디지털화된 오디오 스트림을 메모리에 저장하기 위해 저전력 집적 회로를 제공한다. 게다가, 예시들은 또한 디지털화된 오디오 스트림에서 키워드를 인식하면, 저전력 집적 회로는 전력을 증가시키기 위한 신호를 프로세서에 송신하고 디지털화된 오디오 스트림을 분석한다.

Подробнее
20-12-2012 дата публикации

SYSTEM AND METHOD OF SMART AUDIO LOGGING FOR MOBILE DEVICES

Номер: KR1020120137436A
Автор:
Принадлежит:

Подробнее
27-10-2022 дата публикации

자동 응답 서비스에서의 사용자 음성 응답의 식별 정확도를 향상시키기 위한 자동 응답 서비스 운영 서버 및 그 동작 방법

Номер: KR20220144498A
Автор: 안정우
Принадлежит:

... 자동 응답 서비스에서의 사용자 음성 응답의 식별 정확도를 향상시키기 위한 자동 응답 서비스 운영 서버 및 그 동작 방법이 개시된다. 본 발명에 따른 자동 응답 서비스 운영 서버 및 그 동작 방법은 음성 인식기가 자동 응답 서비스에 대한 사용자의 음성 응답을 잘못된 단어로 인식할 경우, 오인식된 단어를 적절한 응답 단어로 대체함으로써, 자동 응답 서비스에서의 사용자 음성 응답의 식별 정확도가 향상될 수 있도록 지원할 수 있다.

Подробнее
07-07-2015 дата публикации

Method and device for search of voice for stored content using discovered of unitermo

Номер: BR0PI0906385A2
Автор: MA CHANGXUE, CHANGXUE MA
Принадлежит:

Подробнее
16-03-2018 дата публикации

Device and method for activating a sensor apparatus

Номер: TW0201810253A
Принадлежит:

The invention relates to an activation device (1) for activating an external sensor apparatus (12), having a multiplicity of sound converter apparatuses (10-i; 10a; 10b) which have a respective resonant frequency and a respective sensitivity and which are designed to pick up a sound signal, convert the picked-up sound signal into an electrical signal and output the electrical signal, wherein the resonant frequencies and/or sensitivity levels of the sound converter apparatuses (10-i; 10a; 10b) differ at least partially from one another; and an evaluation apparatus (11) which is designed to detect a predefined sound pattern by evaluating at least one of the output electrical signals, and in this case to output an activation signal to the external sensor apparatus (12) in order to activate the external sensor apparatus (12).

Подробнее
16-12-2018 дата публикации

Electronic device having wake on voice function and operating method thereof

Номер: TW0201843582A
Принадлежит:

The present disclosure illustrates an electronic device having wake on voice function and an operating method thereof. The electronic device detects an analog form of voice signal by an increasing additional pre-signal detector. Then, when the analog form of voice signal meets a preset condition, the pre-signal detector can wake up the analog-to-digital converter and the human voice detector, so that the analog-to-digital converter and the human voice detector can also into a sleep mode when a host system of the electronic device already in the sleep mode.

Подробнее
21-07-2011 дата публикации

INTELLIGENT AND PARSIMONIOUS MESSAGE ENGINE

Номер: WO2011088049A2
Автор: MORAES, Ian, M.
Принадлежит:

A message engine for analyzing or examining a message and generating a textual description of the message. The message engine can provide a textual description of a voice message. The message engine does not present a speech to text conversion of the complete voice message (that is, it does not convert the entire message to text and present the textual version of the entire voice message to the user). Rather, the message engine presents only the conceptual key words that describe the essence of the voice message to the user. As such, the message engine is a more intelligent version of a speech-to-text converter. An exemplary message engine will only present in text the key conceptual words of the message rather than the entire speech to text translation of the whole message.

Подробнее
03-01-2014 дата публикации

SYSTEM AND METHOD TO PERFORM TEXTUAL QUERIES ON VOICE COMMUNICATIONS

Номер: WO2014001449A1
Принадлежит:

Methods and systems to perform textual queries on voice communications. The system comprises an index service for storing a plurality of audio content data sets for a plurality of voice communications, the plurality of audio content data sets comprising at least three audio content data sets for each voice communication, the at least three audio content data sets comprising a first audio content data set generated using a text-to-speech conversion technique, a second audio content data set generated using a phoneme lattice technique, and a third audio content data set generated using a keyword identification technique. The system also comprises a search engine configured to: receive search criteria from a user, the search criteria comprising at least one keyword; search each of the first, second and third audio content data sets for at least a portion of the plurality of voice communications to identify voice communications matching the search criteria; and combine the voice communications ...

Подробнее
04-09-2014 дата публикации

VOICE MESSAGE PROVIDING METHOD, AND APPARATUS AND SYSTEM FOR SAME

Номер: WO2014133225A1
Принадлежит:

The present invention relates to a voice message providing method. More particularly, the present invention relates to a voice message providing method which, when a voice message is received from one terminal through a messenger program, recognizes the voice message and converts the voice message into a text message according to a result of the recognition in a manner of first extracting a keyword satisfying a preset condition from the voice message and then generating and displaying the text message including the extracted keyword, and an apparatus and a system for the same. A terminal according to an embodiment of the present invention for the same may comprise: an output unit for outputting one or more pieces of information generated during execution of the messenger program; and a control unit for, when a voice message is applied during execution of the messenger program, generating a text message including a predetermined keyword through the applied voice message and controlling the ...

Подробнее
26-10-2000 дата публикации

BROADCAST SPEECH RECOGNITION SYSTEM FOR KEYWORD MONITORING

Номер: WO2000063879A1
Автор: WARREN, Robert, L.
Принадлежит:

L'invention concerne un système de surveillance automatique de messages audio (10)permettant de localiser des informations d'intérêt. Un processeur (12) est assorti d'une mémoire pour le stockage d'un vocabulaire (24) de mots d'intérêt. Un récepteur audio reçoit un message audio. Un système de reconnaissance vocale (12, 22) associé à ce récepteur et au processeur détecte un mot clé dans un segment de message audio reçu. Un générateur de rapport (14) est associé au processeur et réagissant en cas de détection d'un mot clé génère un rapport rendant compte en détail dudit mot clé et de son contexte. Ce système convient particulièrement pour la vérification de messages commerciaux et comme aide aux activités de relations publiques.

Подробнее
21-11-2019 дата публикации

VOICE AND CONVERSATION RECOGNITION SYSTEM

Номер: US20190355352A1
Принадлежит:

A conversation recognition system on board a vehicle includes an acoustic sensor component that detects sound in a cabin of the vehicle, a voice recognition component coupled to the acoustic sensor component that analyzes the sound detected by the acoustic sensor component and identifies a plurality of utterances, and a conversation threading unit coupled to the voice recognition component that analyzes the utterances identified by the voice recognition component and identifies a plurality of conversations between a plurality of occupants of the vehicle. The conversation recognition system enables multiple conversations in an environment to be recognized and distinguished from each other.

Подробнее
01-02-2018 дата публикации

FAR-FIELD AUDIO PROCESSING

Номер: US20180033428A1
Принадлежит:

An apparatus includes multiple microphones to generate audio signals based on sound of a far-field acoustic environment. The apparatus also includes a signal processing system to process the audio signals to generate at least one processed audio signal. The signal processing system is configured to update one or more processing parameters while operating in a first operational mode and is configured to use a static version of the one or more processing parameters while operating in the second operational mode. The apparatus further includes a keyword detection system to perform keyword detection based on the at least one processed audio signal to determine whether the sound includes an utterance corresponding to a keyword and, based on a result of the keyword detection, to send a control signal to the signal processing system to change an operational mode of the signal processing system.

Подробнее
09-06-2020 дата публикации

Device arbitration by multiple speech processing systems

Номер: US0010679629B2
Принадлежит: Amazon Technologies, Inc., AMAZON TECH INC

A device can perform device arbitration, even when the device is unable to communicate with a remote system over a wide area network (e.g., the Internet). Upon detecting a wakeword in an utterance, the device can wait a period of time for data to arrive at the device, which, if received, indicates to the device that another speech interface device in the environment detected an utterance. If the device receives data prior to the period of time lapsing, the device can determine the earliest-occurring wakeword based on multiple wakeword occurrence times, and may designate whichever device that detected the wakeword first as the designated device to perform an action with respect to the user speech. To account for differences in sound capture latency between speech interface devices, a pre-calculated time offset value can be applied to wakeword occurrence time(s) during device arbitration.

Подробнее
16-07-2019 дата публикации

Voice control of playback device using voice assistant service(s)

Номер: US0010354658B2
Принадлежит: Sonos, Inc., SONOS INC

Disclosed herein are example techniques to identify a voice service to process a voice input. An example implementation may involve a playback device capturing, via a microphone array, audio into one or more buffers. The playback device analyzes analyzing the captured audio using multiple wake-word detection algorithms. When a particular wake-word detection algorithm detects a wake-word corresponding to a particular voice assistant service, the playback device transmits the captured audio to the particular voice assistant service. The captured audio includes a voice input that includes a command to modify at least one playback setting of a media playback system. After transmitting the captured audio, the playback device receives, from the particular voice assistant service, instructions to modify the at least one playback setting according to the command, modifies the at least one playback setting, and with the at least one playback setting modified, plays back at least one audio track.

Подробнее
30-04-2015 дата публикации

Information Recognition Method and Apparatus

Номер: US20150120301A1
Принадлежит:

An information recognition method and apparatus are provided. The method includes receiving, by a terminal, voice information, extracting a voice feature from the voice information, performing matching calculation on the voice feature and a phoneme string corresponding to each candidate text in multiple candidate texts to obtain a recognition result, where the recognition result includes at least one command word and a label corresponding to the at least one command word, and recognizing, according to the label corresponding to the at least one command word, an operation instruction corresponding to the voice information. A terminal recognizes text information, which is corresponding to voice information input by a user, as an operation instruction.

Подробнее
27-04-2021 дата публикации

Image forming apparatus performing processing for resolving user's complaint against to printed result

Номер: US0010992829B2

An image forming apparatus includes: an image forming device; a recording paper conveyance device discharging, to an outside of the image forming apparatus, a print result obtained by forming an image on a recording paper by the image forming device; a microphone; and a control device functioning as a controller that controls the image forming device and the recording paper conveyance device in accordance with a print condition and as a voice recognition device that recognizes voice collected by the microphone during a predefined detection period starting after the print result is discharged to the outside of the image forming apparatus and detects, from the recognized voice, a word indicating a predefined complaint against appearance of the print result. Upon detecting the word indicating the predefined complaint by the voice recognition device, the controller performs predefined processing as processing of resolving a cause of the complaint indicated by the word.

Подробнее
21-01-2021 дата публикации

RADIO APPARATUS, RADIO COMMUNICATION SYSTEM, AND RADIO COMMUNICATION METHOD

Номер: US20210022199A1
Принадлежит: JVCKENWOOD CORPORATION

A radio apparatus includes a session control unit configured to maintain a half-duplex session with a radio apparatus at the other end during a session keeping time, a voice recognition unit configured to recognize a predetermined voice in the session, and an extension unit configured to extend the session keeping time when the predetermined voice is recognized.

Подробнее
22-07-2010 дата публикации

DEALING WITH SWITCH LATENCY IN SPEECH RECOGNITION

Номер: US20100185448A1
Принадлежит:

In embodiments of the present invention improved capabilities are described for interacting with a mobile communication facility comprising receiving a switch activation from a user to initiate a speech recognition recording session, wherein the speech recognition recording session comprises a voice command from the user followed by the speech to be recognized from the user; recording the speech recognition recording session using a mobile communication facility resident capture facility; recognizing at least a portion of the voice command as an indication that user speech for recognition will begin following the end of the at least a portion of the voice command; recognizing the recorded speech using a speech recognition facility to produce an external output; and using the selected output to perform a function on the mobile communication facility.

Подробнее
22-05-2018 дата публикации

Method and device for analyzing data from a microphone

Номер: US0009978372B2

A method for analyzing data from a microphone, the microphone being arranged to continuously record audio data, the method comprising: continuously storing the recorded audio data in a buffer, receiving a first triggering signal at a first point in time, the first triggering signal pertaining to a request for analyzing the audio data presently being recorded by the microphone, transferring a predefined amount of recorded audio data stored prior to the first point in time and audio data stored after the first point in time to a conversation analyzing unit. The method is advantageous as the information that was momentarily available during a conversation in the form of sound waves is now analyzed and thus may be converted to a format compatible with information available from other sources of relevance to the user.

Подробнее
23-02-2016 дата публикации

Voice controlled audio recording or transmission apparatus with keyword filtering

Номер: US0009269350B2
Принадлежит: Google Technology Holdings LLC

A method includes obtaining a plurality of audio channels using a plurality of microphone outputs having at least one audio control channel and at least one audio output channel. When a keyword is detected on the audio control channel using voice recognition, adaptive filtering is performed to attenuate the keyword from the audio output channel. An apparatus operative to perform the method includes a plurality of microphones that provide a plurality of audio channels with at least one audio output channel and at least one audio control channel. Voice command recognition logic is operatively coupled to the plurality of microphones to receive the at least one audio control channel. The voice command recognition logic detect keywords on the audio control channel and filter logic with at least one adaptive filter performs adaptive filtering to attenuate the keyword from the at least one audio output channel.

Подробнее
07-04-1998 дата публикации

Speech recognition employing a permissive recognition criterion for a repeated phrase utterance

Номер: US0005737724A
Автор:
Принадлежит:

The invention relates to a method and apparatus for speech recognition, the speech to be recognized including one or more words. Recognition is based on an analysis of a first and a second utterance. In accordance with the invention, the first utterance is compared to one or more models of speech to determine a similarity metric for each such comparison. The model of speech which most closely matches the first utterance is determined based on the one or more similarity metrics. The similarity metric corresponding to the most closely matching model of speech is analyzed to determine whether the similarity metric satisfies a first recognition criterion. The second utterance is compared to one or more models of speech associated with the most closely matching model (which may include the most closely matching model) to determine a second utterance similarity metric for each such comparison. The one or more second utterance similarity metrics are analyzed to determine whether the one or more ...

Подробнее
21-11-2019 дата публикации

System and Method for Monitoring and Visualizing Emotions in Call Center Dialogs at Call Centers

Номер: US2019356779A1
Принадлежит:

A method, system, and a computer program product are provided for monitoring an emotional state of a conversation by monitoring a communication between first and second persons to determine an emotional state of the communication, providing a visual representation of the emotional state of the communication for display to a person having access to the communication, and performing an intervention action to improve the emotional state of the communication in response to detecting the emotional state of the communication meets one or more specified intervention criteria.

Подробнее
30-01-2020 дата публикации

COMMUNICATION ROBOT AND METHOD FOR OPERATING THE SAME

Номер: US20200035237A1
Принадлежит: LG ELECTRONICS INC.

A communication robot for operating a massage providing apparatus by executing an artificial intelligence algorithm (AI) and/or machine learning algorithm in a 5G environment connected for the Internet of things and a method for operating the communication robot are provided. The method for operating the communication robot may include acquiring a spoken utterance of a user positioned within a prescribed range from the massage providing apparatus, tracking a spoken utterance direction of the user from the spoken utterance of the user, orienting the communication robot towards the spoken utterance direction of the user, acquiring a voice command included in the spoken utterance in relation to operation of the massage providing apparatus, and operating the massage providing apparatus according to the voice command.

Подробнее
06-09-2018 дата публикации

SYSTEM AND METHOD FOR ACTIVATION OF VOICE INTERACTIVE SERVICES BASED ON USER STATE

Номер: US20180254041A1
Принадлежит:

The present invention provides a system for activating personal assistance services. The system includes an audio data collector adapted to collect a sample of speech, a processing module, and a service activator couple to an output device. The processing module further includes an audio feature extractor that extracts a plurality of acoustic features from the sample of speech, and a classification unit that classifies a status of a user from the plurality of acoustic features. The Service activator activates a personal assistance service according to the status of the user classified by the classification unit.

Подробнее
10-10-2019 дата публикации

TECHNIQUES FOR WAKE-UP WORD RECOGNITION AND RELATED SYSTEMS AND METHODS

Номер: US20190311715A1
Принадлежит: Nuance Communications, Inc.

A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.

Подробнее
15-09-2016 дата публикации

METHOD AND SYSTEM FOR ORDER-FREE SPOKEN TERM DETECTION

Номер: US20160267907A1
Принадлежит:

A method for spoken term detection, comprising generating a time-marked word list, wherein the time-marked word list is an output of an automatic speech recognition system, generating an index from the time-marked word list, wherein generating the index comprises creating a word loop weighted finite state transducer for each utterance, i, receiving a plurality of keyword queries, and searching the index for a plurality of keyword hits.

Подробнее
24-01-2019 дата публикации

DIGITAL VOICE ASSISTANT OPERATION SYSTEM

Номер: US20190027139A1
Принадлежит:

A digital voice assistant operation system includes an access point, an electronic device, an audio device and a digital voice assistant module. The access point is in communication with a voice recognition module through network connection. The electronic device is in communication with the access point through network connection. After the user provides a voice command to the audio device, the voice command is transmitted to the voice recognition module through the audio device, the digital voice assistant module and the access point. After the voice recognition module recognizes the voice command, an operating signal corresponding to the voice command is outputted from the voice recognition module to the electronic device. Consequently, the electronic device performs an operation according to the operating signal.

Подробнее
07-12-2021 дата публикации

Accessory for a voice-controlled device

Номер: US0011195531B1
Принадлежит: Amazon Technologies, Inc., AMAZON TECH INC

This disclosure describes techniques and systems for encoding instructions in audio data that, when output on a speaker of a first device in an environment, cause a second device to output content in the environment. In some instances, the audio data has a frequency that is inaudible to users in the environment. Thus, the first device is able to cause the second device to output the content without users in the environment hearing the instructions. In some instances, the first device also outputs content, and the content output by the second device is played at an offset relative to a position of the content output by the first device.

Подробнее
08-12-2020 дата публикации

Utterance annotation user interface

Номер: US0010861440B2

A computing device includes a display configured to present a graphical user interface. The graphical user interface includes a transcript portion configured to display an unannotated transcript representing an ordered sequence of one or more dialogue events involving a client and a computerized assistant, at least one of the dialogue events taking the form of an example client utterance, and an annotation portion configured to display a hierarchical menu including a plurality of candidate utterance annotations. An utterance annotation machine is configured to receive one or more computer inputs selecting, for each of one or more response parameters in the example client utterance, utterance annotations from the hierarchical menu that collectively define a machine-readable interpretation of the example client utterance. An annotated utterance having a predetermined format usable to train the computerized assistant is output to a data store based on the example client utterance.

Подробнее
12-02-2019 дата публикации

Recording system for generating a transcript of a dialogue

Номер: US0010204641B2
Принадлежит: ECONIQ LIMITED, ECONIQ LTD

A recording system has a listener processor for automatically capturing events involving computer applications during a dialog involving the user of the computer. The system generates a visual transcript of events on a timeline. It automatically detects start of a dialog and proceeds to detect events and determines if they are configured as transcript events, before detecting end of the dialog. The system may associate dialog events with audio clips, using meta tags.

Подробнее
12-03-2020 дата публикации

VEHICLE LANGUAGE PROCESSING

Номер: US20200082817A1
Принадлежит: Ford Global Technologies, LLC

A computing system can be programmed to receive a spoken language command in response to emitting a spoken language cue and process the spoken language command with a generalized adversarial neural network (GAN) to determine a vehicle command. The computing system can be further programmed to operate a vehicle based on the vehicle command.

Подробнее
29-12-2020 дата публикации

Voice controlled keyword generation for automated test framework

Номер: US0010878804B2

Methods, systems and computer program products for voice controlled generation of keywords for test cases are provided. Aspects include receiving a verbal test plan for a computing system and analyzing the verbal test plan to identify a plurality of keywords. Aspects also include mapping each known keyword of the plurality of keywords to an existing grammar and obtaining, from a user, a definition for a new grammar for each unknown keyword of the plurality of keywords. Aspects further include analyzing a software being tested on the computing system and creating a test plan by respectively applying the new grammar and the existing grammar of the plurality of keywords to the software being tested.

Подробнее
09-04-2020 дата публикации

SPEECH RECOGNITION DEVICE AND SPEECH RECOGNITION METHOD

Номер: US20200111493A1
Принадлежит: Mitsubishi Electric Corporation

Included here are: a speech recognition unit for performing speech recognition on a speaker's speech; a keyword extraction unit for extracting a preset keyword from a result of the speech recognition; a conversation determination unit for referring to a keyword extraction result and determining whether or not the speaker's speech is a conversation; and an operation command extraction unit for extracting a command for operating an apparatus from the speech recognition result when the speech is determined not to be a conversation, and not extracting the command from the speech recognition result when the speech is determined to be a conversation.

Подробнее
28-02-2019 дата публикации

INFORMATION PROCESSING DEVICE, IN-VEHICLE DEVICE, AND STORAGE MEDIUM

Номер: US20190066678A1
Принадлежит: TOYOTA JIDOSHA KABUSHIKI KAISHA

An information processing device enables a user to register a wake-up-word for activating a predetermined function by voice recognition. The information processing device includes a receiving unit configured to receive, from a user, an input word for registering a wake-up-word, and a determination unit configured to determine whether the input word received by the receiving unit satisfies conditions for an accuracy of voice recognition.

Подробнее
29-01-2019 дата публикации

Method and apparatus for evaluating trigger phrase enrollment

Номер: US0010192548B2

An electronic device includes a microphone that receives an audio signal that includes a spoken trigger phrase, and a processor that is electrically coupled to the microphone. The processor measures characteristics of the audio signal, and determines, based on the measured characteristics, whether the spoken trigger phrase is acceptable for trigger phrase model training. If the spoken trigger phrase is determined not to be acceptable for trigger phrase model training, the processor rejects the trigger phrase for trigger phrase model training.

Подробнее
12-01-2021 дата публикации

Speech recognition

Номер: US0010891952B2

Methods and systems for speech recognition are disclosed. In one aspect, a method includes receiving an input signal representing spoken words, identifying individual words in the input signal, building a plurality of word sets based on the individual words, generating a partial result based on at least one of the plurality of word sets, receiving an additional input signal subsequent to the generation of the partial result, identifying an additional word from the additional input, detecting a semantic delimiter based on at least one of the plurality of word sets and the additional word, and in response to the detection, generating a response to the input signal based at least in part, on the partial result and the semantic delimiter.

Подробнее
12-01-2021 дата публикации

Uncertainty measure of a mixture-model based pattern classifer

Номер: US0010891942B2

There are provided mechanisms for determining an uncertainty measure of a mixture-model based parametric classifier. A method is performed by a classification device. The method includes obtaining a short-term frequency representation of a multimedia signal. The short-term frequency representation defines an input sequence. The method includes classifying the input sequence to belong to one class of at least two available classes using the parametric classifier. The parametric classifier has been trained with a training sequence. The method includes determining an uncertainty measure of the classified input sequence based on a relation between posterior probabilities of the input sequence and posterior probabilities of the training sequence.

Подробнее
28-05-2020 дата публикации

ELECTRONIC DEVICE AND CONTROL METHOD THEREOF

Номер: US20200168223A1
Принадлежит: Samsung Electronics Co., Ltd.

An electronic apparatus is provided. The electronic device includes: a storage configured to store recognition related information and misrecognition related information of a trigger word for entering a speech recognition mode; and a processor configured to identify whether or not the speech recognition mode is activated on the basis of characteristic information of a received uttered speech and the recognition related information, identify a similarity between text information of the received uttered speech and text information of the trigger word, and update at least one of the recognition related information or the misrecognition related information on the basis of whether or not the speech recognition mode is activated and the similarity.

Подробнее
14-06-2018 дата публикации

SYSTEM AND METHOD FOR AUTOMATED ON-DEMAND CREATION OF AND EXECUTION OF A CUSTOMIZED DATA INTEGRATION SOFTWARE APPLICATION

Номер: US20180167490A1
Принадлежит: Dell Products, LP

An information handling system operating a voice commanded integration application management system may include a memory for storing an established customized software integration application for a specific integration process for transforming data to enable electronic data exchange between data formats, and for storing one or more internal data records associating the established customized software integration application with an authorized user, and a run-time engine. The voice commanded integration application management system may also include a processor operatively connected to the memory for receiving a first integrated data keyword from a voice recognition software system requesting access to the specific integration process from the voice recognition software system on behalf of a user, and associating the first integrated data keyword with the established customized software integration application, and for receiving user authorization.

Подробнее
08-10-2020 дата публикации

OPERATING MODES THAT DESIGNATE AN INTERFACE MODALITY FOR INTERACTING WITH AN AUTOMATED ASSISTANT

Номер: US20200319849A1
Принадлежит:

Implementations described herein relate to transitioning a computing device between operating modes according to whether the computing device is suitably oriented for received non-audio related gestures. For instance, the user can attach a portable computing device to a docking station of a vehicle and, while in transit, wave their hand near the portable computing device in order to invoke the automated assistant. Such action by the user can be detected by a proximity sensor and/or any other device capable of determining a context of the portable computing device and/or an interest of the user in invoking the automated assistant. In some implementations location, orientation, and/or motion of the portable computing device can be detected and used in combination with an output of the proximity sensor to determine whether to invoke the automated assistant in response to an input gesture from the user.

Подробнее
09-05-2019 дата публикации

METHOD AND APPARATUS FOR WAKING UP VIA SPEECH

Номер: US20190139545A1
Автор: Bin YUAN, YUAN BIN, YUAN, Bin
Принадлежит:

The present disclosure provides a method and an apparatus for waking up via a speech. The method includes: obtaining a speech signal; decoding the speech signal according to a pre-generated searching space to obtain a speech recognition result, in which the searching space includes a path where an inversion model is located, the inversion model includes a first inversion model generated by training based on one or more word segmentation results of each of one or more wake-up phrases; when the first preset number of words of the speech recognition result is obtained, determining whether the preset number of words contains at least part of words in one of the one or more wake-up phrases; and determining cancellation of a wake-up operation directly if does not contain at least part of words in one of the one or more wake-up phrases, and ending the decoding of the speech signal.

Подробнее
11-07-2019 дата публикации

INFORMATION PROCESSING APPARATUS, SPEECH RECOGNITION SYSTEM, AND INFORMATION PROCESSING METHOD

Номер: US20190214015A1
Принадлежит:

An information processing apparatus includes: a speech obtainer which obtains speech of a user; a first controller which, when the first controller recognizes that the speech obtained by the speech obtainer is a first activation word, outputs a speech signal corresponding to a second activation word different from the first activation word; and a second controller which performs an activation process for starting transmission of the speech signal of the speech obtained by the speech obtainer to a VPA cloud server when the second controller recognizes that the speech signal output by the first controller indicates the second activation word.

Подробнее
20-08-2020 дата публикации

TRANSFERRING AN AUTOMATED ASSISTANT ROUTINE BETWEEN CLIENT DEVICES DURING EXECUTION OF THE ROUTINE

Номер: US20200265835A1
Принадлежит:

Transferring (e.g., automatically) an automated assistant routine between client devices during execution of the automated assistant routine. The automated assistant routine can correspond to a set of actions to be performed by one or more agents and/or one or more devices. While content, corresponding to an action of the routine, is being rendered at a particular device, the user may walk away from the particular device and toward a separate device. The automated assistant routine can be automatically transferred in response, and the separate device can continue to rendering the content for the user.

Подробнее
17-03-2020 дата публикации

Keyword generation apparatus and keyword generation method

Номер: US0010593323B2

A keyword generation apparatus, comprises a vocabulary acquisition unit that acquires a keyword uttered by a first user; a first positional information acquisition unit that acquires first positional information including information representing a location at which the first user has uttered the keyword; a storage unit that stores the first positional information and the keyword in association with each other; a second positional information acquisition unit that acquires second positional information including information representing a current position of a second user; and an extraction unit that extracts a keyword unique to a locality in which the second user is positioned from the storage unit based on the second positional information.

Подробнее
22-08-2019 дата публикации

ENHANCED VEHICLE KEY

Номер: US20190259383A1
Принадлежит:

A enhanced vehicle key and a method for voice activation, the method may include supplying power to a low power trigger sensor of a enhanced vehicle key, by a power source of the enhanced vehicle key; wherein the supplying of power is preceded by powering the enhanced vehicle key while a part of the enhanced vehicle key is positioned within an ignition switch of a vehicle; detecting, by the low power trigger sensor, a voice trigger; awakening, following the detection, a processor of the enhanced vehicle key; searching, by the processor, for a voice command; and when finding the voice command then executing the voice command.

Подробнее
16-08-2012 дата публикации

Method and apparatus for information extraction from interactions

Номер: US20120209606A1
Принадлежит: Nice Systems Ltd

Obtaining information from audio interactions associated with an organization. The information may comprise entities, relations or events. The method comprises: receiving a corpus comprising audio interactions; performing audio analysis on audio interactions of the corpus to obtain text documents; performing linguistic analysis of the text documents; matching the text documents with one or more rules to obtain one or more matches; and unifying or filtering the matches.

Подробнее
23-08-2012 дата публикации

Sound Recognition Operation Apparatus and Sound Recognition Operation Method

Номер: US20120215537A1
Автор: Yoshihiro Igarashi
Принадлежит: Individual

According to one embodiment, a sound recognition operation apparatus includes a sound detection module, a keyword detection module, an audio mute module, and a transmission module. The sound detection module is configured to detect sound. The keyword detection module is configured to detect a particular keyword using voice recognition when the sound detection module detects sound. The audio mute module is configured to transmit an operation signal for muting audio sound when the keyword detection module detects the keyword. The transmission module is configured to recognize the voice command after the keyword is detected by the keyword detection module, and transmit an operation signal corresponding to the voice command.

Подробнее
07-02-2013 дата публикации

Apparatus and method for recognizing voice

Номер: US20130035938A1
Автор: Ho Young JUNG

The present invention includes a hierarchical search process. The hierarchical search process includes three steps. In a first step, a word boundary is determined using a recognition method of determining a following word dependent on a preceding word, and a word boundary detector. In a second step, word unit based recognition is performed in each area by dividing an input voice into a plurality of areas based on the determined word boundary. Finally, in a third step, a language model is applied to induce an optimal sentence recognition result with respect to a candidate word that is determined for each area. The present invention may improve the voice recognition performance, and particularly, the sentence unit based consecutive voice recognition performance.

Подробнее
08-08-2013 дата публикации

Enhanced context awareness for speech recognition

Номер: US20130204622A1
Автор: Mika Grundstrom, Wenhui Lu
Принадлежит: Nokia Oyj

A method comprising establishing a call connection ( 200 ) between at least a first and a second terminal; monitoring ( 202 ), by at least the first terminal, a conversation during the call in order to detect ( 204 ) at least one predetermined context-related keyword repeated in at least the first and the second terminal; and in response to detecting ( 210 ) at least one repeated predetermined context-related keyword, providing an indication ( 212, 214, 216 ) about the detected context-related keyword to a user of at least the first terminal, said indication enabling opening an application linked to said context-related keyword.

Подробнее
31-10-2013 дата публикации

Negative Example (Anti-Word) Based Performance Improvement For Speech Recognition

Номер: US20130289987A1
Принадлежит: Interactive Intelligence Inc

A system and method are presented for negative example based performance improvements for speech recognition. The presently disclosed embodiments address identified false positives and the identification of negative examples of keywords in an Automatic Speech Recognition (ASR) system. Various methods may be used to identify negative examples of keywords. Such methods may include, for example, human listening and learning possible negative examples from a large domain specific text source. In at least one embodiment, negative examples of keywords may be used to improve the performance of an ASR system by reducing false positives.

Подробнее
06-01-2022 дата публикации

KEYWORD DETECTION APPARATUS, KEYWORD DETECTION METHOD, AND PROGRAM

Номер: US20220005466A1

A keyword is extracted robustly despite a voice recognition result including an error. A model storage unit stores a keyword extraction model that accepts word vector representations of a plurality of words as an input and extracts and outputs a word vector representation of a word to be extracted as a keyword. A speech detection unit detects a speech part from a voice signal. A voice recognition unit executes voice recognition on the speech part of the voice signal and outputs a confusion network which is a voice recognition result. A word vector representation generating unit generates a word vector representation including reliability of voice recognition with regard to each candidate word for each confusion set. A keyword extraction unit inputs the word vector representation of the candidate word to the keyword extraction model in descending order of the reliability and obtains the word vector representation of the keyword. 1. A keyword extraction device comprising:a model store configured to store a keyword extraction model that accepts word vector representations of a plurality of words as an input and extracts and outputs a word vector representation of a word to be extracted as a keyword;a voice recognize configured to execute voice recognition on an input voice signal and output a confusion network which is a voice recognition result;a word vector representation generator configured to generate a word vector representation including at least reliability of voice recognition with regard to each candidate word included in each confusion set included in the confusion network; anda keyword extractor configured to input the word vector representation of the candidate word included in each confusion set to the keyword extraction model in descending order of the reliability and obtain the word vector representation of the word to be extracted as the keyword.2. The keyword extraction device according to claim 1 , wherein the keyword extraction model outputs a ...

Подробнее
06-01-2022 дата публикации

MULTI-LOOK ENHANCEMENT MODELING AND APPLICATION FOR KEYWORD SPOTTING

Номер: US20220005468A1
Автор: Yu Dong, Yu Meng
Принадлежит: Tencent America LLC

A method, computer system, and computer readable medium are provided for activating speech recognition based on keyword spotting (KWS). Waveform data corresponding to one or more speakers is received. One or more direction features are extracted from the received waveform data. One or more keywords are determined from the received waveform data based on the one or more extracted features. Speech recognition is activated based on detecting the determined keyword. 1. A method of activating speech recognition , executable by a processor , comprising:receiving waveform data corresponding to one or more speakers;extracting one or more direction features from the received waveform data;determining one or more keywords from the received waveform data based on the one or more extracted features; andactivating speech recognition based on detecting the determined keywords.2. The method of claim 1 , wherein the received waveform data comprises one or more multi-channel input waveforms and one or more preset look directions.3. The method of claim 2 , further comprising mapping the multi-channel input waveforms to one or more spectrograms by a 1-D convolution layer.4. The method of claim 3 , further comprising extracting a single-channel spectral feature claim 3 , logarithm power spectrum data claim 3 , and one or more multi-channel spatial features based on the spectrograms.5. The method of claim 2 , wherein each extracted direction feature corresponds to one of the one or more preset look directions.6. The method of claim 5 , wherein a set of directions in the horizontal plane is sampled based on the direction features.7. The method of claim 6 , wherein the one or more direction feature vectors are derived from the set of directions in the horizontal plane.8. The method of claim 7 , wherein the directional feature computes the averaged cosine distance between a look direction steering vector between one or more pairs of microphones based on a phase of the look direction ...

Подробнее
05-01-2017 дата публикации

SMART HOME APPLIANCES, OPERATING METHOD OF THEREOF, AND VOICE RECOGNITION SYSTEM USING THE SMART HOME APPLIANCES

Номер: US20170004828A1
Принадлежит:

Provided is a smart home appliance. The smart home appliance includes: a voice input unit collecting a voice; a voice recognition unit recognizing a text corresponding to the voice collected through the voice input unit; a capturing unit collecting an image for detecting a user's visage or face; a memory unit mapping the text recognized by the voice recognition unit and a setting function and storing the mapped information; and a control unit determining whether to perform a voice recognition service on the basis of at least one information of image information collected by the capturing unit and voice information collected by the voice input unit. 1. A smart home appliance comprising:a voice input unit collecting a voice;a voice recognition unit recognizing a text corresponding to the voice collected through the voice input unit;a capturing unit collecting an image for detecting a user's visage;a memory unit mapping the text recognized by the voice recognition unit and a setting function and storing the mapped information; anda control unit determining whether to perform a voice recognition service on the basis of at least one information of image information collected by the capturing unit and voice information collected by the voice input unit.2. The smart home appliance according to claim 1 , wherein the control unit comprises a face detection unit recognizing that a user is in a staring state for voice input when image information on a user's visage is collected for more than a setting time through the capturing unit.3. The smart home appliance according to claim 2 , wherein the control unit determines that a voice recognition service standby state is entered when it is recognized that there is keyword information in a voice through the voice input unit and a user is in the staring state through the face detection unit.4. The smart home appliance according to claim 1 , further comprising:a filter unit removing a noise sound from the voice inputted through the ...

Подробнее
07-01-2021 дата публикации

KEYPHRASE EXTRACTION BEYOND LANGUAGE MODELING

Номер: US20210004439A1
Принадлежит: Microsoft Technology Licensing, LLC

A system for extracting a key phrase from a document includes a neural key phrase extraction model (“BLING-KPE”) having a first layer to extract a word sequence from the document, a second layer to represent each word in the word sequence by ELMo embedding, position embedding, and visual features, and a third layer to concatenate the ELMo embedding, the position embedding, and the visual features to produce hybrid word embeddings. A convolutional transformer models the hybrid word embeddings to n-gram embeddings, and a feedforward layer converts the n-gram embeddings into a probability distribution over a set of n-grams and calculates a key phrase score of each n-gram. The neural key phrase extraction model is trained on annotated data based on a labeled loss function to compute cross entropy loss of the key phrase score of each n-gram as compared with a label from the annotated dataset. 1. A system for extracting a key phrase from a document , the system comprising:a processor and a memory, wherein a program executing on the processor is configured to run a neural key phrase extraction model having:a first layer to extract a word sequence from the document;a second layer to represent each word in the word sequence by word embedding and visual features;a third layer to concatenate the word embedding and the visual features to produce hybrid word embeddings;a convolutional transformer to model the hybrid word embeddings to n-gram embeddings; anda feedforward layer to convert the n-gram embeddings into a probability distribution over a set of n-grams and calculate a key phrase score of each n-gram,wherein the neural key phrase extraction model was trained on annotated data based on a labeled loss function to compute loss of the key phrase score of each n-gram as compared with a label from the annotated dataset, andwherein the key phrase score of each n-gram is used in a computer-implemented application.2. The system of claim 1 , wherein the labeled loss function is ...

Подробнее
03-01-2019 дата публикации

METHOD AND DEVICE FOR SWITCHING INPUT MODES

Номер: US20190004676A1
Автор: DENG Yuanyuan
Принадлежит:

An input mode method includes providing a first input mode of an application program which receives a first type of information from a user and detecting whether the first type of information comprises a pre-configured condition for switching the first input mode of the application program to a second input mode of the application program. The second input mode of the application program receives a second type of information from the user. The method further includes, if the first type of information comprises the pre-configured condition, switching the first input mode of the application program to the second input mode of the application program. 1. An input mode method comprising:providing a first input mode of an application program which receives a first type of information from a user;detecting whether the first type of information comprises a pre-configured condition for switching the first input mode of the application program to a second input mode of the application program, wherein the second input mode of the application program receives a second type of information from the user; andif so, switching the first input mode of the application program to the second input mode of the application program.2. The method according to claim 1 , wherein:providing the first input mode includes providing an audio input mode for the application program and displaying an input interface of the audio input mode on a display interface of a mobile terminal; andswitching to the second input mode includes switching to an image input mode for the application program and displaying an input interface of the image input mode on the display interface.3. The method according to claim 2 , wherein detecting whether the first type of information comprises a pre-configured condition includes:detecting, within a preset period of time, whether the first type of input information is not audio information.4. The method according to claim 2 , wherein:the first type of information ...

Подробнее
13-01-2022 дата публикации

TRANSFERRING AN AUTOMATED ASSISTANT ROUTINE BETWEEN CLIENT DEVICES DURING EXECUTION OF THE ROUTINE

Номер: US20220013121A1
Автор: Ni Yuzhao
Принадлежит:

Transferring (e.g., automatically) an automated assistant routine between client devices during execution of the automated assistant routine. The automated assistant routine can correspond to a set of actions to be performed by one or more agents and/or one or more devices. While content, corresponding to an action of the routine, is being rendered at a particular device, the user may walk away from the particular device and toward a separate device. The automated assistant routine can be automatically transferred in response, and the separate device can continue to rendering the content for the user. 1. A method implemented by one or more processors , the method comprising:receiving, at a remote server device, data transmitted from a first client device;determining, at the remote server device, that the data corresponds to a request for initialization of an automated assistant routine that corresponds to a set of automated assistant actions;in response to determining that the data corresponds to the request, generating, at the remote server device, content for an action of the set of automated assistant actions; 'transmitting the content for the action to the first client device to cause the first client device to render the content for the action;', 'in response to the data that corresponds to the request for initialization of the automated assistant routine being received from the first client devicedetermining, at the remote server device during rendering of the content for the action by the first client device, that a user has directly or indirectly indicated an interest in the automated assistant routine being continued at a second client device; and 'rendering, at the second client device, additional data that is in furtherance of the automated assistant routine.', 'in response to determining that the user has indicated the interest in the automated assistant routine being continued at the second client device2. The method of claim 1 , wherein in response to ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR INTERVIEW TRAINING WITH TIME-MATCHED FEEDBACK

Номер: US20210004768A1
Принадлежит:

The present disclosure generally relates to interview training and providing interview feedback. An exemplary method comprises: at an electronic device that is in communication with a display and one or more input devices: receiving, via the one or more input devices, media data corresponding to a user's responses to a plurality of prompts; analyzing the media data; and while displaying, on the display, a media representation of the media data, displaying a plurality of analysis representations overlaid on the media representation, wherein each of the plurality of analysis representations is associated with an analysis of content located at a given time in the media representation and is displayed in coordination with the given time in the media representation. 115-. (canceled)16. A computer-implemented method , comprising:recording data comprising speech by a user;analyzing the recorded data to calculate a metric related to the speech;determining that the calculated metric exceeds a predefined threshold; an interactive representation of the recorded data for providing a playback of the recorded data;', 'the calculated metric; and', 'an indication of whether the calculated metric exceeds the predefined threshold., 'rendering a user interface comprising17. The method of claim 16 , further comprising: rendering an overall analysis score claim 16 , wherein the overall analysis score is at least partially based on the calculated metric.18. The method of claim 16 , wherein the recorded data comprises audio data claim 16 , image data claim 16 , video data claim 16 , or any combination thereof.19. The method of claim 16 , wherein the metric related to the speech comprises a number of words per minute.20. The method of claim 16 , wherein the indication of whether the calculated metric exceeds the predefined threshold comprises an indication of whether the speech is of an appropriate talking speed.21. The method of claim 16 , wherein the indication of whether the calculated ...

Подробнее
07-01-2021 дата публикации

System and method for automated agent assistance within a cloud-based contact center

Номер: US20210004817A1
Принадлежит: Talkdesk Inc

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing.

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR SMS AND EMAIL ENABLED AUTOMATED AGENT ASSISTANCE WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004819A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a SMS text communication from a customer;performing inference processing on the SMS text to determine a customer intent;automatically analyzing the SMS text to determine a subject of the SMS text communication and key terms associated with the subject;automatically parsing a knowledgebase using the key terms for at least one responsive answer associated with the subject; andproviding the solution to an agent in a unified interface during the communication with the customer.21. The method of , further comprising:querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; anddisplaying responsive results from the CRM/CSM in the second field in the unified interface.3. The method of claim 1 , further comprising:querying a database of customer-agent transcripts using the key terms; anddisplaying responsive results from the database of customer-agent transcripts in the second field in the unified interface.4. The method of claim 1 , wherein the method is performed in real-time as the customer communication progresses with the agent.5. The method of claim 1 , further comprising concurrently displaying to the agent the text in a first field of the unified interface and the solution in a second field of the unified interface.6. The method of claim 1 , further comprising authenticating the customer claim 1 , predicting a customer gender claim 1 , predicting a customer age category claim 1 , predicting a customer accent claim 1 , and predict ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AUTOMATED AGENT ASSISTANCE NEXT ACTIONS WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004820A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a communication from a customer;automatically analyzing the communication to determine a subject for the customer's communication;automatically parsing a knowledgebase for at least one responsive answer to the subject associated with the customer's communication;providing the solution to an agent as a clickable link within a unified interface during the communication with the customer; andreceiving selection of the clickable link to perform a subsequent action in response to the selection.2. The method of claim 1 , wherein the communication is in textual form claim 1 , the method further comprising:displaying text input by the customer in a first field of a unified interface;parsing the text input by the customer for key terms;querying the knowledgebase using the key terms; anddisplaying responsive results from the knowledgebase as the solution in a second field in the unified interface.3. The method of claim 2 , wherein the method is performed in real-time.42. The method of claim 2 , further comprising:querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; anddisplaying responsive results from the CRM/CSM in the second field in the unified interface.5. The method of claim 2 , further comprising:querying a database of customer-agent transcripts using the key terms; anddisplaying responsive results from the database of customer-agent transcripts in the second field in the unified interface.6. The method of claim 1 , further comprising ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AUTOMATED AGENT ASSISTANCE NEXT ACTIONS WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004821A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; andexecuting an agent assist functionality within the automation infrastructure that performs operations comprising:receiving a communication from a customer;automatically analyzing the communication to determine a subject for the customer's communication;automatically parsing a knowledgebase for at least one responsive answer to the subject associated with the customer's communication;providing the solution to an agent as a clickable link within a unified interface during the communication with the customer; andreceiving selection of the clickable link to perform a subsequent action in response to the selection.2. The method of claim 1 , wherein the communication is in textual form claim 1 , the method further comprising:displaying text input by the customer in a first field of a unified interface;parsing the text input by the customer for key terms;querying the knowledgebase using the key terms; anddisplaying responsive results from the knowledgebase as the solution in a second field in the unified interface.3. The method of claim 2 , wherein the method is performed in real-time.42. The method of claim 2 , further comprising:querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; anddisplaying ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AUTOMATED SMART NOTES USING AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004822A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a communication from a customer;displaying text associated with the communication in a first field of a unified user interface;automatically analyzing the text to determine key terms associated the communication; andautomatically populating the key terms into a notes field of a call notes user interface.2. The method of claim 1 , further comprising:querying one of a knowledgebase, a customer relationship management (CRM) platform/a customer service management (CSM) platform, and a database of customer-agent transcripts using the key terms; anddisplaying responsive results to the querying in a second field in the unified interface.3. The method of claim 2 , further comprising automatically populating query results into the notes field of the call notes user interface.4. The method of claim 3 , further comprising providing an editor in the call notes user interface to receive edits from the agent of the automatically populated key terms and query results.5. The method of claim 3 , wherein the populated key terms and query results form a call summary.6. The method of claim 1 , further comprising highlighting the key terms in the unified interface.7. The method of claim 1 , further comprising:receiving the communication as speech;converting the speech to text; andparsing the text for key terms.8. The method of claim 1 , wherein the communication is a multi-channel communication and received as one of an SMS text claim 1 , voice call claim 1 , e-mail claim 1 , chat claim 1 , interactive voice response (IVR ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AUTOMATED SMART NOTES USING AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004823A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; andexecuting an agent assist functionality within the automation infrastructure that performs operations comprising:receiving a communication from a customer;displaying text associated with the communication in a first field of a unified user interface;automatically analyzing the text to determine key terms associated the communication; andautomatically populating the key terms into a notes field of a call notes user interface.2. The method of claim 1 , further comprising:querying one of a knowledgebase, a customer relationship management (CRM) platform/a customer service management (CSM) platform, and a database of customer-agent transcripts using the key terms; anddisplaying responsive results to the querying in a second field in the unified interface.3. The method of claim 2 , further comprising automatically populating query results into the notes field of the call notes user interface.4. The method of claim 3 , further comprising providing an editor in the call notes user interface to receive edits from the agent of the automatically populated key terms and query results.5. The method of claim 3 , wherein the populated key terms and query results form a call summary.6. The method of claim 1 , further comprising highlighting the key terms in the unified interface.7. ...

Подробнее
07-01-2021 дата публикации

System and method for automated scheduling using agent assist within a cloud-based contact center

Номер: US20210004824A1
Принадлежит: Talkdesk Inc

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing.

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AUTOMATED SCHEDULING USING AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004825A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; andexecuting an agent assist functionality within the automation infrastructure that performs operations comprising:receiving a communication from a customer;displaying text associated with the communication in a first field of a unified user interface;automatically analyzing the text to determine key terms associated the communication; andautomatically using the key terms to schedule a subsequent interaction with the customer.2. The method of claim 1 , further comprising:querying one of a knowledgebase, a customer relationship management (CRM) platform/a customer service management (CSM) platform, and a database of customer-agent transcripts using the key terms; anddisplaying responsive results to the querying in a second field in the unified interface.3. The method of claim 2 , further comprising automatically using query results to schedule the subsequent interaction.4. The method of claim 1 , further comprising:receiving the communication as speech;converting the speech to text; andparsing the text for key terms.5. The method of claim 1 , wherein the communication is a multi-channel communication and received as one of an SMS text claim 1 , voice call claim 1 , e-mail claim 1 , chat claim 1 , interactive voice response (IVR)/intelligent virtual agent (IVA) systems ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AN AGENT ASSIST UNIFIED INTERFACE WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004826A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a communication from a customer;presenting text associated with the communication in a unified user interface;automatically analyzing the communication to determine a subject of the customer's communication;automatically querying an information source for at least one response to the subject; andpresenting the response the unified user interface.2. The method of claim 1 , further comprising:parsing the communication for key terms; andautomatically highlighting the key terms in the unified user interface.3. The method of claim 2 , the analyzing further comprising:inferring an intent of the customer using an intent inference module; anddetermining the keywords in accordance with the intent.4. The method of claim 1 , further comprising scrolling the unified user interface to receive subsequent communication from the customer and to present subsequent responses from the information source.5. The method of claim 1 , further comprising:querying a plurality of information sources for responses to the subject; andpresenting the responses from the plurality of information sources in the unified user interface.6. The method of claim 1 , further comprising displaying an indication in the unified user interface that the communications being analyzed.7. The method of claim 1 , further comprising presenting communication controls in the unified user interface.8. The method of claim 1 , further comprising presenting information about the customer.9. The method of claim 8 , wherein the information comprises a customer ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AN AGENT ASSIST UNIFIED INTERFACE WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004827A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; andexecuting an agent assist functionality within the automation infrastructure that performs operations comprising:receiving a communication from a customer;presenting text associated with the communication in a unified user interface;automatically analyzing the communication to determine a subject of the customer's communication;automatically querying an information source for at least one response to the subject; andpresenting the response the unified user interface.2. The method of claim 1 , further comprising:parsing the communication for key terms; andautomatically highlighting the key terms in the unified user interface.3. The method of claim 2 , the analyzing further comprising:inferring an intent of the customer using an intent inference module; anddetermining the keywords in accordance with the intent.4. The method of claim 1 , further comprising scrolling the unified user interface to receive subsequent communication from the customer and to present subsequent responses from the information source.5. The method of claim 1 , further comprising:querying a plurality of information sources for responses to the subject; andpresenting the responses from the plurality of information sources in the unified user interface.6. The method of claim 1 , further comprising ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR REAL-TIME AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004828A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a communication from a customer;automatically analyzing the communication in real-time to determine a subject of the customer's communication;automatically parsing a knowledgebase in real-time for at least one responsive answer to a question associated with the subject of the customer's communication; andproviding the at least one responsive answer in real-time to an agent during the communication with the customer.2. The method of claim 1 , wherein the communication is in textual form claim 1 , the method further comprising:displaying text input by the customer in a first field of a unified interface;parsing the text input by the customer for key terms;querying the knowledgebase using the key terms; anddisplaying responsive results from the knowledgebase as the solution in a second field in the unified interface.3. The method of claim 2 , further comprising highlighting the key terms.42. The method of claim 2 , further comprising:querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; anddisplaying responsive results from the CRM/CSM in the second field in the unified interface.5. The method of claim 2 , further comprising:querying a database of customer-agent transcripts using the key terms; anddisplaying responsive results from the database of customer-agent transcripts in the second field in the unified interface.6. The method of claim 1 , further comprising:receiving the communication as speech;converting the speech to text; ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR REAL-TIME AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004829A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; andexecuting an agent assist functionality within the automation infrastructure that performs operations comprising:receiving a communication from a customer;automatically analyzing the communication in real-time to determine a subject of the customer's communication;automatically parsing a knowledgebase in real-time for at least one responsive answer to a question associated with the subject of the customer's communication; andproviding the at least one responsive answer in real-time to an agent during the communication with the customer.2. The method of claim 1 , wherein the communication is in textual form claim 1 , the method further comprising:displaying text input by the customer in a first field of a unified interface;parsing the text input by the customer for key terms;querying the knowledgebase using the key terms; anddisplaying responsive results from the knowledgebase as the solution in a second field in the unified interface.3. The method of claim 2 , further comprising highlighting the key terms.42. The method of claim 2 , further comprising:querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; anddisplaying responsive results from the CRM/CSM in the second field in the unified interface.5. ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AGENT ASSIST ERROR CORRECTION AND DETECTION WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004830A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a communication from a customer;routing the communication to an agent;creating a transcript of the communication;analyzing to detect predetermined groupings of words or variations of the groupings of words; andalerting a supervisor upon detecting the predetermined groupings of the words or the variations of the groupings of the words.2. The method of claim 1 , further comprising performing the analyzing to determine agent compliance.3. The method of claim 1 , further comprising determining an emotional state of the agent.4. The method of claim 1 , further comprising determining if the agent is talking too fast claim 1 , too slow or if there is a delay between words.5. The method of claim 1 , further comprising:receiving the communication as speech;converting the speech to text; anddetermining intent from the text.6. The method of claim 1 , wherein the communication is a multi-channel communication and received as one of an SMS text claim 1 , voice call claim 1 , e-mail claim 1 , chat claim 1 , interactive voice response (IVR)/intelligent virtual agent (IVA) systems claim 1 , and social media.7. The method of claim 1 , wherein the analyzing is performed in real time during the communication.8. The method of claim 1 , further comprising:populating a notes interface with a transcript of the communication; andproviding the transcript of the indication to the supervisor.9. The method of claim 8 , further comprising highlighting the predetermined groupings of the words or the variations of the groupings of ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR AGENT ASSIST ERROR CORRECTION AND DETECTION WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004831A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; and receiving a communication from a customer;', 'routing the communication to an agent;', 'creating a transcript of the communication;', 'analyzing to detect predetermined groupings of words or variations of the groupings of words; and', 'alerting a supervisor upon detecting the predetermined groupings of the words or the variations of the groupings of the words., 'executing an agent assist functionality within the automation infrastructure that performs operations comprising2. The method of claim 1 , further comprising performing the analyzing to determine agent compliance.3. The method of claim 1 , further comprising determining an emotional state of the agent.4. The method of claim 1 , further comprising determining if the agent is talking too fast claim 1 , too slow or if there is a delay between words.5. The method of claim 1 , further comprising:receiving the communication as speech;converting the speech to text; anddetermining intent from the text.6. The method of claim 1 , wherein the communication is a multi-channel communication and received as one of an SMS text claim 1 , voice call claim 1 , e-mail claim 1 , chat claim 1 , interactive voice response (IVR)/intelligent virtual agent (IVA) systems claim 1 , and social media.7. The method of claim 1 , wherein the ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR ESCALATION USING AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004832A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; andexecuting an agent assist functionality within the automation infrastructure that performs operations comprising:receiving a communication from a customer;displaying text associated with the communication in a first field of a unified user interface;automatically analyzing the text to determine key terms associated the communication;automatically populating the key terms into a notes field of a call notes user interface; andescalating the communication to a supervisor for further action.2. The method of claim 1 , further comprising:querying one of a knowledgebase, a customer relationship management (CRM) platform/a customer service management (CSM) platform, and a database of customer-agent transcripts using the key terms; anddisplaying responsive results to the querying in a second field in the unified interface.3. The method of claim 2 , further comprising automatically populating query results into the notes field of the call notes user interface.4. The method of claim 3 , further comprising providing an editor in the call notes user interface to receive edits from the agent of the automatically populated key terms and query results.5. The method of claim 3 , wherein the populated key terms and query results form a call summary for use by the supervisor to understand ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR QUERYING MULTIPLE INFORMATION SOURCES USING AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004833A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a communication from a customer;presenting text associated with the communication in a unified user interface;automatically analyzing the communication to determine a subject of the customer's communication;automatically querying an in a plurality of information sources in real time formation source for at least one response to the subject; andpresenting the at least one response from the plurality of information sources the unified user interface.2. The method of claim 1 , further comprising:parsing the communication for key terms; andautomatically highlighting the key terms in the unified user interface.3. The method of claim 2 , the analyzing further comprising:inferring an intent of the customer using an intent inference module; anddetermining the keywords in accordance with the intent.4. The method of claim 1 , further comprising scrolling the unified user interface to receive subsequent communication from the customer and to present subsequent responses from the information source.5. The method of claim 1 , further comprising ranking a plurality of responses from the plurality of information sources in accordance with a relevance of each of the plurality of responses.6. The method of claim 1 , further comprising displaying an indication in the unified user interface that the communications being analyzed.7. The method of claim 1 , further comprising presenting communication controls in the unified user interface.8. The method of claim 1 , further comprising presenting information about the ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR COMMUNICATION ANALYSIS FOR USE WITH AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004834A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a communication from a customer;automatically analyzing the communication to determine a subject of the customer's communication;automatically querying a database of communications between other customers and other agents related to the subject of the customer's communication;determining at least one responsive answer to the subject from the database; andproviding the at least one responsive answer to an agent during the communication with the customer.2. The method of claim 1 , further comprising populating the database with information about the communications claim 1 , wherein information includes at least one of a time of the other communication claim 1 , a duration of the origination claim 1 , a topic of the other communication claim 1 , a frequency of words in the other communication.3. The method of claim 1 , wherein the communication is in textual form claim 1 , the method further comprising:displaying text input by the customer in a first field of a unified interface;parsing the text input by the customer for key terms;querying the knowledgebase using the key terms; anddisplaying responsive results from the knowledgebase as the solution in a second field in the unified interface.41. The method of claim 1 , further comprising:querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; anddisplaying responsive results from the CRM/CSM in the second field in the unified interface.5. The method of claim 1 , further comprising: ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR COMMUNICATION ANALYSIS FOR USE WITH AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004835A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; andexecuting an agent assist functionality within the automation infrastructure that performs operations comprising:receiving a communication from a customer;automatically analyzing the communication to determine a subject of the customer's communication;automatically querying a database of communications between other customers and other agents related to the subject of the customer's communication;determining at least one responsive answer to the subject from the database; andproviding the at least one responsive answer to an agent during the communication with the customer.2. The method of claim 1 , further comprising populating the database with information about the communications claim 1 , wherein information includes at least one of a time of the other communication claim 1 , a duration of the origination claim 1 , a topic of the other communication claim 1 , a frequency of words in the other communication.3. The method of claim 1 , wherein the communication is in textual form claim 1 , the method further comprising:displaying text input by the customer in a first field of a unified interface;parsing the text input by the customer for key terms;querying the knowledgebase using the key terms; anddisplaying responsive results from the knowledgebase as the solution in ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR PRE-POPULATING FORMS USING AGENT ASSIST WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210004836A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:receiving a communication from a customer;displaying text associated with the communication in a first field of a unified user interface;automatically analyzing the text to determine key terms associated the communication; andautomatically populating the key terms into a web-based form.2. The method of claim 1 , further comprising:querying one of a knowledgebase, a customer relationship management (CRM) platform/a customer service management (CSM) platform, and a database of customer-agent transcripts using the key terms; anddisplaying responsive results to the querying in a second field in the unified interface.3. The method of claim 2 , further comprising automatically populating query results into the web form.4. The method of claim 3 , further comprising providing an editor in the call notes user interface to receive edits from the agent of the automatically populated key terms and query results.5. The method of claim 3 , wherein the populated key terms and query results are responsive inputs to the web form.6. The method of claim 1 , further comprising highlighting the key terms in the unified interface.7. The method of claim 1 , further comprising:receiving the communication as speech;converting the speech to text; andparsing the text for key terms.8. The method of claim 1 , wherein the communication is a multi-channel communication and received as one of an SMS text claim 1 , voice call claim 1 , e-mail claim 1 , chat claim 1 , interactive voice response (IVR)/intelligent virtual agent (IVA) systems claim ...

Подробнее
03-01-2019 дата публикации

TOPIC SHIFT DETECTOR

Номер: US20190005123A1
Принадлежит:

Aspects detect or recognize shifts in topics in computer implemented speech recognition processes as a function of mapping keywords to non-verbal cues. An initial topic is mapped to one or more keywords extracted from a first spoken query within a user keyword ontology mapping. A query spoken subsequent in time to the first query is identified and distinguished by recognizing one or more non-verbal cues associated with the audio data input that include a time elapsed between the queries, and in some aspects a user's facial expression or motion activity. Aspects determine whether the second spoken query is directed to the initial topic or to a new topic that is different from the initial topic, as a function of mappings of the keyword(s) extracted from the first query to one or more keywords extracted from the second query and to the non-verbal cue(s) within the user ontology mapping. 1. A computer-implemented method for detecting shifts in topics in computer implemented speech recognition processes as a function of mapping keywords to non-verbal cues , the method comprising executing on a computer processor:identifying an initial topic of a first spoken query within an audio data input from a user that is mapped to at least one keyword extracted from the first spoken query within a user keyword ontology mapping;identifying a second spoken query within the audio data input that is subsequent in time to the first spoken query and is distinguished from the first query by recognizing at least one non-verbal cue associated with the audio data input, wherein the at least one non-verbal cue comprises a time elapsed between the first spoken query and the second spoken query, and a user's motion activity relative to a programmable device comprising the processor during a time between the first spoken query and the second spoken query;determining whether the second spoken query is directed to the initial topic or to a new topic that is different from the initial topic, as a ...

Подробнее
07-01-2021 дата публикации

System and Method for Audio Event Detection in Surveillance Systems

Номер: US20210005067A1
Принадлежит:

A method and system for detecting and localizing a target audio event in an audio clip is disclosed. The method and system use utilizes a hierarchical approach in which a dilated convolutional neural network to detect the presence of the target audio event anywhere in an audio clip based on high level audio features. If the target audio event is detected somewhere in the audio clip, the method and system further utilizes a robust audio vector representation that encodes the inherent state of the audio as well as a learned relationship between state of the audio and the particular target audio event that was detected in the audio clip. A bi-directional long short term memory classifier is used to model long term dependencies and determine the boundaries in time of the target audio event within the audio clip based on the audio vector representations. 1. A method for detecting and localizing a target audio event in an audio clip , the method comprising:receiving, with a processor, an audio clip;determining, with the processor, a plurality of audio features based on the audio clip;determining, with the processor, whether the target audio event is present in the audio clip using a first neural network based on the plurality of audio features;determining, with the processor, in response to determining that the target audio event is present in the audio clip, a plurality of vectors based on (i) the plurality of audio features and (ii) the target audio event, the vectors in the plurality of vectors indicating a correlation between audio features in the plurality of audio features and the target audio event; anddetermining, with the processor, a position in time of the target audio event within the audio clip using a second neural network based on the plurality of vectors.2. The method of claim 1 , the determining the plurality of audio features further comprising:segmenting the audio clip into a plurality of audio segments with a first predetermined length; anddetermining, ...

Подробнее
07-01-2021 дата публикации

Abuse Alert System by Analyzing Sound

Номер: US20210005069A1
Автор: Mangal Aryan
Принадлежит:

A method, including recognizing an indicia pattern on an audio signal recognizing its characteristics including but not limited to intensity, pitch, presence of abuse words, utilizing a portable device based sound controller system, prompting one of acceptance and disapproval of the indicia pattern, detecting a present of abuse intent and if intent is present, start taking an action e.g. recording, emailing or flashing an LED. 1. A method , comprising:recognizing an indicia pattern of abuse detection through analysing various characteristics of sound;prompting one of acceptance and disapproval of the indicia pattern determining abusal intent;start recording a sound clip from a sound input if the abuse detection indicia pattern is accepted;generate alert and send communication in any desirable format as configured; andprompting one of confirmation and refusal of the pairing.2. The method of claim 1 , further comprising detecting voice intensity by measuring any or all of amplitude claim 1 , pitch to at least one of accept the indicia pattern and disapprove the indicia pattern.3. The method of claim 1 , further comprising detecting voice intensity can be tuned based on person claim 1 , environment claim 1 , surrounding noise level and other factors that can impact sound detection.4. The method of claim 1 , further comprising detecting voice intensity duration to at least one of confirm the abuse and refuse the abuse.5. The method of claim 1 , further comprising detecting presence of abusive words by converting voice to text to at least one of confirm the abuse and refuse the abuse.6. In addition to claim 4 , ability to choose and personalize abusive or derogatory words is also claimed.7. The method of claim 1 , further comprising machine learning models that can be miniaturized in size and can be fit onto portable devices such as microcontrollers claim 1 , mobile phone claim 1 , tablets with the sole purpose of abuse detection.8. Adding to claim 7 , a feedback loop ...

Подробнее
07-01-2021 дата публикации

AUDIBLE KEYWORD DETECTION AND METHOD

Номер: US20210005181A1
Принадлежит:

The disclosure describes keyword detection in an audio processor and methods therefor including a low-power keyword detection engine (LKDE) and a high-power keyword detection engine (HKDE). In one implementation, the LKDE detects a keyword in data from a single audio source while buffering data from multiple audio sources and, upon detection of a keyword, the HKDE is awakened to verify the previously detected keyword by processing the buffered audio data from the multiple sources. 1. A digital processor for processing audio data , the processor comprising:an audio data interface;a buffer coupled to the interface and configured to buffer data received at the interface;a low-power keyword detection engine (LKDE) configured to determine likely presence of a keyword in data received at the interface while the data is buffered in the buffer; anda high-power keyword detection engine (HKDE) configured to wakeup from a low-power sleep mode if the LKDE determines likely presence of a keyword, and after awakening, verify the likely presence of the keyword detected by the LKDE by processing data in the buffer,wherein the HKDE is configured to detect keywords with higher certainty than the LKDE.2. The processor of claim 1 ,wherein the LKDE is configured to determine likely presence of a keyword with a true positive rate (TPR) above a first threshold and a false acceptance rate (FAR) below a second threshold, wherein the first and second thresholds are constrained by a maximum acceptable power consumption associated with a duty cycle with which the HKDE is awakened, andwherein the HKDE is configured to detect likely presence of a keyword with a lower FAR than the LKDE.3. The processor of claim 2 , wherein the LKDE is configured to determine likely presence of a keyword based on whether a confidence level associated with detection of the keyword satisfies a condition.4. The processor of claim 2 ,the interface is a multi-source interface and the buffer is configured to buffer data ...

Подробнее
07-01-2021 дата публикации

ORTHOGONALLY CONSTRAINED MULTI-HEAD ATTENTION FOR SPEECH TASKS

Номер: US20210005183A1
Принадлежит:

A method for operating a neural network includes receiving an input sequence at an encoder. The input sequence is encoded to produce a set of hidden representations. Attention-heads of the neural network calculate attention weights based on the hidden representations. A context vector is calculated for each attention-head based on the attention weights and the hidden representations. Each of the context vectors correspond to a portion of the input sequence. An inference is output based on the context vectors. 1. A method for operating a neural network , the method comprising:receiving an input sequence at an encoder;encoding the input sequence to produce hidden representations;calculating attention weights in attention-heads of the neural network based on the hidden representations;calculating a context vector for each attention-head based on the attention weights and the hidden representations, each context vector corresponding to a portion of the input sequence; andoutputting an inference based on the context vectors.2. The method of claim 1 , in which the input sequence comprises an acoustic feature and the inference includes an indication of whether a keyword is included in a corresponding portion of the input sequence.3. The method of claim 1 , in which each of the context vectors are orthogonal to other context vectors of other attention-heads.4. The method of claim 1 , in which the attention weight is calculated based on a score function for focus determination.5. The method of claim 1 , further comprising selectively regularizing the neural network based on orthogonality constraints of the attention weights and context vectors.6. The method of claim 5 , in which the selectively regularizing further comprises calculating regularization terms only for positive samples in the input sequence.7. An apparatus for operating a neural network claim 5 , the apparatus comprising:a memory; andat least one processor coupled to the memory, the at least one processor being ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR TEXT-ENABLED AUTOMATED AGENT ASSISTANCE WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210005192A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method , comprising:executing an automation infrastructure within a cloud-based contact center that includes a communication manager, speech-to-text converter, a natural language processor, and an inference processor exposed by application programming interfaces; andexecuting an agent assist functionality within the automation infrastructure that performs operations comprising:receiving a text communication from a customer;performing inference processing on the text to determine a customer intent;automatically analyzing the text to determine a subject of the text communication and key terms associated with the subject;automatically parsing a knowledgebase using the key terms for at least one responsive answer associated with the subject; andproviding the solution to an agent in a unified interface during the communication with the customer.21. The method of , further comprising:querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; anddisplaying responsive results from the CRM/CSM in the second field in the unified interface.3. The method of claim 1 , further comprising:querying a database of customer-agent transcripts using the key terms; anddisplaying responsive results from the database of customer-agent transcripts in the second field in the unified interface.4. The method of claim 1 , wherein the method is performed in real-time as the customer communication progresses with the agent.5. The method of claim 1 , further comprising concurrently displaying to ...

Подробнее
07-01-2021 дата публикации

System and method for speech-enabled automated agent assistance within a cloud-based contact center

Номер: US20210005206A1
Принадлежит: Talkdesk Inc

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing.

Подробнее
04-01-2018 дата публикации

DEVICE INCLUDING SPEECH RECOGNITION FUNCTION AND METHOD OF RECOGNIZING SPEECH

Номер: US20180005627A1
Принадлежит:

A device including a speech recognition function which recognizes speech from a user, includes: a loudspeaker which outputs speech to a space; a microphone which collects speech in the space; a first speech recognition unit which recognizes the speech collected by the microphone; a command control unit which issues a command for controlling the device, based on the speech recognized by the first speech recognition unit; and a control unit which prohibits the command issuance unit from issuing the command, based on the speech to be output from the loudspeaker. 17-. (canceled)8. A device including a speech recognition function which recognizes user speech which is speech from a user , the device comprising:a loudspeaker which outputs speech to a space;a microphone which collects speech in the space;a speech output unit configured to output a speech signal;a first speech recognition unit configured to recognize the speech collected by the microphone;a first control unit configured to generate a command for controlling the speech output unit, based on the speech recognized by the first speech recognition unit; anda second control unit configured to permit or prohibit issuance of the command to the speech output unit, based on the speech signal,wherein the second control unit includes a second speech recognition unit configured to analyze the speech signal to recognize the speech to be output from the loudspeaker, andthe second control unit is configured to determine whether or not the speech recognized by the second speech recognition unit matches a predetermined keyword, and when the speech recognized by the second speech recognition unit matches the predetermined keyword, prohibit the issuance of the command and when the speech recognized by the second speech recognition unit does not match the predetermined keyword, permit issuance of the command.9. A method of recognizing user speech using a device including a speech recognition function , the user speech being ...

Подробнее
04-01-2018 дата публикации

PERFORMING TASKS AND RETURING AUDIO AND VISUAL ANSWERS BASED ON VOICE COMMAND

Номер: US20180005631A1
Принадлежит: KT CORPORATION

An artificial intelligence voice interactive system may provide various services to a user in response to a voice command by providing an interface between the system and a legacy system to enable providing various types of existing services in response to user speech without modifying systems for the existing services. Such system includes a central server, and the central server may perform operations of registering a plurality of service servers at the central server and storing registration information of each service server, analyzing voice command data from the user device and determining at least one task and corresponding service servers based on the analysis results, generating an instruction message based on the voice command data, the determined at least one task, and the registration information of the selected service servers, and transmitting the generated instruction message to the selected service servers, and receiving task results including audio and video data from the selected service servers and outputting the task results through at least one device associated with the user device. 1. A method for providing voice interactive services to a user through an artificial intelligence voice interactive system including a central server and a plurality of service servers each connected to the central server through a communication network , the method of the central server comprising:performing a registration operation upon generation of a predetermined event for registering the service servers and storing registration information of each service server;receiving voice command data, determining at least one task to perform based on the voice command data, and selecting at least one of the service servers based on the determined task; andgenerating an instruction message based on the voice command data, the determined at least one task, and the registration information of the selected service server, and transmitting the generated instruction message to ...

Подробнее
02-01-2020 дата публикации

METHOD AND APPARATUS FOR RECOGNIZING A VOICE

Номер: US20200005768A1
Автор: Chae Jong Hoon
Принадлежит: LG ELECTRONICS INC.

Disclosed are a speech recognition method and a speech recognition device, in which speech recognition is performed by executing an artificial intelligence (AI) algorithm and/or a machine learning algorithm provided therein. According to an embodiment of the present disclosure, the speech recognition method includes buffering an inputted spoken utterance, determining whether a preset wake-up word is present in the spoken utterance by comparing the buffered spoken utterance to the preset wake-up word, and in response to the preset wake-up word in the spoken utterance, activating a speech recognition function and isolating, from the spoken utterance, a spoken sentence as a voice command without the wake-up word, and processing the spoken sentence and outputting a processing result. 1. A speech recognition method by a speech recognition device , the speech recognition method comprising:buffering an inputted spoken utterance;comparing the buffered spoken utterance to a preset wake-up word to determine whether the preset wake-up word is present in the spoken utterance;in response to the preset wake-up word present in the spoken utterance, activating a speech recognition function and isolating, from the spoken utterance, a spoken sentence as a voice command without the wake-up word; andprocessing the spoken sentence and outputting a processing result.2. The speech recognition method of claim 1 , wherein the buffering further includes buffering by varying a buffering volume or time in accordance with an environment condition under which the spoken utterance is inputted.3. The speech recognition method of claim 1 , wherein the isolating includes isolating claim 1 , from the spoken utterance claim 1 , the spoken sentence preceding the wake-up word.4. The speech recognition method of claim 1 , wherein the isolating includes isolating claim 1 , from the spoken utterance claim 1 , a first spoken segment and a second spoken segment without the wake-up word claim 1 , wherein the ...

Подробнее
02-01-2020 дата публикации

Method for Term-Dependent Output of Information Based on a Voice Input to a Specific Group, and System

Номер: US20200005785A1
Принадлежит:

A method for term-dependent output of information based on a voice input to a specific group includes the steps of: capturing the voice input; analyzing the captured voice input for the presence of a group-specific key term, associated with the specific group; and on detection of the group-specific key term in the analyzed voice input, outputting the information based on the voice input to the specific group. 1. A method for term-dependent output of information based on a voice input to a specific group , the method comprising the steps of:a) capturing the voice input;b) analyzing the captured voice input for a presence of a group-specific key term, associated with the specific group; andc) on detection of the group-specific key term in the analyzed voice input, outputting the information based on the voice input to the specific group.2. The method according to claim 1 , wherein the specific group is one or more of:a garden and/or forestry worker group, and the group-specific key term is a garden and/or forestry term,a hauler group, and the group-specific key term is a hauler term,a forestry and/or work management group, and the group-specific key term is a forestry and/or work management term, anda rescue group, and the group-specific key term is an emergency term and/or a rescue term.3. The method according to claim 1 , further comprising the step of:ascertaining a direction from an output location to a capture location, andwherein the information has the ascertained direction.4. The method according to claim 1 , further comprising the steps of:ascertaining a distance between a capture location and an output location; andif the ascertained distance is below a distance limit value, outputting the information in group-aspecific fashion.5. The method according to claim 4 , whereinthe distance is ascertained by a time-of-flight method.6. The method according to claim 1 , further comprising the step of:wirelessly transmitting the captured voice input and/or the ...

Подробнее
02-01-2020 дата публикации

GUIDE ROBOT AND METHOD FOR OPERATING THE SAME

Номер: US20200005787A1
Автор: MAENG Jichan, SHIN Wonho
Принадлежит: LG ELECTRONICS INC.

The present disclosure relates to a guide robot and a method of operating the same. A guide robot according to the present disclosure includes a voice receiving unit to receive a voice, a controller to determine whether the received voice includes a preset wake-up word, and a wireless communication unit to perform communication with an artificial intelligence (AI) server set to be activated by the preset wake-up word. At this time, the control unit transmits the received voice to the artificial intelligence server, receives result information from the artificial intelligence server, and outputs the received result information, when the received voice includes the preset wake-up word. And, the control unit outputs a response voice selected according to a predetermined reference when the received voice does not include the preset wake-up word. 1. A guide robot , comprising:a voice receiving unit configured to receive a voice;a control unit configured to determine whether the received voice includes a preset wake-up word; anda wireless communication unit configured to perform communication with an artificial intelligence (AI) server set to be activated by the preset wake-up word,wherein the control unit transmits the received voice to the artificial intelligence server, receives result information from the artificial intelligence server, and output the received result information, when the received voice includes the preset wake-up word, andoutputs a response voice selected according to a predetermined reference when the received voice does not include the preset wake-up word.2. The guide robot of claim 1 , wherein the control unit performs a greeting recognition operation when the received voice does not include the preset wake-up word claim 1 , and determines whether the received voice is recognized as a greeting based on a sensing signal received from at least one sensor in the greeting recognition operation.3. The guide robot of claim 2 , wherein the control unit ...

Подробнее
02-01-2020 дата публикации

METHOD AND APPARATUS FOR RECOGNIZING A VOICE

Номер: US20200005789A1
Автор: Chae Jong Hoon
Принадлежит: LG ELECTRONICS INC.

Disclosed are a speech recognition device and a speech recognition method, which perform speech recognition by executing an artificial intelligence (AI) algorithm and/or a machine learning algorithm provided therein, and which can communicate with other electronic devices and an external server in a 5G communication environment. According to an embodiment, the speech recognition method includes setting an additional wake-up word target capable of activating a speech recognition function in addition to a preset wake-up word, generating a plurality of additional wake-up word utterances formed on the basis of the additional wake-up word target being uttered under various conditions, learning a wake-up word recognition algorithm by using each of the spoken utterances of the additional wake-up word to generate an additional wake-up word recognition algorithm, and executing the additional wake-up word recognition algorithm upon receiving a select word uttered by a user to determine whether to activate the speech recognition function. 1. A speech recognition method by a speech recognition device , comprising:setting an additional wake-up word target in addition to a preset wake-up word, the additional wake-up word target being capable of activating a speech recognition function;generating a plurality of utterance data items with respect to the additional wake-up word target based on the additional wake-up word target being uttered under various conditions;generating an additional wake-up word recognition algorithm by training a wake-up word recognition algorithm by using each of the plurality of utterance data items as training input data; anddetermining whether to activate the speech recognition function by executing the additional wake-up recognition algorithm in response to a select word received and uttered by a user.2. The speech recognition method of claim 1 , the setting of the additional wake-up word target includes:receiving an additional wake-up word setting menu ...

Подробнее
02-01-2020 дата публикации

DESIRED SIGNAL SPOTTING IN NOISY, FLAWED ENVIRONMENTS

Номер: US20200005813A1
Принадлежит:

There are disclosed devices, system and methods for desired signal spotting in noisy, flawed environments by identifying a signal to be spotted, identifying a target confidence level, and then passing a pool of cabined arrays through a comparator to detect the identified signal, wherein the cabined arrays are derived from respective distinct environments. The arrays may include plural converted samples, each converted sample include a product of a conversion of a respective original sample, the conversion including filtering noise and transforming the original sample from a first form to a second form. Detecting may include measuring a confidence of the presence of the identified signal in each of plural converted samples using correlation of the identified signal to bodies of known matching samples. If the confidence for a given converted sample satisfies the target confidence level, the given sample is flagged. 1. A method of desired signal spotting in noisy , flawed environments comprising:identifying a signal to be spotted, wherein the signal is one or more spoken terms;identifying a target confidence level;passing a pool of call samples through a comparator to spot the identified signal, wherein the call samples are derived from respective distinct environments, wherein spotting the identified signal comprises measuring a confidence of a presence of the identified signal in each of the call samples using correlation of the identified signal to bodies of known matching samples that are machine created reference signals, and wherein the correlation includes the comparator performing a weighting analysis comparison of the identified signal to the bodies of known matching samples that are selected by the comparator during measuring of each of the call samples;if the confidence for a given call sample satisfies the target confidence level, flagging the given call sample as having the identified signal; andusing the flagged sample as a signal of the bodies of known ...

Подробнее
03-01-2019 дата публикации

METHOD AND DEVICE FOR MANAGING DIALOGUE BASED ON ARTIFICIAL INTELLIGENCE

Номер: US20190005948A1
Принадлежит:

Embodiments of the present disclosure provide a method and a device for managing a dialogue based on artificial intelligence. The method includes the followings. An optimum system action is determined from at least one candidate system action according to a current dialogue status feature, a candidate system action feature and surrounding feedback information of the at least one candidate system action and based on a decision model. Since the current dialogue status corresponding to the current dialogue status feature includes uncertain results of natural language understanding, the at least one candidate system action acquired according to the current dialogue status also includes the uncertain results of natural language understanding. 1. A method for managing a dialogue based on artificial intelligence , comprising:receiving current dialogue information;determining a user intention of the current dialogue information;determining query dimension distribution information and current single-round slot distribution information of the current dialogue information under the user intention;generating current multi-round slot distribution information according to the current single-round slot distribution information of the current dialogue information and historical multi-round slot distribution information of historical dialogue information;generating a current dialogue status according to the user intention, the query dimension distribution information and the current multi-round slot distribution information of the current dialogue information;performing a first feature extraction on the current dialogue status to obtain a current dialogue status feature;determining at least one candidate system action according to the current dialogue status and a pre-configured rule;performing a second feature extraction on the at least one candidate system action to obtain a candidate system action feature of each of the at least one candidate system; andinputting the current ...

Подробнее
03-01-2019 дата публикации

Hands free always on near field wakeword solution

Номер: US20190005953A1
Принадлежит: Amazon Technologies Inc

Apparatuses and systems for conserving power for a portable electronic device that monitors local audio for a wakeword are described herein. In a non-limiting embodiment, a portable electronic device may have two-phases. The first phase may be a first circuit that stores an audio input while determining whether human speech is present in the audio input. The second phase may be a second circuit that activates when the first circuit determines that human speech is present in the audio input. The second circuit may receive the audio input from the first circuit, store the audio input, and determine whether a wakeword is present within the audio input.

Подробнее
03-01-2019 дата публикации

WAKE-ON-VOICE METHOD, TERMINAL AND STORAGE MEDIUM

Номер: US20190005954A1
Принадлежит:

The present disclosure provides a wake-on-voice method, a terminal and a storage medium. The method includes: acquiring a wake-up voice configured to wake up a smart terminal; performing an analysis on an acoustic feature of the wake-up voice by using a preset acoustic model and a preset wake-up word recognition network of the smart terminal, so as to acquire a confidence coefficient of the acoustic feature of the wake-up voice with respect to an acoustic feature of a preset wake-up word; determining whether the confidence coefficient falls in a preset range of moderate confidence coefficients, if yes, uploading the wake-up voice to a remote server; and determining whether a linguistic feature obtained by analyzing the wake-up voice using a linguistic model matches to a linguistic feature of the preset wake-up word, if yes, receiving an instruction to wake up the smart terminal generated by the remote server. 1. A wake-on-voice method , comprising:acquiring a wake-up voice configured to wake up a smart terminal;performing an analysis on an acoustic feature of the wake-up voice by using a preset acoustic model and a preset wake-up word recognition network of the smart terminal, so as to acquire a confidence coefficient of the acoustic feature of the wake-up voice with respect to an acoustic feature of a preset wake-up word;determining whether the confidence coefficient falls in a preset range of moderate confidence coefficients, and if yes, uploading the wake-up voice to a remote server; anddetermining whether a linguistic feature obtained by analyzing the wake-up voice using a linguistic model in the remote server matches to a linguistic feature of the preset wake-up word, and if yes, receiving an instruction to wake up the smart terminal generated by the remote server.2. The wake-on-voice method according to claim 1 , wherein after it is determined that the linguistic feature obtained by analyzing the wake-up voice using the linguistic model in the remote server ...

Подробнее
03-01-2019 дата публикации

METHODS, SYSTEMS, AND MEDIA FOR VOICE-BASED CALL OPERATIONS

Номер: US20190005955A1
Принадлежит:

Methods, systems, and media for voice-based call operations are provided. In some embodiments, a method comprises: receiving, at a first user device, a communication; detecting a voice command, using the first user device, that includes a keyword; and in response to detecting the voice command, causing the communication to be transferred to a second user device that is associated with the keyword. 1. A method comprising:receiving, at a first user device, a communication;detecting a voice command, using the first user device, that includes a keyword; andin response to detecting the voice command, causing the communication to be transferred to a second user device that is associated with the keyword.2. The method of claim 1 , wherein the communication is an invitation to join a telecommunication channel.3. The method of claim 2 , wherein the telecommunication channel is a telephone channel.4. The method of claim 2 , wherein the telecommunication channel is an Internet videotelephony channel.5. The method of claim 2 , further comprising:in response to receiving the invitation to join the telecommunication channel, presenting a notification of the invitation during a period of time, wherein the period of time is terminated upon acceptance of the invitation, and wherein the voice command is detected during the period of time.6. The method of claim 2 , further comprising causing the second user device to join the telecommunication channel.7. The method of claim 1 , wherein causing the communication to be transferred to the second user device is further in response to detecting a second keyword in the voice command claim 1 , wherein the second keyword corresponds to a communication transfer operation.8. The method of claim 1 , wherein the communication is an SMS message or an MMS message.9. The method of claim 1 , further comprising:receiving an indication of the keyword; anddetermining that the keyword is to be associated with the second user device.10. A system ...

Подробнее
05-01-2017 дата публикации

ELECTRONIC DEVICE AND METHOD FOR CONTROLLING THE SAME

Номер: US20170006329A1
Принадлежит: LG ELECTRONICS INC.

A digital television can include a display; an interface to receive a control signal from a remote controller; a communication unit for internet access; a broadcast receiver; and a controller to display broadcast content received through the broadcast receiver, display a screen corresponding to a voice recognition mode in response to entering the voice recognition mode, the screen includes a symbol and text indicating the voice recognition mode, receive a voice command signal corresponding to a voice command through the remote controller in the voice recognition mode while the broadcast content is displayed, in which the voice command is input via a microphone provided in the remote controller, and the input voice command includes a search source and a search keyword, and display a web page corresponding to the search source included in the voice command, which includes a search window including the search keyword and a search result. 1. A digital television comprising:a display;an interface configured to receive a control signal for controlling an operation of the digital television from a remote controller;a communication unit configured to support internet access;a broadcast receiver configured to receive broadcast signals or broadcast associated information from an external broadcast management server via a broadcast channel; and display broadcast content received through the broadcast receiver on the display,', 'display a screen corresponding to a voice recognition mode on the display in response to entering the voice recognition mode, wherein the screen includes a symbol and text indicating the voice recognition mode,', 'receive a voice command signal corresponding to a voice command through the remote controller in the voice recognition mode while the broadcast content is displayed, wherein the voice command is input via a microphone provided in the remote controller, wherein the input voice command includes a search source and a search keyword, and', ' ...

Подробнее
07-01-2021 дата публикации

METHOD FOR CONTACT-CENTER MOBILE VIRTUAL NETWORK OPERATION

Номер: US20210006654A1
Автор: Gramacho Rui, Paiva Tiago
Принадлежит:

Methods for providing a cloud-based contact center solution configured as a mobile virtual network operator (MVNO) that can provide mobile subscription and call services and applications to contact center agents over a third-party communication system (e.g., without the contact center having its own spectrum allocation. Advanced or complex applications and services may thus be maintained/executed in the cloud-based contact center with minimal configuration to the respective mobile device that would use such applications and services. 1. A method is disclosed of configuring a mobile device for use by an agent of a cloud-based contact center to natively operate with the cloud-based contact center , the method comprising:installing in the mobile device a subscriber identification module (SIM) associated with operation with a mobile network operator comprising a cloud-based contact center;authenticating via the mobile device the subscriber identification module with the mobile network operator comprising the cloud-based contact center; andnatively routing from a network infrastructure at least a portion of a data transmission from an initiated data exchange between the mobile device and the mobile network operator comprising the cloud-based contact center, wherein the mobile network operator is configured to analyze the data transmission in one or more intelligent contact center applications.2. The method of claim 1 , further comprising:converting, at the mobile network operator comprising the cloud-based contact center, a voice portion of the routed portion of data transmission to text data; andanalyzing, at the mobile network operator comprising the cloud-based contact center, the text data to provide agent assist information in the intelligent contact center applications.3. The method of claim 2 , wherein at least one of the intelligent contact center applications is configured to:extract smart notes from the text data real-time; andtransmit the smart notes to the ...

Подробнее
07-01-2021 дата публикации

CONTACT-CENTER MOBILE VIRTUAL NETWORK OPERATOR

Номер: US20210006655A1
Автор: Gramacho Rui, Paiva Tiago
Принадлежит:

Disclosed herein are computer readable medium with instructions for a cloud-based contact center, and corresponding network devices, configured as a mobile virtual network operator (MVNO) that can provide mobile subscription and call services and applications to contact center agents over a third-party communication system. 1. A non-transitory computer readable medium comprising instructions stored thereon , wherein execution of the instructions by a processor cause the processor to:authenticate a mobile device comprising subscriber identification module, the subscriber identification module being configured with an identifier associated with a mobile network operator comprising a cloud-based contact center; andnatively route at least a portion of data transmission from an initiated data exchange between the mobile device and the mobile network operator comprising the cloud-based contact center, wherein the mobile network operator is configured to analyze the data transmission for one or more intelligent contact center applications.2. The computer readable medium of claim 1 , wherein the mobile network operator comprising the cloud-based contact center is configured to i) process a voice portion of the routed portion of data transmission and convert the voice portion to text data and ii) analyze the text data to provide agent assist information in the intelligent contact center applications.3. The computer readable medium of claim 2 , wherein the intelligent contact center applications comprise a first application configured to extract smart notes from the text data real-time and transmit the smart notes to the mobile device to be presented at a graphical user interface of the mobile device.4. The computer readable medium of claim 2 , wherein the intelligent contact center applications comprise a second application configured to search a database with the text data and transmit retrieved information from the search to the apparatus to be presented at a graphical ...

Подробнее
07-01-2021 дата публикации

SYSTEM AND METHOD FOR SPEECH-ENABLED AUTOMATED AGENT ASSISTANCE WITHIN A CLOUD-BASED CONTACT CENTER

Номер: US20210006660A1
Принадлежит:

Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing. 1. A method for automated services in a geographically-distributed cloud-based contact center , the method comprising:receiving, by the contact center, a speech communication from a customer;converting, by the contact center, the speech communication to text to perform inference processing on the text to determine a customer intent;automatically analyzing, by the contact center, the text to determine a subject of the speech communication and key terms associated with the subject;automatically parsing, by the contact center, a knowledgebase using the key terms for at least one responsive answer associated with the subject; andproviding, by the contact center, the solution to an agent in a unified interface during the communication with the customer.21. The method of , further comprising:querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; anddisplaying responsive results from the CRM/CSM in the second field in the unified interface.3. The method of claim 1 , further comprising:querying a database of customer-agent transcripts using the key terms; anddisplaying responsive results from the database of customer-agent transcripts in the second field in the unified interface.4. The method of claim 1 , wherein the method is performed in real-time as the customer communication progresses with the agent.5. The method of claim 1 , further comprising concurrently displaying to the agent the text in a first field of the unified interface and the solution in a second field of ...

Подробнее
04-01-2018 дата публикации

CUSTOMER CARE DATABASE CREATION SYSTEM AND METHOD

Номер: US20180007205A1
Принадлежит:

A live service agent is associated to a first service session, where the first service session facilitates a communication exchange associated with a user inquiry between user equipment and a first service resource. The first service session remains uninterrupted by the associating of the live service agent. A verbal exchange of the communication exchange between the user equipment and the live service agent is recorded in response to the associating of the live service agent, and a number of keywords are identified. To the extent that the verbal exchange yields a satisfactory response to the user inquiry and that the recording is dissimilar to other recordings of a solution database, the recording is stored in the solution database. Other embodiments are disclosed. 1. A method , comprising:joining, by a system comprising a processing system including a processor, a live, human customer service agent to a first interactive customer service session, wherein the first interactive customer service session facilitates a communication exchange between customer equipment and a first customer service resource associated with a customer inquiry, wherein the first interactive customer service session remains uninterrupted by the joining of the live, human customer service agent;recording, by the processing system, the communication exchange responsive to the joining of the live, human customer service agent to obtain a recording, wherein the communication exchange comprises a verbal exchange between the customer equipment and the live, human customer service agent;identifying, by the processing system, a plurality of keywords within the verbal exchange;determining, by the processing system, that the verbal exchange yielded a satisfactory response to the customer inquiry;determining, by the processing system, that the recording is dissimilar to other recordings of a solution database comprising a plurality of recordings of other verbal exchanges; andstoring, by the processing ...

Подробнее
02-01-2020 дата публикации

METHOD AND APPARATUS FOR SENDING INFORMATION

Номер: US20200007656A1
Автор: Zhu Ziqiang
Принадлежит:

A method and apparatus for sending information are provided. An embodiment of the method comprises: determining, in response to acquiring a user speech audio through a target application, a user command instructed by the user speech audio, and determining whether the user command satisfies a preset trigger condition for plug-in downloading; and sending, in response to the user command satisfying the trigger condition for plug-in downloading, a request for downloading a target plug-in to a target server, the target plug-in being a plug-in in a preset plug-in set for the target application, and the plug-in being not locally installed. According to the embodiment, a terminal device may be triggered to download the plug-in based on the content instructed by the user speech audio, to implement the functional upgrading. Therefore, the self-learning capability and the self-upgrading capability of the terminal device are improved, which makes the response to the user command more accurate and more pertinent. 1. A method for sending information , comprising:determining, in response to acquiring a user speech audio through a target application, a user command instructed by the user speech audio, and determining whether the user command satisfies a preset trigger condition for plug-in downloading; andsending, in response to the user command satisfying the trigger condition for plug-in downloading, a request for downloading a target plug-in to a target server, the target plug-in being a plug-in in a preset plug-in set for the target application, and the plug-in being not locally installed.2. The method according to claim 1 , wherein the user command instructs to execute an operation claim 1 , the trigger condition for plug-in downloading includes that the operation the user command instructs to execute is inexecutable claim 1 , and the target plug-in supports the operation.3. The method according to claim 1 , wherein the trigger condition for plug-in downloading includes that ...

Подробнее
20-01-2022 дата публикации

Wireless audio testing

Номер: US20220020370A1
Автор: Jonathan D. Hurwitz
Принадлежит: Google LLC

A method includes outputting, by a computing device, to a remote computing device, test audio data; determining, by the computing device, whether audio data detected by an audio input device includes the test audio data; and responsive to determining that the test audio data was not detected by the audio input device, temporarily refraining, by the computing device, from outputting advisory audio data indicating the audio input device is ready to receive a spoken audio command.

Подробнее
02-01-2020 дата публикации

METHOD AND SYSTEM FOR ACCURATE AUTOMATIC CALL TRACKING AND ANALYSIS

Номер: US20200007683A1
Принадлежит:

There is disclosed a method in a data processing system for automatically and accurately determining an outcome of a phone call by using signifiers or audibles as a way to increase accuracy without altering the flow of the conversation. The disclosed method can be used in any call center, such as a high volume call center used in the financial (banking), insurance, rental (hotel or car), ticket sales, and the like. The method comprises receiving voice data of a phone call; transmitting a response communication based on the voice data, wherein the response communication includes at least one audible or signifier; identifying, by at least one processor, the at least one audible or signifier in the response communication; and automatically determining the outcome of the phone call based on the audible or signifier in the response communication. A data processing system and a non-transitory computer-readable medium for storing instructions consistent with the described method are also disclosed. 1. A method in a data processing system for automatically determining an outcome of a phone call , the method comprising:receiving voice data of a phone call;transmitting a response communication based on the voice data, wherein the response communication includes at least one audible or signifier;identifying, by at least one processor, the at least one audible or signifier in the response communication; andautomatically determining the outcome of the phone call based on the audible or signifier in the response communication.2. The method of claim 1 , wherein the response communication is generated by a call operator.3. The method of claim 1 , wherein the at least one audible or signifier includes at least one word that is not commonly used in conversation.4. The method of claim 1 , wherein the at least one audible or signifier includes a plurality of words.5. The method of claim 1 , wherein identifying the at least one audible or signifier comprises:automatically transcribing ...

Подробнее
20-01-2022 дата публикации

METHOD, DEVICE, AND PROGRAM FOR CUSTOMIZING AND ACTIVATING A PERSONAL VIRTUAL ASSISTANT SYSTEM FOR MOTOR VEHICLES

Номер: US20220020374A1
Автор: BEN ABDELAZIZ Omar
Принадлежит:

A method for customizing and activating a personal virtual assistant (PVA) in a motor vehicle. The method includes: activating a PVA management system, determining a customized mode of use of the personal virtual assistant, and activating the customized mode of use of the personal virtual assistant. A device for carrying out the method is also included. 1. A method for customizing and activating a personal virtual assistant for a motor vehicle , comprising:at least one seat suitable for accommodating a user,at least one imaging device having a field of view configured to include said user,at least one central processing unit communicating at least with said at least one imaging device, activating a PVA management system, from said central processing unit,', 'determining a customized mode of use of the personal virtual assistant, and', 'activating said customized mode of use of the personal virtual assistant., 'the method comprising at least2. Method according to claim 1 , wherein said PVA management system is activated by performing:activation of said imaging device, anddetection of at least one event indicating the presence of said user.3. Method according to claim 2 , wherein said determination of a customized mode of use of said personal virtual assistant comprises:capture of at least one image by said at least imaging device,comparison of said at least one captured image with preexisting user image data, anddetermination, by said central processing unit, of a customized mode of use of the personal virtual assistant, based on the result of said comparison.4. Method according to claim 3 , wherein said customized mode of use comprises:a default operation if the result of said comparison is an absence of recognition of the user or a recognition of a user for whom the central processing unit cannot access a language preference for said PVA system or for the language of wake words,absence of operation if the result of said comparison is recognition of a user who does ...

Подробнее
20-01-2022 дата публикации

INTERRUPT FOR NOISE-CANCELLING AUDIO DEVICES

Номер: US20220020387A1
Принадлежит:

Implementations of the subject technology provide systems and methods for determining whether to interrupt a user of an audio device that is operating in a noise-cancelling mode of operation. For example, the user may desire to be interrupted by one or more pre-designated contacts that are identified at an associated electronic device as interrupt-authorized contacts, or by a person who speaks a designated keyword to the user. 1. A device of a first user , the device comprising:secure memory storing a plurality of contacts including contacts designated at the device as interrupt-authorized contacts for a peripheral device; and provide audio content to the peripheral device, the audio content to be played by the peripheral device in a first mode of operation of the peripheral device or to be played combined with noise cancelling content by the peripheral device in a second mode of operation of the peripheral device;', 'receive, from the peripheral device, information associated with a voice input received by the peripheral device from a person other than the first user during operation of the peripheral device in the second mode of operation;', 'determine, at least in part based on the information received from the peripheral device, whether the person is one of the interrupt-authorized contacts;', 'transmit an instruction to the peripheral device to switch from the second mode of operation to the first mode of operation if it is determined that the person is one of the interrupt-authorized contacts; and', 'transmit an instruction to the peripheral device to continue operation in the second mode of operation if it is determined that the person is not one of the interrupt-authorized contact., 'one or more processors configured to2. The device of claim 1 , wherein the interrupt-authorized contacts are not authorized users of the device or the peripheral device claim 1 , and wherein determining that the person is one of the interrupt-authorized contacts does not provide ...

Подробнее
12-01-2017 дата публикации

DEVICE AND METHOD FOR UNDERSTANDING USER INTENT

Номер: US20170011742A1
Принадлежит: Mitsubishi Electric Corporation

A voice recognizer generates plural voice recognition results from one input speech . For each of the voice recognition results, an intent understanding processor estimates an intent to thereby output one or more candidates of intent understanding results and scores of them. A weight calculator calculates standby weights using setting information of a control target apparatus. An intent understanding corrector corrects the scores of the candidates of intent understanding result, using the standby weights, to thereby calculate their final scores, and then selects one from among the candidates of intent understanding result, as an intent understanding result , on the basis of the final scores. 110-. (canceled)11. An intent understanding device , comprising:a voice recognizer that recognizes one speech spoken in a natural language by a user, to thereby generate plural voice recognition results of highly ranked recognition scores;a morphological analyzer that converts the respective voice recognition results into morpheme strings;an intent understanding processor that estimates an intent about the speech by the user on the basis of each of the morpheme strings, to thereby output from each one of the morpheme strings, one or more candidates of intent understanding result and scores indicative of degrees of likelihood of the candidates and generate the candidates of intent understanding result in descending order of likelihoods of the plural voice recognition results;a weight calculator that calculates respective weights for the candidates of intent understanding result; andan intent understanding corrector that corrects the scores of the candidates of intent understanding result, using the weights, to thereby calculate their final scores, and then selects the candidate of intent understanding result with the final score that satisfies a preset condition first, as the intent understanding result.12. The intent understanding device of claim 11 , wherein the weight ...

Подробнее
12-01-2017 дата публикации

In-Vehicle Device, Server Device, Information System, and Content Start Method

Номер: US20170011743A1
Принадлежит:

An in-vehicle device according to the present invention includes a voice input receiving unit that receives input information with a voice, an output control unit that transmits the input information to a predetermined external device, and then outputs a plurality of received content candidates according to order of a priority degree of the content candidates, and a content execution unit that executes the content candidate upon receipt of specification of any of the output content candidates. 1. An in-vehicle device comprising:a voice input receiving unit configured to receive input information with a voice;an output control unit configured to transmit the input information to a predetermined external device, and then output a plurality of received content candidates according to order of a priority degree of the content candidates; anda content execution unit configured to execute, upon receipt of specification of any of the output content candidates, the content candidate.2. The in-vehicle device according to claim 1 , whereinthe output control unit displays a display area of the content candidate larger as the order of the priority degree is higher.3. The in-vehicle device according to claim 1 , further comprising:a biological state information acquisition unit configured to acquire biological state information indicating a state of a biological body from an external biological information detection device, whereinthe output control unit transmits the biological state information to the external device.4. The in-vehicle device according to claim 1 , further comprising:an environment information acquisition unit configured to acquire environment information, whereinthe output control unit transmits the environment information to the external device.5. A server device comprising:a keyword storage unit configured to store a word in association with a word and content related to the word;a related word search unit configured to recognize, upon receipt of information ...

Подробнее
10-01-2019 дата публикации

SPEECH CONVERTER

Номер: US20190012313A1
Автор: Chen Yong
Принадлежит:

Disclosed is a speech converter for inter-translation between Chinese and the official language of the other party including a matching card for forming unidirectional matches of a word, a phrase a word group, a short sentence, a common expression and, a sentence expressing the same meaning. The matching cards include a word matching card, a phrase matching card, a word group matching card, a short sentence matching card, a common expression matching card and a sentence matching card. The present invention enables a user to translate what he/she wants to say into a language that a listener can understand immediately, so that the listener can hear and understand these words and answer immediately, and the answer is sent back in mandarin. 1. A speech converter for inter-translation between Chinese and the official language of the other party , comprising a matching card for forming unidirectional matches of a word , a phrase a word group , a short sentence , a common expression and a sentence expressing the same meaning; wherein the matching card comprise a word matching card , a phrase matching card , a word group matching card , a short sentence matching card , a common expression matching card and a sentence matching card;unidirectionality refers to the translation from Chinese to the official language of the other party or the translation from the official language of the other party to Chinese.2. The speech converter according to claim 1 , wherein the matching card is further used for generating corresponding words and phrases in the other party's official language expressing the same meaning in the Chinese words; the matching card is further used for generating corresponding word groups claim 1 , short sentences claim 1 , common expressions claim 1 , sentences in the other party's official language expressing the same meaning in the Chinese word groups claim 1 , short sentences claim 1 , common expressions claim 1 , sentences; the matching card is the ...

Подробнее
11-01-2018 дата публикации

Enhanced speech endpointing

Номер: US20180012591A1
Принадлежит: Google LLC

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.

Подробнее
11-01-2018 дата публикации

CALL MANAGEMENT SYSTEM AND ITS SPEECH RECOGNITION CONTROL METHOD

Номер: US20180012600A1
Автор: ARIKAWA Hidehisa
Принадлежит:

A speech recognition server has a speech recognition engine, and a mode control table to hold a speech recognition mode for each call. The speech recognition engine has a mode management unit to designate a speech recognition mode for a decoder, and an output analysis unit to analyze recognition result data speech-to-text converted by speech recognition. The output analysis unit designates the speech recognition mode for the mode management unit in accordance with result of analysis of the recognition result data speech-to-text converted by the speech recognition. The mode management unit designates the speech recognition mode for the decoder in accordance with the designation with the output analysis unit. Upon speech recognition of call data, it is possible to suppress hardware resource consumption while improve users' satisfaction. 1. A call management system for recording a call from an external line , performing speech-recognition , speech-to-text converting the call , and displaying text , comprising:a speech recognition server that performs speech recognition on recording data of call data, and outputs text data,wherein the speech recognition server has:a speech recognition engine that performs speech recognition on audio data and speech-to-text converts the data; anda mode control table that holds a speech recognition mode for each call,and wherein the speech recognition engine has:a mode management unit that designates a speech recognition mode for a decoder; andan output analysis unit that analyzes recognition result data, speech-to-text converted by the speech recognition,further wherein the output analysis unit designates the speech recognition mode for the mode management unit, based on the result of analysis of the recognition result data speech-to-text converted by the speech recognition,further wherein the mode management unit rewrites the speech recognition mode held in the mode control table based on the designation with the output analysis unit, ...

Подробнее
11-01-2018 дата публикации

SPEAKER VERIFICATION USING CO-LOCATION INFORMATION

Номер: US20180012604A1
Принадлежит:

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score. 1. (canceled)2. A system comprising: receiving, while the first computing device is in a locked state and is co-located with a second computing device that is configured to respond to voice commands that are preceded by a predefined hotword, audio data that corresponds to an utterance of a voice command that is preceded by the particular, predefined hotword;', 'while the first computing device is in the locked state, transmitting a message that includes speaker verification data to a server that receives speaker verification data from multiple co-located devices, including the first computing device and the second computing device, and uses the received data to determine a speaker of a corresponding one of the multiple co-located devices that likely spoke the voice command; and', 'after transmitting the message, determining to remain in the locked state and not respond to the voice command despite receiving the audio data that corresponds to the utterance of the voice command that is preceded by the particular, predefined hotword., 'a first computing device that is configured to respond to voice commands while in a locked state upon receipt of a ...

Подробнее
11-01-2018 дата публикации

RECORDING SYSTEM FOR GENERATING A TRANSCRIPT OF A DIALOGUE

Номер: US20180012619A1
Автор: CALLAN James, RYAN Stephen
Принадлежит: ECONIQ LIMITED

A recording system has a listener processor for automatically capturing events involving computer applications during a dialogue involving the user of the computer. The system generates a visual transcript of events on a timeline. It automatically detects start of a dialogue and proceeds to detect events and determines if they are configured as transcript events, before detecting end of the dialogue. The system may associate dialogue events with audio clips, using meta tags. 152.-. (canceled)53. A recording system comprising:a listener processor configured to automatically monitor and capture events in a user computer as applications are used during a dialogue between the user and another party, anda server configured to automatically save events captured by the listener processor, wherein, said events including user computer screen transitions, data inputted by the user, event times, dialogue start events, and dialogue end events, andwherein the system comprises a transcript generator configured to apply each captured event to a timeline, and to generate a visual transcript representing the events on the timeline; detect an event and to then automatically determine if it is in a category of events to capture; and to', monitoring the computer operating system to detect low level mouse and keyboard events generated by the operating system when a user interacts with it,', 'interfacing with application programming language of the user computer to detect higher level mouse and keyboard events coming from the computer applications, and', 'optical character recognition to extract data from a computer application;, 'extract data from a user computer by performing one or more of], 'wherein the listener processor is configured towherein the listener processor or the server are configured to automatically determine a meaning of each captured event by performing a look-up to a table associating event identifiers and associated meanings.55. The recording system as claimed in ...

Подробнее
14-01-2021 дата публикации

CONTEXTUAL INFORMATION INSERTION IN CONTEXT WITH CONTENT

Номер: US20210012390A1
Принадлежит:

Context associated with content may be received. An additional content may be generated for insertion into the content. The additional content can be created to be within the context associated with the content and based on a likely responsiveness of the user to the additional content, the additional content referring to an item. The additional content preserves continuity and/or semantics in the context of the content. 1. A computer-implemented method comprising:receiving context associated with content for presenting to a user;creating an additional content to insert into the content, the additional content created to be inserted within the context associated with the content and based on a likely responsiveness of the user to the additional content, the additional content referring to an item,wherein the additional content preserves continuity in the context of the content.2. The computer-implemented method of claim 1 , wherein the item includes at least a product.3. The computer-implemented method of claim 1 , wherein the item includes at least a service.4. The computer-implemented method of claim 1 , wherein the additional content is created based at least on the context associated with the content and a characteristic associated with the user.5. The computer-implemented method of claim 1 , wherein the content includes at least an audio content claim 1 , and the method further comprises causing a change in cadence of reading the audio content during a period of time the additional content is read.6. The computer-implemented method of claim 1 , wherein the content includes at least a visual content claim 1 , and the method further comprises causing the additional content to be highlighted from rest of the content in presenting the visual content.7. The computer-implemented method of claim 1 , further comprising identifying a location in the content to insert the additional content claim 1 , the location identified based at least on the likely responsiveness of ...

Подробнее
10-01-2019 дата публикации

SPEAKER COMMAND AND KEY PHRASE MANAGEMENT FOR MULI -VIRTUAL ASSISTANT SYSTEMS

Номер: US20190013019A1
Автор: Lawrence Sean J.
Принадлежит:

Systems, apparatuses and methods are described for automatically managing a plurality of virtual assistants that may be simultaneously available on the same device and wherein each assistant may be preferred for a particular task. Selected assistants may be activated by substituting their key phrase when another was actually uttered. 1. An apparatus , comprising:a smart device;a microphone communicatively connected to the smart device to listen for utterances;at least a first virtual assistant and a second virtual assistant accessible by the smart device, the first virtual assistant to respond to an utterance of a first key phrase and the second virtual assistant to respond to a second key phrase, where the first key phrase is different from the second key phrase; andan abstraction layer circuit responsive to an utterance of a third key phrase, the abstraction layer circuit to replace the third key phrase with one of the first key phrase or the second key phrase and to communicate it to the first virtual assistant and the second virtual assistant.2. The apparatus as recited in claim 1 , further comprising:a natural language processing circuit to analyze utterances for intent; anda rules circuit to store rules to select one of the first virtual assistant or second virtual assistant based on the intent.3. The apparatus as recited in claim 2 , further comprising:a user preference circuit where a user defines rules.4. The apparatus as recited in claim 2 , wherein the intent comprises one of a task to be carried out locally or to be carried out via a cloud connection.5. The apparatus as recited in claim 1 , wherein an utterance containing the first key phrase or the second key phrase is unchanged by the abstraction layer.6. The apparatus as recited in claim 1 , wherein the abstraction layer further comprises:a database including key phrase utterances for all available virtual assistants.7. A method claim 1 , comprising:providing at least a first virtual assistant and a ...

Подробнее
10-01-2019 дата публикации

Voice wakeup method, apparatus and system, cloud server and readable medium

Номер: US20190013021A1

The present disclosure provides a voice wakeup method, apparatus and system, a cloud server and a readable medium. The method comprises: receiving a first request input by the user in a voice manner after a smart device enters a voice recognition phase; judging whether the first request after semantic parsing is consistent with pre-stored wakeup words; determining that the received first request is used to re-wake up the smart device if the first request is consistent with the pre-stored wakeup words. According to the above technical solutions, after the smart device enters the voice recognition phase, if the received first request is the wakeup words, the smart device does not perform service satisfaction processing for the first request. Capability of recognizing that the first request is re-wakeup does not cause a command recognition error and thereby can satisfy the user's expectation. The voice wakeup solution according to the present disclosure can effectively process the user's request and thereby can effectively improve the user's experience in use.

Подробнее
09-01-2020 дата публикации

Image annotating method and electronic device

Номер: US20200012888A1

An image annotating method includes: acquiring an image collected at a terminal; acquiring voice information associated with the image; annotating the image according to the voice information; and storing an annotated result of the image.

Подробнее
10-01-2019 дата публикации

ANALOG VOICE ACTIVITY DETECTION

Номер: US20190013039A1
Принадлежит:

According to some embodiments, an analog processing portion may receive an audio signal from a microphone. The analog processing portion may then convert the audio signal into sub-band signals and estimate an energy statistic value, such as a Signal-to-Noise Ratio (“SNR”) value, for each sub-band signal. A classification element may classify the estimated energy statistic values with analog processing such that a wakeup signal is generated when voice activity is detected. The wakeup signal may be associated with, for example, a battery-powered, always-listening audio application. 1. A voice activity detection device , comprising:an analog processing portion to receive an audio signal from a microphone, convert the audio signal into sub-band signals, and estimate an energy statistic value for each sub-band signal; anda classification element to classify the estimated energy statistic values with analog processing such that a wakeup signal is generated when voice activity is detected, wherein lookback information is provided when the wakeup signal is generated.2. The device of claim 1 , wherein the lookback information is associated with at least one of: (i) extracted noise estimates prior to voice activity claim 1 , (ii) an audio spectrum when the wakeup signal is generated claim 1 , (iii) a free-running buffer claim 1 , (iv) a threshold detector to trigger a buffer claim 1 , (v) an acoustic activity detector to trigger a buffer claim 1 , and (vi) a periodic sampling of a subset of voice activity.3. The device of claim 1 , wherein the estimated energy statistic value is a Signal-to-Noise Ratio (“SNR”) value.4. The device of claim 1 , wherein the wakeup signal is provided to an audio processor which detects a key phrase in the audio signal and claim 1 , as a result claim 1 , wakes up an application processor.5. The device of claim 1 , wherein the wakeup signal is provided to a beamforming microphone array and digital processor to initiate high-performance audio ...

Подробнее
14-01-2021 дата публикации

VOICE-OPERATED SYSTEM, CONTROLLER, COMPUTER-READABLE RECORDING MEDIUM, AND PROCESSING DEVICE

Номер: US20210012771A1
Автор: Tajima Hiroki
Принадлежит: KONICA MINOLTA, INC.

A voice-operated system includes a processing device, and a controller that communicates with the processing device. The processing device includes a first processor to perform: displaying, on an operating panel, an operation screen for instructing a process for execution by the processing device, to receive user's instruction; and executing a process corresponding to a command received from the controller. The controller includes a second processor to perform: generating the command for the processing device based on an input voice; and transmitting the command to the processing device. The generation of the command includes, when the voice instructs the processing device to execute a first process, generating a first command for instructing the operating panel to display the operation screen for instructing execution of the first process; and when the voice instructs the processing device to execute a second process, generating a second command for instructing execution of the second process. 1. A voice-operated system comprising:a processing device; anda controller that communicates with the processing device, displaying, on an operating panel, an operation screen for instructing a process to be executed by the processing device, and receiving an instruction from a user; and', 'executing a process corresponding to a command received from the controller,, 'the processing device including a first processor to perform generating the command for the processing device based on an input voice; and', 'transmitting the command to the processing device, wherein, 'the controller including a second processor to perform when the voice instructs the processing device to execute a first process, generating a first command for instructing the operating panel to display the operation screen for instructing execution of the first process, and', 'when the voice instructs the processing device to execute a second process, generating a second command for instructing execution of the ...

Подробнее
14-01-2021 дата публикации

ELECTRONIC DEVICE AND VOICE RECOGNITION CONTROL METHOD OF ELECTRONIC DEVICE

Номер: US20210012775A1
Принадлежит:

According to one embodiment of the present invention, a server comprises at least one communication interface, at least one processor operatively connected to the communication interface, and at least one memory operatively connected to the processor, wherein the memory store instructions configured to, when executed, cause the processor: receives, from a first electronic device, first input voice data including a first request for conducting a first task by using a second electronic device by user's utterance; determines or receives a state of the first electronic device; and provides a first external electronic device with a first response related to control of the state of the first electronic device. Various other embodiments are possible. 1. A server , comprising:at least one communication interface;at least one processor operatively connected to the communication interface; andat least one memory operatively connected to the processor,wherein the memory stores instructions configured to, when executed, cause the processor to:receive, from a first electronic device, first input voice data including a first request for performing a first task by using a second electronic device by user utterance;determine or receive a state of the first electronic device; andprovide the first electronic device with a first response related to adjustment of the state of the first electronic device.2. The server of claim 1 , wherein the instructions are configured to cause the processor to:analyze the first request included in the first input voice data;based at least in part on the analysis result, determine whether there is a relation between the state of the first electronic device and an operation of performing the first task to be performed based on the first request, or receive a determination result of the relation from the first electronic device;if it is determined that there is the relation between the state of the first electronic device and the operation of performing ...

Подробнее
14-01-2021 дата публикации

Multi-User Personalization at a Voice Interface Device

Номер: US20210012779A1
Принадлежит:

A method at an electronic device with one or more microphones and a speaker includes receiving a first voice input; comparing the first voice input to one or more voice models; based on the comparing, determining whether the first voice input corresponds to any of a plurality of occupants, and according to the determination, authenticating an occupant and presenting a response, or restricting functionality of the electronic device. 1. A method , comprising: receiving a first voice input associated with a request for a service;', 'in response to receiving the first voice input, accessing one or more voice models each trained to a different one of the plurality of occupants;', 'comparing the first voice input to the one or more voice models;', 'based on the comparing, determining whether the first voice input corresponds to any of the plurality of occupants;', 'in accordance with a determination that the first voice input corresponds to a first occupant of the plurality of occupants, presenting at the electronic device a personalized response in accordance with a user account associated with the service; and', 'in accordance with a determination that the first voice input corresponds to none of the plurality of occupants: restricting functionality of the electronic device., 'at an electronic device with one or more microphones, a speaker, one or more processors, and memory storing one or more programs for execution by the one or more processors, the electronic device being located at a structure associated with a plurality of occupants and being configured to present information to the plurality of occupants2. The method of claim 1 , wherein the request is a request for a music service claim 1 , the personalized response is in accordance with a user account associated with a subscription to the music service claim 1 , and restricting functionality of the electronic device comprises presenting a generic response that does not involve a user account associated with a ...

Подробнее
09-01-2020 дата публикации

SPEECH WAKEUP METHOD, APPARATUS, AND ELECTRONIC DEVICE

Номер: US20200013390A1
Принадлежит:

A speech wakeup method, apparatus, and electronic device are disclosed in embodiments of this specification. The method includes: inputting speech data to a speech wakeup model trained with general speech data; and outputting, by the speech wakeup model, a result for determining whether to execute speech wakeup, wherein the speech wakeup model includes a Deep Neural Network (DNN) and a Connectionist Temporal Classifier (CTC). 1. A speech wakeup method , comprising:inputting speech data to a speech wakeup model trained with general speech data; andoutputting, by the speech wakeup model, a result for determining whether to execute speech wakeup, wherein the speech wakeup model includes a Deep Neural Network (DNN) and a Connectionist Temporal Classifier (CTC).2. The method of claim 1 , wherein the general speech data comprises a Large Vocabulary Continuous Speech Recognition (LVCSR) corpus.3. The method of claim 1 , further comprising:training the speech wakeup model with the general speech data, wherein the training includes:iteratively optimizing parameters in the speech wakeup model with the general speech data by means of an asynchronous stochastic gradient descent method until the training converges.4. The method of claim 3 , further comprising:acquiring keyword-specific speech data; andtraining the speech wakeup model with the keyword-specific speech data, wherein a learning rate used in the training is less than that used in the training of the speech wakeup model with the general speech data.5. The method of claim 3 , further comprising:cross-verifying the speech wakeup model with a verification data set in the training to determine whether the training converges.6. The method of claim 1 , wherein the outputting claim 1 , by the speech wakeup model claim 1 , a result for determining whether to execute speech wakeup comprises:extracting acoustic features from the input speech data;inputting the acoustic features to the DNN included in the speech wakeup model for ...

Подробнее
09-01-2020 дата публикации

METHOD AND APPARATUS FOR GENERATING INFORMATION

Номер: US20200013399A1
Принадлежит:

A method and an apparatus for generating information are provided. The method includes: determining, in response to receiving a first user sentence, whether a keyword of a preset first category is included in the first user sentence, the first category including at least one subcategory; determining, in response to determining the first category keyword being included in the first user sentence, the first category keyword included in the first user sentence as a first keyword, and determining a subcategory to which the first keyword belongs, to generate a first keyword set and a subcategory set; and selecting, based on the first keyword set and the subcategory set, a song list from a pre-generated song list set as a to-be-played song list, to generate a to-be-played song list set, the song list including at least one piece of audio and song list category information. 1. A method for generating information , comprising:determining, in response to receiving a first user sentence, whether a keyword of a preset first category is included in the first user sentence, the first category including at least one subcategory;determining, in response to determining the first category keyword being included in the first user sentence, the first category keyword included in the first user sentence as a first keyword, and determining a subcategory to which the first keyword belongs, to generate a first keyword set and a subcategory set; andselecting, based on the first keyword set and the subcategory set, a song list from a pre generated song list set as a to-be-played song list, to generate a to-be-played song list set, the song list including at least one piece of audio and song list category information.2. The method according to claim 1 , wherein the selecting claim 1 , based on the first keyword set and the subcategory set claim 1 , a song list from a pre-generated song list set as a to-be-played song list claim 1 , to generate a to-be-played song list set comprises: ...

Подробнее
09-01-2020 дата публикации

INTERACTION METHOD AND APPARATUS

Номер: US20200013400A1
Принадлежит:

Embodiments of the present disclosure disclose an interaction method and apparatus. A specific embodiment of the method includes: generating, in response to determining that a request input by a user satisfies a guiding condition, guiding information, and feeding back the guiding information to the user, the guiding condition including one of the following: associating with a plurality of query intents, or associating with no query intent; and generating, based on the request and a feedback input by the user corresponding to the guiding information, an intent-clear request, and feeding back push information bound with the intent-clear request to the user. Realizing that in the process of interacting with the user, for conditions such as the request input by the user is associated with a plurality of query intents or incompleteness, an intent-clear request associated with an explicit query intent is determined through the interaction with the user. 1. An interaction method , the method comprising:generating, in response to determining that a request input by a user satisfies a guiding condition, guiding information, and feeding back the guiding information to the user, the guiding condition comprising one of: associating with a plurality of query intents, or associating with no query intent; andgenerating, based on the request and a feedback input by the user corresponding to the guiding information, an intent-clear request, and feeding back push information bound with the intent-clear request to the user.2. The method according to claim 1 , wherein the request comprises one of: an information query request claim 1 , a multimedia content request claim 1 , or a call request.3. The method according to claim 2 , wherein the guiding condition comprises: the associating with the plurality of query intents claim 2 , and the request comprises: an entity keyword claim 2 , and the guiding information is used to guide the user to select at least one resource type from resource ...

Подробнее
09-01-2020 дата публикации

Method and apparatus for recognizing a voice

Номер: US20200013407A1
Автор: Jong Hoon CHAE
Принадлежит: LG ELECTRONICS INC

Disclosed are a speech recognition method and a speech recognition device, in which speech recognition is performed by executing an artificial intelligence (AI) algorithm and/or a machine learning algorithm provided therein. According to one embodiment, the speech recognition method includes buffering a spoken utterance, extracting a standby wake-up word corresponding to a preset wake-up word from the spoken utterance by comparing the buffered spoken utterance to the preset wake-up word, analyzing the role of the standby wake-up word in the spoken utterance, determining the speech intent in uttering the standby wake-up word by using results of analyzing the role of the standby wake-up word, and determining whether to execute a spoken sentence as a voice command in the spoken utterance and processing the spoken sentence accordingly.

Подробнее
09-01-2020 дата публикации

Speaker Verification Using Co-Location Information

Номер: US20200013412A1
Принадлежит: Google LLC

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score. 1. A method comprising: receiving, at data processing hardware of a server in communication with the user device, one or more sample utterances from the corresponding user of the plurality of different users during an enrollment process; and', 'generating, by the data processing hardware, corresponding speaker verification data for the corresponding user of the plurality of different users of the user device based on the one or more sample utterances received from the corresponding user of the plurality of different users of the user device;, 'for each user of a plurality of different users of a user devicereceiving, at the data processing hardware, audio data corresponding to an utterance captured by the user device having the plurality of different users;determining, by the data processing hardware, a speaker of the utterance from one of the plurality of different users of the user device based on a comparison between the received audio data and the corresponding speaker verification data generated for each user of the plurality of different users of the user device; andprocessing, by the data processing hardware, the audio data corresponding to ...

Подробнее
09-01-2020 дата публикации

RETROACTIVE SOUND IDENTIFICATION SYSTEM

Номер: US20200013427A1
Принадлежит:

A method for identifying at least one characteristic of a sound-producing object includes storing, in a memory, audio data acquired from an auditory environment via at least one microphone; receiving an input indicating a user request to identify a characteristic of a sound-producing object included in the auditory environment; determining, via a processor and based on a portion of the audio data acquired from the auditory environment prior to the user request, the characteristic of the sound-producing object; and causing information corresponding to the characteristic of the sound-producing object to be output via at least one output device. 1. A method for identifying a sound-producing object , the method comprising:storing, in a memory, audio data acquired from an auditory environment via at least one microphone;receiving an input indicating a user request to identify the sound-producing object included in the auditory environment;generating, via a processor and based on a portion of the audio data acquired from the auditory environment prior to the user request, a first determination that identifies the sound-producing object, wherein the first determination has a first confidence measure; andcausing, when the first confidence measure is greater than a confidence threshold, information corresponding to a characteristic of the sound-producing object to be output via at least one output device.2. The method of claim 1 , further comprising discarding claim 1 , from the memory claim 1 , audio data acquired from the auditory environment that is older than a threshold duration of time.3. The method of claim 2 , wherein the memory comprises a circular buffer.4. The method of claim 3 , wherein storing claim 3 , in the memory claim 3 , audio data acquired from the auditory environment comprises storing a first portion of the audio data in a first portion of the circular buffer and a second portion of the audio data in a second portion of the circular buffer.5. (canceled) ...

Подробнее
10-01-2019 дата публикации

LAST MILE EQUALIZATION

Номер: US20190013788A1
Принадлежит:

An audio playback system performs a method including identifying a first type of audio included in a first audio stream, tagging the first audio stream with a first digital tag corresponding to the first type of audio, identifying a second type of audio included in a second audio stream, tagging the second audio stream with a second digital tag corresponding to the second type of audio, rendering the first audio stream with a first equalization profile applied thereto, the first equalization profile selected responsive to the audio playback system detecting the first digital tag in the first audio stream, and rendering the second audio stream with a second equalization profile different than the first equalization profile applied thereto, the second equalization profile selected responsive to the audio playback system detecting the second digital tag in the second audio stream. 1. An audio playback system including a processor and associated programming , the programming , when executed on the processor , causing the audio playback system to perform a method comprising:identifying a first type of audio included in a first audio stream;tagging the first audio stream with a first digital tag corresponding to the first type of audio;identifying a second type of audio included in a second audio stream;tagging the second audio stream with a second digital tag corresponding to the second type of audio;identifying an audio chime included in a third audio stream; andtagging the third audio stream with a third digital tag corresponding to the audio chime;wherein the audio playback system includes a master streaming audio player and at least one slave streaming audio player, the at least one slave streaming audio player configured to render the first audio stream with a first equalization profile applied thereto, the first equalization profile selected responsive to the at least one slave streaming audio player detecting the first digital tag in the first audio stream, to ...

Подробнее
14-01-2016 дата публикации

Telecommunication call management and monitoring system with voiceprint verification

Номер: US20160014270A1
Автор: Stephen Lee Hodge
Принадлежит: Global Tel Link Corp

A secure telephone call management system is provided for authenticating users of a telephone system in an institutional facility. Authentication of the users of the telephone call management system is accomplished by using a personal identification number, biometric means, and/or radio frequency means. The secure telephone call management system includes accounting software capable of limiting access to the system based on funds in a user's account, and includes management software capable of implementing widespread or local changes to the system. The system monitors a conversation in the telephone call to detect a presence of a first characteristic in audio of the conversation, and terminates the telephone call if the first characteristic does not match a second characteristic of biometric information of a user or a called party.

Подробнее
11-01-2018 дата публикации

Selective Sound Field Environment Processing System and Method

Номер: US20180014107A1
Принадлежит: BRAGI GmbH

A method for selective sound field environment processing for one or more wireless earpieces, each of the one or more wireless earpieces including comprising a plurality of microphones is provided. The method includes receiving audio input of a sound field environment from the plurality of microphones of each of the one or more wireless earpieces, identifying background noise in the sound field environment, identifying audio from at least one sound source within the sound field environment, processing the audio input using one or more processors of at least one of the wireless earpieces to reduce the background noise and processing the audio input using one or more processors of the at least one of the wireless earpieces to enhance the audio from the at least one sound source within the sound field environment to provide processed audio, and transducing the processed audio at one or more speakers of the one or more wireless earpieces. 1. A method for selective sound field environment processing for one or more wireless earpieces , each of the one or more wireless earpieces comprising a plurality of microphones , the method comprising:receiving audio input of a sound field environment from the plurality of microphones of each of the one or more wireless earpieces;identifying background noise in the sound field environment;identifying audio from at least one sound source within the sound field environment;processing the audio input using one or more processors of at least one of the wireless earpieces to reduce the background noise and processing the audio input using one or more processors of the at least one of the wireless earpieces to enhance the audio from the at least one sound source within the sound field environment to provide processed audio;transducing the processed audio at one or more speakers of the one or more wireless earpieces.2. The method of wherein the processing the audio input using one or more processors of at least one of the wireless earpieces ...

Подробнее
10-01-2019 дата публикации

METHOD AND APPARATUS FOR SPONSORED MESSAGING

Номер: US20190014369A1
Автор: Maria Arturo
Принадлежит: AT&T MOBILITY II LLC

Aspects of the subject disclosure may include, for example, a method including monitoring a communication network managed by a service provider to detect transmission of content to a communication device through the communication network, analyzing content information associated with the content that is transmitted to the communication device to identify a feature of the content, capturing a user-generated comment that is transmitted from the communication device through the communication network, analyzing the user-generated comment that is captured to detect an occurrence of a keyword that is associated with the feature of the content, determining whether the keyword in the user-generated comment was temporally associated with the feature of the content transmitted to the communication device to identify an occurrence of a trigger event associating the keyword with the feature of the content, and responsive to identification of the occurrence of the trigger event, transmitting a message associated with the trigger event to the communication device. Other embodiments are disclosed. 1. A machine-readable storage medium , comprising executable instructions that , when executed by a processing system including a processor , facilitate performance of operations , comprising:identifying a feature of content that is streamed to a communication device through a communication network;accessing a user-generated comment transmitted from the communication device through the communication network;determining whether the user-generated comment includes a keyword;determining whether the keyword in the user-generated comment was temporally associated with the feature of the content streamed to the communication device to identify an occurrence of a trigger event; andtransmitting a message associated with the trigger event to the communication device responsive to identifying the occurrence of the trigger event.2. The machine-readable storage medium of claim 1 , wherein the ...

Подробнее
10-01-2019 дата публикации

PROVIDING INTERACTIVE MULTIMEDIA SERVICES

Номер: US20190014382A1
Принадлежит:

A method for presenting interactive multimedia programs includes selecting a multimedia program having a plurality of segments, classifying the multimedia program and the plurality of segments according to content, and generating a plurality of additional content items based on the classification, each of which is associated with at least one of the plurality of segments. The method further includes generating a plurality of pre-defined queries for each of the plurality of segments based on a content of the segment and at least one speech recognition grammar based on the plurality of pre-defined queries. The plurality of additional content items are retrievable responsive to the plurality of pre-defined queries. The method further includes generating a data stream including the at least one speech recognition grammar and all of the plurality of additional content items and the plurality of pre-defined queries arranged according to an order of the plurality of segments. 1. A method comprising: selecting a multimedia program for presentation, the multimedia program having a plurality of segments;', 'classifying each segment of the plurality of segments according to associated content;', 'populating a data entry database via respective data entry templates retrieved based on the classification of each segment of the plurality of segments;', 'generating a data stream via the data entry database, wherein a timing of the data stream is configured to coincide with a timing of the plurality of segments upon presentation by a multimedia system;', 'generating at least one query grammar, wherein the at least one query grammar is configured to interpret commands to access content in the data stream;', 'simultaneously transmitting, to the multimedia system, the content associated with at least one of the plurality of segments, one or more of the at least one query grammar, and the data stream;', 'determining whether a detected input query is valid according to the at least one ...

Подробнее
03-02-2022 дата публикации

SYSTEMS AND METHODS FOR LEVERAGING METADATA FOR CROSS PRODUCT PLAYLIST ADDITION VIA VOICE CONTROL

Номер: US20220035859A1
Принадлежит:

Systems and methods for generating a playlist of audio content for a vehicle are disclosed. An audio input is received at a vehicle entertainment system of the vehicle while an audio content item is currently played by the vehicle entertainment system. The audio input includes an audio command trigger and an audio playlist command. In response to detecting the audio command trigger in the audio input, the audio input is parsed to determine the audio playlist command. A metadata associated with the audio content item is determined. In response to determining the audio playlist command, the audio content item is caused to be added to an audio content playlist of a third-party service based on the metadata of the audio content item. 1. A method of generating a playlist of audio content for a vehicle comprising:receiving an audio input at a vehicle entertainment system of the vehicle while an audio content item is currently played by the vehicle entertainment system, wherein the audio input includes an audio command trigger and an audio playlist command;in response to detecting the audio command trigger in the audio input, parsing the audio input to determine the audio playlist command;determining a metadata associated with the audio content item of the audio content; andin response to determining the audio playlist command, causing the audio content to be added to an audio content playlist of a third-party service based on the metadata of the audio content item.2. The method of claim 1 , wherein the audio content is internet-sourced or terrestrially sourced.3. The method of claim 1 , wherein the third-party service is identified in the audio playlist command and wherein the metadata comprises an audio content identifier that identifies the audio content to the vehicle entertainment system claim 1 , the method further comprising translating the audio content identifier in the metadata to a third-party audio content identifier based on a protocol claim 1 , wherein the ...

Подробнее
21-01-2021 дата публикации

ROBOT AND METHOD FOR RECOGNIZING WAKE-UP WORD THEREOF

Номер: US20210016431A1
Автор: KIM Namgeon
Принадлежит: LG ELECTRONICS INC.

Provided is a robot including a microphone configured to acquire a sound signal corresponding to a sound generated near the robot, a camera, an output interface including at least one of a display configured to output a wake-up screen or a speaker configured to output a wake-up sound when the robot wakes up, and a processor configured to recognize whether the acquired sound includes a voice of a person, activate the camera when the sound includes a voice of a person, recognize whether a person is present in an image acquired by the activated camera, set a wake-up word recognition sensitivity based on a recognition result as to whether a person is present, and recognize whether a wake-up word is included voice data of a user acquired through the microphone based on the set wake-up word recognition sensitivity. 1. A robot comprising:a microphone configured to acquire a sound signal corresponding to a sound generated near the robot;a camera;an output interface including at least one of a display configured to output a wake-up screen or a speaker configured to output a wake-up sound when the robot wakes up; anda processor configured to:recognize whether the acquired sound includes a voice of a person,activate the camera when the sound includes a voice of a person,recognize whether a person is present in an image acquired by the activated camera,set a wake-up word recognition sensitivity based on a recognition result as to whether a person is present, andrecognize whether a wake-up word is included voice data of a user acquired through the microphone based on the set wake-up word recognition sensitivity.2. The robot of claim 1 , wherein the processor is configured to set the wake-up word recognition sensitivity to a first level when a person is recognized from the acquired image.3. The robot of claim 2 , wherein the processor is configured to set the wake-up word recognition sensitivity to be higher than the first level when people equal to or larger than a reference ...

Подробнее
03-02-2022 дата публикации

In-Call Experience Enhancement for Assistant Systems

Номер: US20220036013A1
Принадлежит:

In one embodiment, a method includes establishing a video call between a plurality of client systems, wherein access to an assistant system is persistently maintained during the video call, receiving, from a first client system of the plurality of client systems, a request by a first user to be performed by the assistant system during the video call, wherein the request references one or more activities associated with one or more users associated with the plurality of client systems, analyzing, by a context engine of the assistant system, images of a scene of the video call to identify the one or more activities within the scene, instructing the assistant system to execute the request based on the identified one or more activities, and sending, to one or more of the plurality of client systems, a response to the request while maintaining the video call between the plurality of client systems. 1. A method comprising , by one or more computing systems:establishing a video call between a plurality of client systems, wherein access to an assistant system is persistently maintained during the video call;receiving, from a first client system of the plurality of client systems, a request by a first user to be performed by the assistant system during the video call, wherein the request references one or more activities associated with one or more users associated with the plurality of client systems;analyzing, by a context engine of the assistant system, images of a scene of the video call to identify the one or more activities within the scene;instructing the assistant system to execute the request based on the identified one or more activities; andsending, to one or more of the plurality of client systems, a response to the request while maintaining the video call between the plurality of client systems.2. The method of claim 1 , wherein the request by the first user further references an instruction to perform a virtual activity with respect to one or more of the ...

Подробнее
03-02-2022 дата публикации

SYSTEM AND METHOD FOR INTERVIEW TRAINING WITH TIME-MATCHED FEEDBACK

Номер: US20220036315A1
Принадлежит:

The present disclosure generally relates to interview training and providing interview feedback. An exemplary method comprises: at an electronic device that is in communication with a display and one or more input devices: receiving, via the one or more input devices, media data corresponding to a user's responses to a plurality of prompts; analyzing the media data; and while displaying, on the display, a media representation of the media data, displaying a plurality of analysis representations overlaid on the media representation, wherein each of the plurality of analysis representations is associated with an analysis of content located at a given time in the media representation and is displayed in coordination with the given time in the media representation. 1. A method comprising: ["receiving, via the one or more input devices, media data corresponding to a user's responses to a plurality of prompts;", 'receiving, via the one or more input devices, a user selection of one or more filters from a plurality of predefined filters; and', 'automatically removing, from the media data, one or more portions of the media data corresponding to the selected one or more filters to generate a filtered version of the media data., 'at an electronic device that is in communication with a display and one or more input devices2. The method of claim 1 , wherein the plurality of filters comprises a filter of one or more predefined words.3. The method of claim 2 , wherein the one or more predefined words include: words to avoid claim 2 , clichés claim 2 , business clichés claim 2 , controversial words claim 2 , profanity claim 2 , personal words claim 2 , hesitation or stalling claim 2 , job-related words claim 2 , words that are overused claim 2 , positive things to say claim 2 , technical jargon words claim 2 , military jargon words claim 2 , academic jargon words claim 2 , or a combination thereof.4. The method of claim 1 , wherein the plurality of filters comprises a filter of ...

Подробнее
19-01-2017 дата публикации

INTEREST NOTIFICATION APPARATUS AND METHOD

Номер: US20170018272A1
Принадлежит: SAMSUNG ELECTRONICS CO., LTD.

An apparatus for notification of speech of interest to a user includes a voice analyzer configured to recognize speech, evaluate a relevance between a result of the speech recognition and a determined user's topic of interest, and determine whether to provide a notification; and an outputter configured to, in response to the voice analyzer determining to provide the notification, generate and output a notification message. 1. A speech notification apparatus , the apparatus comprising:a voice analyzer configured to recognize speech, evaluate a relevance between a result of the speech recognition and a determined user's topic of interest, and determine whether to provide a notification; andan outputter configured to, in response to the voice analyzer determining to provide the notification, generate and output a notification message.2. The apparatus of claim 1 , further comprising a topic-of-interest setter configured to determine a user's topic of interest.3. The apparatus of claim 2 , wherein the topic-of-interest setter is further configured to determine the user's topic of interest based on a user input or automatically determine the user's topic of interest by analyzing user's contextual information.4. The apparatus of claim 3 , wherein the contextual information comprises at least one of user's schedule stored in the apparatus or location data determined by the apparatus.5. The apparatus of claim 2 , wherein claim 2 , the topic-of-interest setter is further configured to learn the determined user's topic of interest based on the evaluated relevance.6. The apparatus of claim 2 , wherein the voice analyzer comprises a text generator configured to generate text in the recognizing of the speech and a text analyzer configured to evaluate a relevance between the generated text and the user's topic of interest.7. The apparatus of claim 6 , wherein the text analyzer is configured to apply at least one of support vector machine (SVM) classification or neural network ...

Подробнее
19-01-2017 дата публикации

Voice Control User Interface with Progressive Command Engagement

Номер: US20170018275A1
Принадлежит: Google Technology Holdings LLC

A method include placing a first processor in a sleep operating mode and running a second processor that is operative to wake the first processor from the sleep operating mode in response to a speech command phrase. The method includes identifying, by the second processor, a speech command phrase segment and performing a control operation in response to detecting the segment in detected speech. The control operation is performed while the first processor is maintained in the sleep operating mode.

Подробнее
19-01-2017 дата публикации

Voice Control User Interface with Progressive Command Engagement

Номер: US20170018276A1
Принадлежит:

A method include placing a first processor in a sleep operating mode and running a second processor that is operative to wake the first processor from the sleep operating mode in response to a speech command phrase. The method includes identifying, by the second processor, a speech command phrase segment and performing a control operation in response to detecting the segment in detected speech. The control operation is performed while the first processor is maintained in the sleep operating mode. 1. (canceled)2. A computer-implemented method comprising:determining, by a mobile device that is operating in a low power mode, that a user has likely uttered a particular voice command more than once within a predetermined period of time; andproviding, without exiting the low-power mode, a graphical user interface based on determining that the user has likely uttered the particular voice command more than once within the predetermined period of time.3. The method of claim 2 , wherein in the graphical user interface includes a recommendation to the user to speak subsequent voice commands louder than the likely particular voice command.4. The method of claim 2 , wherein the graphical user interface includes text of a full voice command associated with the particular voice command.5. The method of claim 2 , wherein the low-power mode maintains applications running on the mobile device in an inactive state.6. The method of claim 2 , comprising:determining that an additional predetermined period of time has elapsed; andbased on determining that the additional predetermined period of time has elapsed, deactivating, without exiting the low-power mode, a display of the mobile device.7. The method of claim 2 , comprising:determining, by the mobile device that is operating in the low power mode and while a display of the mobile device is in an off state, that the user has likely uttered an additional particular voice command more than once and less than a predetermined number of ...

Подробнее
03-02-2022 дата публикации

System architecture and embedded circuit to locate a lost portable device using voice command

Номер: US20220036881A1
Принадлежит: Aondevices Inc

Systems and methods are presented herein that include utilizing a specialized circuit architecture for the purpose of locating lost items through voice command recognition. The system can be customized with minimal, low power components to continuously monitor for a locating phrase and produce an audible alert when the locating phrase is detected. The system can be included in a stand-alone portable device that can be attached to electronic and non-electronic portable objects, or the system can be embedded with other circuitry into a portable electronic device. The system can be powered by battery such that it is practical for the device to function as a locating device, continuously monitoring for the locating phrase. Voice recognition can be performed completely within the system, providing the benefit of privacy.

Подробнее
03-02-2022 дата публикации

ELECTRONIC APPARATUS, SYSTEM AND METHOD FOR USING SPEECH RECOGNITION SERVICE

Номер: US20220036882A1
Принадлежит:

According to an embodiment, an electronic apparatus includes a plurality of service engines corresponding to a plurality of speech recognition services, respectively; a communication module connected via wireless communication to a plurality of service servers for providing the plurality of speech recognition services, respectively; a mike configured to receive speech of a user; and a service controller configured to store at least one wakeup word corresponding to each of the plurality of speech recognition services in advance, determine a speech recognition service corresponding to a wakeup word received by the mike from among the plurality of speech recognition services based on the plurality of wakeup words stored in advance, send a user command received by the mike to a service engine corresponding to the determined speech recognition service among the plurality of service engines. 1. An electronic apparatus comprising:a plurality of service engines corresponding to a plurality of speech recognition services, respectively;a communication module connected via wireless communication to a plurality of service servers for providing the plurality of speech recognition services, respectively;a mike configured to receive speech of a user; and store at least one wakeup word corresponding to each of the plurality of speech recognition services in advance,', 'determine a speech recognition service corresponding to a wakeup word received by the mike from among the plurality of speech recognition services based on the plurality of wakeup words stored in advance, and', 'send a user command received by the mike to a service engine corresponding to the determined speech recognition service among the plurality of service engines., 'a service controller configured to2. The electronic apparatus of claim 1 , wherein the plurality of service engines are configured to run a plurality of applications claim 1 , respectively claim 1 , to use the plurality of speech recognition services ...

Подробнее
03-02-2022 дата публикации

HYBRID VOICE COMMAND TECHNIQUE UTILIZING BOTH ON-DEVICE AND CLOUD RESOURCES

Номер: US20220036896A1
Принадлежит:

Systems and methods are presented for recognizing and responding to voice commands at a local system and selectively streaming audio to a network-based computing system to recognize voice commands when the user provides a specific voice command to stream to the network-based computing system and/or when the user provides a voice command that is not recognizable by the local system. 1. An electronic device , comprising:a non-transitory computer readable storage medium having information stored thereon, the information comprising a plurality of command phrases,wherein each of the plurality of command phrases is associated with a respective command instruction;a network transceiver; and receive an audio signal,', 'monitor the audio signal to determine a presence of at least one of the command phrases, and', 'provide the respective command instruction associated with the detected command phrase,', 'wherein a first command phrase of the plurality of command phrases is associated with a respective command instruction to cause the network transceiver to transmit the audio signal, and', 'wherein a second command phrase of the plurality of command phrases is associated with a respective command instruction configured to executed by the portable electronic device without transmission of the audio signal., 'an electrical circuit in communication with the storage medium and the network transceiver, the electrical circuit configured to'}2. (canceled)3. The electronic device of claim 1 , wherein the portable electronic device is an “always-listening” device.4. (canceled)5. (canceled)6. The electronic device of claim 1 ,wherein the information further comprises a wake word, andwherein the electrical circuit is further configured to monitor the audio signal to determine a presence of the wake word.7. The electronic device of claim 1 , wherein a third command phrase of the plurality of command phrases is associated with a respective command instruction to cause the network ...

Подробнее
18-01-2018 дата публикации

Language Merge

Номер: US20180018959A1
Принадлежит:

Systems and methods are described for processing and interpreting audible commands spoken in one or more languages. Speech recognition systems disclosed herein may be used as a stand-alone speech recognition system or comprise a portion of another content consumption system. A requesting user may provide audio input (e.g., command data) to the speech recognition system via a computing device to request an entertainment system to perform one or more operational commands. The speech recognition system may analyze the audio input across a variety of linguistic models, and may parse the audio input to identify a plurality of phrases and corresponding action classifiers. In some embodiments, the speech recognition system may utilize the action classifiers and other information to determine the one or more identified phrases that appropriately match the desired intent and operational command associated with the user's spoken command. 1. A method comprising:receiving, from an input device, command data corresponding to a spoken user command;determining, by a processor, a plurality of acoustic models to process the command data, wherein at least a first acoustic model of the plurality of acoustic models corresponds to a first language, and a second acoustic model of the plurality of acoustic models corresponds to a second language;generating, by the processor and based on the command data, a first acoustic transcript of the spoken user command using the first acoustic model, and a second acoustic transcript of the spoken user command using the second acoustic model;generating, from the first acoustic transcript, a first plurality of phrases;generating, from the second acoustic transcript, a second plurality of phrases;assigning a phrase classification for each phrase of the first plurality of phrases and the second plurality of phrases;determining, by the processor and based at least in part on the assigned phrase classifications, one or more match phrases, wherein each ...

Подробнее
18-01-2018 дата публикации

AUDIO SLICER AND TRANSCRIPTION GENERATOR

Номер: US20180018961A1
Принадлежит:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for combining audio data and a transcription of the audio data into a data structure are disclosed. In one aspect, a method includes the actions of receiving audio data that corresponds to an utterance. The actions include generating a transcription of the utterance. The actions include classifying a first portion of the transcription as a trigger term and a second portion as an object of the trigger term. The actions include determining that the trigger term matches trigger term for which a result of processing is to include both a transcription of an object and audio data of the object in a generated data structure. The actions include isolating the audio data of the object. The actions include generating a data structure that includes the transcription of the object and the audio data of the object. 1. A computer-implemented method comprising:receiving audio data that corresponds to an utterance;generating a transcription of the utterance;classifying a first portion of the transcription as a voice command trigger term and a second portion of the transcription as an object of the voice command trigger term;determining that the voice command trigger term matches a voice command trigger term for which a result of processing is to include both a transcription of an object of the voice command trigger term and audio data of the object of the voice command trigger term in a generated data structure;extracting, from the audio that corresponds to the utterance, audio data that corresponds to the second portion of the transcription classified as the object of the voice command trigger term; andgenerating a data structure that includes the second portion of the transcription classified as the object of the voice command trigger term and the extracted audio data that corresponds to the second portion of the transcription classified as the object of the voice command trigger ...

Подробнее
18-01-2018 дата публикации

Call Forwarding to Unavailable Party Based on Artificial Intelligence

Номер: US20180018969A1
Принадлежит:

A called party indicates that he or she is unavailable to receive a call. However, by way of a combination or any one of determining aspects of the who the caller is, where the caller is located, what he is speaking about, or the like as well as comparing this to prior calls, the call might be sent to a called party to be on the call. This can be by way of speech recognition of the call and creating a transcript and by receiving feedback from a called party about prior calls. 1. A method of conditionally forwarding a received phone call to a bidirectional transceiver associated with a called party , comprising the steps of:receiving said phone call at a network node, said phone call directed towards a called party;determining an identity of a calling party based on at least one of call identification information, voice recognition, and speech recognition;determining that said called party is unavailable;detecting urgency in a voice of said calling party based on content, as determined by speech recognition, of said phone call originating from said calling party;2. The method of claim 1 , comprising an additional step of forwarding said call to said bidirectional transceiver associated with said called party based on said detecting of urgency.3. The method of claim 2 , further comprising a step of transcribing into text said audio within said phone call originating from said calling party; andwherein said step of detecting urgency is based on a keyword within said text which has been pre-designated as a keyword which indicates said urgency.4. The method of claim 2 , wherein said step of detecting urgency is further based on a combination of tone and speed of speech above a pre-defined threshold indicating said urgency.5. The method of claim 1 , wherein:urgency is detected in said voice of said calling party and said call;a request from said calling party for the call to be sent to said calling party is denied based on said call identification information matching pre ...

Подробнее
18-01-2018 дата публикации

INTEGRATING THIRD-PARTY PROGRAMS WITH MESSAGING SYSTEMS

Номер: US20180018972A1
Принадлежит: Alcatel-Lucent USA Inc.

Systems and methods for invoking a third-party program as an additional participant of a chat session include monitoring events of a chat session of a messaging platform. At least one of the events of the chat session is determined to satisfy a triggering condition associated with a third-party program. In response to determining that the at least one of the events of the chat session satisfies the triggering condition associated with the third-party program, the third-party program is invoked as an additional participant of the chat session. 1. A method for invoking a third-party program as an additional participant of a chat session , comprising:monitoring events of a chat session of a messaging platform;determining that at least one of the events of the chat session satisfies a triggering condition associated with a third-party program; and invoking the third-party program as an additional participant of the chat session; and', 'sending data associated with the at least one of the events of the chat session to the third-party program for performing a function in the chat session., 'in response to determining that the at least one of the events of the chat session satisfies the triggering condition associated with the third-party program2. The method as recited in claim 1 , further comprising:registering the third-party program with the triggering condition.3. The method as recited in claim 2 , wherein: receiving the triggering condition from an entity associated with the third-party program; and', 'storing the association between the triggering condition and the third-party program in a registry; and, 'registering the third-party program with the triggering condition comprises retrieving the association between the triggering condition and the third-party program from the registry; and', 'comparing the at least one of the events with the triggering condition to determine that the at least one of the events of the chat session satisfies the triggering condition ...

Подробнее
21-01-2016 дата публикации

Telecommunication call management and monitoring system with voiceprint verification

Номер: US20160021243A1
Автор: Stephen Lee Hodge
Принадлежит: Global Tel Link Corp

A secure telephone call management system is provided for authenticating users of a telephone system in an institutional facility. Authentication of the users of the telephone call management system is accomplished by using a personal identification number, biometric means, and/or radio frequency means. The secure telephone call management system includes accounting software capable of limiting access to the system based on funds in a user's account, and includes management software capable of implementing widespread or local changes to the system. The system monitors a conversation in the telephone call to detect a presence of a first characteristic in audio of the conversation, and terminates the telephone call if the first characteristic does not match a second characteristic of biometric information of a user or a called party.

Подробнее