Настройки

Укажите год
-

Небесная энциклопедия

Космические корабли и станции, автоматические КА и методы их проектирования, бортовые комплексы управления, системы и средства жизнеобеспечения, особенности технологии производства ракетно-космических систем

Подробнее
-

Мониторинг СМИ

Мониторинг СМИ и социальных сетей. Сканирование интернета, новостных сайтов, специализированных контентных площадок на базе мессенджеров. Гибкие настройки фильтров и первоначальных источников.

Подробнее

Форма поиска

Поддерживает ввод нескольких поисковых фраз (по одной на строку). При поиске обеспечивает поддержку морфологии русского и английского языка
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Ведите корректный номера.
Укажите год
Укажите год

Применить Всего найдено 4420. Отображено 196.
13-09-2018 дата публикации

СПОСОБ И УСТРОЙСТВО КАТЕГОРИЗАЦИИ ВИДЕО

Номер: RU2667027C2
Принадлежит: СЯОМИ ИНК. (CN)

Изобретение относится к средствам категоризации видео. Технический результат заключается в улучшении точности категоризации видео. Получают ключевой кадр, содержащий лицо на видео. Получают черты лица в ключевом кадре. Получают одно или нескользкого черт лица, соответствующих одной или нескольким категориям изображений; в соответствии с чертой лица в ключевом кадре и одной или несколькими чертами лица, соответствующими одной или нескольким категориям изображений. Определяют категорию изображений, к которой принадлежит данное видео. Присваивают видео категории изображения, к которой принадлежит видео. 3 н. и 8 з.п. ф-лы, 9 ил.

Подробнее
03-09-2014 дата публикации

A method of video analysis

Номер: GB0201412846D0
Автор:
Принадлежит:

Подробнее
04-08-2021 дата публикации

A method of video analysis

Номер: GB2528330B
Принадлежит: UNIFAI HOLDINGS LTD, Unifai Holdings Limited

Подробнее
04-06-2020 дата публикации

SYSTEM AND METHOD FOR PROCESS SHAPING

Номер: CA3114140A1
Принадлежит:

A system for process shaping in a retail store environment comprises a video generation and processing component, a data source integration and aggregation component for aggregating and integrate information received from various sources, a process sensing component for generating one or more continuous processes, a process aggregator and weighing component for aggregating the one or more continuous processes into a merged weighted process, a proof of problem and value component for determining one or more process variations, a ripple effect analyser for sending one or more nudging messages to the retail store environment, and a gamified feedback algorithm component for communicating a nudging action corresponding to a nudging message, to one or more entities in the retail store environment.

Подробнее
25-12-2014 дата публикации

AUTOMATIC FACE DISCOVERY AND RECOGNITION FOR VIDEO CONTENT ANALYSIS

Номер: US20140375886A1
Принадлежит:

Systems and methods are provided for automatically classifying videos based on faces discovered in the videos, wherein the discovered faces are not known to be associated with a particular category of videos. The detected face is compared to a set of unknown faces to generate a cluster of unknown faces that each match with the detected face. A set of categorized videos is identified based on the cluster of unknown faces. One or more categories are assigned to the video based on categories from the set of categorized videos so that the video can be automatically classified based on the detected face even though the detected face is not associated with a known person.

Подробнее
24-06-2014 дата публикации

Generation and provision of media metadata

Номер: US0008763068B2

Various embodiments related to the generation and provision of media metadata are disclosed. For example, one disclosed embodiment provides a computing device having a logic subsystem configured to execute instructions, and a data holding subsystem comprising instructions stored thereon that are executable by the processor to receive an input of a video and/or audio content item, and to compare the content item to one or more object descriptors each representing an object for locating within the content item to locate instances of one or more of the objects in the content item. The instructions are further executable to generate metadata for each object located in the video content item, and to receive a validating user input related to whether the metadata generated for a selected object is correct.

Подробнее
12-01-2017 дата публикации

VIDEO PROCESSING SYSTEM

Номер: US20170013230A1
Принадлежит: NEC Corporation

A video processing system includes: an object movement information acquiring means for detecting a moving object moving in a plurality of segment regions from video data obtained by shooting a monitoring target area, and acquiring movement segment region information as object movement information, the movement segment region information representing segment regions where the detected moving object has moved; an object movement information and video data storing means for storing the object movement information in association with the video data corresponding to the object movement information; a retrieval condition inputting means for inputting a sequence of the segment regions as a retrieval condition; and a video data retrieving means for retrieving the object movement information in accordance with the retrieval condition and outputting video data stored in association with the retrieved object movement information, the object movement information being stored by the object movement information and video data storing means.

Подробнее
12-10-2017 дата публикации

METHOD FOR VIDEO INVESTIGATION

Номер: US20170294213A1
Принадлежит:

The invention provides a method for processing and analyzing forensic video data using a computer program, the method comprising the steps of recording the forensic video data; providing supplementary data related to the recorded video data, wherein the supplementary data may be provided from or input by a source external of the computer program, in particular a human, or wherein the supplementary data may be extracted from the forensic video data by the computer program in an initial analyzing step; analyzing the forensic video data by the computer program using the supplementary data; and displaying a part of the forensic video data, the displayed part being based on a result of the analyzing step.

Подробнее
27-08-2003 дата публикации

Method and apparatus for reviewing video

Номер: GB0000317317D0
Автор:
Принадлежит:

Подробнее
14-01-2015 дата публикации

Object detection metadata

Номер: GB0002513218B
Принадлежит: APPLE INC, APPLE INC.

Подробнее
26-01-2005 дата публикации

Method and apparatus for reviewing video

Номер: GB0002404299A
Принадлежит:

Video reviewing method including the steps of deriving one or more video segments from unedited video footage, displaying at least two video segments substantially concurrently and identifying video data of interest corresponding to at least one of said video segments. A spatial relationship between the display of at least two video segments may represent a temporal relationship between the content of the video segments. The invention aims to aide a video editing process for video review or summarisation by allowing interesting footage to be selected for the edit process.

Подробнее
23-02-2004 дата публикации

AUTOMATIC SOCCER VIDEO ANALYSIS AND SUMMARIZATION

Номер: AU2003265318A1
Принадлежит:

Подробнее
23-08-2012 дата публикации

FACIAL DETECTION, RECOGNITION AND BOOKMARKING IN VIDEOS

Номер: CA0002827611A1
Принадлежит:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for facial bookmarking in videos. In one aspect, methods include receiving a digital video comprising video data, processing the video data to detect features indicative of a human face in the digital video, determining, from the video data, a first frame, in which the features indicative of the human face are detected in the digital video, determining first timestamp data corresponding to the first frame, determining, from the video data, a second frame, in which the features indicative of the human face are detected in the digital video, determining second timestamp data corresponding to the second frame, generating an identifier corresponding to the human face, generating a data set including the identifier, the first timestamp data and the second timestamp data, and appending the data set to the video data to provide annotated video data.

Подробнее
25-05-2021 дата публикации

Emotive recognition and feedback system

Номер: US0011017239B2
Принадлежит: POSITIVE IQ, LLC, POSITIVE IQ LLC

This disclosure relates to an emotive recognition and feedback system that identifies emotive states of users within a video chat session and determines communication recommendations based on the interaction of the participants' emotive states. For instance, the emotive recognition and feedback system analyzes a video data to determine the participants' individual emotive states. Based on mapping the participants' individual emotive states, the emotive recognition and feedback system can determine a communication recommendation for the video chat. Additionally, the emotive response system provides graphical user interfaces that presents the communication recommendation to the video chat participants. The emotive recognition and feedback system can perform real-time analysis and feedback or delayed analytics of previously performed video chats.

Подробнее
23-08-2012 дата публикации

FACIAL DETECTION, RECOGNITION AND BOOKMARKING IN VIDEOS

Номер: US20120213490A1
Принадлежит: GOOGLE INC.

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for facial bookmarking in videos. In one aspect, methods include receiving a digital video comprising video data, processing the video data to detect features indicative of a human face in the digital video, determining, from the video data, a first frame, in which the features indicative of the human face are detected in the digital video, determining first timestamp data corresponding to the first frame, determining, from the video data, a second frame, in which the features indicative of the human face are detected in the digital video, determining second timestamp data corresponding to the second frame, generating an identifier corresponding to the human face, generating a data set including the identifier, the first timestamp data and the second timestamp data, and appending the data set to the video data to provide annotated video data.

Подробнее
29-08-2019 дата публикации

SYSTEM AND METHOD FOR GENERATING PROBABILISTIC PLAY ANALYSES FROM SPORTS VIDEOS

Номер: US20190267041A1
Принадлежит:

A computer-implemented method may include receiving at least three video clips of a sporting event, where each of the video clips may (i) be simultaneously captured over at least a portion of time, and (ii) include at least one common player wearing an indicia on a jersey that is distinguishing from indicia on other players. Tracking locations of the at least one common player captured in the at least three video clips may be generated by triangulating distances of the common player(s) in the video clips. Statistical information of the common player(s) may be generated from the tracking locations. The common player(s) may be represented on a graphical display. The common player(s) may be controlled by applying at least one of the tracking locations and statistical information of the common player(s).

Подробнее
31-01-2017 дата публикации

Method and apparatus for automated analysis and identification of a person in image and video content

Номер: US0009558397B2

A method, apparatus, and computer readable medium for identifying a person in an image includes an image analyzer. The image analyzer determines the content of an image such as a person, location, and object shown in the image. A person in the image may be identified based on the content and event data stored in a database. Event data includes information concerning events and related people, locations, and objects determined from other images and information. Identification metadata is generated and linked to each analyzed image and comprises information determined during image analysis. Tags for images are generated based on identification metadata. The event database can be queried to identify particular people, locations, objects, and events depending on a user's request.

Подробнее
28-11-2002 дата публикации

Surveillance recording device and method

Номер: US2002175997A1
Автор:
Принадлежит:

A surveillance recording device using cameras extracts facial images and whole body images of a person from images shot by the cameras. A height is calculated from the whole body images. Retrieval information, including a facial image (best shot), is associated with images in a recording medium and recorded into a database. The recorded data are utilized as an index for later retrieval from the recording medium. Facial images are displayed in a list of thumbnails to make it easy to retrieve a target person on a thumbnail screen. The images are displayed together with a moving image of a target person.

Подробнее
16-03-2017 дата публикации

INFORMATION PROCESSING APPARATUS, METHOD OF CONTROLLING THE SAME, AND STORAGE MEDIUM

Номер: US20170075993A1
Принадлежит:

This invention prevents a collation database from becoming bulky and shortening the delay time needed from person detection to registration in the collation database. An information processing apparatus comprises an acquiring unit which acquires a video, a detecting unit which detects a whole body or a part of a person from at least one frame of the acquired video, a tracking unit which tracks the whole body or the part of the person detected, and a registering unit which registers, in a database, a feature amount extracted from the whole body or the part of the person tracked during a first period from a timing of a start of tracking of the whole body or the part of the person by the tracking unit to a timing before an end of the tracking by the tracking unit.

Подробнее
12-07-2007 дата публикации

Image filing method, digital camera, image filing program and video recording player

Номер: US2007159533A1
Автор: AYAKI KENICHIRO
Принадлежит:

The user of a digital camera inputs an event title before shooting, so images shot under the same event title are stored in a group of image files. A face image extractor extracts face images from the respective image files, and characteristic values of the face images are calculated. In each image file group, the characteristic values of one face image are compared with other's, to judge those face images having similar characteristic values to be the same person's. Among the face images extracted from the image files of the same group, one of the most frequently appearing person's face images is determined to be a representative image. Data of the representative image is stored in association with the corresponding image file group, so the representative image and the event title may be displayed as an index to that image file group.

Подробнее
02-10-2013 дата публикации

Facial detection, recognition and bookmarking in videos

Номер: GB0201314656D0
Автор:
Принадлежит:

Подробнее
27-04-2022 дата публикации

Cognitive video and audio search aggregation

Номер: GB0002600281A
Принадлежит:

A method, computer program product, and a system where a processor(s) obtains a video from a user, via a client, and segments the video into temporal shots that comprise a timeline of the video. The processor(s) cognitively analyze the video, by applying an image recognition algorithm to identify image entities in each temporal shot of the video and by applying a data structure comprising a user profile of the user to the temporal shots, to identity personal entities in each temporal shot of the video. The program code generates a search index for the video, utilizing the user entities (image entities and personal entities), where each entry of the search index is a given user entity and a linkage to a given temporal shot and the linkage indicates a location of the given user entity in the timeline of the video.

Подробнее
17-09-2015 дата публикации

Facial detection, recognition and bookmarking in videos

Номер: AU2012217935B2
Принадлежит:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for facial bookmarking in videos. In one aspect, methods include receiving a digital video comprising video data, processing the video data to detect features indicative of a human face in the digital video, determining, from the video data, a first frame, in which the features indicative of the human face are detected in the digital video, determining first timestamp data corresponding to the first frame, determining, from the video data, a second frame, in which the features indicative of the human face are detected in the digital video, determining second timestamp data corresponding to the second frame, generating an identifier corresponding to the human face, generating a data set including the identifier, the first timestamp data and the second timestamp data, and appending the data set to the video data to provide annotated video data.

Подробнее
02-05-2013 дата публикации

Facial detection, recognition and bookmarking in videos

Номер: AU2012217935A1
Принадлежит:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for facial bookmarking in videos. In one aspect, methods include receiving a digital video comprising video data, processing the video data to detect features indicative of a human face in the digital video, determining, from the video data, a first frame, in which the features indicative of the human face are detected in the digital video, determining first timestamp data corresponding to the first frame, determining, from the video data, a second frame, in which the features indicative of the human face are detected in the digital video, determining second timestamp data corresponding to the second frame, generating an identifier corresponding to the human face, generating a data set including the identifier, the first timestamp data and the second timestamp data, and appending the data set to the video data to provide annotated video data.

Подробнее
09-09-2021 дата публикации

Identification of individuals in a digital file using media analysis techniques

Номер: AU2018324122B2
Принадлежит:

This description describes a system for identifying individuals within a digital file. The system accesses a digital file describing the movement of unidentified individuals and detects a face for an unidentified individual at a plurality of locations in the video. The system divides the digital file into a set of segments and detects a face of an unidentified individual by applying a detection algorithm to each segment. For each detected face, the system applies a recognition algorithm to extract feature vectors representative of the identity of the detected faces which are stored in computer memory. The system applies a recognition algorithm to query the extracted feature vectors for target individuals by matching unidentified individuals to target individuals, determining a confidence level describing the likelihood that the match is correct, and generating a report to be presented to a user of the system.

Подробнее
18-03-2021 дата публикации

SYSTEMS AND METHODS FOR SHARING DATA ASSETS VIA A COMPUTER-IMPLEMENTED DATA TRUST

Номер: CA3066816A1
Принадлежит:

Systems and methods for sharing data assets via a computer-implemented data trust are provided herein. The method includes creating, in response to a user input, a data trust domain. Creating the domain includes instantiating a private network. The network includes a plurality of domain nodes. The domain nodes include a data producer node and a data consumer node. The data asset is provided by the data producer node. The method also includes defining access rights for the data asset as between the data consumer node and the data producer node. The method also includes creating a data pathway object. The data pathway object specifies the access rights for the data asset. The flow of data within the data trust domain is controlled according to the data pathway object.

Подробнее
19-09-2017 дата публикации

Positional locating system and method

Номер: US9767351B2
Принадлежит: AVIDASPORTS LLC, AvidaSports, LLC

A method and system are disclosed for locating or otherwise generating positional information for an object, such as but not limited generating positional coordinates for an object attached to an athlete engaging in an athletic event. The positional coordinates may be processed with other telemetry and biometrical information to provide real-time performance metrics while the athlete engages in the athletic event.

Подробнее
23-01-2020 дата публикации

Focalized Behavioral Measurements in a Video Stream

Номер: US20200026926A1
Принадлежит: Ricoh Company, Ltd.

A system and method for analyzing behavior in a video is described. The method includes extracting a plurality of salient fragments of a video; generating a focalized visualization, based on a time anchor, from one or more of the plurality of salient fragments of the video; tagging a human subject in the focalized visualization with a unique identifier; and analyzing behavior of the human subject, using the focalized visualization, to generate a behavior score associated with the unique identifier and the time anchor. 1. A computer-implemented method comprising:extracting a plurality of salient fragments of a video;generating a focalized visualization, based on a time anchor, from one or more of the plurality of salient fragments of the video;tagging a human subject in the focalized visualization with a unique identifier; andanalyzing behavior of the human subject, using the focalized visualization, to generate a behavior score associated with the unique identifier and the time anchor.2. The computer-implemented method of claim 1 , further comprising:storing the behavior score as a record in a database using the unique identifier and the time anchor as attributes.3. The computer-implemented method of claim 2 , wherein the unique identifier and the time anchor form a tuple to be used as a database key.4. The computer-implemented method of claim 2 , further comprising:performing a query on the database based on selection criteria that is selected from the group consisting of date of record, unique identifier, time anchor, behavior score, minimum behavior score, maximum behavior score, and average behavior score.5. The computer-implemented method of claim 1 , wherein tagging the human subject comprises:detecting a face in a frame of the focalized visualization;identifying the face using a template of a known human subject, wherein the template is associated with the unique identifier; andassociating the face with the unique identifier.6. The computer-implemented method ...

Подробнее
25-10-2018 дата публикации

RECOGNITION, REIDENTIFICATION AND SECURITY ENHANCEMENTS USING AUTONOMOUS MACHINES

Номер: US20180307899A1
Принадлежит: Intel Corproation

A mechanism is described for facilitating recognition, reidentification, and security in machine learning at autonomous machines. A method of embodiments, as described herein, includes facilitating a camera to detect one or more objects within a physical vicinity, the one or more objects including a person, and the physical vicinity including a house, where detecting includes capturing one or more images of one or more portions of a body of the person. The method may further include extracting body features based on the one or more portions of the body, comparing the extracted body features with feature vectors stored at a database, and building a classification model based on the extracted body features over a period of time to facilitate recognition or reidentification of the person independent of facial recognition of the person.

Подробнее
23-06-2016 дата публикации

SYSTEMS AND METHODS FOR MANIPULATING ELECTRONIC CONTENT BASED ON SPEECH RECOGNITION

Номер: US20160182957A1
Принадлежит:

Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network; extracting an audio track from the electronic media content; and detecting speech segments within the electronic media content based on the speech model. The method further includes detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.

Подробнее
16-12-2014 дата публикации

Acoustic signal corrector and acoustic signal correcting method

Номер: US0008913834B2
Автор: Ken Hayase, HAYASE KEN

According to one embodiment, an electronic apparatus comprises (i) an image extraction module configured to extract representative images from a plurality of frames which constitute video content data, and to output time stamp information indicative of time points at which the extracted representative images appear, and (ii) an image list display process module configured to display a list of the extracted representative images on a two-dimensional display area. The area includes image display areas which are divided by columns, a plurality of time zones, and the image list display process module is configured to display, based on the time stamp information corresponding to each of the extracted representative images, the representative images, which belong to the time zone allocated to each column.

Подробнее
30-08-2012 дата публикации

METHOD AND APPARATUS FOR ANNOTATING MULTIMEDIA DATA IN A COMPUTER-AIDED MANNER

Номер: US20120219223A1
Принадлежит: Siemens Aktiengesellschaft

Annotation of a sequence of digitized images in multimedia data is aided by a computer analyzing the multimedia data to identify one or more objects and assigning each object to a respective role. The role assignment is determined by processing context information representing a model of the multimedia data.

Подробнее
01-09-2005 дата публикации

Apparatus and method for determining anchor shots

Номер: US2005190965A1
Принадлежит:

A method of and apparatus for determining anchor shots which can be used in indexing, summarizing, and browsing contents of video data. The method includes: extracting a plurality of basic shots from the video data according to a predetermined standard; selecting a plurality of anchor model candidate shots from the plurality of basic shots by applying a first standard to the plurality of basic shots; determining at least one anchor model shot by applying a second standard to the plurality of anchor model candidate shots; and determining at least one anchor shot by comparing the amount of similarity of the anchor model shots and the plurality of basic shots.

Подробнее
28-05-2019 дата публикации

Optimization processes for compressing media content

Номер: US0010303925B2
Принадлежит: Google LLC, GOOGLE LLC

Various embodiments relate generally to a system, a device and a method for optimizing processes for compressing media content. An uncompressed content item is received in a media content management system. One or more parameters associated with the uncompressed content item are determined. A plurality of variants of the uncompressed content item is generated using the one or more parameters, the plurality of variants including one or more compressed content items. A candidate set comprising at least one of the one or more compressed content items is determined from the plurality of variants based on one or more filtering factors. A validated compressed content item is selected from the candidate set based on one or more validation criteria, and the validated compressed content item is stored in a database in the media content management system.

Подробнее
21-10-2021 дата публикации

SEARCH METHOD AND DEVICE, AND STORAGE MEDIUM

Номер: US20210326383A1

A search method, a search device, a storage medium and a computer program. The search method includes: determining a first similarity between text and at least one video, the text being used for representing a search condition; determining a first character interaction graph of the text and a second character interaction graph of the at least one video; determining a second similarity between the first character interaction graph and the second character interaction graph; and according to the first similarity and the second similarity, determining a video matching the search condition from the at least one video.

Подробнее
24-06-2014 дата публикации

Storage apparatus and method, program, and playback apparatus and method

Номер: US0008762659B2

A storage apparatus and method, a program, and a playback apparatus and method, capable of quickly reading a specific part of data among metadata including metadata associated with faces. A storage controller controls storing face metadata in a storage medium, wherein the face metadata includes a content data set added for each content, content data storage location information indicating the storage location of the content data set, a detected face data set associated with each of face images detected from a content, and detected face data storage location information indicating the storage location of the detected face data set, and wherein the face metadata is configured such that the content data storage location information and face block storage location information indicating the storage location of the detected face data storage location information are described in a single data set. The present invention is applicable to a digital camera.

Подробнее
10-01-2013 дата публикации

APPARATUS AND SOFTWARE SYSTEM FOR AND METHOD OF PERFORMING A VISUAL-RELEVANCE-RANK SUBSEQUENT SEARCH

Номер: US20130014016A1
Принадлежит:

A method analyzes the visual content of media such as videos for collecting together visually-similar appearances in their constituent images (e.g. same scenes, same objects, faces of the same people.) As a result, the most relevant and salient (of clearest and largest presence) visual appearances depicted in the videos are presented to the user, both for the sake of summarizing the video content for the users to see before they watch (that is, judge by the depicted video content in a filmstrip-like summary whether they want to mouse-click on the video and actually spend time watching it), as well as for allowing to users to further refine their video search result set according to the most relevant and salient video content returned (e.g. largest screen-time faces).

Подробнее
29-11-2019 дата публикации

Номер: RU2019129646A3
Автор:
Принадлежит:

Подробнее
25-12-2019 дата публикации

СИСТЕМА И СПОСОБ ДЛЯ ОБРАБОТКИ ВИДЕОДАННЫХ ИЗ АРХИВА

Номер: RU2710308C1

Изобретение относится к вычислительной технике. Технический результат − повышение эффективности поиска интересующего объекта при минимальных начальных данных Система для обработки данных из архива содержит: видеокамеры; память для хранения архива видеоданных от видеокамер системы; базу данных для хранения метаданных; графический пользовательский интерфейс (ГПИ), содержащий: блок выбора видеокамер; блок задания промежутка времени; блок выбора режима поиска; блок характеристик поиска; блок отображения, для отображения результатов поиска; а также устройство обработки данных для выполнения: декомпрессии и анализа видеоданных для формирования метаданных, характеризующих данные обо всех объектах в видео, при этом упомянутые метаданные записываются в базу данных системы; обработки архивных видеоданных и осуществления поиска по метаданным; вывода результата поиска посредством блока отображения. 3 н. и 30 з.п. ф-лы, 2 ил.

Подробнее
29-03-2018 дата публикации

Bereitstellen von relevanten Videoszenen in Reaktion auf eine Videosuchabfrage

Номер: DE102017005963A1
Принадлежит:

Die vorliegende Offenbarung betrifft Verfahren und Systeme zur Bereitstellung von relevanten Videoszenen in Reaktion auf eine Videosuchabfrage. Die Systeme und Verfahren identifizieren eine Mehrzahl von Keyframes eines Medienobjektes und detektieren ein oder mehrere in der Mehrzahl von Keyframes dargestellte Contentmerkmale. Auf Grundlage des einen oder der mehreren detektierten Contentmerkmale verknüpfen die Systeme und Verfahren die detektierten Contentmerkmale angebende Tags mit der Mehrzahl von Keyframes des Medienobjektes. Die Systeme und Verfahren vergleichen in Reaktion auf ein Empfangen einer Suchbegriffe beinhaltenden Suchabfrage die Suchbegriffe mit den Tags der ausgewählten Keyframes, identifizieren einen ausgewählten Keyframe, der wenigstens ein Contentmerkmal im Zusammenhang mit den Suchbegriffen abbildet, und stellen ein Vorschaubild des Medienobjektes, das das wenigstens eine Contentmerkmal abbildet, bereit.

Подробнее
20-11-2013 дата публикации

Facial detection, recognition and bookmarking in videos

Номер: GB0002502221A
Принадлежит:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for facial bookmarking in videos. In one aspect, methods include receiving a digital video comprising video data, processing the video data to detect features indicative of a human face in the digital video, determining, from the video data, a first frame, in which the features indicative of the human face are detected in the digital video, determining first timestamp data corresponding to the first frame, determining, from the video data, a second frame, in which the features indicative of the human face are detected in the digital video, determining second timestamp data corresponding to the second frame, generating an identifier corresponding to the human face, generating a data set including the identifier, the first timestamp data and the second timestamp data, and appending the data set to the video data to provide annotated video data.

Подробнее
21-12-2011 дата публикации

Generating, transmitting and receiving object detection metadata

Номер: GB0002481298A
Принадлежит:

A perimeter (105) around a detected object, such as face (103), in a video frame 1309 is generated in a first coordinate system. The perimeter is converted from the first coordinate system into a second coordinate system having the same aspect ratio as the first coordinate system. A first metadata entry includes dimensions of the image data in the second coordinate system. A second metadata entry provides a location and dimensions of the converted perimeter in the second coordinate system. Additional metadata can indicate matching objects between frames, position of an object relative to other objects in a frame, a probability that an object is correctly detected, and a total number of objects detected across multiple frames of image data. Claims are also included for the transmitting and receiving of the generated metadata.

Подробнее
15-11-2016 дата публикации

FACIAL DETECTION, RECOGNITION AND BOOKMARKING IN VIDEOS

Номер: CA0002827611C
Принадлежит: GOOGLE INC., GOOGLE INC

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for facial bookmarking in videos. In one aspect, methods include receiving a digital video comprising video data, processing the video data to detect features indicative of a human face in the digital video, determining, from the video data, a first frame, in which the features indicative of the human face are detected in the digital video, determining first timestamp data corresponding to the first frame, determining, from the video data, a second frame, in which the features indicative of the human face are detected in the digital video, determining second timestamp data corresponding to the second frame, generating an identifier corresponding to the human face, generating a data set including the identifier, the first timestamp data and the second timestamp data, and appending the data set to the video data to provide annotated video data.

Подробнее
25-05-2018 дата публикации

User attribute extraction method and device, and electronic equipment

Номер: CN0108076128A
Принадлежит:

Подробнее
23-09-2021 дата публикации

OBTAINING ARTIST IMAGERY FROM VIDEO CONTENT USING FACIAL RECOGNITION

Номер: US20210295023A1
Принадлежит:

An example method may include receiving, at a computing device, a digital image associated with a particular media content program, the digital image containing one or more faces of particular people associated with the particular media content program. A computer-implemented face recognition program together with a set of computational models associated with the particular media content program may be applied to the digital image to recognize one or more of the particular people in the digital image, together with respective geometric coordinates for each of the one or more detected faces. At least a subset of the set of the computational models may be associated with a respective one of the particular people. The digital image together may be stored in non-transitory computer-readable memory, together with information assigning respective identities of the recognized particular people, and associating with each respective assigned identity geometric coordinates in the digital image.

Подробнее
20-11-2012 дата публикации

Object recognition and database population for video indexing

Номер: US0008315430B2

A method for processing digital media is described. The method, in one example embodiment, includes identification of objects in a video stream by detecting, for each video frame, an object in the video frame and selectively associating the object with an object cluster. The method may further include comparing the object in the object cluster to a reference object and selectively associating object data of the reference object with all objects within the object cluster based on the comparing. The method may further include manually associating the object data of the reference object with all objects within the object cluster having no associated reference object and populating a reference database with the reference object for the object cluster.

Подробнее
30-06-2009 дата публикации

Method and system for segmenting videos using face detection

Номер: US0007555149B2

A method generates a summary of a video. Faces are detected in a plurality of frames of the video. The frames are classified according to a number of faces detected in each frame and the video is partitioned into segments according to the classifications to produce a summary of the video. For each frame classified as having a single detected face, one or more characteristics of the face is determined. The frames are labeled according to the characteristics to produce labeled clusters and the segments are partitioned into sub-segments according to the labeled clusters.

Подробнее
07-05-2013 дата публикации

Systems, methods and articles for video analysis reporting

Номер: US0008438175B2

A macro video analysis system including a first connection to a first persistent database of a first video analysis system monitoring a first remote location, a second connection to a second persistent database of a second video analysis system monitoring a second remote location, a macro persistent database archiving a respective event generated by each the first remote video analysis system monitoring the first remote location and the second video analysis system monitoring the second remote location and wherein each respective event is respectively transmitted through the first connection and the second connection, and a database query module to evaluate the events.

Подробнее
12-11-2019 дата публикации

Video surveillance system based on larger pose face frontalization

Номер: US0010474882B2

A video surveillance system is provided. The system includes a device configured to capture an input image of a subject located in an area. The system further includes a processor. The processor estimates, using a three-dimensional Morphable Model (3DMM) conditioned Generative Adversarial Network, 3DMM coefficients for the subject of the input image. The subject varies from an ideal front pose. The processor produces, using an image generator, a synthetic frontal face image of the subject of the input image based on the input image and coefficients. An area spanning the frontal face of the subject is made larger in the synthetic than in the input image. The processor provides, using a discriminator, a decision of whether the subject of the synthetic image is an actual person. The processor provides, using a face recognition engine, an identity of the subject in the input image based on the synthetic and input images.

Подробнее
17-04-2014 дата публикации

INTERACTIVE PHOTOGRAPHY SYSTEM AND METHOD EMPLOYING FACIAL RECOGNITION

Номер: US20140105466A1
Принадлежит: Ocean Images UK Ltd.

A system and method for providing social photography services at locations such as Cruise Ship and Theme Parks with the purpose of giving a customer a unique identifier, associating the unique identifier with a reference imagine of the customer, and matching reference images to images shot at the locale using facial recognition techniques. A photograph gallery application is also provided, allowing customers and employees to access the facially recognized images for viewing, purchasing, and printing. This system and method employs computer based technology and facial recognition to provide fast and accurate photography production, processing, selection, printing, and sale.

Подробнее
24-10-2017 дата публикации

Security system operator efficiency

Номер: US0009798803B2

Systems and methods for increasing an efficiency of an operator of a security system are discussed generally herein. A system can include a memory including ontology data saved thereon, the ontology data can define interrelationships between a scanner associated with access to a room of an area under surveillance, a camera with a field of view at least partially overlapping a footprint of the room, an identifier configured to be scanned by the scanner and associated with a person, and a security policy including one or more predefined conditions, which when satisfied, indicate when a security threat exists, the security policy includes a response an operator can perform if the conditions are satisfied, and the system can include a query module configured to receive a query and search the ontology data and temporal and spatial data associated with the area under surveillance in response to receiving the query.

Подробнее
25-10-2018 дата публикации

METHODS, SYSTEMS, AND MEDIA FOR GENERATING SEARCH RESULTS BASED ON CONTEXTUAL INFORMATION

Номер: US20180307752A1
Принадлежит:

Methods, systems, and media for generating search results based on contextual information are provided. In some implementations, a method for presenting search results is provided, the method comprising: receiving, using a hardware processor, a query related to media that is currently being presented; identifying a program that is currently being presented; identifying a plurality of keywords associated with the identified program; determining one or more of the plurality of keywords that are contextually relevant to the query; obtaining a plurality of search results based on the query and the one or more contextually relevant keywords; and causing at least one of the plurality of search results to be presented to the user.

Подробнее
17-01-2019 дата публикации

SYSTEMS AND METHODS FOR DETERMINING USERS ASSOCIATED WITH DEVICES BASED ON FACIAL RECOGNITION OF IMAGES

Номер: US20190019012A1
Принадлежит: Facebook Inc

Systems, methods, and non-transitory computer readable media can identify a user associated with a device based on a subset of media content items on the device based at least in part on analysis of the subset of media content items. A relationship between the user and one or more other users depicted in the media content items can be determined. A recommendation relating to sending at least one media content item on the device to at least of the one or more other users can be generated based on the determined relationship.

Подробнее
23-07-2009 дата публикации

VIDEO SURVEILLANCE SYSTEM AND METHOD USING IP-BASED NETWORKS

Номер: US2009185784A1
Принадлежит:

A technique capable of preventing an increase in the load of a system and a IP-based network that perform both a process of transmitting information in real time and a process of storing and searching history information and of improving the scalability of the system. A video surveillance system includes: plural cameras that capture images; a monitoring apparatus that displays the images captured by the cameras; an information generating apparatus that generates information for searching the images captured by the cameras; and a search apparatus that searches the images captured by the cameras. The information generating apparatus acquires the images captured by the cameras, generates retrieval data from the acquired images, and transmits the generated retrieval data to the search apparatus. The search apparatus stores the retrieval data received from the information generating apparatus in a storage device, and notifies the monitoring apparatus that the retrieval data has been stored.

Подробнее
10-02-2005 дата публикации

Method and apparatus for reviewing video

Номер: US2005031296A1
Автор:
Принадлежит:

One embodiment is a method for reviewing videos, comprising: deriving at least two video segments from unedited video footage based upon a previously determined unique saliency, each saliency associated with a corresponding one of the video segments; and displaying a display window for each of the derived video segments substantially concurrently.

Подробнее
13-06-2019 дата публикации

APPARATUS AND METHOD FOR RECOGNIZING PERSON

Номер: US20190179960A1
Принадлежит:

An apparatus for recognizing a person includes a content separator configured to receive contents and separate the contents into video content and audio content; a video processor configured to recognize a face from an image in the video content received from the content separator and obtain information on a face recognition section by analyzing the video content; an audio processor configured to recognize a speaker from voice data in the audio content received from the content separator and obtain information on a speaker recognition section by analyzing the audio content; and a person recognized section information provider configured to provide information on a section of the contents in which a person appears based on the information on the face recognition section and the information on the speaker recognition section.

Подробнее
26-11-2013 дата публикации

Image information output method

Номер: US0008593486B2

Provided is a video image data generation system including a database for storing a plurality of image data photographed in various directions in various locations, correlating the directions and the locations with the stored image data, and correlating and storing a photographed sub-region when the image data is acquired, a route view point specifying device which specifies various locations and eye level directions arranged on a view point route, an image search engine which searches an image of an eye level direction specified from a location of a view point route specified by the route view point specifying device and outputs video data, wherein the image search engine searches image data stored in a database and the image data including a sub-region located in an eye level direction in each of a plurality of locations on a view point route by referencing photography direction data correlated with the sub-region.

Подробнее
13-02-2013 дата публикации

Method of searching for a target within video data

Номер: GB0002493580A
Принадлежит:

A method, and processor, for searching for a target within video data comprises the steps of: receiving a target selected from within video data; identifying a current selection of target matches for the selected target within further video data; ranking the current selection of target matches; receiving a signal confirming or rejecting one or more of the ranked target matches; identifying a further selection of target matches for the confirmed target matches from the further video data; and, indicating portions of the further video data containing the further selection of target matches. The signal could be received from a user, and the target could be a person. The method may also include space-time profiling.

Подробнее
27-06-2019 дата публикации

System and Method for Algorithmic Editing of Video Content

Номер: AU2018271424A1
Принадлежит: Adams Pluck

A computer implemented method for algorithmically editing digital video content. The method comprises processing a video 5 file containing source video to extract metadata representative of at least one of: video content within the source video; and audio content within the source video. Label taxonomies are applied to the extracted metadata and stored in a metadata store in association with the video file. The labelled metadata is 10 subsequently processed to identify higher-level labels for the labelled metadata. Any identified higher-level labels are also stored in the metadata store as additional metadata associated with the video file. A clip generating algorithm is implemented that applies the stored metadata for selectively editing the 15 source video to thereby generate a plurality of different candidate video clips therefrom. Responsive to determining a clip presentation trigger on a viewer device, a clip selection algorithm is implemented that applies engagement data and metadata ...

Подробнее
27-12-2019 дата публикации

TRACKING METHOD AND SYSTEM USING A DATABASE OF A PERSON'S FACES

Номер: KR1020190142553A
Автор:
Принадлежит:

Подробнее
05-12-2019 дата публикации

IMAGE INVENTORY PRODUCTION

Номер: US2019370286A1
Принадлежит:

There are disclosed methods and apparatus for manufacture of image inventories from frames of a digital work including audio which corresponds to objects in still images of the digital video work. Objects are detected in each frame's image, and the objects are recognized. Metadata is assigned to the objects, the object metadata linking audio from the digital video work to the corresponding object in the frame's image which produces the audio. For each frame, at least one cryptographic hash of the object metadata is generated, and the hash is written to a node of a transaction processing network.

Подробнее
06-10-2015 дата публикации

Content-based video segmentation

Номер: US0009154761B2
Принадлежит: Google Inc., GOOGLE INC, GOOGLE INC.

In general, video segmentation techniques are described. According to various examples, the video segmentation techniques may be based on video content. An example method includes determining one or more segments into which to divide video content, dividing the video content into the determined number of segments identifying a boundary frame associated with each of the segments, and adjusting the respective boundary frame associated with a first segment of the segments to generate an adjusted boundary frame associated with the first segment, wherein the adjusting is based on an one or more entity representations associated with the adjusted boundary frame.

Подробнее
22-05-2019 дата публикации

УСТРОЙСТВО И СПОСОБ ДЛЯ АНАЛИЗА ИМПОРТИРОВАННОГО ВИДЕО

Номер: RU2688757C1
Принадлежит: ООО "Ай Ти Ви групп" (RU)

Изобретение относится к вычислительной технике. Технический результат − повышение скорости поиска необходимого события или объекта в импортированном видео, полученном от стороннего устройства. Устройство для анализа импортированного видео содержит: память, базу данных для хранения метаданных, графический пользовательский интерфейс и устройство обработки данных, причем устройство обработки данных сконфигурировано для загрузки видео в общедоступном формате в память и импорта загруженного видео в программное обеспечение (ПО) устройства для анализа импортированного видео, причем ПО позволяет выполнять декомпрессию и анализ импортированного видео для формирования метаданных, характеризующих данные обо всех объектах в видео, и для записи упомянутых метаданных в базу данных. 3 н. и 30 з.п. ф-лы, 2 ил.

Подробнее
16-03-2018 дата публикации

Номер: RU2016136707A3
Автор:
Принадлежит:

Подробнее
10-09-2014 дата публикации

ПОЛУЧЕНИЕ КЛЮЧЕВЫХ СЛОВ ДЛЯ ПОИСКА

Номер: RU2013108254A
Принадлежит:

... 1. Устройство (100) воспроизведения для воспроизведения изображений в программе, содержащее контроллер (110), сконфигурированный для выполнения:загрузки данных изображений объектов на изображениях программы на основе предварительной информации о программе (305);распознавания объекта на воспроизводимом изображении (320) на основе загруженных данных изображений;получение ключевого слова (410), связанного с распознанным объектом (340); ипоиска информации на основе ключевого слова (370).2. Устройство воспроизведения по п. 1, в котором контроллер дополнительно сконфигурирован для:получения множества ключевых слов; ипредоставления пользователю возможности выбора одного из этих ключевых слов для поиска (345).3. Устройство воспроизведения по п. 2, в котором контроллер дополнительно сконфигурирован для:распознавания множества объектов на воспроизводимом изображении; иполучения множества ключевых слов посредством получения ключевого слова, связанного с каждым из распознанных объектов.4. Устройство ...

Подробнее
11-06-2014 дата публикации

Object detection metadata

Номер: GB0002481298B
Принадлежит: APPLE INC, APPLE INC.

Подробнее
11-11-2003 дата публикации

SCALABLE VIDEO SUMMARIZATION AND NAVIGATION SYSTEM AND METHOD

Номер: AU2003230369A1
Принадлежит:

Подробнее
11-11-2003 дата публикации

Scalable video summarization

Номер: AU2003230362A8
Принадлежит:

Подробнее
05-10-2019 дата публикации

METHODS, APPARATUS, AND SYSTEMS FOR AI-ASSISTED OR AUTOMATIC VIDEO PRODUCTION

Номер: CA0003038767A1
Принадлежит: RIDOUT & MAYBEE LLP

Methods, apparatus, and systems for automatically producing a video program in accordance with a script are provided. Various media assets are recorded and or stored in a content database, together with metadata relating to each of the media assets. Each media asset is tagged with a unique content ID, the unique content ID associating the metadata with the media asset. The media assets are then indexed. Text from a script is then analyzed using natural language processing to locate one or more relevant indexed media assets. The located one or more media assets are assembled into a video program in accordance with the script.

Подробнее
04-10-2012 дата публикации

FACE RECOGNITION BASED ON SPATIAL AND TEMPORAL PROXIMITY

Номер: CA0002829079A1
Принадлежит:

In one embodiment, a social networking system determines one or more individuals matching one or more faces in an image file of a still image or a video sequence, associated with a first user based on the one or more individuals' spatial and temporal proximity to the image file, and presents the matched individuals to the first user.

Подробнее
25-04-2019 дата публикации

RESTAURANT FRAUD DETECTION APPARATUS AND METHOD

Номер: US20190122248A1
Принадлежит:

A restaurant fraud detection apparatus includes one or more cameras, point-of-sale (POS) terminals, and a backend server. The cameras are disposed capture one or more images of a patron within a retail establishment. The POS terminals are coupled to the cameras, where a physical payment method is provided by the patron and entered into one of the POS terminals, and where one or more of the POS terminals receives the images and transmits a request over a network for recognition of the patron. The backend server is configured to receive the request, to access loyalty program data comprising a stored payment method that corresponds to the physical payment method, and to transmit an alert to the POS terminals that indicates the patron is not recognized and that use of the physical payment method may be fraudulent because the stored payment method is associated with a different patron.

Подробнее
22-11-2018 дата публикации

AUTOMATIC AND INTELLIGENT VIDEO SORTING

Номер: US20180336931A1
Принадлежит:

Systems and methods disclosed herein provide automatic and intelligent video sorting in the context of creating video compositions. A computing device sorts a media bin of videos in the user's work area based on similarity to the videos included in the video composition being created. When a user selects or includes a particular video on the composition's timeline, the video is compared against the entire video collection to change the display of videos in the media bin. In one example, videos that have similar tags to a selected video are prioritized at the top. Only a subset of frames of each of the videos are used to use to identify video tags. Intelligently selecting tags using a subset of frames from each video rather than using all frames enables more efficient and accurate tagging of videos, which facilitates quicker and more accurate comparison of video similarities.

Подробнее
28-07-2016 дата публикации

POSITIONAL LOCATING SYSTEM AND METHOD

Номер: US20160217324A1
Принадлежит:

A method and system are disclosed for locating or otherwise generating positional information for an object, such as but not limited generating positional coordinates for an object attached to an athlete engaging in an athletic event. The positional coordinates may be processed with other telemetry and biometrical information to provide real-time performance metrics while the athlete engages in the athletic event.

Подробнее
05-01-2021 дата публикации

Multi-restaurant facial recognition system

Номер: US0010885542B2
Принадлежит: Toast, Inc., TOAST INC, TOAST, INC.

A multi-restaurant facial recognition includes one or more cameras, first point-of-sale (POS) terminals, and a backend server. The cameras are disposed within a first retail establishment and are configured to capture one or more images of a patron within the first retail establishment. The POS terminals are also within the first retail establishment and one or more of the first POS terminals receives the one or more images and transmits a request over a network for enrollment of the patron in a loyalty program. The backend server receives the request, enrolls the patron in the loyalty program, and stores loyalty program data, where the backend server may recognize and provide the loyalty program data in response to subsequent requests for recognition of the patron from any one of a plurality of second POS terminals that are in other retail establishments that are related to the first retail establishment.

Подробнее
28-05-2019 дата публикации

Biometric notification system

Номер: US0010303934B2
Принадлежит: FACEFIRST, INC

The present invention provides a biometric notification system for selectively sending messages to interested recipients. In various embodiments, message trigger criteria, interested recipients, and message content may vary depending upon, among other things, the service being provided.

Подробнее
07-02-2019 дата публикации

PERFORMING MULTIPLE QUERIES WITHIN A ROBUST VIDEO SEARCH AND RETRIEVAL MECHANISM

Номер: US20190042584A1
Принадлежит:

The disclosed herein relates to a method, a system, and a computer program product. The method, the system, and the computer program product can include selecting a video segment within a video and extracting a feature set from the video segment. The method, the system, and the computer program product can further include retrieving data information that matches the feature set from a database; determining a degree of similarity between each instance of the data information and the feature set; and presenting a ranked result set based on the degree of similarity.

Подробнее
13-06-2019 дата публикации

System and Method for Algorithmic Editing of Video Content

Номер: US20190182565A1
Принадлежит: Playable Pty Ltd

A computer implemented method for algorithmically editing digital video content is disclosed. A video file containing source video is processed to extract metadata. Label taxonomies are applied to extracted metadata. The labelled metadata is processed to identify higher-level labels. Identified higher-level labels are stored as additional metadata associated with the video file. A clip generating algorithm applies the stored metadata for selectively editing the source video to generate a plurality of different candidate video clips. Responsive to determining a clip presentation trigger on a viewer device, a clip selection algorithm is implemented that applies engagement data and metadata for the candidate video clips to select one of the stored candidate video clips. The engagement data is representative of one or more engagement metrics recorded for at least one of the stored candidate video clips. The selected video clip is presented to one or more viewers via corresponding viewer devices ...

Подробнее
28-09-2017 дата публикации

Method and system for modeling image of interest to users

Номер: US20170277785A1
Принадлежит:

A system and method for modeling and distributing image data of interest to users is disclosed. Users on user devices such as mobile phones send request messages for image data captured by surveillance cameras of the system. The request messages include information for selecting the image data, such as camera number and time of recording of the image data, in examples. In response, an application server of the system collects the image data from the surveillance cameras, and supplies image data to the users based on a model that the application server creates and updates for each of the users. The model ranks image data of potential interest for each of the users, where the model is based on the information for selecting the image data provided by the users. Preferably, a machine learning application of the application server creates the model for each of the users.

Подробнее
30-05-2023 дата публикации

Video processing system

Номер: US0011665311B2
Автор: Yasufumi Hirakawa
Принадлежит: NEC CORPORATION, NEC Corporation

A video processing system includes: an object movement information acquiring means for detecting a moving object moving in a plurality of segment regions from video data obtained by shooting a monitoring target area, and acquiring movement segment region information as object movement information, the movement segment region information representing segment regions where the detected moving object has moved; an object movement information and video data storing means for storing the object movement information in association with the video data corresponding to the object movement information; a retrieval condition inputting means for inputting a sequence of the segment regions as a retrieval condition; and a video data retrieving means for retrieving the object movement information in accordance with the retrieval condition and outputting video data stored in association with the retrieved object movement information, the object movement information being stored by the object movement ...

Подробнее
12-03-2014 дата публикации

Object detection metadata

Номер: GB0201401271D0
Автор:
Принадлежит:

Подробнее
16-06-2016 дата публикации

Methods, systems, and media for generating search results based on contextual information

Номер: AU2014374036A1
Принадлежит:

Methods, systems, and media for generating search results based on contextual information are provided. In some implementations, a method for presenting search results is provided, the method comprising: receiving, using a hardware processor, a query related to media that is currently being presented; identifying a program that is currently being presented; identifying a plurality of keywords associated with the identified program; determining one or more of the plurality of keywords that are contextually relevant to the query; obtaining a plurality of search results based on the query and the one or more contextually relevant keywords; and causing at least one of the plurality of search results to be presented to the user.

Подробнее
12-05-2020 дата публикации

Spark-frame-based massive face image retrieval system and retrieval method

Номер: CN0106777167B
Автор:
Принадлежит:

Подробнее
15-05-2003 дата публикации

Method and system for information alerts

Номер: US2003093580A1
Автор:
Принадлежит:

An information alert system and method are provided. Content from various sources, such as television, radio and/or Internet, are analyzed for the purpose of determining whether the content matches a predefined alert profile, which is manually or automatically created. An alert is then automatically created to permit access to the information in audio, video and/or textual form.

Подробнее
28-09-2017 дата публикации

System and method for retail customer tracking in surveillance camera network

Номер: US20170278137A1
Принадлежит:

A retail customer tracking system and method is disclosed. The retail system preferably includes at least one surveillance camera for generating image data of customer interactions with products and at least one point of sale camera for generating image data of customers at a point of sale area. An analytics system determines product interactions from the image data of the customer interactions with the products and stores facial image information and the product interactions for each of the customers. When a customer arrives at a point of sale area, facial image information of the customer determined from the image data of the point of sale camera is matched to previously stored facial information for the customer, and associated product interactions for the customer are provided to a management system. The management system then provides sale cues based on the product interactions to the customer at the point of sale area.

Подробнее
03-04-2012 дата публикации

Electronic apparatus and image display control method of the electronic apparatus

Номер: US0008150168B2
Автор: Takuya Koda, KODA TAKUYA

According to one embodiment, an electronic apparatus extracts face images of persons from video content data and outputs timestamp information indicating time points at which each extracted face image appears in the video content data, and displays face images in each column of a plurality of face image display areas arranged in a matrix based on the time stamp information. The apparatus detects presence or absence of a face area in each frame consisting of the video content data and decides a cutout range of the detected face area. And, the apparatus adjusts a case in which the cutout range of the decided face area protrudes outside the frame.

Подробнее
10-08-2023 дата публикации

SYSTEM AND METHOD FOR AUTOMATICALLY IDENTIFYING KEY DIALOGUES IN A MEDIA

Номер: US20230254552A1
Принадлежит:

A system and a method for automatically identifying key dialogues in media is disclosed herein. In the method disclosed herein, the key dialogues engine receives the media asset and extract transcript data and supplementary data. The key dialogues engine processes the transcript data into a plurality of transcript data elements and associate the transcript data elements with respective data elements selected from the supplementary data. The key dialogues engine identifies one or more key dialogues from the associated transcript data elements based on configurable criteria, in operable communication with one or more of a plurality of data sources, wherein the configurable criteria comprises one or more of repetitive keywords, rhyming words, audio signal levels, matching keywords, text-based sentiments, dialogue similarity, repetitive dialogues, signature dialogues, entry dialogues recited by actors comprising protagonists and antagonists, faces of the actors, celebrity detection, image labels ...

Подробнее
23-01-2024 дата публикации

Synthetic visual content creation and modification using textual input

Номер: US0011880917B2
Автор: Yair Adato, Gal Jacobi
Принадлежит: BRIA ARTIFICIAL INTELLIGENCE LTD.

Systems, methods and non-transitory computer readable media for generating and modifying synthetic visual content using textual input are provided. One or more keywords may be received from a user. The one or more keywords may be used to generate a plurality of textual descriptions. Each generated textual description may correspond to a possible visual content. The generated plurality of textual descriptions may be presented to the user through a user interface that enables the user to modify the presented textual descriptions. A modification to at least one of the plurality of textual descriptions may be received from the user, therefore obtaining a modified plurality of textual descriptions. A selection of one textual description of the modified plurality of textual descriptions may be received from the user. A plurality of visual contents corresponding to the selected textual description may be presented to the user.

Подробнее
02-06-2022 дата публикации

METHOD FOR PROCESSING VIDEO, DEVICE AND STORAGE MEDIUM

Номер: US20220174369A1
Принадлежит:

The present disclosure provides examples of a method and apparatus for processing a video, a device and a storage medium. The method may include: acquiring a target video and a target comment of the target video; recognizing a picture in the target video to obtain text information of the picture; determining a target comment matching a content of the text information; and inserting, in response to displaying the picture in the target video, the target comment matching the content in a form of a bullet screen. 1. A method for processing a video , the method comprising:acquiring a target video and a first target comment of the target video;recognizing a picture in the target video to obtain text information of the picture;determining a second target comment matching a content of the text information from the first target comment; andinserting, in response to displaying the picture in the target video, the second target comment matching the content in a form of a bullet screen.2. The method according to claim 1 , wherein acquiring the target video comprises:acquiring original news;searching for an original video related to the original news;extracting a summary of the original news to obtain a commentary of the original news;generating, based on the commentary, a video voice, and generating, based on the original news and the original video, a video picture corresponding to the video voice; andsynthesizing the video picture and the video voice to obtain the target video.3. The method according to claim 2 , wherein searching for the original video related to the original news claim 2 , comprises:acquiring an original comment of the original news; andsearching for, based on the original news and/or a content of the original comment, the original video.4. The method according to claim 2 , wherein acquiring the first target comment of the target video comprises:acquiring an original comment of the original news; andselecting an original comment matching a content of the ...

Подробнее
02-04-2024 дата публикации

System and method for automatically identifying key dialogues in a media

Номер: US0011949971B2
Принадлежит: PRIME FOCUS TECHNOLOGIES LIMITED

A system and a method for automatically identifying key dialogues in media is disclosed herein. In the method disclosed herein, the key dialogues engine receives the media asset and extract transcript data and supplementary data. The key dialogues engine processes the transcript data into a plurality of transcript data elements and associate the transcript data elements with respective data elements selected from the supplementary data. The key dialogues engine identifies one or more key dialogues from the associated transcript data elements based on configurable criteria, in operable communication with one or more of a plurality of data sources, wherein the configurable criteria comprises one or more of repetitive keywords, rhyming words, audio signal levels, matching keywords, text-based sentiments, dialogue similarity, repetitive dialogues, signature dialogues, entry dialogues recited by actors comprising protagonists and antagonists, faces of the actors, celebrity detection, image labels ...

Подробнее
27-07-2011 дата публикации

Object detection metadata

Номер: GB0201109873D0
Автор:
Принадлежит:

Подробнее
06-09-2017 дата публикации

Providing relevant video scenes in response to a video search query

Номер: GB0201711692D0
Автор:
Принадлежит:

Подробнее
13-01-2016 дата публикации

Facial detection, recognition and bookmarking in videos

Номер: GB0002502221B
Принадлежит: GOOGLE INC, GOOGLE INC.

Подробнее
12-01-2012 дата публикации

Systems And Methods For Identifying And Notifying Users of Electronic Content Based on Biometric Recognition

Номер: US20120011085A1
Принадлежит: AOL Inc

Systems and methods are disclosed for manipulating electronic multimedia content to a user. One method includes generating a plurality of biometric models, each biometric model corresponding to one of a plurality of people; receiving electronic media content over a network; extracting image or audio data from the electronic media content; detecting biometric information in the image or audio data; and calculating a probability of the electronic media content involving one of the plurality of people, based on the biometric information and the plurality of biometric models.

Подробнее
29-03-2012 дата публикации

Content summarizing apparatus and content summarizing displaying apparatus

Номер: US20120078977A1
Автор: Tomoyuki Shibata
Принадлежит: Toshiba Corp

According to one embodiment, a content summarizing apparatus includes a selection unit, a record unit, and a storage unit. The selection unit selects at least one image from input content in accordance with at least one selection criterion and at least one parameter corresponding to the at least one selection criterion, and to produce a summary. The record unit cause the storage unit to store a summary record information item that includes the at least one selection criterion and the at least one parameter used by the selection unit. The storage unit stores the summary record information item whenever the summary of the input content is produced. The selection unit acquires past summary record information items from the storage unit, and produces the summary using the at least one selection criterion and the at least one parameter that fails to be included in the past summary record information items.

Подробнее
05-04-2012 дата публикации

Smart Real-time Content Delivery

Номер: US20120084435A1
Принадлежит: International Business Machines Corp

A method, a computer program product, and a system is provided to monitor several continuously transmitted content channels, each providing video information and audio information, or a combination thereof with text information; simultaneously convert the monitored content from each channel to provide data stream feeds for each of these channels; simultaneously analyze each channel feed to determine if a user provided topic is included in any of these feeds; and send an alert to the user when a channel feed includes a user provided topic. In a further embodiment, the method includes delivery of content from the channel feed to a user device, upon request, in a user designated transmission format (such as video, audio or text).

Подробнее
12-04-2012 дата публикации

Video Signature Based on Image Hashing and Shot Detection

Номер: US20120087583A1
Принадлежит: FutureWei Technologies Inc

In accordance with an embodiment, A method of comparing a first group of frames to a second group of frames includes electronically receiving the first group of frames, selecting a group of frames from the first group of frames as a first key frame set, calculating a hash distance between an image hash for each frame in the first key frame set to an image hash of each frame of a second key frame set taken from second group of frames, and choosing frames in the first group of frames with a minimum hash distances to respective reference frames to form a series of minimum hash distances.

Подробнее
11-10-2012 дата публикации

Electronic device and facial image display apparatus

Номер: US20120257801A1
Автор: Kouetsu Wada
Принадлежит: Toshiba Corp

According to one embodiment, an electronic apparatus includes a storage device which stores face thumbnail indexing information including face images and time stamp information, extracting module configure to assign time zones to a video content data and to extract face images belonging to each time zone based on a time stamp information, classifying module configure to classify facial images of the same person from the extracted facial images, calculating module configure to calculate a frequency of appearance of each classified facial image, and facial image indication module configure to display a list of the facial images included in the facial image indexing information in a facial image indication in a two-dimensional display area, the facial image indication having time-zone-specific display areas in columns corresponding to the time zones, each facial image displayed in each time-zone-specific display area being displayed in a size based on the frequency of appearance.

Подробнее
28-02-2013 дата публикации

Systems and Methods of Detecting Significant Faces in Video Streams

Номер: US20130051756A1
Принадлежит: CyberLink Corp

Systems and methods of processing video streams are described. A face is detected in a video stream. The face is tracked to determine a video clip associated with one of a plurality of individuals. The video segment is assigned to a group of video clips based on the associated individual. A significant face is detected in the group of video clips when the detected face meets one or more significance criteria. The significance criteria describes a face-frame characteristic. A representation of the significant face is displayed in association with a representation of the group of video clips. The order of the significance criteria is adjusted through a user interface.

Подробнее
11-04-2013 дата публикации

Method and Apparatus for Navigating a Video Via a Transcript of Spoken Dialog

Номер: US20130091299A1
Принадлежит: HULU LLC

A method and apparatus for navigating a media program via a searchable transcript of the dialog of the media program is disclosed. In one embodiment, a textural transcript of the dialog is generated, wherein the textural transcript comprising a plurality of portions wherein each portion is associated with a segment of the media program, a command is accepted to display the transcript and in response to that command, user interface data is transmitted to the client computer for presentation in a user interface, wherein the user interface comprising a concurrently presented media program player and the textural transcript.

Подробнее
30-05-2013 дата публикации

Character-based automated shot summarization

Номер: US20130138435A1
Автор: Frank Elmo Weber
Принадлежит: Individual

Methods, devices, systems and tools are presented that allow the summarization of text, audio, and audiovisual presentations, such as movies, into less lengthy forms. High-content media files are shortened in a manner that preserves important details, by splitting the files into segments, rating the segments, and reassembling preferred segments into a final abridged piece. Summarization of media can be customized by user selection of criteria, and opens new possibilities for delivering entertainment, news, and information in the form of dense, information-rich content that can be viewed by means of broadcast or cable distribution, “on-demand” distribution, internet and cell phone digital video streaming, or can be downloaded onto an iPod™ and other portable video playback devices.

Подробнее
04-07-2013 дата публикации

Image processing

Номер: US20130173420A1
Принадлежит: Sony Europe Ltd

An image processing method includes partitioning an image under test to form a plurality of contiguous image segments having similar image properties, deriving feature data from a subset including one or more of the image segments, and comparing the feature data from the subset of image segments with feature data derived from respective image segments of one or more other images so as to detect a similarity between the image under test and the one or more other images.

Подробнее
18-07-2013 дата публикации

Feeling-expressing-word processing device, feeling-expressing-word processing method, and feeling-expressing-word processing program

Номер: US20130182907A1
Принадлежит: NEC Corp

The present approach enables an impression of the atmosphere of a scene or an object present in the scene at the time of photography to be pictured in a person's mind as though the person were actually at the photographed scene. A feeling-expressing-word processing device has: a feeling information calculating unit 11 for analyzing a photographed image, and calculating feeling information which indicates a temporal change in a scene shown in the photographed image or a movement of an object present in the scene; and a feeling-expressing-word extracting unit 12 for extracting, from among feeling-expressing words which express feelings and are stored in a feeling-expressing-word database 21 in association with the feeling information, a feeling-expressing word which corresponds to the feeling information calculated by the feeling information calculating unit 11.

Подробнее
12-09-2013 дата публикации

Digital Image Processing Using Face Detection and Skin Tone Information

Номер: US20130236052A1
Принадлежит: DigitalOptics Corp Europe Ltd

A technique for processing a digital image uses face detection to achieve one or more desired image processing parameters. A group of pixels is identified that corresponds to a face image within the digital image. A skin tone is detected for the face image by determining one or more default color or tonal values, or combinations thereof, for the group of pixels. Values of one or more parameters are adjusted for the group of pixels that correspond to the face image based on the detected skin tone.

Подробнее
20-02-2014 дата публикации

Image information output method

Номер: US20140049617A1
Принадлежит: KODAIRA ASSOCIATES Inc

Provided is a video image data generation system including a database for storing a plurality of image data photographed in various directions in various locations, correlating the directions and the locations with the stored image data, and correlating and storing a photographed sub-region when the image data is acquired, a route view point specifying device which specifies various locations and eye level directions arranged on a view point route, an image search engine which searches an image of an eye level direction specified from a location of a view point route specified by the route view point specifying device and outputs video data, wherein the image search engine searches image data stored in a database and the image data including a sub-region located in an eye level direction in each of a plurality of locations on a view point route by referencing photography direction data correlated with the sub-region.

Подробнее
20-02-2014 дата публикации

Media Fingerprinting and Identification System

Номер: US20140052737A1
Принадлежит: Zeitera LLC

The overall architecture and details of a scalable video fingerprinting and identification system that is robust with respect to many classes of video distortions is described. In this system, a fingerprint for a piece of multimedia content is composed of a number of compact signatures, along with traversal hash signatures and associated metadata. Numerical descriptors are generated for features found in a multimedia clip, signatures are generated from these descriptors, and a reference signature database is constructed from these signatures. Query signatures are also generated for a query multimedia clip. These query signatures are searched against the reference database using a fast similarity search procedure, to produce a candidate list of matching signatures. This candidate list is further analyzed to find the most likely reference matches. Signature correlation is performed between the likely reference matches and the query clip to improve detection accuracy.

Подробнее
06-03-2014 дата публикации

Displaying additional data about outputted media data by a display device for a speech search command

Номер: US20140067402A1
Автор: Yongsin Kim
Принадлежит: LG ELECTRONICS INC

A speech search method performed by a display device, the method including outputting media data including audio data, receiving a speech search command for additional data about the outputted media data from a user, the speech search command including at least one query word, determining whether the at least one query word matches a query term that is full and searchable, when the at least one query word matches the query term that is full and searchable, performing a search for the additional data using the query term, and when the at least one query word does not match the query term that is full and searchable, determining the query term from a predetermined amount of the audio data prior to receiving the speech search command and performing the search for the additional data using the query term.

Подробнее
27-03-2014 дата публикации

Display apparatus and control method thereof

Номер: US20140085187A1
Принадлежит: SAMSUNG ELECTRONICS CO LTD

Disclosed are a display apparatus and a control method thereof. The display apparatus includes: an image processor which processes an image of a content including a plurality of scenes in order to display the image; a display which displays an image of the content thereon; a voice input which inputs a user's voice; and a controller which displays a first scene of the plurality of scenes of the content, and displays a second scene falling under a next scene of the first scene, out of the plurality of scenes of the content, in response to a determination that the user's voice, which has been input while the first scene is displayed, corresponds to the first scene.

Подробнее
02-01-2020 дата публикации

Electronic apparatus, document displaying method thereof and non-transitory computer readable recording medium

Номер: US20200004493A1
Принадлежит: SAMSUNG ELECTRONICS CO LTD

The disclosure relates to an artificial intelligence (AI) system using a machine learning algorithm such as deep learning, and an application thereof. In particular, an electronic apparatus, a document displaying method thereof, and a non-transitory computer readable recording medium are provided. An electronic apparatus according to an embodiment of the disclosure includes a display unit displaying a document, a microphone receiving a user voice, and a processor configured to acquire at least one topic from contents included in a plurality of pages constituting the document, recognize a voice input through the microphone, match the recognized voice with one of the acquired at least one topic, and control the display unit to display a page including the matched topic.

Подробнее
05-01-2017 дата публикации

User created textbook

Номер: US20170004859A1
Автор: Jaran Charumilind
Принадлежит: Coursera Inc

In one general aspect, a method for generating a digital textbook can include receiving, by a computing device, a time-based transcript of a video of an online lecture, receiving a time-based thumbnail image subset of images included in the video of the online lecture, and displaying at least a portion of the transcript including a particular word. The method can further include receiving a selection of the particular word, determining a first thumbnail image and a second thumbnail image associated with the particular word, displaying the first thumbnail image and the second thumbnail image, receiving a selection of the first thumbnail image, and modifying, based on the selection of the first thumbnail image, the time-based transcript by including the first thumbnail image in the time-based transcript. The method can further include storing the modified time-based transcript as the digital textbook.

Подробнее
07-01-2021 дата публикации

METHOD AND APPARATUS FOR STORING MEDIA FILES AND FOR RETRIEVING MEDIA FILES

Номер: US20210004406A1
Автор: Chen Xi, Hu Yichen, Tian Hao
Принадлежит:

Embodiments of the present disclosure disclose a method and apparatus for storing a media file and for searching a media file. A specific embodiment of the method includes: acquiring a semantic vector for characterizing semantics of a context of the media file, the context being a context of the media file in a webpage presenting the media file; and storing the semantic vector and the media file in association. Based on the corresponding relationship established by this embodiment, the semantic vector corresponding to the media file may be used to match the media file to ensure the semantic matching of the media file. 1. A method for storing a media file , the method comprising:acquiring a semantic vector for characterizing semantics of a context of the media file, the context being a context of the media file in a webpage presenting the media file; andstoring the semantic vector and the media file in association.2. The method according to claim 1 , wherein the acquiring a semantic vector for characterizing semantics of a context of the media file claim 1 , comprises:acquiring the semantic vector for characterizing the semantics of the context of the media file, in response to receiving a request for requesting to store the media file presented by the webpage.3. The method according to claim 1 , wherein the semantic vector is obtained by:generating the semantic vector for characterizing the semantics of the context of the media file using a pre-trained semantic model, wherein the semantic model is used to generate a semantic vector for characterizing semantics of a text.4. The method according to claim 3 , wherein the semantic model is obtained by training based on a knowledge-enhanced semantic representation model ERNIE.5. The method according to claim 1 , wherein the method further comprises:adding an index to the semantic vector based on an HNSW algorithm.6. The method according to claim 1 , wherein the storing the semantic vector and the media file in ...

Подробнее
02-01-2020 дата публикации

Method and Apparatus for Multi-Dimensional Content Search and Video Identification

Номер: US20200004779A1
Принадлежит:

A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures. 1. A computer-implemented method for storing information associated with videos in a reference database using hash values as traversal indexes , the computer-implemented method comprising: obtaining, by a processor, data associated with the video sequence,', 'determining, by the processor, a multi-dimensional vector signature of a region of a frame of the video sequence,', 'determining, by the processor, a hash value based on the multi-dimensional vector signature, and', 'storing the data associated with the video sequence at a leaf node of a plurality of leaf nodes, wherein the leaf node is addressable by the hash value., 'for each of multiple video sequences2. The computer-implemented method of claim 1 , wherein the region comprises multiple sectors claim 1 , and wherein the multi-dimensional vector signature represents each sector.3. The computer-implemented method of claim 2 , wherein determining the multi-dimensional vector signature comprises comparing features within each sector to a threshold value to generate a value for the sector.4. The computer-implemented method of claim 2 , wherein the region is a rectangular ...

Подробнее
02-01-2020 дата публикации

Method and Apparatus for Multi-Dimensional Content Search and Video Identification

Номер: US20200004780A1
Принадлежит:

A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures. 1. A computer-implemented method for storing information associated with a video sequence in a reference database , the computer-implemented method comprising:obtaining, by a processor, data associated with the video sequence; and determining, by the processor, respective global features of a global region of interest,', 'determining, by the processor, respective local features of a respective keypoint within the global region of interest, wherein, for multiple frames of the set of frames of the video sequence, the respective keypoints correspond to different respective locations within the global region of interest,', 'generating, by the processor, a respective signature using both the global features for the frame and the local features for the frame,', 'determining, by the processor, a respective hash value for the frame based on the signature for the frame, and', 'storing, by the processor, the data associated with the video sequence in the reference database in association with the hash value for the frame., 'for each frame of a set of frames of the video sequence2. The computer-implemented method of claim 1 , wherein ...

Подробнее
02-01-2020 дата публикации

Method and Apparatus for Multi-Dimensional Content Search and Video Identification

Номер: US20200004781A1
Принадлежит:

A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures. 1. A computer-implemented method comprising:obtaining, by a processor, a first query index and a second query index that are derived from different respective features of a frame of a query video;determining, by the processor, that a distance measure between the first query index and a candidate database index of a reference database satisfies a threshold condition, wherein the candidate database index corresponds to a frame of an original video;determining, by the processor, a correlation score for the frame of the query video and the frame of the original video based on a comparison of the second query index and an additional candidate database index corresponding to the frame of the original video;based at least on the correlation score, determining, by the processor, a video sequence likelihood indicative of a confidence of match between the query video and the original video; andbased on the video sequence likelihood, providing, by the processor, a results list that includes a name of the original video.2. The computer-implemented method of claim 1 , wherein the second query index corresponds to a texture signature of a ...

Подробнее
02-01-2020 дата публикации

Method and Apparatus for Multi-Dimensional Content Search and Video Identification

Номер: US20200004782A1
Принадлежит: Gracenote Inc

A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures.

Подробнее
07-01-2021 дата публикации

Method and System for Retrieving Video Temporal Segments

Номер: US20210004605A1
Автор: Ho Chiuman, Hsiao Jenhao
Принадлежит:

A method and a system for retrieving video temporal segments are provided. In the method, a video is analyzed to obtain frame feature information of the video; the frame feature information is input into an encoder to output first data relating to temporal information of the video; the first data and a retrieval description for retrieving video temporal segments of the video are input into a decoder to output second data; attention computation training is conducted according to the first data and the second data; video temporal segments of the video corresponding to the retrieval description are determined according to the attention computation training. 1. A method for retrieving video temporal segments , comprising:analyzing a video to obtain frame feature information of the video;inputting the frame feature information into an encoder to output first data relating to temporal information of the video;inputting the first data and a retrieval description for retrieving video temporal segments of the video into a decoder to output second data;conducting attention computation training according to the first data and the second data; anddetermining video temporal segments of the video corresponding to the retrieval description according to the attention computation training.2. The method of claim 1 , wherein conducting the attention computation training according to the first data and the second data comprises:inputting the first data and the second data into an attention layer coupled with the encoder and the decoder;obtaining, at the attention layer, temporal attention weight data for each video temporal segment of the video based on correlation of each video temporal segment with each description term contained in the retrieval description;obtaining, at the attention layer, weighted average data, based on the first information and the temporal attention weight data, and outputting the weighted average data to a fully connected (FC) layer coupled with the attention ...

Подробнее
13-01-2022 дата публикации

METHOD FOR VIDEO INTERACTION AND ELECTRONIC DEVICE

Номер: US20220013026A1
Автор: DI Yifeng
Принадлежит:

A method for video interaction includes: displaying an information input interface in response to an information input instruction from a first user account for a target video in a playing state; acquiring, via the information input interface, target information input by the first user account, and generating corresponding interaction information based on the target information and a target video frame picture corresponding to the target information; and sending the interaction information to a second user account. 1. A method for video interaction , comprising:displaying an information input interface in response to an information input instruction from a first user account for a target video in a playing state;acquiring, via the information input interface, target information input by the first user account, and generating corresponding interaction information based on the target information and a target video frame picture corresponding to the target information; andsending the interaction information to a second user account.2. The method according to claim 1 , further comprising:acquiring, from the target video, a video frame picture corresponding to a time point at which the information input instruction is received, and determining the video frame picture as the target video frame picture.3. The method according to claim 1 , further comprising:acquiring target playing progress information of the target video, and reading, from the target video, the target video frame picture corresponding to the target playing progress information.4. The method according to claim 1 , wherein said acquiring claim 1 , via the information input interface claim 1 , the target information input by the first user account comprises:receiving input content via the information input interface;acquiring and presenting interaction association information matching the input content; anddetermining, in response to a received selection instruction, the corresponding target information from ...

Подробнее
14-01-2021 дата публикации

VISUAL SEARCH METHOD, COMPUTER DEVICE, AND STORAGE MEDIUM

Номер: US20210012511A1
Принадлежит:

A visual search method, a computer device, and a non-transitory computer readable storage medium are provided. An iimage frame is received. The location and the classification of the subject in the iimage frame are extracted. A detection block corresponding to the subject is generated. In subsequent image frames of the iimage frame, the subject is tracked on the basis of the location of the subject in the iimage frame. The detection block is adjusted on the basis of the tracking result. 1. A method for video searching , comprising:{'sup': 'th', 'receiving an iimage frame, i being a positive integer;'}{'sup': 'th', 'extracting a location and a classification of a subject in the iimage frame, and generating a detection block corresponding to the subject; and'}{'sup': th', 'th, 'in subsequent image frames of the iimage frame, tracking the subject according to the location of the subject in the iimage frame, and adjusting the detection block according to a tracking result.'}2. The method according to claim 1 , further comprising:{'sup': 'th', 'receiving an (i+M)image frame, M being a positive integer;'}{'sup': th', 'th, 'determining whether a subject in the (i+M)image frame changes relative to the subject in the iimage frame; and'}{'sup': th', 'th, 'in response to changing, regenerating a detection block according to the subject detected in the (i+M)image frame, and re-tracking the subject in the (i+M)image frame.'}3. The method according to claim 1 , wherein in the subsequent image frames of the iimage frame claim 1 , tracking the subject according to the location of the subject in the iimage frame comprises:{'sup': th', 't, 'obtaining an (i+n)image frame after the iimage frame, n being a positive integer; and'}{'sup': 'th', 'tracking the subject according to the location of the subject in the (i+n)image frame.'}4. The method according to claim 3 , further comprising:{'sup': th', 'th, 'obtaining image frames between the (i+1)image frame and an (i+n−1)image frame as ...

Подробнее
14-01-2021 дата публикации

VIDEO PROCESSING SYSTEM

Номер: US20210014451A1
Автор: HIRAKAWA Yasufumi
Принадлежит: NEC Corporation

A video processing system includes: an object movement information acquiring means for detecting a moving object moving in a plurality of segment regions from video data obtained by shooting a monitoring target area, and acquiring movement segment region information as object movement information, the movement segment region information representing segment regions where the detected moving object has moved; an object movement information and video data storing means for storing the object movement information in association with the video data corresponding to the object movement information; a retrieval condition inputting means for inputting a sequence of the segment regions as a retrieval condition; and a video data retrieving means for retrieving the object movement information in accordance with the retrieval condition and outputting video data stored in association with the retrieved object movement information, the object movement information being stored by the object movement information and video data storing means. 1. A video processing system comprising:at least one memory configured to store instructions; and detecting a trajectory of a target object in a monitoring area from video data;', 'determining that the trajectory is a predetermined state; and', 'storing the trajectory in association with the predetermined state., 'at least one processor configured to execute the instructions to perform2. The video processing system according to claim 1 , wherein the at least one processor is configured to perform:determining that the trajectory is the predetermined state based on a trajectory shape.3. The video processing system according to claim 1 , wherein the at least one processor is configured to perform:determining that the trajectory is the predetermined state based on a staying time of the target object.4. The video processing system according to claim 1 , wherein the at least one processor is configured to perform:determining that the trajectory is ...

Подробнее
09-01-2020 дата публикации

Systems and methods for recording media assets

Номер: US20200014973A1
Автор: Paul Stathacopoulos
Принадлежит: Rovi Guides Inc

Systems and methods are provided to record portions of media assets. User request is received to record a media asset together with a criterion for recording portions of that media asset. A content recognition algorithm is executed against segments of the media asset to determine a set of keywords associated with those segments. Separately a set of keywords associated with the criterion is generated. Sets of keywords are compared and segments that match the criterion are discovered. If it is determined that a first segment and third segment each match the criterion and a second segment does not, a delete indicator is added to the second segment and the third and first segments are compared. If those segments match the delete indicator is removed from the second segment.

Подробнее
03-02-2022 дата публикации

TWO-WAY INTERCEPT USING COORDINATE TRACKING AND VIDEO CLASSIFICATION

Номер: US20220038661A1
Принадлежит:

A system comprising a coordinate tracking engine and a video classification engine communicably coupled to a notification engine. The coordinate tracking engine detects that geographical coordinates of a mobile device indicate that an account holder is within a threshold distance of a physical branch of an institution. The notification engine retrieves account information for the account holder. The coordinate tracking engine further detects that the account holder has arrived at the physical branch. The video classification engine captures video frames of an entrance to the physical branch and identifies the account holder. The notification engine further presents account information for the account holder on a display. 1. A two-way intercept system comprising:a coordinate tracking engine and a video classification engine communicably coupled to a notification engine;the coordinate tracking engine comprising a memory coupled to one or more processors, the memory comprising instructions that, when executed by the one or more processors, are operable to detect that geographical coordinates of a mobile device of an institutional account holder indicate that the account holder is within a threshold distance of a physical branch of the institution;the notification engine comprising a display and a memory coupled to one or more processors, the memory comprising instructions that, when executed by the one or more processors, are operable to in response to the coordinate tracking engine detecting that the geographical coordinates of the mobile device is within the threshold distance, retrieve account information for the account holder; 'detect that geographical coordinates of the mobile device of the account holder indicate that the account holder has arrived at the physical branch of the institution;', 'the coordinate tracking engine further operable to capture video frames of an entrance to the physical branch of the institution where the account holder has arrived;', ' ...

Подробнее
03-02-2022 дата публикации

SYSTEMS AND METHODS FOR CONTROLLING DISPLAY OF SUPPLEMENTARY DATA FOR VIDEO CONTENT

Номер: US20220038792A1
Принадлежит: The Toronto-Dominion Bank

A processor-implemented method is disclosed. The method includes: obtaining metadata associated with a video; identifying one or more tradeable objects associated with video content of the video based on performing textual comparison between text of the metadata and a defined list of tradeable objects; determining one or more segments of the video corresponding to the one or more identified tradeable objects, the one or more video segments having respective playback start timestamps; receiving, via a client device during playback of the video, a user selection of a first one of the video segments; in response to receiving the user selection: generating supplementary display data associated with a first tradeable object corresponding to the first video segment; and sending, to the client device, the supplementary display data. 1. A computer system , comprising:a processor;a communications module coupled to the processor; and obtain metadata associated with a video;', 'identify one or more tradeable objects associated with video content of the video based on performing textual comparison between text of the metadata and a defined list of tradeable objects;', 'determine one or more segments of the video corresponding to the one or more identified tradeable objects, the one or more video segments having respective playback start timestamps;', 'receive, via a client device during playback of the video, a user selection of a first one of the video segments;', generate supplementary display data associated with a first tradeable object corresponding to the first video segment; and', 'send, to the client device, the supplementary display data., 'in response to receiving the user selection], 'a memory coupled to the processor, the memory storing instructions that, when executed, configure the processor to2. The computer system of claim 1 , wherein the supplementary display data comprises graphical user interface data associated with the first tradeable object for displaying ...

Подробнее
16-01-2020 дата публикации

Content Analysis to Enhance Voice Search

Номер: US20200019564A1
Принадлежит:

Methods and apparatus for improving speech recognition accuracy in media content searches are described. An advertisement for a media content item is analyzed to identify keywords that may describe the media content item. The identified keywords are associated with the media content item for use during a voice search to locate the media content item. A user may speak the one or more of the keywords as a search input and be provided with the media content item as a result of the search. 1. A method comprising:determining, by a computing device, one or more keywords corresponding to an advertisement that is associated with a media content item;associating the one or more keywords with the media content item such that a plurality of keywords, associated with the media content item, comprises the one or more keywords;receiving a search request indicative of a search keyword;determining, based on a search indicating an association between the search keyword and at least one of the plurality of keywords associated with the media content item, the media content item; andcausing, based on the search, output of an indication associated with the media content item.2. The method of claim 1 , wherein the receiving the search request further comprises receiving a speech input indicative of an audio signal corresponding to the keyword of the plurality of keywords.3. The method of claim 1 , further comprising:determining one or more second keywords based on user voice input associated with the media content item; andassociating the one or more second keywords with the media content item such that the plurality of keywords associated with the media content item comprises the one or more second keywords.4. The method of claim 1 , wherein the associating further comprises forming at least one n-gram from at least two keywords of the plurality of keywords.5. The method of claim 1 , wherein the search request is associated with at least one of: an audio signal or a user speech input.6. ...

Подробнее
28-01-2021 дата публикации

RETRIEVAL DEVICE, TRAINING DEVICE, RETRIEVAL SYSTEM, AND RECORDING MEDIUM

Номер: US20210026887A1
Принадлежит: TOYOTA JIDOSHA KABUSHIKI KAISHA

The retrieval device extracts a feature corresponding to search text by inputting the search text into a pre-trained text feature extraction model. The retrieval device then, for plural combinations stored in a database associating a text description including plural sentences, with a vehicle-view video, and with vehicle behavior data representing temporal vehicle behavior, computes a text distance represented by a difference between a feature extracted from each sentence of the text description associated with the video and vehicle behavior data, and the feature corresponding to the search text. The retrieval device outputs as the search result a prescribed number of pairs of video and vehicle behavior data pairs in sequence from the smallest text distance according to the text distances. 1. A retrieval device , comprising:a memory, anda processor coupled to the memory, the processor being configured to:acquire a search text,extract a feature corresponding to the search text by inputting the search text to a text feature extraction model configured to extract features from input sentences, the text feature extraction model being pre-trained so as to reduce a loss represented by a difference between a feature extracted from a sentence and a feature extracted from a correctly matched vehicle-view video, and also being pre-trained so as to reduce a loss represented by a difference between a feature extracted from the sentence and a feature extracted from correctly matched vehicle behavior data representing temporal vehicle behavior,compute a text distance for each of a plurality of combinations stored in the memory, each combination associating a text description, including a plurality of sentences, with a vehicle-view video and with vehicle behavior data representing temporal vehicle behavior, the text distance being represented by a difference between a feature extracted from each sentence of the text description associated with the video and the vehicle behavior ...

Подробнее
23-01-2020 дата публикации

SYSTEMS AND METHODS FOR ALERT SERVICES

Номер: US20200028701A1
Автор: LIU Yishi, WAN Ching Leong
Принадлежит:

Embodiments relate to systems, processes and devices for an information delivery platform or data hub with an alert processor that can be configured to receive a request to generate an alert configuration at the data hub, the request indicating a target unit; generate and store an alert rule corresponding to the alert configuration, the alert rule having a trigger and an action; detect an event at the data hub based on a set of data of the data stored at the data hub, the event having event data; convert the event data to an alert trigger at the data hub based on the trigger of the alert rule; generate an alert notification for the alert trigger based on the action of the alert rule; and transmit the alert notification to the target unit. 1. A system for generating alert notifications , comprising at least a processor and a non-transient data memory storage , the data memory storage containing machine-readable instructions for execution by the processor , the machine-readable instructions configured to , when executed by the processor , provide an alert service configured to:load and store data from a plurality of source systems at a data hub implemented by a non-transient data store;receive a request for an alert subscription at the data hub, the request indicating a subscriber and an alert configurations;generate and store an alert rule corresponding to the alert configuration, the alert rule having a trigger and an action;detect an event at the data hub based on correlated event data of the data stored at the data hub, the correlated event data corresponding to the trigger of the alert rule, the correlated event data based on in-memory event correlation and computing for real-time detection;generate an alert notification for the alert trigger based on the action of the alert rule; andtransmit the alert notification to the subscriber based on the subscription.2. The system of claim 1 , wherein the subscriber is linked to an alert format and the alert service is ...

Подробнее
02-02-2017 дата публикации

Systems and methods for addressing a media database using distance associative hashing

Номер: US20170032033A1
Автор: Brian Reed, Zeev Neumeier
Принадлежит: Vizio Inscape Technologies LLC

A system, method and computer program utilize a distance associative hashing algorithmic means to provide a highly efficient means to rapidly address a large database. The indexing means can be readily subdivided into a plurality of independently-addressable segments where each such segment can address a portion of related data of the database where the subdivided indexes of said portions reside entirely in the main memory of each of a multiplicity of server means. The resulting cluster of server means, each hosting an addressable sector of a larger database of searchable audio or video information, provides a significant improvement in the latency and scalability of an Automatic Content Recognition system, among other uses.

Подробнее
04-02-2021 дата публикации

ON-DEMAND INDEXING

Номер: US20210034655A1
Принадлежит:

A method for indexing objects in a computerized system having an index, comprising identifying in the computerized system an at least one indexed object that meets an at least one criterion related to contents of the at least one indexed object, detecting an at least one non-indexed object having a property similar to an at least one property of the at least one indexed object that was identified, and indexing the at least one non-indexed object in the index, wherein the method is performed by the computerized system, and an apparatus for performing the same. 1. A method for indexing objects in a computerized system having an index , comprising:based on the index, identifying in the computerized system an at least one indexed object that meets an at least one criterion related to contents of the at least one indexed object;based on the identified indexed object looking for and detecting an at least one non-indexed object having a property similar to an at least one property of the at least one indexed object that was identified; andindexing the at least one non-indexed object in the index,wherein the method is performed by the computerized system.2. The method according to claim 1 , wherein the indexing is performed responsive to detecting the at least one non-indexed object.3. The method according to claim 1 , wherein the indexing is scheduled to be performed subsequently to detecting the at least one non-indexed object.4. The method according to claim 1 , wherein the at least one indexed object comprises a plurality of indexed object.5. The method according to claim 1 , wherein the at least one criterion comprises a plurality of criteria.6. The method according to claim 1 , wherein the at least one property comprises a plurality of properties.7. An apparatus for indexing objects in a computerized system claim 1 , comprising:an at least one computer;an at least one storage device constructed with an index for objects of the computerized system,wherein the at least ...

Подробнее
04-02-2021 дата публикации

ENTERING OF HUMAN FACE INFORMATION INTO DATABASE

Номер: US20210034898A1
Принадлежит: NEXTVPU (SHANGHAI) CO., LTD.

A processor chip circuit is provided, which is used for entering human face information into a database and includes a circuit unit configured to perform the steps of: videoing one or more videoed persons and extracting human face information of the one or more videoed persons from one or more video frames during the videoing; recording a voice of at least one of the one or more videoed persons during the videoing; performing semantic analysis on the recorded voice so as to extract respective information therefrom; and associating the extracted information with the human face information of the videoed person who has spoken the extracted information, and entering the associated information into the database. 1. A processor chip circuit for entering human face information into a database , comprising:a circuit unit coupled with an auxiliary wearable device configured for being worn by a visually impaired person, the circuit unit being configured to perform, from the auxiliary wearable device, the steps of:videoing one or more videoed persons and extracting human face information of the one or more videoed persons from one or more video frames during the videoing;recording a voice of at least one of the one or more videoed persons during the videoing, wherein the voice of the at least one videoed person comprises identity information of a speaker that is spoken by the speaker;performing semantic analysis on the recorded voice so as to extract respective information therefrom, wherein the extracted respective information comprises the identity information of the speaker; andassociating the extracted information with the human face information of the videoed person who has spoken the extracted information, and entering the associated information into the database,wherein the circuit unit is further configured to perform, from the auxiliary wearable device, the step of:accessing, during a conversation participated by the visually impaired person and at least one of the ...

Подробнее
24-02-2022 дата публикации

Methods and systems for providing searchable media content and for searching within media content

Номер: US20220058216A1
Автор: Kevin Yao
Принадлежит: DISH Network LLC

A method for providing searchable media content includes generating a text file that is representative of an instance of media content. The instance of media content comprises a first scene and a second scene. A first portion of the text file is representative of the first scene and a second portion of the text file is representative of the second scene. The method further includes indexing the first portion with the first scene and indexing the second portion with the second scene.

Подробнее
24-02-2022 дата публикации

DISPLAY DEVICE AND METHOD FOR CONTROLLING SAME

Номер: US20220060792A1
Принадлежит: LG ELECTRONICS INC.

The invention relates to a display device and method for controlling the same, the method mainly comprising: capturing a screen in which content is reproduced; extracting a first keyword from an image of the captured screen, generating reliability corresponding to the first keyword, when an input of selecting the first keyword is received from an external remote controller, transmitting the first keyword, feedback information and the reliability to an external server, receiving a second keyword, corrected reliability, and feedback information from the external server, and when an input of selecting the second keyword is received, displaying a screen corresponding to the second keyword. 120-. (canceled)21. A display device comprising:a tuner configured to receive a broadcast signal;a communication module configured to perform communication with at least one of an external server or an external remote controller;a display configured to display a content included in the received broadcast signal, wherein the content is received from the external server or stored in an internal memory; anda controller configured to control at least one of the tuner, the communication module, or the display,wherein the controller is further configured to:capture a screen image in which the content is displayed;extract a first keyword from the captured screen image and electronic program guide (EPG) information included in the broadcast signal;generate a confidence value corresponding to the first keyword;cause the display to display a first menu including the extracted first keyword;transmit at least one of the first keyword, first feedback information corresponding to a selection of the first keyword, or the confidence value of the first keyword to the external server based on receiving an input selecting the first keyword from the external remote controller;receive a second keyword, a corrected confidence value, and second feedback information from the external server;cause the display ...

Подробнее
22-02-2018 дата публикации

Monitoring Individual Viewing of Television Events Using Tracking Pixels and Cookies

Номер: US20180054658A1
Принадлежит: Inscape Data Inc

A real-time content identification and tracking system enabling monitoring of television programming consumption specific to an individual television or other viewing device. Metrics collected may include data regarding viewing of specific broadcast media, commercial messages, interactive on-screen information or other programming, as well as locally cached, time-shifted programming. Information about media consumption by such specific television sets or other viewing means may be returned to a commercial client of the system through a trusted third-party intermediary service and, in certain embodiments, encoded tokens may be used to manage the display of certain events as well as to enable robust auditing of each involved party's contractual performance.

Подробнее
13-02-2020 дата публикации

Systems and Methods for Automated Extraction of Closed Captions in Real Time or Near Real-Time and Tagging of Streaming Data for Advertisements

Номер: US20200053409A1
Автор: Abed Samir
Принадлежит: Crossbar Media Group, Inc

System and methods for finding and analyzing targeted content from audio and video content sources, including means and methods for extracting captions from audio and video content sources; searching the captions for a mention of at least one target; extracting audio and video segments relating to the at least one target; delivering extracted audio and video segments to a user device; harvesting social media data relevant to the at least one target; analyzing the search results in correlation with the social media data for target content. 1. A system for targeted content analysis , comprising:a server platform constructed and configured for network communication with at least one device;wherein the at least one device is operable to receive a live broadcast and/or stream audio or video content;wherein the at least one device and/or the server platform is operable to extract captions of the live broadcast and/or the audio or video content in real time or near real time; andwherein the server platform is operable to search the extracted captions for at least one keyword relating to targeted content, thereby creating search result data, and calculate an impact of the targeted content by correlating the search result data with data obtained from the Internet.2. The system of claim 1 , wherein the data obtained from the Internet includes social media data.3. The system of claim 1 , wherein the data obtained from the Internet includes web site traffic.4. The system of claim 1 , wherein the extracted captions are operable to be recorded on a blockchain.5. The system of claim 1 , wherein the server platform includes a peer-to-peer platform.6. The system of claim 1 , wherein the server platform and/or the at least one device is operable to extract segments of the live broadcast and/or the audio or video content based on the at least one keyword relating to targeted content.7. The system of claim 1 , further comprising a summarizer operable to provide a summary of the live ...

Подробнее
10-03-2022 дата публикации

VOICE SEARCHING METADATA THROUGH MEDIA CONTENT

Номер: US20220075829A1
Принадлежит:

Techniques for searching metadata through media content. User input identifying a search criteria is received from a user device. Metadata associated with media content files is searched to identify a subset of the media content files. Search results identifying the subset of the media content files are provided to the user device. The metadata is generated by an originator of each media content file and describes each scene. 1. A computer-implemented method , comprising:receiving, from a user device, user input identifying a search criteria;searching, by operation of one or more computer processors, metadata associated with a plurality of media content files to identify a subset of the plurality of media content files, the subset of the plurality of media content files comprising one or more media content files of the plurality of media content files that includes one or more scenes that match the search criteria; andproviding, to the user device, search results identifying the subset of the plurality of media content files,wherein the metadata is generated by a respective originator of each media content file of the plurality of media content files and describes each scene of the plurality of media content files.2. The computer-implemented method of claim 1 , wherein the metadata comprises visual metadata describing an actor claim 1 , an actress claim 1 , a character claim 1 , an object claim 1 , a location claim 1 , an emotion claim 1 , an action claim 1 , a theme claim 1 , or a plot point associated with each scene of the plurality of media content files.3. The computer-implemented method of claim 1 , wherein the metadata comprises audio metadata describing a dialog or a song associated with each scene of the plurality of media content files.4. The computer-implemented method of claim 1 , wherein the metadata comprises subtitle metadata describing a subtitle associated with each scene of the plurality of media content files.5. The computer-implemented method of ...

Подробнее
04-03-2021 дата публикации

Identifying Video Content via Fingerprint Matching

Номер: US20210064654A1
Автор: Harron Wilson
Принадлежит:

Methods and systems to identify video content based on video fingerprint matching are described. In some example embodiments, the methods and systems generate a query fingerprint of a frame of video content captured at a client device, query a database of reference fingerprints, determine the query fingerprint of the frame of captured video content matches a reference fingerprint, and identify the video content based on the match of fingerprints. 1. A method comprising:selecting, by one or more processors of a client device, a plurality of patches of at least one video frame of video content received by the client device;calculating, by one or more processors of the client device, for each respective patch of the plurality of patches, a respective value by subtracting a middle region of the respective patch from outer regions of the respective patch, to obtain a plurality of values;generating, by one or more processors of the client device, a query fingerprint based on the plurality of values;comparing, by one or more processors of the client device, the query fingerprint to a reference fingerprint;determining, by one or more processors of the client device, that the query fingerprint matches the reference fingerprint; andidentifying, by one or more processors of the client device, the video content based on the determining that the query fingerprint matches the reference fingerprint.2. The method of claim 1 , wherein the plurality of patches are selected from a single video frame.3. The method of claim 1 , wherein selecting the plurality of patches comprises:dividing the at least one video frame into a plurality of regions; andfor each respective region of the plurality of regions, selecting a left-right patch corresponding to left-right features of the respective region and a top-down patch corresponding to top-down features of the respective region.4. The method of claim 3 , wherein the plurality of regions comprises regions arranged in a grid pattern.5. The ...

Подробнее
17-03-2022 дата публикации

METHOD FOR ALIGNING TEXT WITH MEDIA MATERIAL, APPARATUS AND STORAGE MEDIUM

Номер: US20220083741A1
Автор: Chen Xi, Hu Yichen, Tian Hao
Принадлежит:

A method for aligning a text with a media material, an apparatus, and a storage medium are provided. The method includes: determining a set of anchor points in the text; performing following operations i) to v) repeatedly until all anchor points are removed from the set of anchor points or all media materials are removed from a set of media materials: i) ranking the anchor points in the set of anchor points, ii) selecting a target anchor point from the set of anchor points based on the ranked anchor points in the set, iii) determining, from the set of media materials, a media material matching a text segment starting from the target anchor point, iv) removing the target anchor point, and v) removing the media material matching the text segment starting from the target anchor point; and aligning the text segments with respective media matching materials. 1. A method for aligning a text with a media material , the method comprising:determining a set of anchor points in the text based on a grammatical structure of the text, each of the anchor points being a starting position of a text segment of the text;{'claim-text': ['i) ranking the anchor points in the set of anchor points based on text segments starting from the anchor points,', 'ii) selecting a target anchor point from the set of anchor points based on the ranked anchor points in the set,', 'iii) determining, from the set of media materials, a media material matching a text segment starting from the target anchor point,', 'iv) removing the target anchor point from the set of anchor points, and', 'v) removing, from the set of media materials, the media material matching the text segment starting from the target anchor point; and'], '#text': 'performing following operations i) to v) repeatedly until all anchor points are removed from the set of anchor points or all media materials are removed from a set of media materials:'}aligning the text segments with respective media materials.2. The method according to claim ...

Подробнее
27-02-2020 дата публикации

SYSTEMS AND METHODS FOR VIDEO ARCHIVE AND DATA EXTRACTION

Номер: US20200065329A1
Принадлежит:

Systems and methods for full motion video search are provided. In one aspect, a method includes receiving one or more search terms. The search terms include one or more of a characterization of the amount of man-made features in a video image and a characterization of the amount of natural features in the video image. The method further includes searching a full motion video database based on the one or more search terms. 1. A method of adding a video entity to a full motion video database supporting search capabilities , the method comprising:determining a starting cell in the full motion video database; and splitting the starting cell to create two new child cells, and', 'adding the video entity to one of the two new child cells., 'determining if the starting cell should be split, and if the starting cell should be split2. The method of claim 1 , wherein splitting the starting cell comprises splitting the starting cell along a split axis.3. The method of claim 2 , wherein the split axis is along a line of latitude.4. The method of claim 2 , wherein the split axis is along a line of longitude.5. The method of claim 2 , wherein the two new child cells are created along the split axis.6. The method of claim 1 , further comprising moving one or more video entities stored in the starting cell from the starting cell to the new child cells.7. The method of claim 6 , further comprising adding pointers to the starting cell pointing to the child cells.8. The method of claim 1 , wherein determining if the starting cell should be split comprises determining if the starting cell is full.9. The method of claim 8 , further comprising adding the video entity to the starting cell if the starting cell not full.10. The method of claim 1 , further comprising:determining a child cell from a plurality of child cells of the starting cell if the starting cell contains a plurality of child cells; andadding the video entity to one of the plurality of child cells.11. The method of claim 10 ...

Подробнее
29-05-2014 дата публикации

Displaying a text-based description of digital content in a sub-frame

Номер: US20140149597A1
Принадлежит: Adobe Systems Inc

In some example embodiments, a system and method is shown that includes receiving a text request that includes an identifier value that identifies a text-based description associated with a portion of digital content that is part of a larger portion of digital content. Further, the method includes responsive to the text request, retrieving the text-based description associated with the portion of digital content from a data store, the retrieving using the identifier value to identify the text-based description. Additionally, the method includes communicating the text-based description to a user.

Подробнее
17-03-2016 дата публикации

SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR SEARCHING WITHIN MOVIES (SWiM)

Номер: US20160078043A1
Автор: Simon D. Byers
Принадлежит: AT&T Intellectual Property II LP

Systems, methods and computer-readable media process a series of media files into a searchable format. The method includes generating a media database by processing each of a plurality of programs. The steps of the method include extracting a subtitle track from each of the programs, retrieving at least one frame associated with the subtitle track, adding metadata to the extracted subtitle track and at least one frame, processing the subtitle track, program statistics and at least one frame in a media database. Another aspect includes receiving a user query associated with dialog in a program, searching the media database and presenting a listing of results, receiving a user selection or program from their listing and transmitting at least one frame, a portion of associated subtitle track and prompts for ordering the program.

Подробнее
15-03-2018 дата публикации

Method and system for generating a summary of the digital content

Номер: US20180075139A1
Принадлежит: YANDEX EUROPE AG

There is provided a method and a system for generating a summary of digital content. The method comprises: executing a syntax analysis of a textual representation of the digital content; segmenting the digital content into an ordered set of fragments (i.e. a first fragment and a second fragment); executing a semantic analysis of each fragment of the textual representation; determining a utility parameter for each fragment of the set of fragments; determining a linkage between each pair of fragments of the set of fragments; in response to the utility parameter of the second fragment exceeding a pre-determined threshold value, including the second fragment in a subset of fragments for inclusion in the summary of the digital content; in response to the linkage having been determined between the second fragment and the first fragment, including the first fragment in the subset of fragments; and generating the summary of the digital content.

Подробнее
16-03-2017 дата публикации

Method and apparatus for clustering product media files

Номер: US20170075977A1
Принадлежит: Adobe Systems Inc

A method for clustering product media files is provided. The method includes dividing each media file corresponding to one or more products into a plurality of tiles. The media file include one of an image or a video. Feature vectors are computed for each tile of each media file. One or more patch clusters are generated using the plurality of tiles. Each patch cluster includes tiles having feature vectors similar to each other. The feature vectors of each media file are compared with feature vectors of each patch cluster. Based on comparison, product groups are then generated. All media files having comparison output similar to each other are grouped into one product group. Each product group includes one or more media files for one product. Apparatus for substantially performing the method as described herein is also provided.

Подробнее
05-03-2020 дата публикации

VIDEO COOKIES

Номер: US20200073877A1
Автор: Winter Christian
Принадлежит:

A computer-implemented method of re-identifying a physical object before an image background, the method comprising: providing image data comprising an image object representing a physical object before at least one image background of a set of image backgrounds pre-stored in a database; extracting identification data for the image object as well as image background data from the image data; determining if the identification data matches identification data stored in the database, and if no match is found, creating a temporary data object linking the extracted identification data and the extracted image background data and storing the temporary data object and the extracted identification data in the database, else determining if a temporary data object linking the extracted image background data and the matched identification data has already been stored in the database, and if no temporary data object is found, creating a temporary data object linking the extracted identification data and the extracted image background data and storing the temporary data object in the database, else determining if the temporary data object fulfills at least one predetermined condition, and if no predetermined condition is fulfilled, executing a default action with respect to the temporary data object, else executing a specific action to call attention of an external user. 1. A computer-implemented method of re-identifying a physical object before an image background , the method executed by one or more processing devices and comprising:providing image data comprising an image object representing a physical object before at least one image background of a set of image backgrounds pre-stored in a database;extracting identification data for the image object as well as image background data from the image data; anddetermining if the identification data matches identification data stored in the database, and if no match is found, creating a temporary data object linking the extracted ...

Подробнее
05-03-2020 дата публикации

IMAGE DISPLAY APPARATUS AND OPERATION METHOD OF THE SAME

Номер: US20200073885A1
Принадлежит: SAMSUNG ELECTRONICS CO., LTD.

Method and apparatus for obtaining audio corresponding to a plurality of images, based on semantic information and the emotion information of the plurality of images. 1. An image display apparatus comprising:a display configured to display a plurality of images comprising a first image and a second image;a memory storing one or more instructions; anda processor configured to execute the one or more instructions stored in the memory to obtain semantic information comprising first semantic information corresponding to the first image and second semantic information corresponding to the second image by using a first neural network, obtain emotion information comprising first emotion information corresponding to the first image and second emotion information corresponding to the second image by using a second neural network, determine at least one piece of audio corresponding to the first image and the second image, based on the first semantic information, the second semantic information, the first emotion information, and the second emotion information, and output the at least one piece of audio.2. The image display apparatus of claim 1 , wherein the processor is further configured to determine the at least one piece of audio corresponding to the first semantic information claim 1 , the second semantic information claim 1 , the first emotion information claim 1 , and the second emotion information by using a third neural network.3. The image display apparatus of claim 1 , wherein the processor is further configured to obtain audio information corresponding to the first image and the second image claim 1 , based on the first semantic information and the second semantic information claim 1 , and determine the at least one piece of audio claim 1 , based on the audio information.4. The image display apparatus of claim 1 , wherein the processor is further configured to:determine first audio, based on the first semantic information and the first emotion information, and ...

Подробнее
05-03-2020 дата публикации

EMOTION DETECTION ENABLED VIDEO REDACTION

Номер: US20200074156A1
Принадлежит:

In some examples, a computer system may receive video from one or more video sources. The computer system may detect a plurality of faces in a first video portion of the received video. Further, the computer system may determine that a first face of the plurality of faces has features indicative of an emotion of interest. Based on determining that the first face has the features indicative of the emotion of interest, the computer system may redact other faces of the plurality of faces while leaving the first face unredacted in the first video portion. The computer system may send the first video portion with the first face unredacted and the other faces redacted to at least one computing device. 1. A system comprising:one or more processors; and receiving a video from a video source;', 'detecting a plurality of faces in a first video portion of the received video;', 'determining that a first face of the plurality of faces has features indicative of an emotion of interest;', 'based on determining the first face has the features indicative of the emotion of interest, redacting other faces of the plurality of faces while leaving the first face unredacted in the first video portion; and', 'sending the first video portion with the first face unredacted and the other faces redacted to at least one computing device., 'one or more non-transitory computer-readable media maintaining executable instructions, which, when executed by the one or more processors, configure the one or more processors to perform operations comprising2. The system as recited in claim 1 , the operations further comprising:matching the first face with an image of a previously stored face, wherein the previously stored face is associated with a second video portion received from a different video source than the first video portion; andstoring an indication of the emotion of interest and identifying information for the first video portion in association with the previously stored face.3. The system as ...

Подробнее
05-03-2020 дата публикации

Imported Video Analysis Device and Method

Номер: US20200074183A1
Автор: Altuev Murat K.
Принадлежит:

The invention relates to the area of computer vision video data analysis, in particular to the technologies aimed to search the required objects or events in the analyzed video originally received from a third-party device. An imported video analysis device consists of memory, database for metadata storage, a graphical user interface, and a data processing module. The data processing module is configured to upload a video in any available format into the memory and to import the uploaded video into software of the imported video analysis device. Software decompresses and analyzes the imported video to generate metadata characterizing the data in all objects in the video and to save the metadata in database. The search speed for the required event or object in the imported video received from a third-party device is increased. 1. A video processing device comprising:a memory module configured to store data;a database module configured to store metadata;a graphical user interface module; and uploading a video into the memory module; and', 'providing the video to processing software,', processing the video to generate metadata for all objects in the video;', 'recording the metadata into the database module;', 'receiving a request from a user through the graphical user interface or through an application programming interface (API) with at least one search parameter or criterion to search for at least one event or at least one object in the video; and', 'searching for the at least one event or the at least one object in the video using the metadata and the at least one search parameter or criterion., 'wherein the processing software is configured for], 'a processing module configured for2. The device of claim 1 , wherein the video is captured with a video camera or a network video recorder (NVR).3. The device of claim 1 , wherein the processing software is further configured for:assigning a time value to the video based on a time tag associated with the video.4. The ...

Подробнее
05-03-2020 дата публикации

Generating training data for natural language processing

Номер: US20200074229A1
Автор: Waseem Alshikh
Принадлежит: Qordoba Inc

A training data system enables the generation of training data based on video content received from one or more outside video sources. For example, the generated training data can include a transcript of a word or phrase alongside emotion, language style, and brand perception data associated with that word or phrase. To generate the training data from a video, the subtitles, video frame, metadata, and audio levels of the video can be analyzed by the training data system. The generated training data (potentially from a plurality of videos) can then be grouped into a set of training data and used to train machine learning modules for Natural Language Processing (NLP) techniques.

Подробнее
18-03-2021 дата публикации

COMPUTERIZED SYSTEM AND METHOD FOR ADAPTIVE STRANGER DETECTION

Номер: US20210081657A1
Принадлежит:

Disclosed are systems and methods for improving interactions with and between computers in computerized security and content monitoring, hosting and providing devices, systems and/or platforms. The disclosed systems and methods provide a novel framework that adaptively distinguishes between known people versus unknown people based on a dynamically applied, anonymous facial recognition methodology. The disclosed framework provides such functionality by recognizing faces within captured images without storing any information or annotations regarding or revealing the captured person's identity. The framework is configured to adaptively learn to distinguish between faces seen for the first time and faces it has previously seen by locally processing a captured image and only sending face embeddings to a network location for future comparisons of subsequently, anonymously captured images. 1. A method comprising the steps of:identifying, via a computing device, an image comprising content depicting a person at a location;analyzing, via the computing device, said image, and based on said analysis, determining information associated with a face of the person depicted by said content, said face information comprising data indicating characteristics of traits of said face;comparing, via the computing device, the face information to each face embedding stored in a gallery hosted by storage, each stored face embedding comprising face information for previous person depictions captured at said location, the stored face embeddings being ordered in said gallery according to how recent a respective person depiction was observed at said location, said comparison comprising determining a similarity value for each stored face embedding indicating how similar each stored face embedding is to said face information;identifying, via the computing device, a stored face embedding having a highest similarity value;comparing, via the computing device, said highest similarity value to a ...

Подробнее
31-03-2022 дата публикации

Method and system for manufacturing operations workflow monitoring using structural similarity index based activity detection

Номер: US20220101010A1
Принадлежит: Wipro Ltd

The present invention discloses a method and a system for monitoring manufacturing operation workflow using Structural Similarity (SSIM) index based activity detection. The method comprising receiving video data corresponding to a manufacturing operation activity, extracting a plurality of video frames from the video data, measuring SSIM index for each video frame of the plurality of video frames with respect to next consecutive video frame of the plurality of video frames, comparing the SSIM index of the each video frame with the SSIM index of next consecutive video frame of the plurality of video frames to identify one or more local maxima, and determining at least one manufacturing operation activity based on the one or more local maxima using machine learning technique.

Подробнее
12-03-2020 дата публикации

SYSTEM AND METHOD FOR IMPROVING SPEED OF SIMILARITY BASED SEARCHES

Номер: US20200082212A1
Принадлежит: Avigilon Corpoation

A method and system for processing images for a search is provided, including: receiving a plurality of images selected from search results; for each image in the plurality of images, retrieving a feature vector associated with the image; selecting a subset of the feature vectors based on similarity of feature vectors associated with the images in the plurality of images; and performing a search for feature vectors in a database similar to the feature vectors in the subset of feature vectors. 1. A method of processing images for a search , comprising:receiving a plurality of images selected from search results;for each image in the plurality of images, retrieving a respective feature vector associated therewith;selecting a subset of the feature vectors based on similarity of the feature vectors; andperforming a search for feature vectors in a database similar to feature vectors in the subset of feature vectors.2. The method of wherein the received plurality of images are selected based on similarity to a reference image.3. The method of wherein the plurality of images are selected by a user.4. The method of wherein the plurality of images were generated from a search for images similar to a reference image.5. The method of wherein selecting the subset of feature vectors comprises clustering the feature vectors associated with the images into a plurality of clusters.6. The method of wherein selecting the subset of images comprises filtering the feature vectors based on the clustering.7. The method of wherein the clustering is k-mediod clustering.8. The method of wherein the filtering is selecting a feature vector from each of the clusters.9. The method of wherein selecting a feature vector from each of the clusters comprises selecting a feature vector in a cluster associated with an image showing the face of a person.10. The method of wherein the number of images in the plurality of images exceeds a threshold.11. The method of wherein the number of feature vectors in ...

Подробнее
25-03-2021 дата публикации

SYSTEMS AND METHODS FOR DISPLAYING SUBJECTS OF A PORTION OF CONTENT AND DISPLAYING AUTOCOMPLETE SUGGESTIONS FOR A SEARCH RELATED TO A SUBJECT OF THE CONTENT

Номер: US20210089577A1
Принадлежит:

Systems and methods are described herein for displaying subjects of a portion of content and searching for content related to a particular subject in the content. Media data of content is analyzed during playback, and a number of signatures are identified. Each signature is associated with a particular subject within the content. The signature is stored, along with a timestamp corresponding to a playback position at which the signature begins, in association with an identifier of the particular subject. Upon receiving a command, icons representing each of a number of signatures at or near the current playback position are displayed. Upon receiving user selection of an icon corresponding to a particular signature, a portion of the content corresponding to the signature is played back or a search interface is displayed including autocomplete suggestions related to the subject of the signature. 1. A method for searching for content related to a subject of a portion of video of content , the method comprising:identifying, during playback of the content, a subject signature corresponding to each of a plurality of subjects in the video of the content; storing a timestamp at which the respective subject signature begins, and an identifier of the respective subject signature; and', 'identifying a portion of a frame of video of the content that depicts a subject of the respective subject signature;, 'for each subject signaturereceiving an input command;generating for display, in an overlay over the video content, a plurality of icons, each icon of the plurality of icons representing a respective subject signature and comprising the identified portion of the frame of video of the content that depicts the subject of the respective subject signature;receiving a selection of an icon of the plurality of icons;determining an identifier of the subject of the subject signature represented by the selected icon;retrieving a plurality of search strings, each search string of the ...

Подробнее
25-03-2021 дата публикации

System and Method for Processing Video Data from Archive

Номер: US20210089784A1
Принадлежит:

The invention pertains to the field of video data analysis and processing, and more specifically to technologies aimed at finding information about objects of interest by the minimum known initial data. The system for processing the data from the archive comprises video cameras, memory, a database, a data processing device, and a graphical user interface. The graphical user interface comprises a video camera selection unit, a time period setting unit, a search mode selection unit, a search features unit, and a display. The data processing device is configured with the ability to perform the video data decompression and analysis, process the archive video data, perform the search, and display the search results through the display. The method for processing the data from the archive comprises providing a choice of specific video cameras of the system, setting a specific video time period, providing a choice of search mode from the search modes, setting the known object features, processing the archive video data, and searching for the corresponding metadata, wherein the metadata is generated by video data decompression and analysis and stored in the system database. 1. A system for processing the data from the archive comprising:at least two video cameras;memory made with the ability to store video data archive coming from all video cameras of the system;database for storing the metadata;a graphical user interface (GUI) containing at least the following:a video camera selection unit enabling the user to select specific cameras, the data will from which be processed,a time period setting unit enabling the user to set a specific video time period for the selected video camera,search mode selection unit enabling the user selecting one of three possible search modes: face search mode, vehicle license plate number search mode, or object search mode,search features unit configured for setting the known features of the object to perform search by objects, anddisplay ...

Подробнее
30-03-2017 дата публикации

Navigation method and device

Номер: US20170089714A1
Автор: Guoming LIU
Принадлежит: Xiaomi Inc

A navigation method and device are provided. The method includes: receiving start point information and end point information sent by a target device; acquiring a target path navigation video from a start point to an end point based on the start point information and the end point information; and sending the target path navigation video to the target device. Accordingly, the target path navigation video is broadcasted in real time, thereby the user may determine whether a deviation occurs between the target path and the actual route in real time, and an accuracy of navigation is improved.

Подробнее
19-06-2014 дата публикации

System and method for generating a second screen experience using video subtitle data

Номер: US20140173647A1
Автор: Emil Hansson
Принадлежит: SONY MOBILE COMMUNICATIONS AB

Second screen content for a display of an electronic device is generated and displayed in coordination with video content displayed on a display of another electronic device. The generation of the second screen content includes identifying keywords from subtitle data for the video content and identifying links to media content that relate to one or more of the keywords.

Подробнее
19-03-2020 дата публикации

System and method for contextually enriching a concept database

Номер: US20200089660A1
Принадлежит: CORTICA LTD.

A system and method for contextually enriching a concept database. The method includes determining, based on at least one signature of a first multimedia data element (MMDE) and signatures of a plurality of third concepts stored in the concept database, at least one matching first concept among the plurality of third concepts; generating a reduced representation of the first MMDE; comparing the reduced representation of the first MMDE to signatures representing a plurality of second MMDEs to determine a plurality of matching MMDEs among the plurality of second MMDEs; generating, based on the reduced representation of the first MMDE and the signatures representing the plurality of matching MMDEs, a second concept; and generating at least one context based on the second concept and the plurality of third concepts, wherein each context includes at least one common pattern among the second concept and at least one of the plurality of third concepts.

Подробнее
01-04-2021 дата публикации

OBTAINING ARTIST IMAGERY FROM VIDEO CONTENT USING FACIAL RECOGNITION

Номер: US20210097263A1
Принадлежит:

An example method may include applying an automated face detection program implemented on a computing device to a plurality of training digital images associated with a particular TV program to identify a sub-plurality of the training digital images, each containing a single face of a particular person associated with the particular TV program. A set of feature vectors determined for the sub-plurality may be used to train a computational model of a face recognition program for recognizing the particular person in any given digital image. The face recognition program and the computational model may be applied to a runtime digital image associated with the particular TV program to recognize the particular person in the runtime digital image, together with geometric coordinates. The runtime digital image may be stored together with information identifying the particular person and corresponding geometric coordinates of the particular person in the runtime digital image. 1. A method comprising:applying an automated face detection program implemented on a computing device to a first plurality of training digital images associated with a particular TV program to identify a first sub-plurality of the training digital images, each of which contains a single face of a first particular person associated with the particular TV program;based on a first set of feature vectors determined for the first sub-plurality of training digital images, training a first computational model of a computer-implemented face recognition program for recognizing the first particular person in any given digital image;applying the face recognition program together with the first computational model to a runtime digital image associated with the particular TV program to recognize the first particular person in the runtime digital image from among one or more faces detected, together with respective geometric coordinates, in the runtime digital image; andstoring, in non-transitory computer-readable ...

Подробнее
01-04-2021 дата публикации

METHOD AND SYSTEM FOR GENERATING VIDEO

Номер: US20210097288A1
Принадлежит: SAMSUNG ELECTRONICS CO., LTD.

A method and an apparatus for generating a video are provided. The method may include performing, by a server, semantic analysis on an original video according to a timing characteristic of the original video, and segmenting the original video to obtain video segments with semantic information; obtaining, by the server, a video generation sequence model with a timing characteristic based on at least one previously configured video generation sequence model with a timing characteristic according to preference video information obtained from a client; and reorganizing, by the server, the video segments with the semantic information according to the video generation sequence model with the timing characteristic to obtain a target video of the client. 1. A method for generating a video , the method comprising:performing, by a server, semantic analysis on an original video according to a timing characteristic of the original video, and segmenting the original video to obtain video segments with semantic information;obtaining, by the server, a video generation sequence model with a timing characteristic based on at least one previously configured video generation sequence model with a timing characteristic according to preference video information obtained from a client; andreorganizing, by the server, the video segments with the semantic information according to the video generation sequence model with the timing characteristic to obtain a target video of the client.2. The method of claim 1 , wherein the performing the semantic analysis comprises:performing the semantic analysis on the original video through at least one of a video capture mode, a voice recognition mode, and an image recognition mode.3. The method of claim 1 , further comprising:performing a smoothing process on the reorganized video segments so that the reorganized video segments are normalized, before the target video is obtained.4. The method of claim 1 , further comprising:updating, in real time, the ...

Подробнее
01-04-2021 дата публикации

Frictionless Authentication and Monitoring

Номер: US20210097299A1
Принадлежит:

An identity of a customer within an establishment is authenticated using a variety of captured biometric features obtained from sensors and/or video. Video capturing movements/interactions of the customer is analyzed in real time to identify the customer's behavior and actions. Any staff of the establishment who interact with the customer are identified from the video. Transaction data and other data retained for the customer by the establishment are aggregated and linked with the video and the customer identity. The linked data is analyzed in combination with the customer behavior and actions to determine responses within the establishment to customer-initiated transactions. In an embodiment, the customer is authorized to perform at least one transaction within the establishment based on the authenticated identity and linked data without a presentation by the customer of an identification card, a Personal Identification Number (PIN), a password and/or verification by a staff member. 1. A method , comprising:authenticating an individual within an establishment to a customer identity;aggregating data associated with the customer identity from a plurality of sources as aggregated customer data;capturing video of the individual within the establishment;identifying actions and behaviors of the individual from the video;detecting a transaction request associated with a transaction being performed by the individual within the establishment; anddetermining whether to intervene in the transaction before the transaction completes based on the customer identity, the actions, the behaviors, and the aggregated customer data.2. The method of claim 1 , wherein authenticating further includes receiving biometric features from sensors and biometrically authenticating the individual using the biometric features to the customer identity.3. The method of claim 2 , wherein receiving further includes deriving at least some of the biometric features from the video and from the behaviors ...

Подробнее
12-05-2022 дата публикации

METHOD AND SYSTEM FOR CHARACTERISTIC-BASED VIDEO PROCESSING

Номер: US20220147567A1
Принадлежит:

A method and apparatus for characteristic-based video processing include: in response to receiving a region of a picture of a video sequence, determining a characteristic in the region, the region being independent of other regions of the picture for video coding; determining a class associated with the region based on the characteristic, the class being selected from a plurality of classes; and encoding the region using a parameter set associated with the class, the parameter set being selected from a plurality of parameter sets for video coding at different quality levels. 1. A non-transitory computer-readablestorage medium storing a set of instructions that are executable by one or more processors of a device to cause the device to perform a method comprising:decoding a compressed video sequence to obtain a parameter set associated with a region of a picture in the compressed video sequence, wherein the parameter set represents a quality level for video coding and is associated with a class; anddecoding the region using the parameter set, wherein the decoded region comprises a characteristic, the characteristic comprises a content characteristic associated with video contents in the region, and the content characteristic comprises at least one of a scene representing an incident in the region or an environmental event in the region.2. The non-transitory computer-readable storage medium of claim 1 , wherein the region is one of a slice of the picture or a tile of the picture.3. The non-transitory computer-readable storage medium of claim 1 , wherein the content characteristic further comprises an object in the region.4. The non-transitory computer-readable storage medium of claim 1 , wherein the class is associated with an interest level of content characteristics claim 1 , and the interest level is determined based on an application scenario of the compressed video sequence.5. The non-transitory computer-readable storage medium of claim 1 , wherein the ...

Подробнее
12-05-2022 дата публикации

Method for processing audio and video information, electronic device and storage medium

Номер: US20220148313A1
Принадлежит: Shenzhen Sensetime Technology Co Ltd

A method for processing audio and video information includes: audio information and video information of an audio and video file are acquired; feature fusion is performed on a spectrum feature of the audio information and a video feature of the video information based on time information of the audio information and time information of the video information to obtain at least one fused feature; it is determined, based on the at least one fused feature, whether the audio information and the video information are synchronous.

Подробнее
26-03-2020 дата публикации

Using video of navigation through a user interface as context in translating the user interface

Номер: US20200097603A1
Принадлежит: CA Inc

A processing device obtains a video of navigation through a user interface of an application. The video is divided into a plurality of frames that are based on time units. Each frame of the plurality of frames including a plurality of strings comprising text. A string of the plurality of strings that is in a first frame is determined. A time value that is associated with the first frame is determined. A location of the string that is in the first frame is determined. An untranslated resource bundle is generated and includes a mapping of the string of the plurality of strings to the time value of the first frame of the plurality of frames and to the location of the string of the plurality of strings. The video and the untranslated resource bundle are transmitted to a remote device via a communication interface communicatively coupled to the processing device.

Подробнее
08-04-2021 дата публикации

SHORT-TERM AND LONG-TERM MEMORY ON AN EDGE DEVICE

Номер: US20210103616A1
Принадлежит: NETRADYNE, INC.

Systems and methods are provided for distributed video storage and search over edge computing devices having a short-term memory and a long-term memory. The method may comprise caching a first portion of data on a first device. The method may further comprise determining, at a second device, whether the first device has the first portion of data. The determining may be based on whether the first piece of data satisfies a specified criterion. The method may further comprise sending the data, or a portion of the data, and/or a representation of the data from the first device to a third device. 1. An apparatus comprising:at least one memory unit; and receive visual data at a device, wherein the visual data is captured at a camera that is proximate to the device, wherein the camera is affixed to a first vehicle, and wherein a second vehicle is visible in the visual data;', 'produce, based on the visual data, an observations data, wherein the observations data comprises a descriptor of the second vehicle;', 'determine a priority of the visual data, based at least in part on the descriptor of the second vehicle in the observations data; and', 'store the visual data in the at least one memory unit at a first resolution, at a second resolution, or at both a first resolution and a second resolution based at least in part on the priority., 'at least one processor coupled to the at least one memory unit, in which the at least one processor is configured to2. The apparatus of claim 1 , wherein the descriptor of the second vehicle comprises an indication that the second vehicle was involved in a rare traffic event.3. The apparatus of claim 2 , wherein the rare traffic event is a traffic accident involving the second vehicle.4. The apparatus of claim 2 , wherein claim 2 , in response to the indication that the second vehicle was involved in the rare traffic event claim 2 , the at least one processor is further configured to extend a duration of storage of the visual data in the ...

Подробнее
08-04-2021 дата публикации

PROCESSING CONTENT BASED ON NATURAL LANGUAGE QUERIES

Номер: US20210103734A1
Принадлежит:

Disclosed are systems and methods for summarizing content or preparing missed portions of content based on natural language queries. A natural language query can be received. One or more portions of summarized or missed content can be determined based on the natural language query, and transmitted to a user device. 1. An apparatus comprising:one or more processors; and receive a natural language query associated with output of content by a computing device, wherein the natural language query indicates a request referencing a character or an event associated with the content;', 'determine, based on the request referencing the character or the event, a start time of a portion of the content responsive to the request;', 'determine, based on the start time of the portion of the content, a summary of the portion of the content; and', 'cause output of the summary of the portion of the content., 'memory storing processor executable instructions that, when executed by the one or more processors, cause the apparatus to2. The apparatus of claim 1 , wherein the processor executable instructions claim 1 , when executed by the one or more processors claim 1 , further cause the apparatus to determine one or more keywords associated with the event.3. The apparatus of claim 1 , wherein the processor executable instructions claim 1 , when executed by the one or more processors claim 1 , further cause the apparatus to determine claim 1 , based on the one or more keywords and metadata associated with the content claim 1 , the portion of the content.4. The apparatus of claim 1 , wherein the metadata comprises a plurality of time codes claim 1 , and wherein each time code of the plurality of time codes corresponds to a respective one or more attributes.5. The apparatus of claim 4 , wherein the processor executable instructions that claim 4 , when executed by the one or more processors claim 4 , cause the apparatus to determine the summary of the portion of the content cause the ...

Подробнее
19-04-2018 дата публикации

Using content identification as context for search

Номер: US20180107748A1
Автор: Zbigniew Skolicki
Принадлежит: Google LLC

Techniques for using contextual information relating to content presented by a television as part of a search query for an information search are presented. A search management component, at a given moment in time during presentation of television-related content by a communication device in or associated with a television, identifies contextual information associated with a section of the television-related content and generates a content identifier timestamp associated with the contextual information and the section of television-related content. A search component augments a search query using the contextual information to facilitate customization of a subset of search results based on the contextual information. The contextual information in the search query can facilitate disambiguating the search query or promoting a search result over another search result in the subset of search results, based on the contextual information, to facilitate customization of the subset of search results.

Подробнее
29-04-2021 дата публикации

Multi-time search analytics

Номер: US20210125641A1
Автор: Akif EKIN
Принадлежит: Akif EKIN

Multi-time search analytics for smart video indexing based on active search and video database, which is configured to be operated without any need for a second broadcast with a camera, a second large data index, additional servers, a third additional software for indexing and analytics, a third additional face recognition system, a license recognition system, an object recognition system, or an additional net bandwidth for a new hardware. The method of operating multi-time search analytics for the additional net bandwidth for new hardware includes following steps: recognizing objects, activating software components and/or incoming alarms, collecting the alarms in an alarm database pool, extracting a first video with a same length in the alarm database pool, performing a motion recognition with H264/H265, recording detected motions by a video content recorder, transferring the detected motions to a timeline adjuster and changing the detected motions in accordance with an Alpha Time Zone.

Подробнее
02-04-2020 дата публикации

SYSTEM AND METHOD FOR BLENDED CONTENT SEARCHES ACROSS DISPARATE SOURCES

Номер: US20200107077A1
Принадлежит:

Methods for blended content searches across disparate sources and display of results in a user interface are performed by systems, devices, and apparatuses. Blended searches and results provide for a search and result delivery mechanism to enable users to find content they are looking for across multiple and/or disparate sources. Content that is searched is located spread across different, unrelated services/sources that have exclusive access for searching the content, or live television, on different multimedia devices, as well as on local devices in the form of recordings and locally stored libraries. The blended content searches performed allow for results obtained across disparate sources to be returned to a user from a single search and provided to a single user interface in a common format. Search results are displayed based on rankings and categories associated with the content and content sources. 1. A method performed by a search host , the method comprising:receiving a search request, at a search host, that specifies content;initiating at least two searches for the content over at least two different types of content sources respectively;receiving and normalizing respective search results for the at least two searches;ranking the respective search results in categories according the at least two different types of content sources to generate ranked results; andproviding the ranked results from the search host to a high-definition multimedia interface (HDMI) switch or a user device.2. The method of claim 1 , wherein the at least two different types of content sources includes a first content source of a first type that is associated with a source device connected to the switch via a local connection and includes a second content source of a second type that is accessed via a network that is remote to the switch.3. The method of claim 2 , wherein the first content source is a live television source claim 2 , a streaming content source claim 2 , a set top box ...

Подробнее
09-04-2020 дата публикации

Importing State Data From A Video Stream Into A Gaming Session

Номер: US20200108310A1
Принадлежит:

Methods, systems, and apparatuses are described for identifying computer programs/video games that are compatible with a particular content stream and obtaining metadata indicating various different states of the content stream or event depicted in the content stream. The metadata may be supplied as input state information to the computer programs/video games to begin a simulation of the event depicted in the content stream or to initiate a video game session corresponding to the state of the content stream. 1. A method comprising:determining a type of a content item being output;determining, based the type of the content item, a gaming content associated with the content item, wherein the gaming content is executable by a computing device;determining event state information indicating a current event state of the content item; andcausing, based on the event state information, a session of the gaming content to begin using the current event state of the content item.2. The method of claim 1 , wherein the current event state of the content item comprises a scenario in a live event and wherein the event state information comprises statistics associated with the scenario in the live event.3. The method of claim 1 , further comprising:receiving a user input indicating a request for the session of the gaming content to begin, wherein the determining the gaming content is based on the request.4. The method of claim 1 , further comprising:determining the gaming content associated with the content item is not in a list of a plurality of potential gaming contents associated with the content item; andsending a request to cause display of a request to obtain the gaming content.5. The method of claim 4 , further comprising:sending, based on a request to obtain the gaming content, a request to the computing device to download the gaming content from a gaming content repository; andadding the gaming content to the list of the plurality of potential gaming contents available to ...

Подробнее
18-04-2019 дата публикации

ACTIONABLE CONTENT DISPLAYED ON A TOUCH SCREEN

Номер: US20190114072A1
Автор: Bai Peng, Du Jun, Huo Qiang, Sun Lei
Принадлежит:

Some implementations may present a media file that includes video on a touchscreen display. A user gesture performed on the touchscreen display may be detected. The user gesture may include one of a tap gesture, a swipe gesture, or a tap and hold and drag while holding gesture. Text selected by the user gesture may be determined. One or more follow-up actions may be performed automatically based at least partly on the text selected by the user gesture. 1. (canceled)2: A system comprising:one or more processors; and displaying one or more portions of a first image on a touchscreen display;', 'receiving, by the touchscreen display, input comprising a user gesture;', 'identifying selected text in the first image based on the user gesture;', 'determining that the selected text comprises a plurality of words included in the first image and a second image; and', 'automatically performing at least one follow-up action based at least partly on the selected text., 'a memory storing instructions that are executable by the one or more processors to perform operations comprising3: The system of claim 2 , further comprising: a tap and hold gesture; and', 'a drag while holding gesture., 'determining that the user gesture comprises at least one of4: The system of claim 3 , further comprising:receiving, by the touchscreen display, a second input comprising a tap gesture; anddetermining that the selected text in the first image comprises a word.5: The system of claim 3 , further comprising:receiving, by the touchscreen display, a third input comprising a swipe gesture; anddetermining that the selected text comprises two or more words included in the first image.6: The system of claim 3 , wherein:the tap and hold gesture causes the first image to be selected; andthe drag while holding gesture causes selection of text in the first image and the second image to create the selected text.7: The system of claim 2 , further comprising:translating the selected text from a source language to ...

Подробнее
09-04-2020 дата публикации

SYSTEM AND METHOD FOR MACHINE-ASSISTED SEGMENTATION OF VIDEO COLLECTIONS

Номер: US20200110943A1
Автор: Gunawardena Ananda
Принадлежит: THE TRUSTEES OF PRINCETON UNIVERSITY

According to various embodiments, a system for accessing video content is disclosed. The system includes one or processors on a video hosting platform for hosting the video content, where the processors are configured to generate an automated transcription of the video content and apply text clustering modules based on a trained neural network to segment the video content. 1. A method for accessing video content , comprising:generating an automated transcription of the video content; andapplying text clustering modules based on a trained neural network to segment the video content.2. The method of claim 1 , further comprising hosting the video content on a platform.3. The method of claim 1 , further comprising generating a search index based on the segmented video content.4. The method of claim 1 , wherein the video content is related at least one of education claim 1 , training claim 1 , and customer service.5. The method of claim 1 , further comprising displaying the video content with a generated search index based on the segmented video content.6. The method of claim 1 , further comprising displaying the video content with a synchronized transcript.7. The method of claim 1 , wherein the segmented video content is based on at least one of an identified content or tone change.8. The method of claim 1 , further comprising punctuating the automated transcription of the video content.9. A system for accessing video content claim 1 , comprising:one or processors on a video hosting platform for hosting the video content, the one or more processors configured to:generate an automated transcription of the video content; andapply text clustering modules based on a trained neural network to segment the video content.10. The system of claim 9 , wherein the processors are further configured to generate a search index based on the segmented video content.11. The system of claim 9 , wherein the video content is related at least one of education claim 9 , training claim 9 , and ...

Подробнее
13-05-2021 дата публикации

Video retrieval method, and method and apparatus for generating video retrieval mapping relationship

Номер: US20210142069A1
Принадлежит: Cambricon Technologies Corp Ltd

The present disclosure relates to a video retrieval method, system and device for generating a video retrieval mapping relationship, and a storage medium. The video retrieval method comprises: acquiring a retrieval instruction, wherein the retrieval instruction carries retrieval information for retrieving a target frame picture; and obtaining the target frame picture according to the retrieval information and a preset mapping relationship. The method for generating a video retrieval mapping relationship comprises: performing a feature extraction operation on each frame picture in a video stream by using a feature extraction model so as to obtain a key feature sequence corresponding to each frame picture; inputting the key feature sequence corresponding to each frame picture into a text sequence extraction model for processing so as to obtain a text description sequence corresponding to each frame picture; and constructing a mapping relationship according to the text description sequence corresponding to each frame picture.

Подробнее
13-05-2021 дата публикации

MANAGED NOTIFICATION SYSTEM

Номер: US20210142084A1
Принадлежит:

A managed notification system compares image(s) and/or indicia relating to the image(s) and where there is a match selectively provides a notification of the same. 1. A notification system comprising:plural primary content cases with respective primary indicia stored in a digital library;each primary content case associated with a case group selected from multiple case groups;plural secondary content cases with respective secondary indicia stored in the digital library;captured content including one or more digital video camera frames with primary captured content and secondary captured content;a first comparison wherein primary captured content is compared with primary indicia;a first match exists if primary indicia of a primary content case and primary captured content contain a matched face;if a first match exists, a second match compares secondary captured content and secondary indicia of a secondary content case; and,if a second match exists, a notification is sent to one or more persons indicated by the case group associated with the matched primary content case.2. The notification system of wherein the case groups indicate persons to be notified via links between the case groups and user groups such that one or more persons in one or more of the user groups receive the notification.3. The notification system of wherein the secondary match requires that the secondary indicia of the secondary content case and the secondary captured content contain a matched face.4. The notification system of wherein the secondary match requires that the secondary indicia of the secondary content case and the secondary captured content contain a matched inanimate object.5. The notification system of wherein the secondary match requires that the secondary indicia of the secondary content case and the secondary captured content contain a matched weapon.6. The notification system of wherein the secondary match requires that the secondary indicia of the secondary content case and ...

Подробнее
13-05-2021 дата публикации

DETECTING SCENES IN INSTRUCTIONAL VIDEO

Номер: US20210142188A1
Принадлежит:

Detecting a scene in an instructional video is presented. One example includes analyzing the visual and/or audio content of the instructional video to identify instances of indicative behavior of the instructor, an instance of indicative behavior being identified based on the presence of at least one of a set of predetermined behavioral patterns of the instructor in the visual and/or audio content of the instructional video. A scene in the instructional video is then detected based on the identified instances of indicative behavior of the instructor. 1. A computer-implemented method for detecting scenes in instructional video comprising instructional content conveyed by an instructor , the method comprising:analyzing a visual and/or audio content of an instructional video to identify instances of indicative behavior of the instructor, an instance of indicative behavior being identified based on a presence of at least one of a set of predetermined behavioral patterns of the instructor in the visual and/or audio content of the instructional video; anddetecting a scene in the instructional video based on the identified instances of indicative behavior of the instructor.2. The method of claim 1 , further comprising:processing a sample video comprising instructional content conveyed by the instructor with a machine learning algorithm to identify a behavioral pattern of the instructor in the visual and/or audio content of the instructional video, the identified behavioral pattern being indicative of a beginning or an end of a section of the instructional content; andincluding the identified behavioral pattern in the set of predetermined behavioral patterns.3. The method of claim 2 , wherein the instructional video comprises the sample video.4. The method of claim 1 , wherein a predetermined behavioral pattern of the set of predetermined behavioral patterns comprises at least one of:a word or sequence of words spoken by the instructor;a movement of the instructor;a pose or ...

Подробнее
09-04-2020 дата публикации

HYBRID MEDIA VIEWING APPLICATION INCLUDING A REGION OF INTEREST WITHIN A WIDE FIELD OF VIEW

Номер: US20200112702A1
Принадлежит: IMMERSIVE MEDIA COMPANY

A content delivery and display solution includes a viewing application for displaying immersive images with a region of interest, in addition to conventional fixed-aspect-ratio media. The display can include the layered display of metadata, multiple windows, and images or hotspots embedded into the immersive image. The viewing application can be used for the display of either live or prerecorded images, from local or online sources. 1. A system for the display of media comprising:machine-readable instructions that, when executed by a computing platform, cause the system to provide:a data management service for receiving data comprising requested immersive image media;an application display service for presenting, on a display, a region of interest of the requested immersive image media.2. A system according to wherein the machine-readable instructions claim 1 , when executed by the computing platform claim 1 , cause the system to provide:a playback control service for controlling a speed and a direction of movement of the region of interest within the requested immersive image media presented on the display by the application display service according to user input, the movement of the region of interest comprising spatial movement of the requested region of interest from a first user-selected spatial portion of the requested immersive image media to a second user-selected spatial portion of the requested immersive image media.3. A system according to wherein the machine-readable instructions claim 1 , when executed by the computing platform claim 1 , cause the system to provide: an automatic-control mode wherein a speed and direction of movement of the region of interest within the immersive media image presented on the display by the application display service is pre-determined; and', 'a user-control mode wherein the speed and the direction of movement of the region of interest within the requested immersive image media presented on the display by the application ...

Подробнее
13-05-2021 дата публикации

VIDEO PROCESSING AND MODIFICATION

Номер: US20210144449A1
Принадлежит:

Methods and systems to process video data and modify at least the image portion of the video data real time to alter the image content. In one method, providing a multimedia signal to a processing module; identifying a replaceable area of an image taken from the multimedia signal with a context information; matching, by a client server, a user profile with the context information of the replaceable area; fetching, from an advertising database, a selected advertising content, based on the user profile and the context information; placing the selected advertising content on the replaceable area of the image; and providing a customized multimedia signal to a selected channel, wherein the customized multimedia signal comprises the selected advertising content overlapping, at least in part, an original content of the replaceable area of the image. 1. A method comprising:receiving a first multimedia input signal to a processing module wherein the first multimedia input signal comprises a series of images captured at a physical venue; 'wherein said determining comprises identifying a context information of the replaceable area;', 'determining, by the processing module, a first image of the input signal that has a replaceable area and determining the replaceable area of the first image;'}providing the first image, the replaceable area, and the context information of the replaceable area to a client server;matching the first image, by the client server, to a user profile previously uploaded to the client server;sending a request from the processing module to an advertisement database to select advertising content, based on the matched user profile and the context information;fetching, from the advertisement database, the selected advertising content; 'identifying additional images in the series that have replaceable areas and replacing the replaceable areas of each of the additional images with the selected advertising content so as to generate additional new images;', 'by ...

Подробнее
03-05-2018 дата публикации

Television key phrase detection

Номер: US20180124162A1
Принадлежит: Twitter Inc

Images of key phrases or hashtags appear on televised feeds. Image processing techniques, such as feature locating algorithms or character recognition algorithms, can be used to locate the images of key phrases in the images. Then, character recognition algorithms can be used to generate a list of candidate key phrases for the key phrase in image format. However, identification of the key phrase in image format is not completely accurate with conventional methods. Social media content items associated with the televised feed are used to filter the list of candidate key phrases. Using known information about the televised feed as well as about key phrases in text format in the social media content items, candidate key phrases in the list of candidate key phrases can be scored and, thus, a final candidate key phrase selected based on the scores.

Подробнее
25-04-2019 дата публикации

Multi-restaurant facial recognition system

Номер: US20190122033A1
Автор: Toshit Panigrahi
Принадлежит: Toast Inc

A multi-restaurant facial recognition includes one or more cameras, first point-of-sale (POS) terminals, and a backend server. The cameras are disposed within a first retail establishment and are configured to capture one or more images of a patron within the first retail establishment. The POS terminals are also within the first retail establishment and one or more of the first POS terminals receives the one or more images and transmits a request over a network for enrollment of the patron in a loyalty program. The backend server receives the request, enrolls the patron in the loyalty program, and stores loyalty program data, where the backend server may recognize and provide the loyalty program data in response to subsequent requests for recognition of the patron from any one of a plurality of second POS terminals that are in other retail establishments that are related to the first retail establishment.

Подробнее
25-04-2019 дата публикации

SYSTEMS AND METHODS FOR GENERATING BOOKMARK VIDEO FINGERPRINTS

Номер: US20190122049A1
Принадлежит:

Systems and methods for replacing original media bookmarks of at least a portion of a digital media file with replacement bookmarks is described. A media fingerprint engine detects the location of the original fingerprints associated with the portion of the digital media file and a region analysis algorithm characterizes regions of media file spanning the location of the original bookmarks by data class types. The replacement bookmarks are associated with the data class types and are overwritten or otherwise are substituted for the original bookmarks. The replacement bookmarks then are subjected to a fingerprint matching algorithm that incorporates media timeline and media related metadata. 1. A method for generating video fingerprints comprising:sharing timeline and metadata of an original digital media file including at least one of a video digital file and an audio digital file;identifying a region within the original media file;bookmarking the identified region with a fingerprinting algorithm;detecting a bookmarked region of a duplicate of the original media file using the fingerprinting algorithm; andcomparing the bookmarked region of the original media file to the bookmarked region of the duplicate.2. The method of claim 1 , wherein identifying a region of the original digital media file includes determining data types within the digital media file.3. The method of claim 2 , wherein determining data types includes at least one of a pixel luminescence value claim 2 , a region of pixel luminescence values claim 2 , an indicator of object motion claim 2 , a change in sound volume claim 2 , and a change in sound types.4. The method of claim 1 , wherein detecting a bookmarked region includes selecting a frame group claim 1 , examining for the presence of matchable characteristics in the frame group claim 1 , applying a region algorithm to the frame group claim 1 , removing repetitive occurrences of the matchable characteristics claim 1 , and defining a path of the ...

Подробнее
16-04-2020 дата публикации

SYSTEMS AND METHODS FOR GENERATING BOOKMARK VIDEO FINGERPRINT

Номер: US20200117911A1
Принадлежит:

Systems and methods for replacing original media bookmarks of at least a portion of a digital media file with replacement bookmarks is described. A media fingerprint engine detects the location of the original fingerprints associated with the portion of the digital media file and a region analysis algorithm characterizes regions of media file spanning the location of the original bookmarks by data class types. The replacement bookmarks are associated with the data class types and are overwritten or otherwise are substituted for the original bookmarks. The replacement bookmarks then are subjected to a fingerprint matching algorithm that incorporates media timeline and media related metadata. 1. A method for generating video fingerprints comprising:sharing timeline and metadata of an original digital media file including at least one of a video digital file and an audio digital file;identifying a region within the original media file;bookmarking the identified region with a fingerprinting algorithm;detecting a bookmarked region of a duplicate of the original media file using the fingerprinting algorithm; andcomparing the bookmarked region of the original media file to the bookmarked region of the duplicate.2. The method of claim 1 , wherein identifying a region of the original digital media file includes determining data types within the digital media file.3. The method of claim 2 , wherein determining data types includes at least one of a pixel luminescence value claim 2 , a region of pixel luminescence values claim 2 , an indicator of object motion claim 2 , a change in sound volume claim 2 , and a change in sound types.4. The method of claim 1 , wherein detecting a bookmarked region includes selecting a frame group claim 1 , examining for the presence of matchable characteristics in the frame group claim 1 , applying a region algorithm to the frame group claim 1 , removing repetitive occurrences of the matchable characteristics claim 1 , and defining a path of the ...

Подробнее
27-05-2021 дата публикации

Facial recognition tool

Номер: US20210158025A1
Автор: Alper Özhan
Принадлежит: Karya Property Management LLC

Disclosed embodiments include systems and methods employing a neural network to identify a tenant as the tenant walks into a management office by using facial recognition. The neural network provides a database of information related to each tenant and coupled to a given tenant's facial features. A method comprises of identifying facial features within an image of a tenant via a computer system, wherein the image is taken as the tenant enters into an office; cropping an image of a tenant; transmitting the cropped image to a convolutional neural network as an input; categorizing the input as an output; and displaying information regarding the tenant via an I/O interface of the computer system.

Подробнее
27-05-2021 дата публикации

Frictionless and Autonomous Activity and Behavioral Monitoring

Номер: US20210158055A1
Принадлежит:

Individuals, business transactions, and business activities are monitored for actions and behaviors of the individuals during performance of establishment processes through video feeds captured by cameras, sensor data captured by sensors, and information captured by transaction systems. Transaction/activity information associated with transactions/activities being processed by a transaction system of the establishment or performed by individuals within the establishment are monitored. The actions, behaviors, transaction information, activity information, and establishment processes are correlated to process controls, policies, and procedures of the establishment and logged. Non-compliant actions, behaviors, transaction information, activity information, and/or transaction/activity thresholds generate real-time remedial actions, such as audits, training, and notifications. In an embodiment, an interface is provided for mining, correlating, searching, and reporting the logged data with respect to specific types of transactions/actions. In an embodiment, the logged data is automatically formatted in a target format and provided to an automated system for consumption. 1. A method , comprising:identifying an activity that is initiated within an establishment;obtaining control data for the activity;monitoring, through video feeds and activity data provided for the activity, performance of or progression of the activity based on the control data;recording monitored data for the activity;associating the monitored data with the activity data and video fees as activity audit data; andprocessing at least one automated action based on at least one threshold exceeded as determined from the activity audit data.2. The method of claim 1 , wherein identifying further includes receiving an activity identifier as a transaction identifier from a transaction system indicating that a transaction was initiated within the establishment.3. The method of claim 1 , wherein identifying further ...

Подробнее
23-04-2020 дата публикации

AUTOMATIC CREATION OF METADATA FOR VIDEO CONTENTS BY IN COOPERATING VIDEO AND SCRIPT DATA

Номер: US20200125600A1
Автор: Jo Geun Sik
Принадлежит:

Approaches presented herein enable automatic creation of metadata for contents of a video. More specifically, a video and a script corresponding to the video are obtained. A location corresponding to an object in at least one shot of the video is extracted. This at least one shot includes a series of adjacent frames. The extracted location is saved as an annotation area in an annotation knowledge base. An element of a plot of the video is extracted from the script. This element of the plot is derived from content of the video in combination with content of the script. The extracted element of the plot is saved in a narrative knowledge base. 1. A method for automatically creating metadata for contents of a video , the method comprising:obtaining a video and a script corresponding to the video;extracting a location corresponding to an object in at least one shot of the video, the shot comprising an adjacent series of frames;saving the extracted location as an annotation area in an annotation knowledge base;extracting an element of a plot of the video from the script, the element of the plot being derived from content of the video in combination with content of the script; andsaving the extracted element of the plot in a narrative knowledge base.2. The method of claim 1 , the method further comprising preprocessing the video by:defining a plurality of frames of the video as standard frames;identifying points in the video at which a similarity between a frame and a closest standard frame have a similarity below a threshold; anddesignating the points as boundaries between a plurality of individual shots of the video.3. The method of claim 2 , the method further comprising preprocessing the script by:comparing a dataset of various types of script forms against a form of the obtained script;identifying a script form in the script form dataset as representative of the form of the obtained script;structuring the obtained script according to the identified script form in a ...

Подробнее
11-05-2017 дата публикации

Monitoring individual viewing of television events using tracking pixels and cookies

Номер: US20170134770A9
Принадлежит: Vizio Inscape Technologies LLC

A real-time content identification and tracking system enabling monitoring of television programming consumption specific to an individual television or other viewing device. Metrics collected may include data regarding viewing of specific broadcast media, commercial messages, interactive on-screen information or other programming, as well as locally cached, time-shifted programming. Information about media consumption by such specific television sets or other viewing means may be returned to a commercial client of the system through a trusted third-party intermediary service and, in certain embodiments, encoded tokens may be used to manage the display of certain events as well as to enable robust auditing of each involved party's contractual performance.

Подробнее
07-08-2014 дата публикации

Systems and methods for generating bookmark video fingerprints

Номер: US20140219496A1
Принадлежит: Garrick Barr, Nils Bjorn Lahr

Systems and methods for replacing original media bookmarks of at least a portion of a digital media file with replacement bookmarks is described. A media fingerprint engine detects the location of the original fingerprints associated with the portion of the digital media file and a region analysis algorithm characterizes regions of media file spanning the location of the original bookmarks by data class types. The replacement bookmarks are associated with the data class types and are overwritten or otherwise are substituted for the original bookmarks. The replacement bookmarks then are subjected to a fingerprint matching algorithm that incorporates media timeline and media related metadata.

Подробнее
23-04-2020 дата публикации

Systems and Methods that Match Search Queries to Television Subtitles

Номер: US20200128285A1
Принадлежит:

A process identifies a search query spike from queries submitted by users during a first span of time, which is less than a predefined duration. The spike corresponds to a set of queries identified as equivalent. The frequency of submitting queries from the set during the first time span exceeds the frequency of submitting queries from the set during an average span of time. The process correlates the spike to a video program by matching terms from the set of search queries to subtitle terms appearing in the video program at a first location. The first location in the video program was presented within a predefined time before the first span of time. The process receives notification from a user device indicating user interest in the video program. The process transmits to the user device search results corresponding to some search queries from the set of search queries. 1. A method for providing video program information , comprising: identifying a search query spike from a set of search queries submitted by a first plurality of users during a first span of time;', 'correlating the search query spike to a video program presented during the first span of time by matching a plurality of terms from the set of search queries to a plurality of subtitle terms appearing in the video program at a first location, wherein the first location in the video program was presented within a predefined time before the first span of time;', 'receiving notification from a user device during a second span of time subsequent to the first span of time indicating user interest in the video program; and', 'in direct response to receiving the notification indicating user interest in the video program, transmitting to the user device search results corresponding to one or more search queries from the set of search queries, such that the search results are presented at the user device at the same time as the first location in the video program when the video program is presented during the ...

Подробнее
09-05-2019 дата публикации

Processing Content Based on Natural Language Queries

Номер: US20190138809A1
Принадлежит: COMCAST CABLE COMMUNICATIONS LLC

Disclosed are systems and methods for summarizing content or preparing missed portions of content based on natural language queries. A natural language query can be received. One or more portions of summarized or missed content can be determined based on the natural language query, and transmitted to a user device.

Подробнее
09-05-2019 дата публикации

METHODS AND DEVICES FOR CLARIFYING AUDIBLE VIDEO CONTENT

Номер: US20190141413A1
Принадлежит:

The various implementations described herein include methods, devices, and systems for clarifying media content. In one aspect, a method is performed at a client device that includes a microphone, memory, and one or more processors. The method includes: (1) receiving, via the microphone, audio content of a media content item playing on a second client device in proximity to the client device; (2) receiving, via the microphone, a verbal query from a user of the client device to clarify a portion of the media content item; (3) sending a request to a remote server system; (4) receiving from the remote server system information responsive to the verbal user query for the portion of the media content item; and (5) presenting the information to the user. 1. A method , comprising: receiving, via the microphone, audio content of a media content item playing on a second client device in proximity to the client device;', 'receiving, via the microphone, a verbal query from a user of the client device to clarify a portion of the media content item;', 'sending a request to a remote server system, the request including at least a portion of the audio content and the user query;', 'in response to sending the request, receiving from the remote server system information responsive to the verbal user query for the portion of the media content item; and', 'presenting the information to the user., 'at a client device having a microphone, one or more processors, and memory2. The method of claim 1 , wherein the information comprises song lyrics.3. The method of claim 1 , wherein the information comprises a transcription of speech.4. The method of claim 1 , wherein the portion of the media content item corresponds to a plurality of persons speaking; andwherein the user query comprises a request to clarify the speech of a particular person of the plurality of persons.5. The method of claim 1 , wherein the information comprises subtitle data associated with the media content item.6. The ...

Подробнее
24-05-2018 дата публикации

Retrieval device, retrieval method, and computer program product

Номер: US20180144479A1
Принадлежит: Toshiba Corp

A retrieval device includes one or more processors. The processors acquire trajectory information indicating a movement trajectory of a target in time-series images. The processors acquire situation information indicating a peripheral situation of the target in the time-series images. The processors acquire a retrieval query containing a movement trajectory and a peripheral situation. The processors retrieve an image matching with the retrieval query among images contained in the time-series images based on the trajectory information and the situation information.

Подробнее
14-08-2014 дата публикации

Peer-to-peer picture sharing using custom based rules for minimal power consumption and better user experience

Номер: US20140229538A1
Принадлежит: Qualcomm Inc

The disclosure is directed to content sharing. An aspect defines a filter having at least one parameter for receiving content and detects a content device. The content device is a peer device with sharable content. The aspect further queries the content device for desired content from the sharable content and receives the desired content from the content device. The desired content matches the at least one parameter.

Подробнее
25-05-2017 дата публикации

Content Analysis to Enhance Voice search

Номер: US20170147576A1
Принадлежит: COMCAST CABLE COMMUNICATIONS LLC

Methods and apparatus for improving speech recognition accuracy in media content searches are described. An advertisement for a media content item is analyzed to identify keywords that may describe the media content item. The identified keywords are associated with the media content item for use during a voice search to locate the media content item. A user may speak the one or more of the keywords as a search input and be provided with the media content item as a result of the search.

Подробнее
07-05-2020 дата публикации

SYSTEM AND METHOD FOR USING A WEBSITE CONTAINING VIDEO PLAYLISTS AS INPUT TO A DOWNLOAD MANAGER

Номер: US20200143170A1
Принадлежит:

Systems and methods for enabling the download of a set of media files with a specific order and specific contents and, more particularly, to enabling a download manager to automatically receive the information it requires to retrieve those elements required to replicate a streaming edit through local playback after the downloads complete. 120-. (canceled)21. A method for downloading files from a website comprising:presenting via at least one processing device a website viewable by a user, the website displaying selectable first and second numerical values, the first numerical value indicating a number of occurrences of events of a first type occurring within a predetermined period of time and a number of media files characterizing the first-type events, the second numerical value indicating a number of occurrences of events of a second type occurring within the predetermined period of time and a number of media files characterizing the second-type events;receiving with the at least one processing device from the user a selection of said first numerical value; andin response to receiving the selection of the first numerical value, displaying via the at least one processing device the one or more of the media files characterizing the first-type events.22. The method of claim 21 , further comprising presenting a website viewable by a user including a webpage having a plug-in application executable by the user's local machine.23. The method of claim 22 , wherein presenting a website viewable by the user includes a webpage having a list of downloadable files.24. The method of claim 23 , further comprising:in response to receiving the selection of the first numerical value, utilizing via the at least one processing device a download manager to obtain an embedded player configured to present one or more of the media files characterizing the first-type events, wherein utilizing a download manager includes downloading the plug-in application executable by the local computer. ...

Подробнее
17-06-2021 дата публикации

BIOMETRIC NOTIFICATION SYSTEM

Номер: US20210182541A1
Принадлежит:

The present invention provides a biometric notification system for selectively sending messages to interested recipients. In various embodiments, message trigger criteria, interested recipients, and message content may vary depending upon, among other things, the service being provided. 1. A biometric notification system (“BNS”) with facial recognition and related services , the system comprising:{'b': '0', 'a person of interest and a facial feature set FFS of the person of interest;'}{'b': 1', '1, 'plural digital video cameras (C . . . Cn) for monitoring monitor respective zones (Z . . . Zn) and a network that interconnects the video cameras with a data processing system including a processing facility and a database facility;'}{'b': 1', '1', '2', '2, 'CZ and CZ acquiring video frames, each video frame associated with a date and time stamp and saved in the database facility;'}{'b': 1', '1', '1', '1, 'a group G of CZ video frames containing images from which a common facial feature set FFS is derived;'}{'b': 2', '2', '2', '2, 'a group G of CZ video frames containing images from which a common facial feature set FFS is derived; and,'}{'b': 0', '1', '2', '1', '2, 'where there is a three-way match between FFS, FFS, and FFS, the frames of G and G are played back in chronological order.'}2. The biometric notification system (“BNS”) of further comprising:{'b': 1', '2', '1', '2, 'an overlap of coverage by camera C and camera C such that a video frame in G has the same date and time stamp as a video frame in G;'}wherein one of the video frames that coincides in date and time is not included in the playback.3. The biometric notification system (“BNS”) of further comprising:{'b': 1', '2', '1', '2, 'an overlap of coverage by camera C and camera C such that at the same date and time a video frame in G and a video frame in G include the person of interest;'}wherein one of the video frames that coincides in date and time is not included in the playback.4. A biometric notification ...

Подробнее