Video analytics system

05-12-2013 дата публикации
Номер:
AU2011367785A1
Принадлежит: Solink Corp
Контакты:
Номер заявки: 77-36-201185
Дата заявки: 12-05-2011

[1]

Background of the Invention An automated teller machine (ATM), also known as an automated banking machine (ABM) or Cash Machine is a computerized telecommunications device that provides the clients of a financial institution with access to financial transactions in a public space without the need for a cashier, human clerk or bank teller.

[2]

On most modern ATMs, the customer identifies himself by inserting a plastic ATM card with a magnetic stripe or a plastic smart card with a chip, which contains a unique card number and some security information. Authentication is achieved by the customer entering a personal identification number (PIN).

[3]

ATMs are placed not only near or inside the premises of banks, but also in locations such as shopping centers/malls, airports, grocery stores, petrol/gas stations, restaurants, or any place large numbers of people may gather. These represent two types of ATM installations: on and off premise. On-premise ATMs are typically more advanced, multi-function machines that complement an actual bank branch's capabilities and thus more expensive. Off-premise machines are deployed by financial institutions and also ISOs (or Independent Sales Organizations) where there is usually only a need for cash, so they typically are the cheaper mono-function devices. In North America, banks often have drive-thru lanes providing access ATMs.

[4]

An ATM typically includes a CPU (to control the user interface and transaction devices), a magnetic and/or Chip card reader (to identify the customer), a PIN Pad often manufactured as part of a secure enclosure, a Secure crypto-processor, generally within a secure enclosure, a Display (used by the customer for performing the transaction), Function key buttons (usually close to the display) or a Touchscreen (used to select the various aspects of the transaction), a Record Printer (to provide the customer with a record of their transaction), a Vault (to store the parts of the machinery requiring restricted access), and a Housing (for aesthetics and to attach signage to).

[5]

Encryption of personal information, required by law in many jurisdictions, is used to prevent fraud.. Sensitive data in ATM transactions are usually encrypted with DES, but transaction processors now usually require the use of Triple DES. Remote Key Loading techniques may be used to ensure the secrecy of the initialization of the encryption keys in the ATM. Message Authentication Code (MAC) or Partial MAC may also be used to ensure messages have not been tampered with while in transit between the ATM and the financial network.

[6]

There are various methods by which criminals attempt to defraud the system. Card skimming or card cloning involves the installation of a magnetic card reader over the real ATM's card slot, which is not easily detectable. The devices used are smaller than a deck of cards, and are used in association with a wireless surveillance camera or a digital camera that is hidden to observe the user's PIN. Card data is then cloned onto a second card and the criminal attempts a standard cash withdrawal. The availability of low-cost commodity wireless cameras and card readers has made it a relatively simple form of fraud, with comparatively low risk to the fraudsters. Criminals tend to attach skimming devices either late at night or early in the morning, and during periods of low traffic. Skimming devices are usually attached for a few hours only because of battery life in the camera. It is estimated that globally, financial institutions are losing over a billion dollars annually to card skimming.

[7]

Customers count on ATM security, but with ATM skimming on the rise, customer confidence is threatened. 67% of U.S. adults who use banking ATMs would be likely to switch institutions after an instance of ATM fraud or data breach. It is essential that financial institutions take corrective measures to ensure banking security.

[8]

Rules are usually set by the government or ATM operating body that dictate what happens when integrity systems fail. Depending on the jurisdiction, a bank may or may not be liable when an attempt is made to dispense a customer’s money from an ATM and the money either gets outside of the ATM's vault, or was exposed in a non-secure fashion, or they are unable to determine the state of the money after a failed transaction.

[9]

In an attempt to stop these practices, countermeasures against card cloning have been developed by the banking industry, in particular by the use of smart cards which cannot easily be copied or spoofed by unauthenticated devices, and by attempting to make the outside of their ATMs tamper evident. Older chip-card security systems include the French Carte Bleue, Visa Cash, Mondex, Blue from American Express1 and EMV '96 or EMV 3.1 1. The most actively developed form of smart card security in the industry today is known as EMV 2000 or EMV 4.x.

[10]

EMV is widely used in the UK (Chip and PIN) and other parts of Europe, but when it is not available in a specific area, ATMs must fallback to using the easy-to-copy magnetic stripe to perform transactions. This fallback behavior can be exploited. Card cloning and skimming can be detected by the implementation of magnetic card reader heads and firmware that can read a signature embedded in all magnetic stripes during the card production process. This signature known as a "MagnePrint" or "BluPrint" can be used in conjunction with common two factor authentication schemes utilized in ATM, debit/retail point-of-sale and prepaid card applications.

[11]

Another ATM fraud issue is ATM card theft which includes credit card trapping and debit card trapping at ATMs. Originating in South America this type of ATM fraud has spread globally.

[12]

Although somewhat replaced in terms of volume by ATM skimming incidents, a re-emergence of card trapping has been noticed in regions such as Europe where EMV Chip and PIN cards have increased in circulation.

[13]

A Lebanese loop is a device used to commit fraud and identity theft by exploiting automated teller machines (ATMs). Its name comes from its regular use amongst Lebanese financial crime perpetrators, although it has now spread to various other international criminal groups. The Lebanese loop is becoming one of the simplest and most widespread forms used to perpetrate ATM fraud by retaining the user's card. In their simplest form, Lebanese loops consist of a strip or sleeve of metal or plastic (even something as simple as a strip of video cassette tape) that is inserted into the ATM's card slot. When the victim inserts their ATM card, the loop is sufficiently long enough for the card to be fully drawn into the machine and read. The victim then enters their PIN as normal, and requests the funds. The ATM then tries to eject the card, but a "lip" folded at the end of the loop prevents the card from being ejected. The machine senses that the card has not been ejected, and draws the card back into the machine. The cash drawer does not open, and the money that has been counted is retained by the machine. In most cases, the victim's account is not debited. The victim believes the machine has malfunctioned or genuinely retained their card. In some cases, the fraudsters attach a small camera to the ATM to record the victim entering their PIN. The video from this camera is then transmitted to the fraudsters, who may be waiting near the machine and viewing the video on a laptop computer meaning they need not approach the victim directly. There have been cases where a fake keypad is fitted to the machine over the top of the real one, and this records the PINs entered. Once the victim has left the ATM, the perpetrator retrieves the loop and the trapped card, and uses it, along with their PIN, to withdraw cash from the victim's account.

[14]

There are different types of cameras used at locations for security purposes. One type is expensive, and does video analytics itself, or is combined with an expensive encoder attached to the camera (the embedded video analytics automatically monitor the video by watching for motion detection, object recognition and many other security threats). The other is much less expensive and just takes video, from which images can be extracted from every set time period.

[15]

In both cases, the cameras run continuously.

[16]

Various approaches are currently used to address the problem of ATM fraud.

[17]

Diebold sell ATM machines. Their card-skimming technology includes ATM card-reader security designed to deter skimmer attachment, an alert system that warns bank personnel thieves have attached a skimming device to an ATM and an electromagnetic field that interferes with a skimmer's ability to capture a card's magnetic-stripe data.

[18]

Diebold's monitoring center also issues real-time e-mail alerts and text messages warning bank employees of skimming attacks.

[19]

Customers have to buy their equipment; therefore it is not a solution for installed base.

[20]

ADT has CPK+ (Card Protection Kit) technology, an advanced anti-skimming protective device installed inside the ATM near the ATM’s card reader. CPK+ helps prevent the skimming of card data by emitting an electromagnetic field to interrupt the operation of an illegal card-reader head, without interrupting the customer transaction or the operation of most ATMs. They also have Surface Detection Kit (SDK); the SDK sensor helps detect foreign devices placed near or over the ATM card-entry slot, whether made of plastic, paper, iron or wood. Upon detection, it relays output signals, triggering silent alarms for monitoring center response, or to coordinate DVR surveillance sequencing of skimming activities.

[21]

ADT Anti-Skim sensing devices can be integrated with ATM or vestibule surveillance DVRs for video documentation and sequencing of skimming activities and corresponding ATM customer transactions.

[22]

ADT also has Monitoring Centers to monitor the ATM security program, and the customer can receive real-time notification of ATM skimming occurrences and law enforcement or security can be dispatched to review ATMs in alarm or remove detected skimming devices.

[23]

ATM Secure has a product Shadow Shield-ECS which offers the ability to provide an electronic shield in the vicinity of the card reader, thus providing a jamming protection shield. This prevents any card reading-skimming device from collecting data, when placed within a 100mm radius of the ATMs card reader. In addition to this, Shadow Shield-ECS transmits a signal that is designed to confuse and corrupt data collected by an attached skimmer.

[24]

They also have SED-E-field which provides an electronic sensing area around the ATM card slot, and can detect the presence of foreign objects like card skimming devices. Once detected, the SED-E-field can send an alarm signal to the security system, alerting of the detection.

[25]

Wincor Nixdorf sells ATM machines. They have increased even further the security in and around ATMs with the software solution ProView for the remote monitoring of self-service banking systems. Monitoring of anti-skimming modules has now been integrated into the bank machine. Anti-skimming modules are equipped with special sensors that check the area around the card insertion point for illicitly installed attachments. If such a module detects anything suspicious, it sends an “event” to ProView, which immediately initiates a variety of protective measures: for instance, it can activate a camera, photograph the perpetrator, take the ATM offline and generate a report to the service provider. If the camera monitoring an ATM fails, ProView can also take the machine offline.

[26]

They also have an anti-skimming mechanism, which is a plastic insert that can be mounted in the card reader slot. The shape of the special insert is designed to prevent tampering with skimming mechanisms but, at the same time, does not restrict ATM usage. The anti-skimming mechanism is equipped with security technology that puts the machine out of service as soon as the insert is destroyed or the machine removed by force.

[27]

Customers have to buy their equipment: therefore it is not a solution for installed base.

[28]

Jitter technology works via a stop start or jitter motion inside the card drive specifically designed to distort the magnetic stripe details should they be copied onto a foreign card reader inserted into the ATM.

[29]

Video Analytics, also known as IVS (Intelligent Video Surveillance) is a new emerging market for security allowing its users to easily monitor and secure areas with security cameras. With this new state of the art technology, businesses can easily monitor places of interest with sophisticated software that makes detecting threats or unwanted visitors simple and effective.

[30]

Intelligent Video Surveillance consists of algorithms that detect movement or changes in live and recorded video to see whether the movement or changes mean a possible threat is about to occur or occurring. These algorithms work by examining each pixel of the video and putting together all the pixel changes. If many pixels are changing in one area and that area is moving in a direction, the software considers this to be motion. Depending on the policies and alerts that have been setup, the bank will be notified of this motion. Other actions can be automatically taken by the as motion tracking which follows the motion until it is no longer detected. It can include Loitering Detection, Queue Length Monitoring and Facial Detection, among other things.

[31]

There are various problems with current solutions. Jitter is a security feature, but it helps only for simple skimmers. With motorized skimmers or extended skimmers, only a sensory solution will offer protection because magnetic stripe data still can be read.

[32]

Sensor detection does not work well, because it can be set off by a customer’s electronic device like a cell phone or iPod.

[33]

New video cameras are very expensive, and can be prohibitive from a cost point of view given the number of ATMs that would have to be fitted with the cameras.

[34]

No current solution targets cash harvesting, which occurs when the thieves take the money out using the fake cards. No current solution can warn of skimmer installation.

[35]

US patent application no. 2008/0303902 describes a system for collecting video data and transactional data and correlating the two. However, this system is bandwidth intensive because the video data is processed along with the transactional data, and is thus not suitable for largescale systems.

[36]

Summary of the Invention According to the present invention there is provided a video analytics system comprising a first database for storing searchable time-stamped transactional data indicative of activity within a monitored system; a second database for storing time-stamped video metadata; a correlation server for comparing the time-stamped transactional data with the time-stamped video metadata to identify correlation events indicating potential activity of interest; and an output subsystem for reporting the correlation events from the correlation server.

[37]

In one embodiment the time-stamped video metadata comprises searchable attributes associated with a raw video data stream, but it could also be data indicative of a customer session, where the customer is detected by analyzing the frames of the video to detect the presence of a person or just mere motion in the video image.

[38]

It will be appreciated that the first and second databases could be physically separate entities, or could in the alternative equally well be part of the same physical storage medium.

[39]

A feature of invention is the conversion of the rawvideo data into searchable, time-stamped metadata, which can then be stored and correlated without consuming large amounts of bandwidth. A large number of facilities can be monitored at a central location without consuming unmanageable amounts of bandwidth. For example, in the case of an ATM machine, the metadata may indicate the presence of a person in front of the machine. This can then be correlated against the transaction data to confirm that a transaction did take place. If there was no transaction, the raw video data, or frames thereof, can be view to identify the person or see whether there were performing an illegal activity, such as installing a skimming device.

[40]

The output subsystem may be a correlation database for storing the correlated data for subsequent review. It could also be part of a monitoring station for attracting the attention of a supervisor.

[41]

The invention is particularly applicable to ATM fraud detection, but this is just one example of a more general embodiment of the invention that takes two or more independent data streams, time stamped data and time stamped video, and correlates them using time and location and runs an analysis to detect when relevant information is present. This correlation and analysis can be applied to other situations as well. It could be done at a point of sale terminal, when a badge is swiped, when a door is opened or closed, when an area (like a store) is open or closed, when a traffic light is red, green or yellow, when a fire alarm is pulled, etc., or any time there is video data and other time sensitive data that is being stored at the same time. In accordance with embodiments of the invention, by correlating the video metadata extracted from the raw video with other transactional data avoids the need to correlate the transactional data directly with the raw video.

[42]

Embodiments of the invention thus provide an advanced data analytic system that uses video/picture metadata and time sensitive data (which could be ATM transactions, badge swiping, retail transactions, traffic light schedules, operating hours, crime patterns, emergency situations, etc.) and uses customizable rules to detect a predefined activity. It uses an algorithm or correlation engine to do a proactive analysis, and when a possible problem is found, it can alerts the appropriate person(s) to the activity discovered.

[43]

The video can be in the form of metadata obtained from embedded analytics created by a camera and stored on site in a DVR. Such intelligent cameras are however expensive and if not already in place would need to be specially installed. Embodiments of the invention also provide a much more economical solution, which is to take existing video shot from normal cameras, and create the metadata at a centralized location. No software has to reside locally where the cameras are, and all processing and storage is done centrally. This then creates a searchable record of the meta-data taken from the images (license plate, height, colors, facial features etc.).

[44]

If a company is already streaming video to a location like a security room (and the video is not stored anywhere), then a ‘filter’ could be put on this stream which can extract analytics or images on the fly and send them to the centralized location. Another way of conserving bandwidth is to have a trigger, such as motion detection or a transaction which tells the system to capture images for a particular time frame to determine the start and stop time for the session.

[45]

Embodiments of the invention can, for example, be used to determining that there is someone at an ATM with no corresponding transaction, suggesting a possible skimmer installation, or determine that the same person did many transactions in a row at the same ATM using different cards, suggesting possible cash harvesting. Another application is to determine that there is an unauthorized person present in a building during off hours by using the guard schedule (either pre-programmed, or done in real time by badge swiping in different areas), suggesting a possible break-in. Further applications include determining that the same license plate is seen at many different ATM branches, suggesting possible cash harvesting, determining that after a badge swipe, the wrong person or the wrong number of people are in an area, suggesting possible theft or fraud, determining that the same person used a stolen card at one or more locations, suggesting possible theft.

[46]

Also, if it is known that a crime took place at a certain date/time/place, video analytics taken from different cameras in the area can be used to track the person, or for example a hit and run in a parking lot, to determine the license plate. Using video analytics from a subway station combined with the subway schedule to determine if there is enough capacity on the trains at various times of the day.

[47]

The solution lets the user configure at least one variable that influences whether an activity is to be flagged or not. This can be done through a user interface, or an Application Programming Interface (API).

[48]

It also can display the results in a unique way making it very easy for a user to find significant instances.

[49]

Embodiments of the invention correlate video and data evidence that might otherwise take days to collect, and present it to a customizable set of users in an organized dashboard. This enables users to, for example, detect fraud crimes quickly, investigate them easily, and ultimately reduce the losses they cause. Once they are alerted, users can drill down into the data and retrieve all the correlated and relevant information. In the case of fraud detection, this dramatically improves investigation capabilities, reduces the time and cost per investigation, and produces superior evidence for prosecution purposes.

[50]

There is no need for local hardware at each site; therefore there are no protocol or encryption issues.

[51]

Embodiments of the invention permit the user to search network-wide on a broad range of customized transaction data fields, use simple refinement tools to drill down into relevant video and data, combine desired data and synchronized video into compelling files with interactive case notes, and export video clips, images, case notes and/or receipt data for easy use by enforcement personnel.

[52]

Another aspect of the invention provides a method of monitoring a site for activity of interest, comprising storing searchable time-stamped transactional data indicative of activity at the site; storing time-stamped video metadata; and performing a rule-based correlation server of the time-stamped transactional data with the time-stamped video metadata to identify correlation events indicating potential activity of interest, and reporting the correlation events.

[53]

In yet another aspect the invention provides a video analytics system comprising a video analytics system comprising a first database for storing searchable time-stamped transactional data indicative of activity within a monitored system; a second database for storing timestamped metadata indicative of the presence of a person at a location monitored by a video camera producing a raw video stream; a correlation server for comparing the time-stamped transactional data with the time-stamped metadata to identify correlation events indicating potential activity of interest; and a module for analyzing video data corresponding to a correlation event.

[54]

In this embodiment, the analytics are triggered by, for example, the detection of a person in the view of the camera. In this embodiment the system detects a person and then determines whether there is a matching transaction.

[55]

In one embodiment the video analytics module is configured to identify the presence of a person by carrying out the following steps:

[56]

1) retrieve any available data with a time-stamp; 2) retrieve one or more video frames corresponding to the time-stamp; 3) analyze the retrieved frames to identify correlation events; and 4) store the retrieved video frames or raw video stream associated with correlation events.

[57]

This embodiment of the invention also conserves bandwidth because the person detection is performed on individual frames rather than the video stream.

[58]

Brief Description of the Drawings The invention will now be described in more detail, by way of example only, with reference to the accompanying drawings, in which: - Figure 1 is a block diagram of a video analytics system in accordance with one embodiment of the invention; Figure 2 is a more detailed block diagram of the correlation server; Figure 3 is a flow diagram illustrating operation of the system; Figure 4 is a block diagram of the correlation server; Figure 5 is the pseudo code for a skimming installation module; Figure 6 is the pseudo code for a person-based harvesting detection module; Figure 7 is the pseudo code for a license-based harvesting detection module; Figure 8 is the pseudo code for an unauthorized access module; Figure 9 is the pseudo code for a capacity detection pseudo code module; Figure 10 is the pseudo code for a pinpoint detection module; Figure 1 1 is the pseudo code for a perpetrator detection module; Figure 12 is the pseudo code for a vagrant on premises detection module; and Figure 13 is a flow chart of an exemplary bandwidth-efficient embodiment for creating a session to perform video analytics.

[59]

Detailed Description of the Invention:

[60]

The invention will be described by reference to one exemplary use of the video analytics system, which is for ATM fraud detection.

[61]

In this embodiment, the ATM video and transaction records are converted into searchable data.

[62]

This can be used to detect people approaching and loitering at an ATM, capture total time at ATM facility, target skimming device installation or removal, detect multiple card use per session as indicator of cash harvesting, and detect cars being used at more than one ATM.

[63]

Investigator performance can be improved by filtering data daily to flag all new threats automatically, consolidating all data and video events in a single, organized dashboard, creating notification of alarms.

[64]

Any bank or credit union is able to providing customizable alarms that can be tailored to detect the specific behaviors each institution requires, allowing new rules to be tested against historical data to more quickly uncover new fraud patterns, and solve a cost problem of expensive cameras that have video analytics by providing a lower cost software solution that does the same thing.

[65]

Referring now to Figure 1, the system contains multiple time sensitive servers 107 connected to the LAN/WAN 105 that contain time sensitive data, such as ATM transactions, or guard schedules, or train/subway schedules. These store the information in Databases 108, and provide an AP1 1 10 to access this data. They can reside in a Branch Office 1 17, or in a Cloud Service or Head Office 118.

[66]

Another source of data comes from Video Data Servers 103 connected to the LAN/WAN These servers provide access to raw video data or video analytics data, derived from the raw video data as searchable metadata, stored in Raw Video/Analytics Databases 102. They can reside in a Branch Office 117, or in a Cloud Service or Head Office 118.

[67]

The Correlation Server 100 contains Crime Detection Software 101, responsible for using the time sensitive data and the video metadata to determine if there is fraudulent activity in accordance with a predetermined, but configurable, set of rules. It can also have a remote function running on a Remote Correlation Server 1 05, which can run the Video Analytics Creation Module 21 1 to create video analytics from raw video. If fraudulent activity is found, it stores the information in correlation Database 104. Database 104 also contains any image references, a copy of the time sensitive data, and the video and video analytics (which can come from the Video Data Server 103, or could have been created by the Remote Correlation Server 105 from raw video).

[68]

The video metadata or video analytics is searchable time-stamped data associated with the video as well as other information about the video. For example, the video analytics may detect a person present in front of an ATM machine. The video metadata includes this searchable tag, which may be stored in association with an image of the person. Also, face recognition software might be used to tag the person with a particular identity, either known or unknown. The latter might be useful, for example, to determine that the same person withdrew cash from different machines within a specified time frame, as this could indicate cash harvesting.

[69]

The metadata can be derived from the video either at the source, using a more expensive camera, or by the system, for example, using the correlation server. To avoid using up unnecessary bandwidth, in a scenario where a bank branch has an installed base of of dumb cameras, the analytics could be done on the local network at the branch so that only the metadata need to be sent to a central monitoring or analysis station. Alternatively the raw video could be sent to the central monitoring station for analysis, but this would require more bandwidth.

[70]

Communications Servers 106 are also connected to the LAN/WAN 105, which provide a standard AP1 113 that lets the Correlation Server 100 handle any communication that is sent to a programmable set of interested parties when an incident occurs. Examples of communications sent are email, SMS, voice calls, tweets, chats, pages, etc.

[71]

Users 112 can access the data in Database 104, set up parameters for the system, configure the rules for determining the fraudulent activity and cause analysis to be done by using web pages 111 that are connected through the LAN/Wan 105 to the Correlation Server 100. The User 112 can also view incidents, search incidents, view the video associated with the incidents, view the data associated with the incidents, and add notes to the incidents. These APIs are implemented using known techniques.

[72]

Figure 2 is a more detailed view of the Correlation Server 1 00. This includes a Time Sensitive Data Interface module 200, which handles the different APIs 110 that are used by the Time Sensitive Data Servers 107. The correlation server 100 also includes a Video Data Interface module 201, which handles the different APIs 109 that are used by the Video Data Servers 103.

[73]

There is a Communications Server Interface module 202, which handles the different APIs 113 that are used by the Communications Servers 107. The correlation server also includes video data interface 203, which interfaces with video data servers 103. These APIs also provide access to the data in Raw Video/Analytics Database 102. These components all managed by the Control Layer 210.

[74]

This control layer is responsible for coordinating all the activity that goes on within the correlation server. The Video Analytics Creation Module 21 1 uses video analytics algorithms to extract data from a given standard video or set of images. It can also be asked by the Get, Decompose and Store Video Data module 301 (see Figure 3) to analyze the video frame by frame as shown in Figure 13. This reduces bandwidth since when it is done, there is no need to download the video, just a couple of frames and the analytic information on what happened during the determined time period. This can be done locally, or it can reside on a remote server and is accessed through a Video Analytics Creation Module Interface 207. There is a Web Interface 203 which talks to Web Pages 111 in a standard way, for example using SOAP 205.

[75]

There is a Correlation Database Interface 204, which uses a standard database API 206 to get and store data in the Correlation Database 1 04. There are a set of pluggable Correlation Modules 213, which does the work of finding incidents and reporting on them.

[76]

Figure 3 shows a Get, Decompose and Store Time Sensitive Data module 300, which when it is told to do so by the Timer/Capacity Queue 302, it is responsible for interfacing to the Time Sensitive Data Interface 208 to get time sensitive data for a particular time frame, break it up into different records, and store these records in the Correlation Database 104, by using the Write/Get Data and Records module 305.

[77]

The Get, Decompose and Store Video Data module 301, when it is told to do so by the Timer/Capacity Queue 302, is responsible for interfacing to the Video Data Interface 208 to get video data for a particular time frame, (and at a specific frame/sec rate in the case of video - to conserve bandwidth), break it up into different records, and store these records in the Correlation Database 104, by using the Write/Get Data and Records module 305. If it is retrieving straight video, it takes the video and passes it to the Video Creation Module 211, which uses standard techniques to extract video analytics from the passed in data, which are given back to be stored in the database.

[78]

The Timer/Capacity Queue 302 is also responsible for deciding when to kick the Correlation Module Kicker 303, based on either time, or a certain capacity of data being reached. It keeps track of which correlation modules need to be kicked, and what the trigger is.

[79]

The Correlation Module Kicker 303 is used to kick or activate the various Correlation Modules 213. It is told to do this either by the Timer/Capacity Queue 302, by the Web Interface 203 (when a User 112 decides they want to), or the Video Data Interface 201, which can be programmed to receive a motion trigger, which in turn can kick the Correlation Module Kicker 303.

[80]

When a User 112 wants to access the data in the Correlation Database 104, the Web Interface 203 uses the Write/Get Data and Records module 305 which in turn talks to the Correlation Database Interface 204 to retrieve the data. The Web Interface 203 is responsible for formatting the data and sending it to the Web Pages 111. The user can also request to have a particular video analyzed, and certain characteristics searched for. The Web Interface 203 kicks the Get, Decompose and Store Video Data module 301 to do this.

[81]

The Write/Get Data and Records module 305 gets requests to store data and records from the Get, Decompose and Store Time Sensitive Data module 300, and the Get, Decompose and Store Video Data module 301. It retrieves data for the Web Interface module 203 and the Correlation Modules 213. The Correlation Modules 213 also use it to store records that they create.

[82]

The Notify module 304 is used by the Correlation Modules 213 to talk to the Communications Server Interface to send out various types of communications to a programmable group of people.

[83]

Figure 4 shows the pluggable Correlation Modules 213 broken down into some example modules. The Control Layer Interface 400 passes the various requests to and from the different plug-in modules. These correlation modules use helper modules that perform standard functions. The helper modules are the Notification Module 408, which has stored the group of people to be notified, and how they should be notified, the Request Further Analytics Module 409, which can request that a video for a particular time period be retrieved, and further analytics be done on it by the Video Creation Module 21 1, and the Create and Store Incident Report Module 410, which knows how to format the information given to it into an incident report and have it stored.

[84]

The Skimming Installation Module 401 uses the data it retrieves to determine if a skimmer has been installed at a card reader site.

[85]

The Unauthorized Access Module 402 uses the data to determine if there is an unauthorized person or persons present.

[86]

The Capacity Detection Module 403 uses the data to determine if enough people are being serviced in a particular time period.

[87]

The Vagrant on Premises Module 404 determines if there is a person staying in one spot for a period of time (i.e. someone sleeping).

[88]

The Harvesting Detection Module 405 uses the data to see if the same person, or the same license plate does more than one transaction in a row using the same card reader, but different cards.

[89]

The Perpetrator Detection Module 406 correlates the use of a stolen card with videos obtained from different sites.

[90]

The Pinpoint Detection Module 407 uses the area where a crime occurred, the time that it occurred, and the video from different cameras to do an analysis to try and pinpoint the perpetrator.

[91]

Other correlation modules can be created and plugged in that use the data available to come to a conclusion and produce a report.

[92]

Figures 5 to 12 give examples of pseudo code executed by the modules 401 to 406 in Figure 4.

[93]

Figure 1 3 illustrates one example of video analytics that might be performed on a video stream by creation module 21 1 in Figure 2.

[94]

At step 1300, a trigger occurs at time T to get an image from the video at step 1301. At step 1302 the image is analyzed and a determination made (step 1303) whether a person is in the image at time T-x, where x is a predefined time period. If not, the process terminates 1 304. If yes, at step 1 305, the creation module retrieves an image from the video and at time T + x and analyzes the video at step 1306 to determine if there is a different person in the image at step 1 307. If the person is the same, the loop repeats from step 1 305 at a later time T + 2x relative to the original time T and so on. If there is a different person in the image, the loop passes to step 1 309, which sets the analysis time at an intermediate time between the first and second steps.

[95]

The loop repeats until no person is found in the image at step 1310.

[96]

This process thus identifies the fact that a person was present at the ATM and for how long.

[97]

This metadata can be sent to the video database and stored along with the relevant images and a link to the associated video. A monitoring station can be notified by the communications server 106 in the event that a person lingers for an unusual amount of time at an ATM machine or is found at several different ATMs within a sort time-frame.

[98]

ATM fraud detection is just an example of taking two or more independent data streams, time stamped data and time stamped video, and correlating them using time and location and running an analysis to detect when relevant information is present. This correlation and analysis can be applied to other situations as well. It could be done at a point of sale terminal, when a badge is swiped, when a door is opened or closed, when an area (like a store) is open or closed, when a traffic light is red, green or yellow, when a fire alarm is pulled, etc. Any time there is video data and other time sensitive data that is being stored at the same time.

[99]

Advantages of described system include the fact that it can be used on any installed system as there is no need to change the data collector and cameras, it combines and correlates and analyzes video analytics with transaction software that is better than just video or data alone., it can be integrated with existing CCTV/Analog video systems or implemented with new state of the art IP network camera, fewer personnel are needed to view video. With standard video systems, someone must always be watching. This decreases labor costs and increases productivity.

[100]

The system also capitalizes on existing video analytics data and combines it with transaction data to expose possible fraud, not detectable by the two individually. It can be used to detect unwanted people in an area, not needing special equipment, just a video camera. It can also be offered as a central solution inside a company that has many video cameras, or as a cloud solution.

[101]

It should be appreciated by those skilled in the art that any block diagrams herein represent conceptual views of illustrative circuitry embodying the principles of the invention. For example, a processor may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Moreover, explicit use of the term “processor” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (DSP) hardware, network processor, application specific integrated circuit (ASIC), field programmable gate array (FPGA), read only memory (ROM) for storing software, random access memory (RAM), and non volatile storage. Other hardware, conventional and/or custom, may also be included. The term circuit is used herein to encompass functional blocks that may in practice be implemented in software.



A video analytics system includes a first database for storing searchable time-stamped transactional data indicative of activity within a monitored system, a second database for storing time-stamped video metadata, wherein the time-stamped video metadata comprises searchable attributes associated with a raw video data stream; and a rule-based correlation server for comparing the time-stamped transactional data with the time-stamped video metadata to identify correlation events indicating potential activity of interest. An output subsystem reports the correlation events from the correlation engine. The analytics system is useful for detecting fraud in ATM transactions by comparing the transactional data, for example, the presence of a transaction, with video metadata, for example, indicating whether a transaction occurs when a person is present, for how long the person is there.



1. A video analytics system comprising:

a first database for storing searchable time-stamped transactional data indicative of activity within a monitored system; a second database for storing time-stamped video metadata; a correlation server for comparing the time-stamped transactional data with the timestamped video metadata to identify correlation events indicating potential activity of interest; and an output subsystem for reporting the correlation events from the correlation server.

2. A video analytics system as claimed in claim 1, wherein the time-stamped video metadata comprises searchable attributes associated with a raw video data stream.

3. A video analytics system as claimed in claim 1, wherein the time-stamped video metadata comprises data indicative of the presence of a person at a monitored location.

4. A video analytics system as claimed in any one of claims 1 to 3, further comprising a video analytics module for deriving the time-stamped video metadata by analyzing the raw video stream in accordance with a stored set of rules.

5. A video analytics system as claimed in claim 4, wherein the video analytics module configured to identify the presence of a person in a video image.

6. A video analytics system as claimed in claim 5, wherein the video analytics module configured to perform a loop when a particular person is detected to determine the approximate time of arrival and departure of that person.

7. A video analytics system as claimed in claim 5 or 6, wherein the video analytics module is triggered in response to the detection of a video motion event.

8. A video analytics system as claimed in any one of claim s 4 to 7, wherein the rules are user configurable.

9. A video analytics system as claimed in any one of claims 1 to 8, wherein the output subsystem includes a correlation database for storing the correlation events for later review.

10. A video analytics system as claimed in claim 9, wherein one or more corresponding video frames or corresponding raw video is stored with a correlation event.

11. A video analytics system as claimed in any one of claims 1 to 10, wherein the output subsystem includes a communications server for notifying an interested party of a correlation event.

12. A video analytics system as claimed in any one of claimsl to 1 1, further comprising timesensitive data servers for feeding transactional data to said first database.

13. A video analytics system as claimed in any one of claims 1 to 12, further comprising a client station for displaying correlation events retrieved from said output subsystem to a user as web pages.

14. A video analytics system as claimed in claim in any one of claims 1 to 13, wherein the correlation server comprises a first interface for receiving the time-stamped transactional data, a video interface for receiving the video data, a video analytics module for generating the video metadata, and correlation modules for performing rule-based correlation using a set of rules associated with the type of correlation event associated with each particular module, and a control layer for controlling data flow within the correlation server.

15. A video analytics system as claimed in claim 14, wherein the control layer communicates with a correlation database interface for connection to a correlation database storing correlation events.

16. A video analytics system as claimed in claim 15, wherein the control layer communicates with a video data interface for connection to a video data server.

17. A video analytics system as claimed in any one of claims 14 to 17, wherein the control layer is configured to request further video metadata for additional processing in response to an indeterminate correlation event.

18. A video analytics system as claimed in any one of claims 14 to 17, wherein the transactional data is ATM transactional data and the correlation modules include one or more of the following: skimming installation module, unauthorized access module, vagrant on premises module, and harvesting detection module.

19. A video analytics system as claimed in any one of claims 14 to 17, wherein the correlation modules include one or more of the following: capacity detection module, perpetrator detection module, and pinpoint detection module

20. A video analytics system as claimed in any one of claims 1 to 1 9, wherein the components of the system are geographically distributed and connected over a communications network.

21. A video analytics system as claimed in claim 20, which is configured such that the timestamped video metadata is obtained at a local monitored site and transmitted to another location for analysis by the correlation server.

22. A video analytics system as claimed in claim 21, wherein the time-stamped video metadata is produced by individual cameras at the local monitored site.

23. A video analytics system as claimed in claim 22, wherein the time-stamped video metadata is produced by an analytics engine at the local monitored site, which receives raw video data from one or more cameras.

24. A video analytics system comprising:

a first database for storing searchable time-stamped transactional data indicative of activity within a monitored system; a second database for storing time-stamped metadata indicative of the presence of a person at a location monitored by a video camera producing a raw video stream; a correlation server for comparing the time-stamped transactional data with the timestamped metadata to identify correlation events indicating potential activity of interest; and a module for analyzing video data corresponding to a correlation event.

25. A video analytics system as claimed in claim 24, wherein the video analytics module is configured to identify the presence of a person or object by carrying out the following steps:

1) retrieve any available data with a time-stamp;

2) retrieve one or more video frames corresponding to the time-stamp;

3) analyze the retrieved frames to identify correlation events; and

4) store the retrieved video frames or raw video stream associated with correlation events.

26. A video analytics system as claimed in claim 25, which is programmed to perform a new search of video and metadata in response to the identification of correlation events.

27. A video analytics system as claimed in claim 25 or 26, wherein the video analytics module is configured to perform a loop when a particular person is detected to determine the approximate time of arrival and departure of that person.

28. A correlation server for a video analytics system, comprising:

a first interface for receiving searchable time-stamped transactional data indicative of activity within a monitored system; a second interface for receiving time-stamped video metadata, wherein the timestamped video metadata comprises searchable attributes associated with a raw video data stream; and one or more rule-based correlation modules for comparing the time-stamped transactional data with the time-stamped video metadata to identify correlation events indicating potential activity of interest; and an output interface for reporting the correlation events.

29. A correlation server as claimed in claim 28, further comprising a video analytics module for deriving the time-stamped video metadata by analyzing the raw video stream in accordance with a stored set of rules.

30. A correlation server as claimed in claim 28 or 29, further comprising a user interface for permitting user customization of the rules.

31. A correlation server as claimed in any one of claims 28 to 30, wherein the correlation server further comprises correlation modules for performing rule-based correlation using a set of rules associated with the type of correlation event associated with each particular module, and a control layer for controlling data flow within the correlation server.

32. A correlation server as claimed in claim 31, wherein the control layer communicates with a correlation database interface for connection to a correlation database storing correlation events.

33. A correlation server as claimed in claim 32, wherein the control layer communicates with a video data interface for connection to a video data server.

34. A correlation server system as claimed in any one of claims 31 to 33, wherein the control layer is configured to request further video metadata for additional processing in response to an indeterminate correlation event.

35. A correlation server system as claimed in any one of claims 31 to 34, wherein the transactional data is ATM transactional data and the correlation modules include one or more of the following: skimming installation module, unauthorized access module, vagrant on premises module, and harvesting detection module.

36. A video analytics system as claimed in any one of claims 31 to 35, wherein the correlation modules include one or more of the following: capacity detection module, perpetrator detection module, and pinpoint detection module

37. A method of monitoring a site for activity of interest, comprising:

storing searchable time-stamped transactional data indicative of activity at the site; storing time-stamped video metadata; and performing a rule-based correlation server of the time-stamped transactional data with the time-stamped video metadata to identify correlation events indicating potential activity of interest, and reporting the correlation events.

38. A method as claimed in claim 37, wherein the time-stamped video metadata comprises searchable attributes associated with a raw video data stream taken at the site

39. A method as claimed in claim 38, wherein the time-stamped video metadata is derived by analyzing the raw video stream in accordance with a stored set of rules.

40. A method as claimed in claim 39, wherein the time-stamped video metadata includes an indication of the presence of a person in a video image.

41. A method as claimed in claim 40, wherein the correlation server performs a loop when a particular person is detected to determine the approximate time of arrival and departure of that person.

42. A method as claimed in any one of claims 37 to 41, wherein the correlation events are stored in a correlation database later review.

43. A method as claimed in any one of claims 36 to 40, wherein the correlation server notifies an interested party of a correlation event.

44. A method as claimed in any one of claims 37 to 42, wherein the correlation events are sent to a user as web pages.

45. A method as claimed in any one of claims 37 to 44, wherein the correlation server requests further video metadata for additional processing in response to an indeterminate correlation event.

46. A method as claimed in any one of claims 37 to 45, wherein the transactional data is ATM transactional data and the correlation server performs the following functions: skimming installation detection, unauthorized access detection, vagrant on premises detection, harvesting detection.

47. A method as claimed in any One of claims 37 to 46, wherein various functions are performed at different locations in communication with each other over a communications network.

48. A method as claimed in claim 47, wherein the time-stamped video metadata is obtained at a local monitored site and transmitted to another location for analysis by the correlation server.

49. A method as claimed in claim 48, wherein the time-stamped video metadata is produced by individual cameras at the local monitored site.

50. A method as claimed in claim 48, wherein the time-stamped video metadata is produced by an analytics engine at the local monitored site, which receives raw video data from one or more cameras.