07-02-2019 дата публикации
Номер: US20190042761A1
Автор:
Shih-Han Wang,
Yonghong Huang,
Micah Sheller,
Cory Cornelius,
WANG SHIH-HAN,
HUANG YONGHONG,
SHELLER MICAH,
CORNELIUS CORY,
Wang, Shih-Han,
Huang, Yonghong,
Sheller, Micah,
Cornelius, Cory
Принадлежит:
Embodiments discussed herein may be generally directed to systems and techniques to generate a quality score based on an observation and an action caused by an actor agent during a testing phase. Embodiments also include determining a temporal difference between the quality score and a previous quality score based on a previous observation and a previous action, determining whether the temporal difference exceeds a threshold value, and generating an attack indication in response to determining the temporal difference exceeds the threshold value. 1. An apparatus , comprising:memory to store instructions; andprocessing circuitry coupled with the memory:an actor agent, executable by the processing circuitry, to cause an action in a processing environment based on an observation during a testing phase; 'generate a quality score based on the observation and the action caused by the actor agent during the testing phase; and', 'a critic agent, executable by the processing circuitry, to determine a temporal difference between the quality score and a previous quality score based on a previous observation and a previous action,', 'determine whether the temporal difference exceeds a threshold value,', 'generate an attack indication in response to determining the temporal difference exceeds the threshold value, and', 'permit processing of a next observation and a next action in response to determining the temporal difference does not exceed the threshold value., 'a temporal difference detector, executable by the processing circuitry, to2. The apparatus of claim 1 , wherein the attack indication to indicate an occurrence of an attack via an input in the processing environment claim 1 , the attack comprising one or more of a Fast Gradient Sign Method (FGSM) attack and a random attack.3. The apparatus of claim 1 , the actor agent to cause a series of actions including the action and the previous action claim 1 , and the critic agent to determine a sequence of quality scores based ...
Подробнее