Поиск патентов

A processor comprising: a barrel-threaded execution unit for executing concurrent threads, and a repeat cache shared between the concurrent threads. The processor's instruction set includes a repeat instruction which takes a repeat count operand. When the repeat cache is not claimed and the repeat instruction is executed in a first thread, a portion of code is cached from the first thread into the repeat cache, the state of the repeat cache is changed to record it as claimed, and the cached code is executed a number of times. When the repeat instruction is then executed in a further thread, then the already-cached portion of code is again executed a respective number of times, each time from the repeat cache. For each of the first and further instructions, the repeat count operand in the respective instruction specifies the number of times to execute the cached code.

Подробнее

Номер записи: 38

27-02-2013 дата публикации

Номер: EP3350713A1

Автор: DRUZHININ, Igor Vyacheslavovich, MYSOV, Mikhail Evgenyevich, ZENKOVICH, Mikhail Valerevich

Принадлежит:

Подробнее

Номер записи: 49

23-11-2000 дата публикации

Verfahren zur Erhöhung der Nutzleistung von Multiprozessorsystemen

Номер: DE0019833221C2

Автор: HEBSAKER HANS-MARTIN, HEBSAKER, HANS-MARTIN

Принадлежит: SIEMENS NIXDORF INF SYST, SIEMENS NIXDORF INFORMATIONSSYSTEME AG

Das Prinzip der "natürlichen Affinität" wird durch einen weiteren Steuerungsmechanismus ergänzt, indem Programmabschnitte verschiedener Prozesse, welche Daten aus denselben Speicherbereichen benötigen, einheitlich gekennzeichnet und aufgrund dieser Kennzeichnung bei zeitlich konkurrierendem Zugriff demselben Prozessor (CPU) zugewiesen werden. Durch die hierdurch geschaffene Affinität verschiedener Prozesse wird die Anzahl der Speicherzugriffe auf den Hauptspeicher (MM) bzw. die Caches (SIC) der anderen Prozessoren (CPU) weiter verringert und somit die Nutzleistung des Multiprozessorsystems erhöht. DOLLAR A Kann einem freien Prozessor (CPU) kein affiner Prozeß zugewiesen werden, wird zur Vermeidung von Leerlaufverlusten ein nicht-affiner Prozeß zugeweisen und ein Indikator gesetzt, der zur Unterbrechung bei Vorliegen eines affinen Prozesses führt.

Подробнее

Номер записи: 50

26-02-2015 дата публикации

VERBESSERTE VERWENDUNG VON SPEICHERRESSOURCEN

Номер: DE102014012155A1

Автор: MEREDITH JASON, ISHERWOOD ROBERT GRAHAM, JACKSON HUGH, MEREDITH, JASON, ISHERWOOD, ROBERT GRAHAM, JACKSON, HUGH

Принадлежит:

Es werden Verfahren zur Steigerung der Effizienz von Speicherressourcen in einem Prozessor beschrieben. In einer Ausführungsform werden diese Daten anstelle davon, dass eine gewidmete DSP-Indirektregister-Ressource zum Speichern von DSP-Befehlen zugeordneten Daten umfasst ist, in einem zugewiesenen und gesperrten Bereich im Cache gespeichert. Der Zustand aller Cachezeilen, die zur Speicherung von DSP-Daten verwendet werden, wird daraufhin festgelegt, um zu verhindern, dass die Daten in den Speicher geschrieben werden. Die Größe des zugewiesenen Bereichs im Cache kann gemäß der Menge an DSP-Daten, die gespeichert werden soll, variieren, und wenn keine DSP-Befehle laufen, werden keine Cache-Ressourcen zur Speicherung der DSP-Daten zugewiesen.

Подробнее

Номер записи: 51

13-10-2010 дата публикации

Keeping a cache consistent with a main memory in a system with simultaneous multithreading

Номер: GB0002469299A

Номер: CN116126747A

Автор: XU GANG, ZHANG RUIKAI, YU JINGZHOU, LIU YONGFENG, WU JIPENG

Принадлежит:

The invention provides a caching method, a caching architecture, a heterogeneous architecture and electronic equipment, and is applied to the technical field of computers and chips, and the caching architecture comprises a caching read processing module, a caching write processing module, a cold item detection module, a memory read processing module and a memory write processing module. Wherein the cache read processing module, the cache write processing module and the cold item detection module are realized on the coprocessor side, and the memory read processing module and the memory write processing module are realized on the general processor side. A novel cache architecture is arranged in a heterogeneous architecture, so that part of functions of the coprocessor are improved to a general processor, the overall process of data caching is smooth and high in efficiency, the influence of bottleneck such as area, power consumption and electric leakage on the coprocessor is reduced, the cost ...

Подробнее

Номер записи: 64

06-07-2017 дата публикации

MEMORY NODE WITH CACHE FOR EMULATED SHARED MEMORY COMPUTERS

Номер: WO2017115007A1

Автор: FORSELL, Martti

Принадлежит:

Data memory node (400) for ESM (Emulated Shared Memory) architectures (100, 200), comprising a data memory module (402) containing data memory for storing input data therein and retrieving stored data therefrom responsive to predetermined control signals, a multi-port cache (404) for the data memory, said cache being provided with at least one read port (404A, 404B) and at least one write port (404C, 404D, 404E), said cache (404) being configured to hold recently and/or frequently used data stored in the data memory (402), and an active memory unit (406) at least functionally connected to a plurality of processors via an interconnection network (108), said active memory unit (406) being configured to operate the cache (404) upon receiving a multioperation reference (410) incorporating a memory reference to the data memory of the data memory module from a number of processors of said plurality, wherein responsive to the receipt of the multioperation reference the active memory unit (406) ...

Подробнее

Номер записи: 65

04-02-2021 дата публикации

CACHE USAGE MEASURE CALCULATION DEVICE, CACHE USAGE MEASURE CALCULATION METHOD, AND CACHE USAGE MEASURE CALCULATION PROGRAM

Номер: WO2021019674A1

Автор: NAKAMURA, Tetsuro, TAKADA, Naoki

Принадлежит:

A cache usage measure calculation device (1) is provided with: a memory from which data is read and to which data is written; a cache which can be accessed at a higher speed than the memory; a central processing unit which performs processing by reading from and writing to the memory and the cache; a usage status measurement unit which measures the status of the usage of the cache by applications (11a, 11b) executed by the central processing unit; a performance measurement unit which measures the cache sensitivity and/or the cache pollution level with respect to the applications (11a, 11b); and a measure calculation unit which calculates a measure of the cache sensitivity and/or the cache pollution level for each of a plurality of pre-selected applications from performance degradation of the pre-selected applications and the usage status of the cache.

Подробнее

Номер записи: 66

19-07-2018 дата публикации

PARTITIONING TLB OR CACHE ALLOCATION

Номер: WO2018130802A1

Автор: KRUEGER, Steven Douglas

Принадлежит:

A request for data from a cache (TLB or data/instruction cache) specifies a partition identifier allocated to a software execution environment associated with the request. Allocation of data to the cache is controlled based on a set of configuration information selected based on the partition identifier specified by the request. For a TLB, this allows different allocation policies to be used for requests associated with different software execution environments. In one example, the cache allocation is controlled based on an allocation threshold specified by the selected set of configuration information, which limits the maximum number of cache entries allowed to be allocated with data associated with the corresponding partition identifier.

Подробнее

Номер записи: 67

31-01-2019 дата публикации

LOCK ADDRESS CONTENTION PREDICTOR

Номер: WO2018057293A3

Автор: SMAUS, Gregory W., KING, John M., RAFACZ, Matthew A., CRUM, Matthew M.

Принадлежит:

Techniques for selectively executing a lock instruction speculatively or non-speculatively based on lock address prediction and/or temporal lock prediction, including methods an devices for locking an entry in a memory device. In some techniques, a lock instruction executed by a thread for a particular memory entry of a memory device is detected. Whether contention occurred for the particular memory entry during an earlier speculative lock is detected on a condition that the lock instruction comprises a speculative lock instruction. The lock is executed non-speculatively if contention occurred for the particular memory entry during an earlier speculative lock. The lock is executed speculatively if contention did not occur for the particular memory entry during an earlier speculative lock.

Подробнее

Номер записи: 68

07-04-2020 дата публикации

Tracking modifications to a virtual machine image that occur during backup of the virtual machine

Номер: US0010613940B2

Автор: Guy Lynn Gutherie, Naresh Nayar, Geraint North, Hugh Shen, William Starke, Phillip Williams, GUTHERIE GUY LYNN, NAYAR NARESH, NORTH GERAINT, SHEN HUGH, STARKE WILLIAM, WILLIAMS PHILLIP, Gutherie, Guy Lynn, Nayar, Naresh, North, Geraint, Shen, Hugh, Starke, William, Williams, Phillip

Принадлежит: International Business Machines Corporation, IBM

A computer system comprises a processor unit arranged to run a hypervisor running one or more virtual machines; a cache connected to the processor unit and comprising a plurality of cache rows, each cache row comprising a memory address, a cache line and an image modification flag; and a memory connected to the cache and arranged to store an image of at least one virtual machine. The processor unit is arranged to define a log in the memory and the cache further comprises a cache controller arranged to set the image modification flag for a cache line modified by a virtual machine being backed up, but not for a cache line modified by the hypervisor operating in privilege mode; periodically check the image modification flags; and write only the memory address of the flagged cache rows in the defined log.

Подробнее

Номер записи: 69

23-03-2021 дата публикации

Temporarily suppressing processing of a restrained storage operand request

Номер: US0010956337B2

Автор: Bruce C. Giamei, Christian Jacobi, Daniel V. Rosa, Anthony Saporito, Donald W. Schmidt, Chung-Lung K. Shum, GIAMEI BRUCE C, JACOBI CHRISTIAN, ROSA DANIEL V, SAPORITO ANTHONY, SCHMIDT DONALD W, SHUM CHUNG-LUNG K, Giamei, Bruce C., Jacobi, Christian, Rosa, Daniel V., Saporito, Anthony, Schmidt, Donald W., Shum, Chung-Lung K.

Принадлежит: INTERNATIONAL BUSINESS MACHINES CORPORATION, IBM

Processing of a storage operand request identified as restrained is selectively, temporarily suppressed. The processing includes determining whether a storage operand request to a common storage location shared by multiple processing units of a computing environment is restrained, and based on determining that the storage operand request is restrained, then temporarily suppressing requesting access to the common storage location pursuant to the storage operand request. The processing unit performing the processing may proceed with processing of the restrained storage operand request, without performing the suppressing, where the processing can be accomplished using cache private to the processing unit. Otherwise the suppressing may continue until an instruction, or operation of an instruction, associated with the storage operand request is next to complete.

Подробнее

Номер записи: 70

01-02-2018 дата публикации

MULTIPLE CHANNEL CACHE MEMORY AND SYSTEM MEMORY DEVICE UTILIZING A PSEUDO-MULTIPLE PORT FOR COMMANDS AND ADDRESSES AND A MULTIPLE FREQUENCY BAND QAM SERIALIZER/DESERIALIZER FOR DATA

Номер: US20180032436A1

Автор: Sheau-Jiung Lee

Принадлежит:

A high performance, low power, and cost effective multiple channel cache-system memory system is disclosed. 1. A computing device comprising:a first chip comprising one or more CPU cores, a memory controller coupled to the one or more CPU cores, and a first serializer-deserializer device;a second chip comprising cache memory managed by the memory controller, a data router, and a second serializer-deserializer device;system memory separate from the first chip and the second chip and managed by the memory controller; andan analog interface coupled to the first chip and the second chip, wherein the first serializer-deserializer device and the second serializer-deserializer device exchange data over the interface using quadrature amplitude modulation;wherein a memory request from the one or more CPU cores is serviced by the memory controller and the data router by providing data to the one or more CPU cores from the cache memory or the system memory.2. The computing device of claim 1 , wherein the memory controller is coupled to the one or more CPU cores with a processor bus.3. The computing device of claim 2 , further comprising a system bus coupled to the memory controller.4. The computing device of claim 3 , wherein the system bus is coupled to one or more graphics processor unit (GPU) cores.5. The computing device of claim 4 , wherein the memory controller comprises an arbiter for managing control of the analog interface.6. A computing device comprising:a first chip comprising one or more CPU cores, a memory controller coupled to the one or more CPU cores, and a first serializer-deserializer device;a second chip comprising cache memory managed by the memory controller, a data router, and a second serializer-deserializer device;system memory separate from the first chip and the second chip and managed by the memory controller;an analog interface coupled to the first chip and the second chip, wherein the first serializer-deserializer device and the second serializer- ...

Подробнее

Номер записи: 71

22-05-2018 дата публикации

Parallel computing apparatus, compiling apparatus, and parallel processing method for enabling access to data in stack area of thread by another thread

Номер: US0009977759B2

Автор: Toshihiro Suzuki, SUZUKI TOSHIHIRO, Suzuki, Toshihiro

Принадлежит: FUJITSU LIMITED, FUJITSU LTD

A parallel computing apparatus includes a first processor that executes a first thread, a second processor that executes a second thread, and a memory. The memory includes a first private area that corresponds to the first thread, a second private area that corresponds to the second thread, and a shared area. The first processor stores first data in the first private area and stores address information that enables access to the first data in the shared area. The second processor stores second data in the second private area, accesses the first data based on the address information, and generates third data based on the first and second data.

Подробнее

Номер записи: 72

26-06-2018 дата публикации

System and method for cache replacement using conservative set dueling

Номер: US0010007620B2

Автор: Seth H. Pugsley, Christopher B. Wilkerson, Roger Gramunt, Jonathan C. Hall, Prabhat Jain, PUGSLEY SETH H, WILKERSON CHRISTOPHER B, GRAMUNT ROGER, HALL JONATHAN C, JAIN PRABHAT, Pugsley, Seth H., Wilkerson, Christopher B., Gramunt, Roger, Hall, Jonathan C., Jain, Prabhat

Принадлежит: Intel Corporation, INTEL CORP

A processor includes a set associative cache and a cache controller. The cache controller makes an initial association between first and second groups of sampled sets in the cache and first and second cache replacement policies. Follower sets in the cache are initially associated with the more conservative of the two policies. Following cache line insertions in a first epoch, the associations between the groups of sampled sets and cache replacement policies are swapped for the next epoch. If the less conservative policy outperforms the more conservative policy during two consecutive epochs, the follower sets are associated with the less conservative policy for the next epoch. Subsequently, if the more conservative policy outperforms the less conservative policy during any epoch, the follower sets are again associated with the more conservative policy. Performance may be measured based the number of cache misses associated with each policy.

Подробнее

Номер записи: 73

23-05-2023 дата публикации

Device and method for maintaining summary consistency in caches

Номер: US0011656991B2

Автор: Aviv Kuvent, Yair Toaff

Принадлежит: Huawei Technologies Co., Ltd.

An information processing device comprises: a memory comprising a cache for storing information related to an object from a plurality of objects, and a summary structure configured to store a summary for the object; a volume configured to store a merge file including the plurality of objects, and a set of dump-files, each dump-file being associated with a specific cache-dump operation of the cache; and a processor configured to assign, to the cache, a first identifier; perform a cache-dump operation based on generating a dump-file associated with the first identifier and storing the information related to the object from the cache to the generated dump-file; and assign, to the cache, a second identifier, wherein the second identifier is larger than the first identifier.

Подробнее

Номер записи: 74

03-10-2023 дата публикации

Extended tags for speculative and normal executions

Номер: US0011775308B2

Автор: Steven Jeffrey Wallach

Принадлежит: Micron Technology, Inc.

A cache system having cache sets, registers associated with the cache sets respectively, and a logic circuit coupled to a processor to control the cache sets according to the registers. When a connection to an address bus of the system receives a memory address from the processor, the logic circuit can be configured to: generate an extended tag from at least the memory address; and determine whether the generated extended tag matches with a first extended tag for a first cache set or a second extended tag for a second cache set of the system. Also, the logic circuit can also be configured to implement a command received from the processor via the first cache set in response to the generated extended tag matching with the first extended tag and via the second cache set in response to the generated extended tag matching with the second extended tag.

Подробнее

Номер записи: 75

26-09-2023 дата публикации

Scheduling of threads for execution utilizing load balancing of thread groups

Номер: US0011768687B2

Автор: Balaji Vembu, Abhishek R. Appu, Joydeep Ray, Altug Koker

Принадлежит: Intel Corporation

An apparatus to facilitate thread scheduling is disclosed. The apparatus includes logic to store barrier usage data based on a magnitude of barrier messages in an application kernel and a scheduler to schedule execution of threads across a plurality of multiprocessors based on the barrier usage data.

Подробнее

Номер записи: 76

16-05-2024 дата публикации

CACHE OPTIMIZATION MECHANISM

Номер: US20240160581A1

Автор: Marcin Andrzej Chrapek, Reshma Lal

Принадлежит: Intel Corporation

An apparatus includes a central processing unit (CPU), including a plurality of processing cores, each having a cache memory, a fabric interconnect coupled to the plurality of processing cores and cryptographic circuitry, coupled to the fabric interconnect including mesh stop station to receive memory data and determine a destination of the memory data and encryption circuitry to encrypt/decrypt the memory data based on a destination of the memory data.

Подробнее

Номер записи: 77

03-07-2019 дата публикации

MULTI-LEVEL SYSTEM MEMORY CONFIGURATIONS TO OPERATE HIGHER PRIORITY USERS OUT OF A FASTER MEMORY LEVEL

Номер: EP3506112A1

Автор: ARAFA, Mohamed, VISWANATHAN, Krishnaswamy

Принадлежит:

A method is described. The method includes recognizing higher priority users of a multi-level system memory characterized by a faster higher level and a slower lower level in which the higher level is to act as a cache for the lower level and in which a first capacity of the higher level is less than a second capacity of the lower level such that caching resources of the higher level are oversubscribe-able. The method also includes performing at least one of: declaring an amount of the second capacity un-useable to reduce oversubscription of the caching resources; allocating system memory address space of the multi-level system memory so that requests associated with lower priority users will not compete with requests associated with the higher priority users for the caching resources.

Подробнее

Номер записи: 78

15-01-2020 дата публикации

MEMORY RESOURCE OPTIMIZATION METHOD AND APPARATUS

Номер: EP3388947B1

Автор: LIU, Lei, WU, Chengyong, FENG, Xiaobing

Принадлежит: Huawei Technologies Co., Ltd.

Подробнее

Номер записи: 79

01-03-2018 дата публикации

SYSTEME UND VERFAHREN ZUR ADRESSIERUNG EINES ZWISCHENSPEICHERS MIT AUFGESPALTENEN INDIZES

Номер: DE112016002247T5

Автор: RICHMOND RICHARD, Richmond, Richard

Принадлежит: LINEAR ALGEBRA TECH LTD, Linear Algebra Technologies Ltd.

Zwischenspeicher-Abbildungsverfahren werden präsentiert. Ein Zwischenspeicher kann ein Indexkonfigurationsverzeichnis enthalten. Das Verzeichnis kann die Sätze eines oberen Indexabschnitts und eines unteren Indexabschnitts einer Speicheradresse konfigurieren. Die Abschnitte können kombiniert werden, um einen kombinierten Index zu erzeugen. Die konfigurierbare Adressenstruktur mit einem aufgespaltenen Index kann unter anderen Anwendungen benutzt werden, um die Rate von Zwischenspeicherkonflikten, die zwischen mehreren, den Videorahmen parallel decodierender Prozesse auftreten, zu verringern.

Подробнее

Номер записи: 80

15-07-2011 дата публикации

CIRCUIT AND PROCEDURE WITH CACHEKOHÄRENZBELASTUNGSSTEUERUNG

Номер: AT0000516542T

Автор: KARLAPALEM SAINATH, TERECHKO ANDREI, KARLAPALEM, SAINATH, TERECHKO, ANDREI

Принадлежит:

Подробнее

Номер записи: 81

31-05-2017 дата публикации

Address access method and device

Номер: CN0106776366A

Автор: LAN KEJIA, CHENG YONGBO, HE CHENGHONG

Принадлежит:

Подробнее

Номер записи: 82

24-08-2018 дата публикации

Номер: US20190034303A1

Автор: Kevin J. Ash, Kyler A. Anderson, Lokesh M. Gupta, Matthew G. Borlick

Принадлежит: International Business Machines Corp

Provided are a computer program product, system, and method for managing failover from a first processor node including a first cache to a second processor node including a second cache. Storage areas assigned to the first processor node are reassigned to the second processor node. For each track indicated in a cache list of tracks in the first cache for the reassigned storage areas, the first processor node adds a track identifier of the track and track format information indicating a layout and format of data in the track to a cache transfer list. The first processor node transfers the cache transfer list to the second processor node. The second processor node uses the track format information transferred with the cache transfer list to process read and write requests to tracks in the reassigned storage areas staged into the second cache.

Подробнее

Номер записи: 199

Настройки

Небесная энциклопедия

Мониторинг СМИ

Форма поиска