15-06-2017 дата публикации
Номер: US20170169326A1
Принадлежит:
Baidu USA LLC
Systems and methods for a multi-core optimized Recurrent Neural Network (RNN) architecture are disclosed. The various architectures affect communication and synchronization operations according to the Multi-Bulk-Synchronous-Parallel (MBSP) model for a given processor. The resulting family of network architectures, referred to as MBSP-RNNs, perform similarly to a conventional RNNs having the same number of parameters, but are substantially more efficient when mapped onto a modern general purpose processor. Due to the large gain in computational efficiency, for a fixed computational budget, MBSP-RNNs outperform RNNs at applications such as end-to-end speech recognition. 1. A method to improve a computing performance of a computing device by mapping a Recurrent Neural Network (RNN) architecture to a processor's microarchitecture of the computing device , the method comprising:obtaining values associated with levels of memory based on a description of the processor's microarchitecture; and grouping neurons into modules, each module representing a logical unit in an RNN layer within the RNN architecture; and', 'arranging connections between the modules such that the modules satisfy predefined conditions of the RNN architecture, the predefined conditions of the RNN architecture being related to the at least two of memory capacity, number of processor cores, bandwidth, computational bandwidth, and latency., "for a lowest to a highest level of a hierarchy of the RNN architecture, each level associated with the processor's microarchitecture and being described by at least two of memory capacity, number of processor cores, bandwidth, computational bandwidth, and latency:"}2. The method according to claim 1 , wherein arranging connections comprises pruning bi-directional connections between the modules to balance the predefined conditions.3. The method according to claim 1 , wherein for each level of processor memory the predefined conditions comprise that parameters ...
Подробнее