METHOD AND DEVICE FOR PROCESSING AUDIO SIGNAL

08-12-2017 дата публикации
Номер:
KR1020170135611A
Принадлежит:
Контакты:
Номер заявки: 00-16-102067810
Дата заявки: 31-05-2016

[1]

The present invention refers to a signal processor device relates to effectively regenerate the audio signal search method and, more particularly HMD (Head Mounted Display) device including a portable device for binaural rendering audio signal processing method and device for implementing the immersive (immersive) are disclosed.

[2]

In immersive audio listening to a HMD binaural rendering techniques (binaural rendering) the inside of the pipe. The, amount of operation and number of power consumption in mobile device about outputs, rendering the operation amount and an increase in power consumption as well as the prediction of the target object or channels, small personal [...] HRTF-non-personalized number due to door number, insufficient number of HRTF set according to artifacts (lack of spatial resolution), head tracking lag according to number of performance degradation in vertical door has disclosed.

[3]

Cinema ticks to real sound environment less number of content such as VR left directions of sounds, such as HMD same virtual reality device for reproducing sound recording is through a series of process in 3 dimensional (3D sound recording) techniques and binaural rendering (binaural rendering) techniques disclosed. The, 3 dimensional space (sound object) in the direction of sound signal supplied entities located sound object in order to a plurality of microphones for acquiring individual sound object signal is required for the sound direction of object obtained by tracking which reproduces techniques disclosed. Therefore may be physically arranged in an omni-directional microphone difficult due to reverberation noise (ambient noise) and entities for providing accurate tracking position with door number difficult has disclosed. In addition recording signal post-processing operation for binaural rendering order sound object in proper position by a certain input sound mixing engineer 3 dimensional space considering the pin is molded with synchronization of the cursor the tax-related information.

[4]

The present invention refers to said door in order to solve in a certain number point, 3 dimensional coordinate based on spatial audio signal acquiring remote (far a-field) (spatial audio signal) local area (near a-field) acquiring sound object signal (sound object signal) tracks the location of the objects coordinates using the sound object signal sound reverberation noise signal optionally conversion and selecting signal processing 3 dimensional sound source space be relocated solve door number even valuable minerals.

[5]

According to an embodiment of the present invention, said number and number XXXX method and device is such as to solve for co 1308.

[6]

<< Key Ideas >>

[7]

1. 3 Dimensional coordinate based spatial audio signal (spatial audio signal) and sound object signal (sound object signal) same time acquires synchronization (time synchronization) and synchronization in 3 dimensional space recorded a method (spatial synchronization)

[8]

2. 3 Dimensional spatial audio signal included in the signal and reverberation noise spatial audio signal recorded sound object number is a stand-alone (ambient noise) number (de-a noising or de-a reverberation) method onto a sound object signal wetting ability

[9]

3. Mono (mono) (multichannel) 3 dimensional space of multi-channel audio signal to sound object signal to them via the plurality of signal comparison 1 1 1 or 3 dimensional coordinate based object velocity of sound (3D spatial positioning) indicate less data amount information metadata that can be effectively converting method

[10]

4. Said spatial audio signal and reverberation noise sound for reparing over said object signal number recorded together with data for evaluating power consumption method for transmitting encoded S800

[11]

5. The sequence of bits transmitted by said spatial audio signal and reverberation noise recorded number for reparing over sound object signal, and obtaining context information about the velocity of sound object method

[12]

6. The decoded signal and uses the metadata has particular application purpose or reproducing condition recorded in discharging of backup battery included in 3 dimensional space audio signal reproducing sound object signal sound level or position or a number of special spatial characteristics of the sound (sound object cancellation) optionally plower number method

[13]

According to an embodiment of the present invention, 3 dimensional space based on the location of the objects sounds in 3 dimensional space coordinates to track the location of the objects representing sound can be.

[14]

In addition 3 dimensional space emphasize audio signal object can be reduce or sound pressure signal.

[15]

Sounds in 3 dimensional space through a position in the listening space object signal can reconstruct the disclosed.

[16]

Figure 1 3 dimensional space and posing a system architecture of audio signal processing device converting sounds in tracking an object are disclosed. Figure 2 shows a 3 dimensional space also sounds in reconstructing an original object position location in general outline listening space same are disclosed. The present invention included in the audio signal and also Figure 3 shows a metadata encoding, decoding procedure according to a stereophonic signal indicating block layout of reproducing system module are disclosed.

[17]

The specification of a term in the present invention used in the ability of a typical possible while a general terms selected but, this it was found intended risks caused by the descriptors, which said scale can be depending on appearance of a new technique or the like. In addition applicant is arbitrarily selecting the specific cases terms may, in this case in corresponding description of the invention based on their meaning will. If a term used in specification, terms in the name of a term not having simple and substantially semantics and the specification should be interpreted as two tanks based on content found broadcast receiver.

[18]

Figure 1 shows a block representing the present invention also include audio signal acquisition and analysis process are disclosed. 1 Remote (far a-field) signal according to the sound object also located microphone array (spatial audio signal) until the acquired spatial audio signal (microphone array), 3 reflects a microphone according to the relative position with influence (3D relative position) and room environment influence of reflected acoustic properties (acoustic room condition) and finally space echo noise signal (ambient noise signal) is equal to lower. The additional short-range object sounds (near a-field) through sound objects located microphone signal (sound object signal) is obtained therefrom. The location of the objects or direction and room environment sound sound object signal for minimizing the impact of attached near the source are acquired through a microphone.

[19]

Each of microphone arrangement used for acquiring remote sounds in a microphone signal in all directions 3 dimensional space for acquiring an object disposed thereon and horizontal and vertical. Acquiring signal contains a 3 dimensional space through microphone array signal representing the reverberation noise sound source in any position of signal can be acquired. A sound source signal acquired by remote microphone array with one omni directional signal orthogonal coordinate axes representing the 3 dimensional space a plurality of vapor cooling (Ambisonic) (First order Ambisonics) HoA FoA [...] B-a format or (Higher order Ambisonics) signal or from outside. The user selects one of the microphone array microphone is supported 3 dimensional space reflecting property of encapsulation in a direction and microphone itself are disclosed.

[20]

Individual sound object used for acquiring local sounds in a microphone for acquiring only a source of sound acquires attached to or disposed on locations closest to the sound source. Through reverberation noise and other sound object is always relatively high level signal corresponding sound object can be acquired. In particular, when sound object is moved, using the sound object in accordance with the relative distance remote acquiring signal and second, acquiring signal without influenced according to changes in position of a local area to be coated.

[21]

The 3 dimensional space mono sound object signal is acquired multi-channel spatial audio signal are used to track the location of the objects sound on. First, local area acquiring coordinate axis of each signal converted into mono signal s is [...] B-a format or HoA [...] B-a format multi-channel signal Retrieving normalized expressions 1 through 1 to 1 on 2 also determines the cross correlation (Normalized Cross provided Correlation), , And time difference (Interchannel Time Difference, ITD) related to HRV.

[22]

[23]

(1 Expressions)

[24]

[25]

[26]

(2 Expressions)

[27]

[28]

[29]

(3 Expressions)

[30]

[31]

[32]

Physically the same [...] B-a format signal point signal in time must be reached since the sound object signal directly (direct sound) and, ideally the expressions can be compared with the 2 is obtained. In addition room space unless an echo signal acquiring spatial audio signal carrier space distance Assuming that the same energy can be acquired. The sound object spatial audio signal which affects the degree of correlation between the signals element 3 dimensional space or distance (distance) (direction) direction of object sounds in such as position information (position) are disclosed. I.e., 3 dimensional space spatial audio signal capturing points on different expressions 1 to the location of the objects according to the origin of the value of the sound on the collar, e.g. sound object spatial audio signal axis (axis) is located close to the X-axis direction, X axis signals and sound object signal the signal obtained high correlation than higher officer degree different from each other.

[33]

1 Expressions spatial audio signals obtained as a result of the interaction with the 3 dimensional space 4 the next higher officer degree sound object signal such as coordinate axes (spherical coordinates) value multiplied by a variable constant expressions in limited thereto. For the distance from the variable constant A thread number sound object signal directing characteristic (source directivity pattern) or spatial audio signal includes acquiring the radiation pattern of a microphone (microphone spherical pattern), spatial audio acquisition microphone sound object distance, determined according to the physical properties of the room space. Storing in 3 dimensional space coordinate axis direction signal denoted as acquire this value is small and this value is small and vice versa in a wider angular direction around the coordinate axis storing incoming signal more acquire with each other.

[34]

[35]

(4 Expressions)

[36]

[37]

[38]

4 Estimated 3 dimensional space coordinates of the expressions such as horizontal plane and in a vertical sound object (azimuth) and vertical (elevation) from outside to each search.

[39]

[40]

(5 Expressions)

[41]

[42]

[43]

(6 Expressions)

[44]

[45]

[46]

Figure 2 chamber number 3 dimensional stereo sound source environment is performed randomly in the location of the objects sounds in the reproduction environment of the virtual reality environment etc. show sound source. According to spatial audio signal contains a room environment also 1 remote acquiring acoustic properties as well as lower space echo signal to be coated. In addition local area acquiring sound object signal is equal to the discharge lower reverberation signal. The forward movement of the object based on the echo signal space using microphone sound acquisition, the user feels the sound object signal direction feeling well in a stand-alone is not capable of recording according to a predetermined number flow tides. 3 Dimensional stereo sound source signal for selecting the location of the objects sound recording signal acquired in addition or counter space for ectasia or reverberation signal to an emphasis to a message, a speech source signal and spatial reverberation signal effectively bent from an object from recorded signal needs to be disclosed.

[47]

The reverberation signal space according, respectively does not vary over time in the case of stationary noise, sound object (video) signal obtained from a microphone or microphone array result in free time interval can be obtained from immediately. However over time in the case of metallic non non-stationary noise or sound object if there is no video tape, sound object is comprehensively reverberation signal must present time interval space even number 2000. The expressions such as multi-channel spatial audio signal from a mono sound object signal number 7 next to a stand-alone, acquires echo signal space.

[48]

[49]

(7 Expressions)

[50]

[51]

[52]

In expressions 7 left protest signal Special number (de-a noising) and defines a space echo signal space audio signal, the original audio signal time delay constant value d is applied signal acquired space individual object sound signal gain constant value Multiplied by the signal obtained by subtracting. The time delay of each constant value gain constant value may be determined by value is 3 to 2 expressions and expressions used, this taste spatial audio signal obtained through an in Figure 1 such as sound object signal reflecting the relative position according to S. influence when applied. Through each coordinate axis is equal to the spatial echo signal is obtained.

[53]

[54]

Local area acquires the discharge can be mono signal sounds in object signal plus noise. For a stand-alone same number space determined by the reverberation signal object for reparing over the number 7 and expressions can be acquired reverberation signal. Each [...] B-a format signal left by obtaining a radiation pattern of a microphone in the effect of reverberation oppose amplifying effect of the ship position number space echo signal can be achieved. Expressions such as number 8 through sound object signal may be both spatially reverberation signal is equal to or higher to obtain a signal for reparing over.

[55]

[56]

(8 Expressions)

[57]

[58]

[59]

The present invention included in the audio signal and also Figure 3 shows a metadata encoding, decoding procedure according to a stereophonic signal indicating block layout of reproducing system module are disclosed. According to number 3 dimensional sound source environment after acquiring the reconstructed (Enhanced spatial audio encoder) also in chamber 3, with regard to each retrieving expressions 7 8 left audio signal and sound object signal and expressions obtained by 5 to 6 produce a heat signal bit including metadata. Also varies the spatial audio signal is provided to said bit string (Enhanced spatial audio decoder) receives the sound object signal and metadata decrypted. 3 Dimensional position coordinates direction information decoded sound object signal is decrypted chamber number recording environment based on audio signal can be converted into a 3 dimensional space or user space on any control by that the new coordinate directions from outside. Said spatial audio signal converted into decoded spatial audio signal such as sound object signal is caused by environment layout with rendered signal.

[60]

[61]

[What is claimed here]

[62]

1.

[63]

Microphone array standby unit acquires spatial audio signal (signal number 1),

[64]

Mono mike through local area sound acquisition (number 2 signal),

[65]

Number 1 number 2 number 1 signal includes a periodic cross-correlation signal outputted from 3 dimensional space comprised of number 2 on map

[66]

Number 1 number 2 signal based on said determined location signal mix

[67]

2.

[68]

Said mix of said space using references to the positions, number 1 number 2 number all or a portion of the signal components after addition on the output signal can be a stand-alone

[69]

3.

[70]

Characterized in that said space created with the meta data for the number 2 position signal

[71]

4.

[72]

Said mixed signal is again HoA signal selection multiplexer number 3 the bit line pair

[73]

Extracting cosmetologic number 3 for transmitting signal encoding

[74]

5.

[75]

Extracting said number 1 and number 2 encoded together with said generated metadata produced features

[76]

6.

[77]

Said generated number 3 signal format tailored to different target platform (e. G. HoA to FoA) characterized in that the encoded

[78]

7.

[79]

Said decoder decodes the signal transmitted in rendering

[80]

8.

[81]

Said decoder in particular metadata transmitted signal number 1 signal, the case number 2, number 1 through number 2 signal using metadata to mixed signal (sound scene) to create an improved sound scene features

[82]

[83]

In the embodiment described in the present invention is more possible to rapidly through but, if one skilled and the modified of the present invention recognize that wider, can be alterations. I.e., binaural rendering is described but the present invention refers to audio signal in the embodiment, the present invention refers to audio signal as well as various multimedia signal including video signal even force and expandable disclosed. In the embodiment detailed description of the invention and the present invention is provided to the party from the hereinafter for thermally processing the rights of the present invention can be interpreted accurately and in a range of 2000.



[1]

The present invention relates to a method and a device for processing a signal in order to effectively play an audio signal. More specifically, the present invention relates to a method and a device for processing an audio signal for realizing an immersive type binaural rendering for a portable device including a head mounted display (HMD) device. The method acquires a spatial audio signal and a sound object signal based on a three-dimensional coordinate, and performs time synchronization of the signals at a recorded time and spatial synchronization of the signals in a three-dimensional space.

[2]

COPYRIGHT KIPO 2018

[3]



Audio signal processing method and device.