MODEL-INDEPENDENT FACE LANDMARK RECOGNIZING DEVICE IN SPACE AUGMENTED REALITY

08-12-2017 дата публикации

Номер:

KR1020170135758A

Автор: WOO, WOON TACK, JANG, YOUNG KYOON, JUNG, EUN AH, KIM, SUNG SIL, YU, JEONG MIN

Принадлежит:

Контакты:

Номер заявки: 00-17-102067150

Дата заявки: 30-05-2017

[1]

The present invention refers to a landmark recognition device augmented reality spatial model at stand-alone face are disclosed.

[2]

[8] And IllumiRoom RoomAlive spatial number such as augmented reality environment virtual object projection (projection) (SAR, Spatial Augmented Reality) distance to seal has the microbalance substrate.

[3]

The spatial augmented reality environment the user to contact the wall of the projected virtual object interaction needs to be disclosed.

[4]

(Head mounted Display) HMD wearing face without measuring the position and direction of outputs, facial pose for detecting or measuring the land face moving natural interaction with various spatial augmented reality environment to a solution point number etched in a interactive (interactive). Here, user's line of sight direction face direction is big.

[5]

Furthermore, stand-alone model face landmark detection and tracking technique augmented reality environment can support various spatial diffuse to the user.

[6]

As a result, indoor environment model at the landmark detection and tracking technique based on stand-alone face and Oculus HoloLens without wearing heavy such as HMD, the user experience distance number [...] substrate.

[7]

However spatial number projected moving quickly contents in chamber space augmented reality environment according to the change of the position and direction of illumination variations of a room user utilizes an anisotropic in addition very in the rear face, in such situation existing RGB illumination variations not confirm that the characteristic point detection method based on robust and face rotation present.

[8]

1. Microsoft HoloLens. Https://www. Microsoft. Com/microsoft provided hololens /, accessed Sep. 25, 20152. Oculus VR. Https://www. Oculus. Com /, accessed Sep. 25, 20153. Ahonen, T. , Hadid, A. , Pietikainen, M. : Face description with local binary patterns: Application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28 (12), 2037 - 2041 (Dec2006)4. Cao, C. , Weng, Y. , Lin, S. , Zhou, K. : 3D shape regression for real-a time facial animation. ACM Trans. Graph. 32 (4), 41:1 - 41:10 (Jul 2013)5. Dalal, N. , Triggs, B. : Histograms of oriented gradients for human detection. In: Schmid, C. , Soatto, S. , Tomasi, C. International Conference on Computer Vision & Pattern Recognition (eds.). Vol. 2, Pp. 886 - 893. INRIA Rhone-a Alpes, ZIRST provided 655, av. De l'Europe, Montbonnot provided 38334 (June 2005), http://lear. Inrialpes. Fr/pubs/2005/DT056. Denning, P. J. : The locality principle. Commun. ACM 48 (7), 19 - 24 (Jul 2005)7. Jang, Y. , Woo, W. : Local feature descriptors for 3d object recognition in ubiquitous virtual reality. In: 2012 International Symposium on Ubiquitous Virtual Reality, Daejeon, Korea (South), August 22 - 25, 2012. Pp. 42 - 45 (2012)8. Jones, B. , Sodhi, R. , Murdock, M. , Mehra, R. , Benko, H. , Wilson, A. , Ofek, E. , MacIntyre, B. , Raghuvanshi, N. , Shapira, L. : Roomalive: Magical experiences enabled by scalable, adaptive projector provided camera units. In: Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology. Pp. 637 - 644. UIST ' 14, ACM, New York, NY, USA (2014)9. Jones, B. R. , Benko, H. , Ofek, E. , Wilson, A. D. : Illumiroom: peripheral projected illusions for interactive experiences. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. Pp. 869 - 878. CHI ' 13, ACM, New York, NY, USA (2013), http://doi. Acm. Org/10. 1145/2470654. 246611210. Lowe, D. G. : Distinctive image features from scale-a invariant keypoints. Int. J. Comput. Vision 60 (2), 91 - 110 (Nov 2004)11. Lucas, B. D. , Kanade, T. : An iterative image registration technique with an application to stereo vision. In: Proceedings of the 7th International Joint Conference on Artificial IntelligenceVolume 2. Pp. 674 - 679. IJCAI ' 81, Morgan Kaufmann Publishers Inc. , San Francisco, CA, USA (1981)12. Shotton, J. , Girshick, R. , Fitzgibbon, A. , Sharp, T. , Cook, M. , Finocchio, M. , Moore, R. , Kohli, P. , Criminisi, A. , Kipman, A. , Blake, A. : Efficient human pose estimation from single depth images. Trans. PAMI (2012)13. Tang, D. , Chang, H. J. , Tejani, A. , Kim, T. K. : Latent regression forest: Structured estimation of 3D articulated hand posture. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (June 2014)14. Viola, P. , Jones, M. : Rapid object detection using a boosted cascade of simple features. In: Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on. Vol. 1, Pp. I-a 511 a-I-a 518 vol. 1 (2001)15. Wang, H. , Klaser, A. , Schmid, C. , Liu, C. L. : Action recognition by dense trajectories. In: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition. Pp. 3169 - 3176. CVPR ' 11, IEEE Computer Society, Washington, DC, USA (2011)

[9]

In order to solve the present invention refers to said door such as number point, augmented reality (SAR, Spatial Augmented Reality) spatial object to be reliably detected and track of a person is to enable robust model at a stand-alone face land mark recognizing device number or a reversed F. [...] spatial augmented reality.

[10]

The one aspect of the present invention color Image and depth Image face including acquiring an Image Image acquisition section; said face region from face Image acquired in Image acquisition section detects, color Image topical calculated binary pattern, each pattern depth Image by computing a previous pattern combined with each pattern and local topical local face feature detector calculates the face feature of the facial Image; face and upper face while the lower two landmark set into a set of phase relationship tree face topology tree instances, said face topology of the tree root node including the plurality of facial Image matches ablaze with learning tub in each node topology tree including a subset of potential regression [...] landmark face along face portion to have learn landmark Image, said facial feature is received and face feature of the facial Image input value learned [...] which leaf node when a circle (Traverse) facilitating difference calculated results accumulated landmark mark then stored at the node landmarks detection portion of the invention.

[11]

In addition, one aspect of the present invention displayed color Image and depth of said Image acquisition can be RGB-a D camera in the color Image and depth images other.

[12]

In addition, one side of said facial feature detector of the present invention detect face obtained Image in said Image acquisition face detector; said subtracter face face color for each pixel from the Image of the number considering the brightness with local binary pattern calculator for calculating local binary pattern; depth of local surface normal vector (Local Surface Normal Vector) face said subtracter facial Image from an Image generating surface normal calculator for; said surface normal of each pixel is calculated by applying a local surface normal vector generated local principle (Locality Principle) local each pattern (LAP: Local Angular Pattern) each calculator for calculating local; and said local binary pattern calculator for calculated by said local binary pattern each calculator for each patterns forming local calculated by a local mixer comprises the face features.

[13]

In addition, one aspect of the present invention controls each of said local binary pattern LBP (Local Binary Pattern) techniques to determine the face Image by applying local binary pattern.

[14]

In addition, the specifications of the 3 dimensional object of one aspect of the present invention said surface normal information based on the calculated depth images from the local surface normal vector (Local Surface Normal Vector) when camera coordinate extracting local face coordinate matrix camera coordinate conversion between rotating for matrix of the substrate surface normal vector tangential surface of topical local face coordinate produce.

[15]

In addition, one aspect of the present invention each of said expressions such as local surface normal vector at each pixel below the calculated local 4 obtained with local surface normal vector dot-product (inner product) between pivot vi vpivot threshold angle less than 0 (threshold) through if, when assigning 1 large local each pattern is defined.

[16]

(4 Expressions)

[17]

[18]

Here, v_Pivot Pivot local surface normal vector whose, v_i The system of the normal vector and pixel i, thresholds for the threshold which is, LAP [x, y] has a coordinate (x, y) represents each pattern having local pixel i, lap to lap of each pattern indicate components local, lap is representative 0 to 7 quantizes i components is about.

[19]

In addition, one aspect of the present invention divided into an upper face of said land while the lower two landmark set generation unit a set of phase relationship tree face topology tree topology former face forming face; said root node of the tree include for face washing tub ablaze with learning including test face Image matches in each node including the subset of potential regression [...] face topology tree along face portion to have potential regression [...] landmark landmark Image learning system for studying; and said facial feature calculating unit face feature of the potential regression [...] interleave (Traverse) facilitating input value learned when stored and accumulated portion of leaf node node mark difference comprises calculating recognizer landmark results.

[20]

In addition, one side of the split (split) node of the present invention said potential regression [...] potential regression [...] learning machine, facilitating node and leaf nodes to the binary tree is defined.

[21]

In addition, the one side portion of the present invention performs filtering kernels [...] landmark landmark in density optical flow of a landmark for locating an landmark portion 37a by landmarks tracking unit further comprises.

[22]

In addition, the configuration of one aspect of the present invention said landmark landmark tracking during tracking error exceeds a threshold of around landmark from landmark (outlier landmark) whose one end for fixing the landmark of these neighbor configuration to follow-pull other.

[23]

In addition, occurrence of one aspect of the present invention said landmark landmarks are an error tracking, configuration and a direction value averages the distance of movement of neighboring land marks, and amplifiers landmarks error correcting substrate.

[24]

In the present invention number features include lighting environments and face rotation loose RGB provided D and robust derivative has a disadvantage of their color depth Image corrected.

[25]

This feature is augmented reality environment which is used repetitively number one spatial illumination variations of various indoor/outdoor environment are recovered later may be applied face detecting and tracking applications are disclosed.

[26]

In addition, in the present invention number through each face land mark detection method when the model-independent personal eye, nose, and mouth can be caring for virtual natural despite difference.

[27]

In addition [...]'s understanding of what is very rapid and the second embodiment is based on test when testing process from virtual makeup.

[28]

Virtual makeup can occur in later studies of developing robust landmark detecting face range of slat hand even when various face expression is needed despite performing classification is correct face land mark detection method for learning needs disclosed.

[29]

The personal face detection method embodiment is optimized landmarks such as virtual makeup as well as entertainment and entertainment content will applicable applied face character animation.

[30]

The present invention refers to the van without face model learning, robust tracking performance than existing KLT tracking technique and viscoelasticity.

[31]

Future, severe deformation input, robustness of the eye motion of tracking the detected landmark information or to use constant tracking error is higher than initialization, it became grudge topology defined number can be the same.

[32]

In addition, tracking results in computer graphics module can be integrated with a seal number based system to augmented reality.

[33]

In the embodiment according to Figure 1 shows a land mark recognizing device of the present invention also augmented reality spatial model at one of stand-alone face are disclosed. Figure 2 shows a detailed configuration of Figure 1 facial feature detector also are disclosed. Figure 3 shows a local coordinate converting also dominant landmarks based on the camera coordinate transformation matrix to obtain process is shown are disclosed. Figure 4 shows a local surface normal Image showing examples applying the principle also are disclosed. Figure 5 shows a detailed configuration of Figure 1 land generation unit also are disclosed. Figure 6 shows a topology model example used in the present invention also face landmarks are disclosed. Figure 7 shows a surface potential regression [...] 06 tree also are disclosed. Figure 8 shows a landmark value according the error also is shown are disclosed. (A) of Figure 9 does not use a learning model of Figure 9 (b) is using the present invention and as a result of use Sparse optical flow based KLT tracker that at or superior performance.

[34]

Semiconductor memory of the present invention embodiment of the present invention achieved by the present invention and the purpose of the present invention preferred embodiment members of example hereinafter with reference to describe it in a heat chamber.

[35]

First, a term used in a particular application only is used to account for in the embodiment, the present invention is defining which are not intending to be, it is apparent that a single representation of the differently in order not providing language translators, comprising plurality of representation can be. In addition in the application, the term "comprising" or "having disclosed" specification of articles feature, number, step, operation, components, parts or specify a combination not present included, another aspect of one or more moveable number, step, operation, component, component or a combination of these is understood to presence of or additionally pre-times those possibility should not number.

[36]

In describing the present invention, publicly known or a function of the specific subject matter of invention related description is the description if a haze can be decided to be dispensed to each other.

[37]

In the embodiment according to Figure 1 shows a land mark recognizing device of the present invention also augmented reality spatial model at one of stand-alone face are disclosed.

[38]

The reference also 1, special issue one in the embodiment according to of the present invention stand-alone face land mark recognizing device for imaging spatial model at reality acquiring unit (100), facial feature detector (200), land generation unit (300) a landmark tracking unit (400) made of disclosed.

[39]

Said Image acquisition section (100) includes a color (RGB: red-a green-a blue) Image and depth (Depth) camera RGB-a D camera in the color Image and depth can be acquiring an Image substrate.

[40]

Said color (RGB) Image capture the Image depth (Depth) for liquid water-depth camera RGB camera RGB-a D (Depth) or separate depth (Depth) camera can be configured.

[41]

General camera RGB Image obtained only but, in one embodiment of the present invention embodiment can obtain information used RGB-a D camera depth (Depth) the slide substrate.

[42]

Said depth (Depth) camera design condition for the SwissRanger 4000, PMD CamCube, D provided IMager, using Microsoft yarn Kinect disapproval.

[43]

Digital video acquisition section (100) during a predetermined time interval includes a plurality of input images each other.

[44]

On the other hand, Image acquisition section (100) includes a plurality of input images are analyzed each single channel can be obtained. For example, gray (Gray) scale the Image can be changing. Or an input channel when the multichannel Image 'RGB' same one channel value changing may be filled. The, respect to the input Image (Intensity) by values into one channel in coding, the color can be represent brightness distribution for hereinafter.

[45]

Then, facial feature detector (200) comprises a plurality of input Image from each detecting facial Image, Image is arranged around face detects a feature of other.

[46]

To this end facial feature detector (200) also includes a reference surface 2, face detector (210), local binary pattern calculator for (220), surface normal calculator for (230), each calculator for local (240) and a mixer (250) having a predetermined wavelength.

[47]

Said face detector (210) comprises a plurality of input Image from each detecting facial Image could be bonded each other. Face detector (210) includes a coarse face face after detection in each input images from certain components of the eyes, nose, mouth or the like to extract, the basis can be predetermined face area.

[48]

For example, if the state of the position is detected, determines the distance eyes can be. Face detector (210) distance between the eyes is facial model may be extracted face region is connected to an input, the input Image can be a defect to background or variation of human head style.

[49]

In addition, face detector (210) is the size of the extracted face region information face area capable of normalizing. Normalizing the size of face region by the distance between two eyes in a facial region, the distance between the eye and nose unique characteristics of the same scale level can be calculated.

[50]

Next, local binary pattern calculator for (220) is the number of facial Image with considering the brightness for each pixel to determine the local binary pattern.

[51]

Local binary pattern calculator for (220) includes a brightness LBP (Local Binary Pattern) by techniques can be calculated local binary pattern.

[52]

Thus, by applying LBP technique, face recognition etc. replacing (robust) illumination can be robust against external interference. Only, local binary pattern calculator for (220) includes illumination of an external interference insignificantly and may be omitted if disapproval. In addition, local binary pattern calculator for (220) local to respective facial Image binary pattern may be calculated, to calculate the local binary pattern formed on at least one of the predetermined face may be filled.

[53]

Said LBP technique and an illuminated visual effect of change in high durability and simple operation the Image recovery, biometric Image analysis, facial Image analysis and recognition method is applied to the intensity representative binary pattern conversion are disclosed. LBP neighboring pixel value of the pixel value difference is current position value of 0 and 1 exhibits.

[54]

LBP is greater as compared to a clockwise direction central pixel values 1, otherwise 0 value of the converting method, compared to 8 adjacent pixels because the represented by 8 bit.

[55]

On the other hand, surface normal calculator for (230) is 3 dimensional object information based on the specifications of the depth images from local surface normal vector (Local Surface Normal Vector) where it is extracting, camera and a local face coordinate conversion between rotating for matrix using a local surface normal vector Image consistency despite switching matrix face rotation point extracted.

[56]

Here, surface normal calculator for (230) also includes a reference surface 3, with reference to the camera coordinate face major landmarks (amount eyebrows, protrusion amount intermediate) local face coordinate system transformation matrix (Transformation Matrix) Tr inputs with each other.

[57]

Transformation matrix (Transformation Matrix) (rotation) value linear motion (translation) of rotation determined by the torso equations 1 through 3 dimensional coordinates camera coordinate system transformation of the face direction can be consistent.

[58]

(1 Expressions)

[59]

[60]

Here, (L_1X , L_1Y , L_1Z ) In local face coordinate right and landmarks of the eyebrows, (L_2X , L_2Y , L_2Z ) Land mark left in local face coordinate eyebrows, (L_3X , L_3Y , L_3Z ) In local face coordinate intermediate landmarks and right jaw, (L_4X , L_4Y , L_4Z ) Left in local face coordinate exhibits intermediate landmarks of protrusion.

[61]

And, (L'_1X , L'_1Y , L'_1Z ) In a camera coordinate right and landmarks of the eyebrows, (L'_2X , L'_2Y , L'_2Z ) Land mark left in camera coordinate eyebrows, (L'_3X , L'_3Y , L'_3Z ) In a camera coordinate intermediate landmarks and right jaw, (L'_4X , L'_4Y , L'_4Z ) Left in camera coordinate exhibits landmarks of the intermediate protrusion.

[62]

On the other hand, transformation matrix Tr 2 is arched as shown in the expressions (internal calibration matrix) on internal calibration matrix camera A, T R and transition vector (translation vector) diagonal matrix (rotation matrix) consisting disclosed.

[63]

(2 Expressions)

[64]

[65]

As described below 3 on expressions using camera coordinate conversion between a local face coordinate rotation of converting totally or matrix (R^-1 ) Surface of camera coordinate tangential vector Image by applying a local surface normal vector Image consistency despite face rotation produce a topical face coordinate conversion point.

[66]

(3 Expressions)

[67]

[68]

Here, (SN_Ix , SN_Iy , SN_Iz ) (Here, n is from 1 as the [...] represents i) surface of a camera coordinate and tangential vector, (SN '_Ix , SN '_Iy , SN '_Iz ) (Here, n is from 1 as the [...] represents i) face coordinate system of local tangential vector disclosed.

[69]

On the other hand, each calculator for local (240) is obtained local surface normal vector at each pixel (Locality Principle) local principle applying to each pattern to determine the local (LAP: Local Angular Pattern).

[70]

In residual same more particularly, each calculator for local (240) is pivotal (pivot) local surface normal vectors obtained in each pixel dot through local surface normal vector (inner product) between angle less than 0 when threshold (threshold), when taking into account the depth and direction 1 large considering local each pattern (LAP) is defined.

[71]

Each calculator for said local (240) is below 0 and 1 8 sites between 0 - 255 through expressions 4 consisting of a result local each pattern formed on the substrate.

[72]

(4 Expressions)

[73]

[74]

[75]

The definition is local (LAP) pixels each pattern generally (holistic) can be used for detecting and tracking face part (patch provided based) can be utilized even detecting a landmark face recognition by anti-inflammatory effect.

[76]

Next, mixer (250) includes a local binary pattern calculator for (220) calculated by a local binary pattern each calculator for local (240) calculated by the local each patterns face feature formed on the substrate.

[77]

Next, land generation unit (300) includes a tub portion of face Image extracted.

[78]

As shown in fig. 5 for land generation unit (300) includes a face topology former (310), potential regression [...] learning device (320) and recognizer (330) having a predetermined wavelength.

[79]

Said face topology former (310) in specific positions to the front and the landmarks are part of respective face face face topology tree design such as 6 are received in positional relationship to each other.

[80]

An upper face to the lower two landmark set into a set of tree produced face phase relationship tree structure, each leaf node in order to create a root node 10 landmark set of face washing tub (Leaf Node) less than 1000.

[81]

Then, potential regression [...] learning device (320) includes a root node of the tree including the plurality of tub ablaze matches said face include for facial Image (specifically said facial feature detector (220) test face Image detected in the face feature of entered by a learner) in each node topology tree having learning potential regression [...] subset including portion of face to have face along landmark landmark Image learning other.

[82]

As a result, potential regression [...] learning device (320) includes a washing tub including Image data include for root node matches ablaze after learning [...] has in each node of the subset has a landmark face along face topology including Image data.

[83]

Figure 7 shows a one in the embodiment according to of the present invention representing the potential regression [...] also are disclosed.

[84]

The split (split) node potential regression [...], facilitating the binary tree leaf node and nodes can be defined. Split node perform against the input test function, determine whether right or left child progress as children in the can.

[85]

Performing test function facilitating node does not. Instead, the split node facilitating arrival samples from two portions from the samples is defined. Single site of objects corresponding to the leaf node can output samples.

[86]

A general regression [...] potential regression [...] alternatively root node input samples can be divided into various portions. For example, root node of input predetermined potential tree model divided into various portions, divided portions along parallel tree leaf nodes different propagated to other. A test sample is divided facilitating root node input node, divided portions can be propagated to a leaf node. While, the general regression [...] root node of input does not dividing. Along one of input is entirely tree leaf node propagated to other.

[87]

Splitter node data comparing (Split) separating face feature in the substrate. When the largest dispersion value subset data separating can be tested several times between and separating into two sets (Split), for dispersion in difference current split node 50 to 95 with previously split node is large enough and not separating unit (Division) executable module. Separation (Division) topology data divided into independent of each tree node of child node corresponding to the node a selected partial region are disclosed. Child node corresponding to the node topology is preferable and facilitating node landmarks set of position value difference (Offset Vectors) for sparse subtrees.

[88]

Sides of the leaf node include for automatically change over only the set potential regression [...] leaf node to leaf node representing partial land position value difference overnight.

[89]

Recognizer (330) is said facial feature calculating unit (200) face feature of the input value learned when potential regression [...] interleave (Traverse) facilitating node and the leaf then stored at the node mark calculates a difference apparent landmark of results accumulated in face area.

[90]

On the other hand, M having face topology tree node, represents p (i) is preferable, on the children nodes is r l (i) (i) exhibit, here i∈M and belong to, i=0, 1, 2.. |M|are disclosed.

[91]

Here, p_i^l Each node i corresponding Image I with toe roll landmarks set from the central position of big.

[92]

Each potential regression [...] tree topology of each corresponding step out with each other. The former has an extent the root node topology tree i=0 in the model corresponding to the landmark face with each other.

[93]

Tree is grown by, landmark of leaf-node range on the children nodes until r l (i) (i) divided according. A learning data at each node in the subset of S S 2^l And S^r To separation function f_i On randomly chosen threshold τ_i Using apart from each other.

[94]

The learning process using the topology node i processing advances. Separation function f_i On subset defined as follows.

[95]

(5 Expressions)

[96]

[97]

(6 Expressions)

[98]

[99]

At this time, a device for imaging a pixel value I (,) I represents, on the normalized offset (offset) randomly u v exhibits. Wherein f_i The separation function are disclosed.

[100]

Separation function f_i In information and has the largest gain value, a gain value is not increased in previous node information, the learning process advances to a separate process. The following expressions i gain information in topology node is defined.

[101]

(7 Expressions)

[102]

[103]

Here, ∑_Im Χ logic value vector θ_m Sample covariance matrix of the set of are disclosed. 2 Subset of current center position used for offset vector disclosed.

[104]

In the center of the offset vector of a given facial Image learning data selected apart from each other by separation. The children nodes with corresponding learning data the server processing continues to a fine range. The offset vector stored in node separation.

[105]

For separating the topology node divided leaf node are repeated until one of the end landmark enter and reach.

[106]

In each leaf node, offset from the center of the landmarks of the preferable node is stored disclosed.

[107]

Next, landmark tracking unit (400) is detected t p landmarks of the frame Image^l_t (X^l_t , Y^l_t ) Based on (portion landmark), density optical flow (dense optical flow) G=(u_t , V_t ) (Convolution) (median filtering kernel) on a map of the landmark [...] 37a filtering kernel M p t + 1 frame^l_{T + 1} (Landmark landmark portion of) the position of a substrate.

[108]

(8 Expressions)

[109]

[110]

Here,, which is meaning [...], (x^-^l_t , Y^-^l_t ) Is (x^l_t , Y^l_t ) Round position are disclosed.

[111]

Then landmark tracking unit (400) is rapid motion or rotation during abrupt face configured landmark tracking of water to a landmark from around threshold error landmark (outlier landmark) whose one end, their neighboring configuration for fixing the landmark of track-pull other.

[112]

Specifically, landmark tracking unit (400) includes a first land marks (eye, nose, mouth) semantic space area neighboring configuration defined considering 2000.

[113]

The landmark tracking unit (400) includes a landmark error occurence, configuration and a direction value averages the distance of movement of neighboring land marks, and amplifiers landmarks error correcting substrate.

[114]

The same reference also 8, left eye [sep second configuration different configurations of land marks out of landmarks are around left eye [sep landmarks are error when it moving distance and direction of paired configuration land marks averaging amplifiers landmarks error correcting substrate.

[115]

The landmark tracking techniques the present invention according to RGB camera motion opens the front face by applying loose face number initial tracking result therefrom.

[116]

(A) of Figure 9 does not use a learning model of Figure 9 (b) is using the present invention and as a result of use Sparse optical flow based KLT tracker that at or superior performance.

[117]

In this way, the present invention refers to the van without face model learning, robust tracking performance than existing KLT tracking technique and viscoelasticity.

[118]

[119]

In addition, tracking results in computer graphics module can be integrated with a seal number based system to augmented reality.

[120]

100: Image acquisition section 200: facial feature detector 210: Face detector 220: local binary pattern calculator for 230: Surface normal calculator for 240: each calculator for local 250: Mixer 300: land generation unit 310: Face topology former 320: potential regression [...] learning device 330: Recognizer 400: landmark tracking unit

[1]

The present invention relates to a model-independent face landmark recognizing device in space augmented reality comprising: an image obtaining unit; a face feature detecting unit; and a landmark detecting unit. The image obtaining unit obtains a face image including a color image and a depth image. The face feature detecting unit detects a face area from the face image obtained in the image obtaining unit; measures a local binary pattern in the color image; and measures a face feature of the face image combined with a local before pattern and each local pattern by measuring each local pattern in the depth image. The landmark detecting unit forms a face topology tree which is a face topology relation tree by dividing an upper face landmark set into two lower groups; learns to have a partial landmark image including a face landmark partial set depending on the face topology tree in each node of a potential regression forest in a process of learning to have a plurality of test face images including an entire face landmark which coincides with a root node of the face topology tree; and calculates a result value of the partial landmark by accumulating a difference of position values stored in a division node and a leaf node by traversing the learned forest when the face feature of the face image detected in the face feature detecting unit is set as an input value. The present invention facilitates real-time virtual makeup.

[2]

[3]

[4]

(100) Image obtaining unit
(200) Face feature detecting unit
(300) Landmark detecting unit
(400) Landmark tracking unit

Color Image and depth Image face including acquiring an Image Image acquisition section; said face region from face Image acquired in Image acquisition section detects, color Image topical calculated binary pattern, each pattern depth Image by computing a previous pattern combined with each pattern and local topical local face feature detector calculates the face feature of the facial Image; face and upper face while the lower two landmark set into a set of phase relationship tree face topology tree instances, said face topology of the tree root node including the plurality of facial Image matches ablaze with learning tub in each node topology tree including a subset of potential regression [...] landmark face along face portion to have learn landmark Image, said facial feature is received and face feature of the facial Image input value learned [...] which leaf node when a circle (Traverse) facilitating difference calculated results accumulated landmark mark then stored at the node portion of the nests model augmented reality spatial landmarks detection including stand-alone face land mark recognizing device.

According to Claim 1, said Image acquisition sensitivity color Image and depth can be RGB-a D camera in the color Image and depth model at stand-alone face land mark recognizing device acquiring an Image spatial augmented reality.

According to Claim 1, said facial feature detector in said Image acquisition face obtained Image face region detector; said subtracter face face color for each pixel from the Image of the number considering the brightness with local binary pattern calculator for calculating local binary pattern; depth of local surface normal vector (Local Surface Normal Vector) face said subtracter facial Image from an Image generating surface normal calculator for; said surface normal of each pixel is calculated by applying a local surface normal vector generated local principle (Locality Principle) local each pattern (LAP: Local Angular Pattern) each calculator for calculating local; and said local binary pattern calculator for calculated by said local binary pattern each calculator for each patterns forming local calculated by a local mixer including stand-alone face augmented reality spatial model at face feature to landmark recognition device.

According to Claim 3, said local binary pattern LBP (Local Binary Pattern) controls each face Image calculating a local binary pattern applying technique augmented reality spatial model at stand-alone face land mark recognizing device.

According to Claim 3, said surface normal information based on the specifications of the local surface normal vector calculated from the depth Image 3 dimensional object extracting camera coordinate (Local Surface Normal Vector) when a local face coordinate matrix camera coordinate conversion between tangential surface of rotating for matrix of the substrate surface normal vector for a topical local face coordinate augmented reality spatial model at stand-alone face land mark recognizing device.

According to Claim 3, said expressions such as local surface normal vector at each pixel below the calculated local each 4 obtained with local surface normal vector dot-product (inner product) between angle pivot vi vpivot threshold (threshold) when through less than 0, 1 when allocating large local each creating a pattern in the landmark recognition device augmented reality spatial model at stand-alone face. (4 Expressions) Here, v_Pivot Pivot local surface normal vector whose, v_i The system of the normal vector and pixel i, thresholds for the threshold which is, LAP [x, y] has a coordinate (x, y) represents each pattern having local pixel i, lap to lap of each pattern indicate components local, lap is representative 0 to 7 quantizes i components is about.

According to Claim 1, said generation unit comprises an upper face to the lower two landmark set into land face while a set of phase relationship tree face topology tree topology former forming face; said root node of the tree include for face washing tub ablaze with learning including test face Image matches in each node including the subset of potential regression [...] face topology tree along face portion to have potential regression [...] landmark landmark Image learning system for studying; and said facial feature calculating unit face feature of the potential regression [...] interleave (Traverse) facilitating input value learned when stored and accumulated in leaf node node mark difference calculating recognizer landmark of results including stand-alone model augmented reality spatial portion face land mark recognizing device.

According to Claim 7, said split (split) node potential regression [...] potential regression [...] learning machine, facilitating the binary tree leaf node and nodes defined by augmented reality spatial model at stand-alone face land mark recognizing device.

According to Claim 1, performs said landmark landmark in density optical flow portion by portion for locating an 37a filtering kernels [...] landmark landmark of landmarks tracking device further including stand-alone face landmark recognition model at spatial augmented reality.

According to Claim 9, the configuration of said landmark landmark tracking tracking error exceeds a threshold during landmark from around whose configuration of these neighboring landmark landmark (outlier landmark) for fixing the track landmark recognition result correcting device augmented reality spatial model at stand-alone face.

According to Claim 10, an error tracking said landmark landmark occurence, configuration and a direction value averages the distance of movement of neighboring land marks, correcting the same error landmarks amplifiers augmented reality spatial model at stand-alone face land mark recognizing device.

CPC - классификация

G G0 G06 G06K G06K9 G06K9/G06K9/0 G06K9/00 G06K9/002 G06K9/0022 G06K9/00221 G06K9/0026 G06K9/00268 G06K9/4 G06K9/48 G06K9/481

IPC - классификация

G G0 G06 G06K G06K9 G06K9/G06K9/0 G06K9/00 G06K9/4 G06K9/48