Three-dimensional object e.g. human face, position determining method for creating key frame, involves determining position of object in image from position information associated to selected two-dimensional representation