全向摄像机与话筒阵列的集成设计
Background Technical Field The invention is directed to an integrated omnidirectional camera and the microphone array, especially can be used for recording of the Conference telephone Conference integrated omnidirectional camera and the microphone array. Background Art Television Conference system there are not many commercial success, there are many reasons for this, in particular, these systems usually have a plurality of technical defect. The camera viewpoint is not good and Image resolution is insufficient, it is difficult to see the participants speakers. And then the inaccurate speaker detection (equipped with panoramic an inclined a zoom camera especially local system), the speakers are not allowed the video camera. Furthermore, poor video compression technology, often lead to the television as the nature is bad and Image display "non-coherent". For telephone Conference the shooting device of the system, a few inclined to concern the television meetings and views the origin of the value, including video data, audio data and on the computer monitor of the electronic document or note is shown. Supposed to have a shared file of the note by various software method, in order to improve the method for capturing the audio/video data prasit concern. Capturing video data have three different method: panoramic/tilt/zoom (PTZ) camera. Based on the whole of the mirror to the camera and the camera array. Despite the great majority of the PTZ camera, however, there are two major limitations. First of all, they can only capture a limited field of view, if they zoom too near, room scene will be lost; if zoom too far, the expression is not on the person. Secondly, the control motor to spend the time of moving the camera, so the camera to the reaction of the session (such as speaker switching) is very slow. In fact, PTZ camera can not move too much too quickly, otherwise distract the attention of the people. It is assumed that these shortcomings and the current technology based on the mirror/prism in the omnidirectional vision sensor has improved, researchers have begun to re-consider the method of capture and analysis of the video, such as in entertainment, news and sports broadcast on the Internet, provides BeHere 360° Internet video technology. Using its interface, remote users can control personalized 360° camera angles to get "its charterhouse" experience, has nothing to do with other viewers. Although the method has overcome the limited facing PTZ camera and camera field of view of the two major problem of slow response, however, according to today's technology and market demand to measure, making such device is too expensive. Furthermore, these based on mirror/prism zone omnidirectional camera is still low (even a 1MP sensor) the problem of the defocus, the video quality is low. In another method, a plurality of inexpensive camera or video sensor assembly enabling to the camera array, such as a kind of known system is used 4 table the United States National television system Committee (NTSC) cameras to construct a panoramic view of a Conference room. However, this design has a number of shortcomings. First, NTSC cameras provide a relatively low quality video signal. Furthermore, in the signal analysis, before transmitting or recording, 4 should be used to video cameras 4 block video shooting board-to-signal digitized. To use 4 block the high video shooting the cost and complexity of such systems, even more difficult to manufacture and maintenance. In addition to the problem of video shooting, capturing in the chamber high quality audio is also a challenge. Audio capture system for removing various noise and reverberation, also shall adjust the gain of the different input signal level. General there are three method can meet these requirements. The most simple method is to use microphone (such as through the head mounted type microphone), but the user/speakers somewhat inconvenient. 2nd kind of method is a microphone placed on the table, so as to prevent a plurality of acoustic path, is the most commonly used currently record law of Conference audio. These systems use several (generally 3 only) microphone ultra heart shape provide omnidirectional characteristic. 3rd kind of method is set up on a table telephone Conference system, the PTZ camera pointing in the top of the speaker is provided with a single to the microphone, camera/microphone group is controlled by the computer, the computer using a group of independent microphone as a sound source location. However, the method to use two independent sets of the microphone. Content of the invention The present invention is directed to a can overcome the TV Conference with the Conference recording system in the system of all above-mentioned limitations of the processing, in particular for the system and method for integrated omnidirectional camera and microphone array accomplish this task. In the most general sense, the invention includes a cylindrical rod, they fine enough to in acoustics frequency range are not speech (50-4000Hz) see, and the camera array is connected to the microphone array, thereby eliminating the sound diffraction and acoustical shadow. The integrated camera and microphone array application 360° camera, designed to overcome the above-mentioned problems of various television Conference. 360° camera is located in the central table, and the typical television Conference system (one end of the camera in the Conference room) compared with, the camera viewpoint is very good to the participants. Camera is elevated from the table, the participants can be provided close to the front view. Furthermore, integrated camera and microphone array to provide sufficient resolution, to allow remote viewers to see the facial expression of the participants (such as in a working embodiment, resolution is 300 × 480). The camera can be either full rearward-facing, or application of camera array, or application with double-curved reflector of a single video sensor. Microphone array is a plane structure, the microphone in the microphone array are preferably arranged in the base, in order to eliminate as far as possible close to the desktop Conference table sound reflection. As mentioned above, fine column rod for video camera is connected to the microphone array base, to the human voice frequency range (i.e. about 50-4000Hz) speaking, the microphone array of thin column rod is not visible in acoustics. From the speakers to the array to form the path of all the microphone, the sound source can be better achieved positioning (measuring speakers position) and beam forming (through filters only if from speakers direction of sound and improve the speaker sound quality). Integrated microphone array used for implementing real-time sound source localization, the array of the video camera based on computer vision, together with detecting and tracking the person used for accurately detecting the position of the speakers in the Image. Based on an audio-video of the speaker detection can be used for automatic camera management, video compression is greatly improved (such as the facial regions than background application more multi-bit). Integrated camera and microphone array is preferably connected to the output of the PC, such application Image like sewing and compressed the same, positioning the sound source can be implemented, beam forming and camera management. Integrated camera and microphone array embodiment of a work, the 1394 bus the video is transmitted to the PC, and the audio frequency an analog cable is transmitted to the personal computer (PC). The application of this example 5 table IEEE1394 provide high-quality video camera, only a single 1394 card. Another embodiment of the camera and the microphone all application a single printed circuit board (PCB), so that all audio and video all through a single 1394 cable transmission. The 1394 cable also for electric power, between the camera and the PC only a single cable. Application of the microphone can be omni-directional or wherein unidirectional, but the orientation is preferred, because it can be provided to all interested by a response. The minimum requirement for the microphone 3 a, although the present invention has a preferred embodiment 8 only, in order to improve the positioning accuracy of the sound source, the entire audio system better beam forming and durability. The microphone is preferably around a circular planar microphone base circle of the four week resettlement side, although other configuration is. The microphone more use, full to the audio frequency, and the better the signal-to-noise ratio. However, taking into account the cost and the complexity of a large number of microphones. Furthermore, the microphone. Audio signal processing is more complex. In order to reduce the noise, the microphone can be in rubber box , microphone and the sound insulation body. lens screen camera can be applied to a block, is the normal mode of operation, privacy mode is downward. Or in the security mode, the shutter of the camera sensors can be turned off, or by electronic technology isolating the camera its cut-off. When the wake-up security mode, the microphone are also preferably closed. During recording, the light shining on the camera, lets the user know the video camera is in the work. When the camera is in the secure mode, light turned off. The integrated omnidirectional camera and microphone design can have various other embodiment, part of the reason is that the same modular characteristic of the system. If in one embodiment, omnidirectional camera used with a plurality of video sensors to achieve 360° camera coverage. In another embodiment of the present invention in, one of the omnidirectional camera, use of a video sensor and a capture 360° of light the lens panoramic covered double-curved surface. Moreover, their own has can be applied to the acoustic transparent column stick rise, any of such camera device, in order to offer a front view of the participants. Or they can be with the above-mentioned microphone array integrated together. Or, other video camera design column stick can also be used in combination with. Is connected with a video camera and microphone array for the rod is not cylindrical, as long as fine in 50-4000Hz diffraction not within the range of sound on the row. Similarly, as previously noted, in one embodiment, any of the speakers to realize the smooth path and the sound from the table and the minimum reflection, microphone array includes a microphone around the circumference of the edge distance so as to close to the desktop is set to be as far as possible. However, the use of acoustic transparent rod omnidirectional camera device integrated other microphone configuration is also feasible. Furthermore, the just discussed can be used to the microphone array used without any camera to achieve 360° acoustic covering. This kind of covering by reducing or eliminating the multi-path problem, therefore, is especially suitable for the sound source position and beam forming. The application of the invention with one embodiment of the microphone array, a computer optimization for the Image data and the audio signal. The output digital Image of the video camera and the microphone array audio output (analog/digital converter) is transmitted into the computer, the latter and in various functions of the Image with the audio input, such as panoramic Image filter the omnidirectional camera in the sensor pick-up Image stitching together. Furthermore, Image data is compressed, so that it is more suitable for broadcast on the Internet (such as the Internet), or preferably through the ready to send and/or recording of the audio and video output separation of the separator, is saved to a computer-readable media. Or, the Image data can also be input individual detector/tracker to improve camera management, such as an Image/video comprising the speaker part of the identified, associated with an audio signal, thereby in their speech, the display of the television Conference camera view to the speaker. Furthermore, the facial regions than background has greater resolution, the position of the speakers for improving video compression. Audio input can also be used for various functions, such as the audio is used for sound source localization, so that the speakers at any designated time optimizing audio frequency direction. Furthermore, beam forming can be used in the computer to improve the beam shape of the audio module, so as to improve the filtering of the audio to the designated direction. Noise can also be used for the automatic gain control module to improve the signal-to-noise ratio, its method is relatively indoor background noise, reduce the noise and gain adjusted, in order to better capture the audio signal from the speaker. These Image and audio processing module can be use alone or in combination, or simply not used. Regardless of whether or not the enhanced audio and video signals, to the other broadcast to a television Conference site or the Internet, can also be saved to a computer readable medium for later viewing. The above-mentioned integrated camera and microphone array is mainly used for television Conference and meeting recording. The microphone array to the camera integrate with the, between the audio and video significantly simplifying the necessary correction (precision manufacturing of the camera and the microphone array without correction), a single device can be used for obtaining the room audio-video information. Figure description The adoption of the following description, the accompanying claim and drawings, can be better special characteristic of the invention the geographical xie Ben , aspects and advantages, wherein: Figure 1 shows an example implementation of the invention a universal computing device system. Figure 2 shows the integrated camera and microphone array table better positioning. Figure 3A is a side view of integrated camera and microphone array one embodiment. Figure 3B is a top view of Figure 3A integrated shown in the embodiment of the camera and the microphone array. Figure 4 is a perspective view of integrated camera and microphone array another embodiment. Figure 5 is the audio-video data computer to enhance capture of the embodiment of a work. A preferred embodiment of the detailed description 1.0 exemplary operating environment In the following the present invention the preferred embodiment, to form a part of this invention with reference to the attached drawing, showing by way of example in the Figure can be implemented a special embodiment of the invention. Should be understood, can also be applied to other embodiments and any structural change without violating the scope of the invention. Figure 1 shows an example of implementation of the invention to a suitable computing system environment 100, the environment 100 is only one example of suitable computing environment, the application range of the invention does not to any restrictions or function. Computing environment 100 is not interpreted as having any dependency, is not interpreted as relative example working environment 100 of any element or elements is shown combined with any requirements. The invention and various other general purpose or special computing system environment or configuration work. The application of this invention the known computing system, environment and/or the configuration of the embodiment, including, but not limited to, a personal computer, server computer, hand-held or laptop devices, multi-processor system, the microprocessor-based system, the set-top box, programmable consumer electronics, network PC, mini-computer, a main frame computer, including the above-mentioned any system or device, such as distributed computing environment. The invention uses computer-executable instructions to describe the general range, such as the computer-implemented program module. Program module generally comprises performing a specific task or implement particular abstract data types of sub-routine, program, object, component, data structure, and the like. The invention also can be implemented in a distributed computing environment, wherein the communication network connected with the remote processing device to carry out the task. In a distributed computing environment, the program module can be set up in the local and remote computer storage media, including memory storage device. With reference to Figure 1, implementation of the example of this invention the system includes the computer 110 forms of universal computing device. Computer 110 components including, but not limited to, processing unit 120, system memory 130 and system bus 121, bus 121 comprises a system memory of the various system components pair to processing unit 120. System bus 121 may be any of several types of bus structure a, including a memory bus or memory controller, peripheral bus, and a bus use of any of local bus structure. For example, but will not be restricted, this kind of structure including industrial standard structure (ISA) bus, micro channel architecture (MCA) bus, enhanced ISA (EISA) bus, video electronics standards association (VESA) local bus and bus Mezzanine at the same time as the peripheral component interconnect (PCI) bus. Computer 110 typically includes a variety of computer-readable media, can be any kind of computer 110 can access the existing media, including volatile and nonvolatile media, removable and non-removable media. For example but will not be restricted, computer-readable media comprises computer storage media and communication media. Computer storage media includes any method and a technique to realize the surface of a volatile and nonvolatile, removable and non-removable media, is used for storing computer-readable instructions, data structures, program modules or other data. Computer-storage media including, but not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical disk storage, cassette tape, magnetic tape, magnetic disk storage or other magnetic storage device, or any other which can be used for storage and the needed information for computer 110 access of the media. Communication media general implementation of computer-readable instructions, data structures, program modules or data modulated carrier signal, or other transmission mechanism in the other data, including any information providing medium. "Modulated data signal" indicative of a has its one set or a plurality of sets of characteristic or with the coded signal way change of information in the signal. For example but will not be restricted, communication media includes wired or direct wire connection, wired media and acoustic, RF, infrared and other wireless media and other wireless media. A combination of any of the above are also included within the scope of the computer-readable media. System memory 130 includes volatile and/or in a non-volatile memory of the computer in the form of a storage medium, such as read-only memory (ROM) 131 and a random access memory (RAM) 132. The basic input/output system 133 (BIOS) stored in the ROM131 generally in, it includes basic sub-routines to assist the computer 110 in the information transmission between the unit, such as during start-up. RAM132 usually contains the data and/or program module, can be directly accessed and/or by the processing unit 120 to its role. For example but will not be restricted, Figure 1 has shown the operating system 134, application programs 135, other program modules 136, and program data 137. The computer 110 may also include other removable/non-removable, volatile/non-volatile computer storage media. For example only, Figure 1 shows 3 to the non-detachable non-volatile magnetic media read or write hard disk drive 141, the detachable non-volatile magnetic disk 152 reading or writing to disk driver 151, and CD-ROM, the detachable non-volatile optical disk 156 or other optical media for reading or writing optical disk drive 155. Working environment can be used for example other removable/non-removable, volatile/non-volatile computer storage media, including, but not limited to, a cassette tape, flash memory card, digital records common, digital videotapes, solid RAM, such as solid-state ROM. The hard disk drive 141 general through the interface 140 are not detachable memory interface connected to the system bus 121, and disk driver 151 and optical disk drive 155 usually rely on interface 150 and other removable memory interface connected to the system bus 121. The above-mentioned Figure 1 indicated by the driver and its relevant computer-storage media, the computer 110 memory of the computer-readable instructions, data structures, program modules and other data. Such as in Figure 1 in, the hard disk drive 141 is illustrated as storing operating system 144, application programs 145, other program modules 146, and program data 147. Note, these parts can be the operating system 134, application programs 135, other program modules 136, and program data 137 the same or different. Here the operating system 144, application programs 145, other program modules 146, and program data 147 giving different label, at least to indicate they are different copies. The user through the input device and the order information is input into the computer 110, such as a keyboard 162 and often called mouse, track ball or touch panel of the directional device 161. Other input device (not shown) including a microphone, joystick, game board, meditech astrolabes, scanners. Often through various input device coupled to the system bus 121 a user input interface 160 is connected to the processing unit 120, can also be used other interface and bus structure such as parallel mouth , or game port connection such as universal Serial bus (USB). Through the video interface 190 interfaces, will also monitor 191 or other type of display device is connected to the system bus 121. In addition to the monitor, the computer also comprises a loudspeaker 197 and the printer 196 and other peripheral output device, can be through an output peripheral interface 195 is connected. The present invention is of particular importance, also will be able to shoot a series of Image 164 of the camera 163 (such as a digital/electronic still or video camera, or film/photographic scanner) including the personal computer 110 of the input device. Moreover, although only shown one camera, but can include a plurality of used as a personal computer 110 of the input device of the camera. From one or a plurality of the images of the video camera 164 by the relevant camera interface 165 input computer 110, interface 165 is connected with the system bus 121, the RAM132 so that Image transmitted to and stored in the computer or a 110 associated other data storage device. However, to be pointed out that, from the above-mentioned Image data can also be input into the computer any computer-readable media 110, without the required use of the camera 163. By using the one or more remote computers such as remote computer 180 logical connection, computer 110 may be operating in a network environment. Remote computer 180 can be a human-computer, server, router, network PC, the same device or other public network node, generally include many or all of the above computer 110 unit described, although Figure 1 shows only a memory storage device 181. Figure 1 comprises a logical connection in the local area network (LAN) 171 and a wide area network (WAN) 173, but may also include other network. Such network environment in offices, enterprise computer network, intranet and is very common in the Internet. In the use of the LAN network environment, the computer 110 after net interface or adapter 170 is connected with the LAN171. WAN use in the network environment, the computer 110 generally comprises a WAN173 such as the Internet the establishment of a communication modem 172 or other device, and the modulation demodulator 172 can be internal or external, through the user input interface 160 or other appropriate mechanism is connected with the system bus 121. In the network environment, the computer 110 or part of a program module shown, may be stored in a remote memory storage device. For example but will not be restricted, Figure 1 shows the remote application logic 85 residing on memory device 181. The illustrated network connection is clearly illustrative, the computer can be used between other device for establishing a communication link. Now the work environment of the example discussed, the remaining part of the description will focus on describing the embodiment of the present invention program module. 3.0 integrated omnidirectional camera and microphone array Acoustic transparent rod discussed in this section is connected with the integrated full to the camera and the microphone array. 3.1 overview The invention solves the problem that the omnidirectional camera and microphone array integrated on the optimization of the. The design goal is to: 1. Microphone array design should provide a from the speaker to the microphone in the array of all smooth path, if it cannot be done, using any of the sound source location and the beam forming algorithm becomes extremely difficult, the deterioration result. 2. Microphone array design should be as far as possible from the microphone is adjacent to the desktop or other surface, in order to prevent sound from the surface reflection, otherwise any deterioration of a sound localization and beam forming results. 3. The camera array should be raised in order to provide near the front face of the the like. Camera array should be small not noticeable. 4. The geometric size of the microphone array (microphone the position and distance between the) should allow audio processing algorithm to realize good beam shapes, speakers directed in the working frequency band, to provide high-quality sound. 3.2 integrated design Integrated camera and microphone array application of the microphone base of the fuse is connected to the camera array. The audio frequency range in the er (50-400Hz), this stake is not visible in acoustics. As shown in Figure 2, integrated camera and microphone array 202 a predetermined the table 204 of the central. This design provides one from any of the designated speaker or sound source to all smooth path of the microphone, the microphone array is close to the desktop, to prevent sound from table reflection caused by multi-path problem. Furthermore, the design of the camera is elevated from the table, for all the participants are provided with positive video or near-like. Integrated camera and microphone array ensures good beam shapes, through the filter only the sound from one direction, the sound quality of the speaker can be improved. Furthermore, camera and microphone integrated characteristic does not require repeated correction, which is advantageous. Because the camera and microphone integrated can be a single device, so only need to a time of initial correction. And because the integrated camera and microphone is a small fixed design, therefore, the abstractor independent cable and the table and the space outside the separation of the two kinds with a camera and microphone, the more accessible. 3.3 system components Integrated omnidirectional camera and microphone array a instantiated in Figure 3A and 3B, the design of the components comprises an omnidirectional camera 302, cylinder 304, microphone base 306, microphone 308, microphone preamplifier 310 analog/digital (A/D) converter (not shown). 3.3.1 omni-directional camera Omnidirectional camera technology has a plurality of, a camera-type in which a plurality of video sensors closely assembled together in back-to-back manner. Another omnidirectional camera type application a single video sensor, of the double-curved surface lens capture 360° of light. The invention of the integrated camera and microphone array design can be applied to any such omnidirectional camera. Preferably, camera head 302 in a small Conference table or other surface without prejudice. If the multi-sensor camera configuration, can be applied to a plurality of camera or video sensing, preferably 8 a. These sensor preferably the back-to-back arranged, so that the sensor is separated from the projection center equiangular distance. For example, if the 8 only sensor, then each sensor is separated from the sensor with the adjacent 45°. However, if photographing different distance to the Image, can be applied to different lens and different camera placement, such as the table is rectangular or elliptical so. A longer distance, can be used for the lens of the narrow and long field-of-view, a relatively short distance, short of the wide field of view can be used for capturing an Image of the lens, at this moment, the camera sensor can be not-around the camera head, and the like. Wide field of view of the camera sensor can be from the narrow field of view of the camera sensor more far. Or, also can be applied to variable field of view of the camera (rotating and pushing and pulling the zoom lens and transferred to the designated state). Figure 4 the working embodiment of the invention, the design of the multi-sensor dense micro , wherein the a plurality of subminiature camera sensor 402 around a circular camera head 404 surrounding the back-to-back configuration. Camera 404 is provided with a sound-absorbing material preferably at the bottom, prevent the sound from the table to be reflected to the camera and microphone. Using this example 8 only camera sensor. This kind of dense micro design of the back-to-back the camera sensor are tightly packed, Image stitching is improved. In this example, camera 404 have a diameter of about 50 mm, the camera is connected to the base of the cylinder 406 the height of about 10 inches, the height of the camera 404 arranged on the front side of all the participants can shoot like. Or, the camera 404 can be lower than the height, the camera sensor 402 to slightly upwards, can also make the camera will not cause illness. 3.3.2 cylinder With reference to Figure 3A and 3B embodiment, omnidirectional camera 302 attachment better the hollow cylinder 304, the latter is attached to the microphone base 306. The cylinder diameter D1 should be fine enough to allow the cylinder 304 corresponding to the frequency of the predetermined application is transparent in acoustics. Preferably, the camera cable 304 laying do not cause any additional barrier. The cylinder height H1 should be high enough to to sit around the table for each person of the positive Image, but is not high such that the camera appears to be prominent. Column height can be adjusted can also be selected to be, such as in television Conference occasions, which can be used as the height of the table and participants high only. Re-reference picture 3A and 3B embodiment of the work of the, cylinder 304 of the diameter D1 to 2 cm or less than, the corresponding to the human voice about 50-4000Hz audio from either speaker through with substantially no interference to the base of the column in all microphone. In this example the column high-H1 to 14 cm. 3.3.3 microphone base Microphone base normally kept microphone, the microphone pre-amplifier and A/D converter, which is connected to the cylinder, is connected with outlet cables provide the video camera. Microphone base is low profile, in order to minimize the distance between the table top and the microphone. The base has the microphone and is connected with the passage. In Figure 3A and 3B integrated camera and microphone array of the working embodiment, microphone base 306 diameter D3 than the camera 302 diameter D2 wider, the integrated camera and microphone array has a stability, preventing easy to be overturned. Base height H3 is relatively small, the size is preferably low enough to make the embedded microphone 308 maintain close to the desktop and avoid multi-path problems, but high enough to allow the microphone 308 embedded base. In the working example, D3 is 16 cm, H3 to 1.5 cm, H2 to 6 cm, D2 is 10 cm. 3.3.4 microphone The use of the microphone can be omni-directional or wherein unidirectional, but preferably omnidirectional microphone, because they are to all concerned by a give of the response. The minimum required for microphone number is 3 a, but the embodiment of the invention, the 8 only, aimed at improving the positioning accuracy of the sound source, better beam forming and durability of the entire audio system. In order to reduce the noise, the microphone into rubber box , similarly the microphone sound insulation material are arranged. Re-reference picture 3A and 3B embodiment of the work, a microphone 308 around the planar microphone base 306 resettlement on the side of the circumference of the. Application of this example 8 only microphone 308. General, the more the microphone use, full to the audio frequency coverage and signal-to-noise ratio, the better, but we must consider a large number of the cost of the microphone and complexity. Furthermore, if the microphone more, the audio signal processing is more complex. In Figure 3A and 3B in the working embodiment, the cylinder 306 and each microphone 308 of the center distance of the D5 to 7 cm. 3.3.5 microphone pre-amplifier, A/D converter Microphone preamplifier 310 analog/digital (A/D) converter (not shown) preferably collected into the microphone base 306, as shown in Figure 3B illustrated. In this example, pre-amplifier 310 of the width D4 is 5.901 cm. Microphone pre-amplifier amplifies the signal from microphone, A/D converter to the normalized amplitude of the signal. A/D converter from the camera analog signal into a digital signal. In this example, the signal from the microphone in the mutual synchronous signal sample 1 USEC inner, in order to facilitate the sound source position and beam-forming. 3.4 secure mode Lens cover camera application, open in the normal mode of operation, in the privacy mode closed. In the closing or in the security mode of the sensor of the camera shutter, or in electronic technology into the camera shut-down. When the wake-up security mode, the microphone are also preferably closed. During recording, the camera dome lamp shine, lets the user know the video camera is in the work. When the secure mode is switched on, lamp off. 4.0 embodiment of other modular is characterized in Integrated omnidirectional camera and microphone designed various other embodiment is feasible, part of the reason is that the modular features of the system. For example, may be applied to various video camera embodiment. In one embodiment, use of the omnidirectional camera application 360° camera covering a plurality of the video sensor. In another embodiment of the present invention in, to the camera application of the use of a video sensor and a capture 360° panoramic coverage of light in order to realize the double-curved surface lens. Moreover, these camera can be used in the acoustic transparent column stick rises on its own, provide positive images to the participants. These camera can be integrated and microphone array. Or, other omnidirectional camera design can also be combined with the use of column stick and/or the microphone array. Similarly, the configuration of the microphone. In one embodiment, the microphone array comprises various microphones around the periphery of the edge distance is provided, close to the desktop as far as possible, any speaker box the smooth path. However, other microphone configuration method can be a transparent rod with acoustics integrate with the camera. Or, the just discussed can be applied to the microphone array, without camera to realize optimized 360° acoustic covering. This kind of covering the reduction or elimination of the multi-path problem, the sound source position and beam-forming is particularly useful. Integrated camera and microphone array in one embodiment, Image stitching in the PC as the compression. Another embodiment is the camera in order to field programmable gate array (FPGA) or other gate array implementation with a compressed Image stitching, the USB interface for this design, the camera and the PC interface, and to use more PC CPU cycle perform other tasks, such as an Image compression and recording and playing session. 5.0 exemplary working embodiment Figure 5 shows the application of the invention the camera 502 and the microphone array 504 embodiment of a, camera 502 Image output and a microphone array 504 audio output, analog/digital converter 506 is transmitted to the computer 508, the latter performing various functions in order to enhance and utilize the Image with the audio input, such as panoramic filter module 510 the omnidirectional camera 502 various sensors in the picked-up Image stitching together. Furthermore, the Image data is compression module 512 compressed, make it more suitable for on the Internet (such as the Internet) broadcast 514 or saved to computer readable media 516 (preferably through the separator 520). Or, Image data input can also be individual detector/tracker module 522, in order to improve camera management (524), such as an Image/video in part containing the speaker is identified, the associated with an audio signal, the speaker's camera/sensor to the photographed Image will be broadcast or keep on the disc. The audio input can also be used, such as the audio input sound source localization module 526, isolation from the speakers of the audio. The computer 508 can be used in beam forming module 528 to improve the beam shape of the audio. Noise reduction with the automatic gain control module 530 by reducing noise and gain adjusted, indoor background noise in order to better capture from the speakers of the audio signal, the signal-to-noise ratio can be improved. As mentioned above, can be broadcast to the other audio and video signals of a television Conference site or the Internet, can also be saved to a computer readable medium for later viewing. The above-mentioned of the invention is used for an example and a note, but not without omission, or the invention limited to the precise form disclosed, such as the above-mentioned integrated camera and microphone array system can be used for monitoring zhu Li. According to the above-mentioned explanation can be made in various modifications. Scope of the present invention is not limited by the detailed description, and as defined by the annexed claim. The patent refers to the field of 'pictorial communication,'. An omni-directional camera (a 360 degree camera) is proposed with an integrated microphone array. The primary application for such a camera is videoconferencing and meeting recording, and the device is designed to be placed on a meeting room table. The microphone array is in a planar configuration, and the microphones are located as close to the desktop as possible to eliminate sound reflections from the table. The camera is connected to the microphone array base with a thin cylindrical rod, which is acoustically invisible to the microphone array for the frequency range 50-4000��1 Hz. This provides a direct path from the person talking to all of the microphones in the array, and can therefore be used for sound source localization (determining the location of the talker) and beam-forming (improving the sound quality of the talker by filtering only sound from a particular direction). The camera array is elevated from the table to provide a near frontal viewpoint of the meeting participants. 1. A capture audio-video data system, which is characterized by comprising: One or more capturing a video Image data of the camera; The microphone array that captures audio signal; and The one or more cameras to the microphone array of the cylinder, the said cylinder is within the range of sound frequency er acoustic transparent. 2. System according to Claim 1, characterized in that said one or a plurality of cameras in the 360° shooting of the Image. 3. System according to Claim 1, characterized in that also comprises a microphone array base, wherein the microphone attached to the microphone array base. 4. System according to Claim 3, characterized in that the height of the microphone array base will be small enough the microphone array is positioned to minimize the array from any surface for picking up sound reflection. 5. System according to Claim 1, characterized in that the cylinder in the acoustics transparent frequency range is 50-4000Hz. 6. System according to Claim 1, characterized in that the audio signal input into a computer, said computer using said audio signals for sound source localization, said sound source localization determining the direction of the sound generating an audio signal. 7. System according to Claim 1, characterized in that the audio signal input into a computer, said computer using said audio signal does ties form, the beam is formed by filtering the sound from all other directions, from one direction to improve the sound of the tone quality. 8. System according to Claim 1, characterized in that the audio signal input into a computer, the computer personnel detection and tracking algorithm precision detection of the video Image data are captured in, and the person with the speakers of the video Image data is associated with the audio signal. 9. System according to Claim 1, characterized in that the video Image data input computer, said computer using said video Image data stitching Image. 10. System according to Claim 1, characterized in that the microphone in the microphone array are omni-directional. 11. System according to Claim 1, characterized in that the microphone in the microphone array are unidirectional. 12. System according to Claim 1, characterized in that the microphone array containing 3 only microphone. 13. System according to Claim 1, characterized in that the microphone array containing 8 only microphone. 14. System according to Claim 1, characterized in that only the microphone array around the plane containing the circular in the circumference of the base of the microphone, the microphone is arranged on the edge. 15. System according to Claim 14, characterized in that the at least one microphone into the rubber box to prevent in external sound reflection. 16. System according to Claim 14, characterized in that in at least one microphone and the sound insulation material, to prevent external sound reflection. 17. System according to Claim 1, characterized in that the camera also includes a lens cover, upward in the normal mode of operation, in the secure mode. 18. System according to Claim 17, characterized in that the microphone of the camera is in the wake is cut off when the privacy mode. 19. System according to Claim 1, characterized in that on-camera lamp illuminate the camera work. 20. System according to Claim 1, characterized in that an audio signal transmitted on the Internet. 21. System according to Claim 1, characterized in that the audio signal stored on the computer-readable media. 22. System according to Claim 1, characterized in that the video Image data transmitted on the Internet. 23. System according to Claim 1, characterized in that the video Image data is stored into computer readable media. 24. System according to Claim 1, characterized in that video signal with a 1394 bus is transmitted to the computer. 25. System according to Claim 1, characterized in that the cable with the analog audio signal is transmitted to the computer. 26. System according to Claim 1, characterized in that the camera is IEEE1394 camera. 27. A for Conference call and meeting recording capture audio/video processing, which is characterized by comprising the following process action: Omnidirectional camera for shooting participants Image, The microphone array for capturing the sound in the audio signal, the microphone array is mounted in the base, the latter is transparent with acoustics the stick meets to the omnidirectional camera. 28. Processing according to Claim 27, characterized in that the microphone array placed on a table, the participants on the seat around the table. 29. Processing according to Claim 28, characterized in that the microphone array as far as possible close to the desk, in order to minimize sound reflections from the table the array pick-up. 30. Processing according to Claim 27, characterized in that the microphone array comprises 3 or more microphone. 31. Processing according to Claim 30, characterized in that the microphone array comprises near the circumference of the base of the microphone in the microphone of equal intervals. 32. Processing according to Claim 27, characterized in that the sensor includes a plurality of video camera. 33. Processing according to Claim 32, characterized in that the video sensor around a circumference are arranged at equal intervals. 34. Processing according to Claim 27, characterized in that comprises a single omnidirectional camera is provided with a capture 360° light double-curved surface of the lens of the video sensor. 35. Processing according to Claim 27, characterized in that said acoustically transparent rod is hollow. 36. Processing according to Claim 27, characterized in that the acoustic transparent the stick attaches is connected with the microphone base, video camera through the acoustic transparent rod of the cable, so the cable does not cause the sound barrier. 37. Processing according to Claim 27, characterized in that the acoustic transparent rod to the outer diameter of 2 cm or less. 38. Processing according to Claim 27, characterized in that the diameter of the acoustic transparent rod from the speaker's 50-4000Hz audio bypass acoustic transparent rod reach almost interference-free all of the microphone in the microphone array. 39. Acoustic transparent rod according to Claim 27, characterized in that the stick height adjustable. 40. An integrated omnidirectional camera and the microphone array, which is characterized by comprising: Omnidirectional camera; Acoustic transparent rod, for efficiently optimize the camera and camera cover; and The microphone array. 41. Integrated omnidirectional camera and microphone array according to Claim 40, characterized in that the omnidirectional camera using a plurality of video sensors to achieve 360° camera coverage. 42. Integrated omnidirectional camera and microphone array according to Claim 40, characterized in that the omnidirectional camera application of a video sensor and a capture 360° panoramic coverage of light to realize double curved lens. 43. Integrated omnidirectional camera and microphone array according to Claim 40, characterized in that the acoustic transparent column stick camera efficiently, took to sit around tables provide positive Image. 44. Integrated omnidirectional camera and microphone array according to Claim 40, characterized in that only the microphone array comprises a plurality are arranged around the circumference of the edge distance, such as a microphone. 45. Integrated omnidirectional camera and microphone array according to Claim 40, characterized in that the surface of the microphone array, and to close to the surface in order to reduce the sound emission from the surface. 46. Integrated camera and microphone array according to Claim 41, characterized in that a narrow field-of-view for a video sensor of relatively long distance shooting the Image, for a relatively short wide field of view of the video sensor to shoot shorter distance Image. 47. An omnidirectional microphone array, which is characterized by comprising: The only around the plane of the circular in the circumference of the base of the microphone, the microphone is arranged on the edge; and wherein The microphone base low enough to the array positioning said microphone from the microphone base to reduce as far as possible any surface of the wall of pick-up of the array of acoustic reflection. 48. The microphone array according to Claim 47, characterized in that the plurality of microphone only the captured audio signal input into a computer, said computer using said audio signals for sound source localization, said sound source localization determining the direction of a sound of the audio signal. 49. The microphone array according to Claim 47, characterized in that the plurality of microphone only the captured audio signal input into a computer, said computer using said audio signal does ties form, the beam is formed by filtering the sound from all other direction from the one direction to improve the sound quality of the sound. 50. The microphone array according to Claim 47, characterized in that the microphone of the microphone array for the entire facing. 51. The microphone array according to Claim 47, characterized in that the microphone is in the microphone array wherein unidirectional. 52. The microphone array according to Claim 47, characterized in that the microphone array containing 8 only microphone. 53. The microphone array according to Claim 47, characterized in that the at least one microphone in rubber box to prevent external sound reflection. 54. The microphone array according to Claim 47, characterized in that in at least one microphone and the sound insulation material, to prevent external sound reflection.