Method and apparatus for retrieving multimedia data through spatio-temporal activity maps