Spatiotemporal semantic model for multimedia presentations, multimedia database searching, and multimedia browsing

Shu-Ching Chen, Purdue University

Abstract

As more information sources become available in multimedia systems, the development of abstract semantic models for video, audio, text, and image data becomes very important. An abstract semantic model has two requirements. First, it should be rich enough to provide a friendly interface of multimedia presentation synchronization schedules to the users. Second, it should be a good programming data structure for implementation to control multimedia playback. An abstract semantic model based on an augmented transition network (ATN) is presented. The inputs for ATNs are modeled by multimedia input strings. Multimedia input strings provide an efficient means for iconic indexing of the temporal/spatial relations of media streams and semantic objects. An ATN and its subnetworks are used to represent the appearing sequence of media streams and semantic objects. The arc label is a substring of a multimedia input string. In this design, a presentation is driven by a multimedia input string. Each subnetwork has its own multimedia input string. Database queries relative to text, image, and video can be answered via substring matching at subnetworks. Subnetworks also can be some existing multimedia presentations to be embedded in other presentations to make module design possible in a multimedia authoring environment. The conditions are checked to see whether certain criteria are satisfied. If they are, a set of corresponding actions are activated. Multimedia browsing allows users the flexibility to select any part of the presentation they prefer to see. This means that an ATN and its subnetworks can be included in a multimedia database system which is controlled by a database management system (DBMS). User interactions and loops are also provided in an ATN. Therefore, ATNs provide three major capabilities: multimedia presentations, temporal/spatial multimedia database searching, and multimedia browsing. A web temporal model (WTM) is constructed after the temporal relations are obtained. Another web spatial model (WSM) is also developed to capture the spatial relations of the semantic objects. Thus our model can integrate temporal and spatial relations, and content information into one framework.

Degree

Ph.D.

Advisors

Kashyap, Purdue University.

Subject Area

Computer science|Electrical engineering

Off-Campus Purdue Users:
To access this dissertation, please log in to our
proxy server
.

Share

COinS