- Research Article
- Open Access
Semantic Identification: Balancing between Complexity and Validity
EURASIP Journal on Advances in Signal Processingvolume 2006, Article number: 041716 (2006)
An efficient scheme for identifying semantic entities within data sets such as multimedia documents, scenes, signals, and so forth, is proposed in this work. Expression of semantic entities in terms of syntactic properties is modelled with appropriately defined finite automata, which also model the identification procedure. Based on the structure and properties of these automata, formal definitions of attained validity and certainty and also required complexity are defined as metrics of identification efficiency. The main contribution of the paper relies on organizing the identification and search procedure in a way that maximizes its validity for bounded complexity budgets and reversely minimizes computational complexity for a given required validity threshold. The associated optimization problem is solved by using dynamic programming. Finally, a set of experiments provides insight to the introduced theoretical framework.
Barnard K, Duygulu P, Forsyth D, de Freitas N, Blei DM, Jordan MI: Matching words and pictures. Journal of Machine Learning Research 2003, 3(7):1107–1135.
Wallace M, Avrithis Y, Stamou G, Kollias S: Knowledge-based multimedia content indexing and retrieval. In Multimedia Content and Semantic Web: Methods, Standards and Tools. Edited by: Stamou G, Kollias S. John Wiley & Sons, New York, NY, USA; 2005.
Dorado A, Izquierdo E: Semantic labeling of images combining color, texture and keywords. Proceeding of IEEE International Conference on Image Processing (ICIP '03), September 2003, Barcelona, Spain 3: 9–12.
Lew MS: Next-generation web searches for visual content. IEEE Computer 2000, 33(11):46–53. 10.1109/2.881694
Manjunath BS, Salembier P, Sikora T (Eds): Introduction to MPEG-7: Multimedia Content Description Interface. John Wiley & Sons, New York, NY, USA; 2002.
Sikora T: The MPEG-7 visual standard for content description-an overview. IEEE Transactions on Circuits and Systems for Video Technology 2001, 11(6):696–702. 10.1109/76.927422
Visser R, Sebe N, Lew MS: Detecting automobiles and people for semantic video retrieval. Proceeding of 16th International Conference on Pattern Recognition (ICPR '02), August 2002, Quebec City, Canada 2: 733–736.
Duygulu P, Barnard K, de Freitas N, Forsyth DA: Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. Proceeding of 7th European Conference on Computer Vision (ECCV '02), May 2002, Copenhagen, Denmark 4: 97–112.
Akrivas G, Stamou GB, Kollias S: Semantic association of multimedia document descriptions through fuzzy relational algebra and fuzzy reasoning. IEEE Transactions on Systems, Man, and Cybernetics—Part A: Systems and Humans 2004, 34(2):190–196. 10.1109/TSMCA.2003.819498
Wallace M, Kollias S: Computationally efficient incremental transitive closure of sparse fuzzy binary relations. Proceeding of IEEE International Conference on Fuzzy Systems (IEEE-FUZZ '04), July 2004, Budapest, Hungary
Avrithis Y, Stamou G, Wallace M, et al.: Unified access to heterogeneous audiovisual archives. Journal of Universal Computer Science 2003, 9(6):510–519.
Klir GJ, Yuan B: Fuzzy Sets and Fuzzy Logic: Theory and Applications. Prentice-Hall, Upper Saddle River, NJ, USA; 1995.
Baader F, Calvanese D, McGuinness DL, Nardi D, Patel-Schneider PF (Eds): The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press, New York, NY, USA; 2003.
Straccia U: Reasoning within fuzzy description logics. Journal of Artificial Intelligence Research January–June 2001, 14: 137–166.
Fellbaum C (Ed): WordNet: An Electronic Lexical Database. MIT Press, Cambridge, Mass, USA; 1998.
Lewis HR, Papadimitriou CH: Elements of the Theory of Computation. Prentice-Hall, Upper Saddle River, NJ, USA; 1998.
Kelleler H, Pferschy U, Pisinger D: Knapsack Problems. Springer, Berlin, Germany; 2004.
Bellman RE: Dynamic Programming. Princeton University Press, Princeton, NJ, USA; 1957.
Bretthauer KM, Shetty B: The nonlinear knapsack problem—algorithms and applications. European Journal of Operational Research 2002, 138(3):459–472. 10.1016/S0377-2217(01)00179-5
Assfalg J, Bertini M, Colombo C, Del Bimbo A: Semantic annotation of sports videos. IEEE Multimedia 2002, 9(2):52–60. 10.1109/93.998060
Leonardi R, Migliorati P, Prandini M: Semantic indexing of sports program sequences by audio-visual analysis. Proceeding of IEEE International Conference on Image Processing (ICIP '03), September 2003, Barcelona, Spain 1: 9–12.
Xie L, Xu P, Chang S-F, Divakaran A, Sun H: Structure analysis of soccer video with domain knowledge and hidden Markov models. Pattern Recognition Letters 2004, 25(7):767–775. 10.1016/j.patrec.2004.01.005
Tsechpenakis G, Xirouhakis Y, Delopoulos A: Main mobile object detection and localization in video sequences. Proceeding of 4th International Conference on Advances in Visual Information Systems (VISUAL '00), November 2000, Lyon, France, Lecture Notes in Computer Science 1929: 84–95.