Semantics Extraction From Multimedia Data: An Ontology-Based Machine Learning Approach

Part of the book series: Springer Series in Cognitive and Neural Systems ((SSCNS))

1631 Accesses
2 Citations

Abstract

It is often the case that related pieces of information lie in adjacent but different types of data sources. Besides extracting such information from each particular type of source, an important issue raised is how to put together all the pieces of information extracted by each source, or, more generally, what is the optimal way to collectively extract information, considering all media sources together. This chapter presents a machine learning method for extracting complex semantics stemming from multimedia sources. The method is based on transforming the inference problem into a graph expansion problem, expressing graph expansion operators as a combination of elementary ones and optimally seeking elementary graph operators. The latter issue is then reduced to learn a set of soft classifiers, based on features each one corresponding to a unique graph path. The advantages of the method are demonstrated on an athletics web-pages corpus, comprising images and text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Exploiting Disagreement Through Open-Ended Tasks for Capturing Interpretation Spaces

Extracting semantic knowledge from web context for multimedia IR: a taxonomy, survey and challenges

Article 25 July 2017

Content semantic image analysis and storage method based on intelligent computing of machine learning annotation

Article 05 February 2020

References

P. Aarabi and B.V. Dasarathy. Robust speech processing using multi-sensor multi-source information fusion – an overview of the state of the art. Information Fusion, 5(2):77–80, 2004.
Article Google Scholar
A. Goshtasby and S. Nikolov. Image fusion: Advances in the state of the art. Information Fusion, 8(2):114–118, 2007.
Article Google Scholar
N. Friedman, L. Getoor, D. Koller, and A. Pfeffer. Learning probabilistic relational models. In International Joint Conference on Artificial Intelligence, volume 16, pages 1300–1309. Citeseer, 1999.
Google Scholar
K. Kersting, L. De Raedt, and T. Raiko. Logical hidden markov models. Journal of Artificial Intelligence Research, 25(1):425–456, 2006.
Google Scholar
B.D. Lucas and T. Kanade. An iterative image registration technique with an application to stereo vision. In International joint conference on artificial intelligence, volume 3, pages 674–679. Citeseer, 1981.
Google Scholar
T. Lukasiewicz. Probabilistic description logic programs. International Journal of Approximate Reasoning, 45(2):288–307, 2007.
Article Google Scholar
A.V. Nefian, L. Liang, X. Pi, X. Liu, and K. Murphy. Dynamic Bayesian Networks for Audio-Visual Speech Recognition. EURASIP Journal on Applied Signal Processing, 2002(11): 1274–1288, 2002.
Article Google Scholar
J. Neville and D. Jensen. Relational dependency networks. The Journal of Machine Learning Research, 8:692, 2007.
Google Scholar
S.E. Peraldi, A. Kaya, S. Melzer, R. Moller, and M. Wessel. Multimedia interpretation as abduction. In Proc. DL-2007: International Workshop on Description Logics, 2007.
Google Scholar
S. Petridis and N. Tsapatsoulis. Semantics Extraction from Multimedia Content: The BOEMIE Architecture. In Proceeding of the first international conference on Semantics and digital Media Technology (SAMT 2006), pages 6–8, 2006.
Google Scholar
M. Richardson and P. Domingos. Markov logic networks. Machine Learning, 62(1):107–136, 2006.
Article Google Scholar
S. Rudolph, T. Tserendorj, and P. Hitzler. What Is Approximate Reasoning? In Proceedings of the 2nd International Conference on Web Reasoning and Rule Systems, pages 150–164. Springer, 2008.
Google Scholar
U. Straccia. Reasoning within Fuzzy Description Logics. Journal of Artificial Intelligence Research, 14:137–166, 2001.
Google Scholar
E. Zavitsanos, G. Paliouras, G.A. Vouros, and S. Petridis. Discovering subsumption hierarchies of ontology concepts from text corpora. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, pages 402–408. IEEE Computer Society Washington, DC, USA, 2007.
Google Scholar

Download references

Acknowledgements

This study is partly supported by the research projects “BOEMIE, Bootstrapping Ontology Evolution with Multimedia Information Extraction”. FP6-027538/STREP, 2006–2009, http://www.boemie.org and “CASAM, Computer-Aided Semantic Annotation of Multimedia”. ICT-217061/STREP, 2008, http://www.casam-project.eu/

Author information

Authors and Affiliations

Institute of Informatics and Telecommunications, NCSR “Demokritos”, Patriarchou Grigoriou and Neapoleos St. GR-15310, Aghia Paraskevi, Attiki, Greece
Sergios Petridis

Authors

Sergios Petridis
View author publications
Search author on:PubMed Google Scholar
Stavros J. Perantonis
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Sergios Petridis .

Editor information

Editors and Affiliations

Department of Psychology, Boston University, Boston, 02215, USA
Vassilis Cutsuridis
Dept. Computing Science, University of Stirling, Stirling, FK9 4LA, United Kingdom
Amir Hussain
King's College London, Dept. Mathematics, University of London, London, WC2R 2LS, United Kingdom
John G. Taylor

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Petridis, S., Perantonis, S.J. (2011). Semantics Extraction From Multimedia Data: An Ontology-Based Machine Learning Approach. In: Cutsuridis, V., Hussain, A., Taylor, J. (eds) Perception-Action Cycle. Springer Series in Cognitive and Neural Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-1452-1_12

Download citation

DOI: https://doi.org/10.1007/978-1-4419-1452-1_12
Published: 31 December 2010
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-1451-4
Online ISBN: 978-1-4419-1452-1
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics