Image interpretation using large corpus: Wikipedia

Mandar Rahurkar, Shen Fu Tsai, Charlie Dagli, Thomas S. Huang

研究成果: 雜誌貢獻期刊論文同行評審

5 引文 斯高帕斯(Scopus)

摘要

Image is a powerful medium for expressing one's ideas and rightly confirms the adage, One picture is worth a thousand words. In this work, we explore the application of world knowledge in the form of Wikipedia to achieve this objectiveliterally. In the first part, we disambiguate and rank semantic concepts associated with ambiguous keywords by exploiting link structure of articles in Wikipedia. In the second part, we explore an image representation in terms of keywords which reflect the semantic content of an image. Our approach is inspired by the desire to augment low-level image representation with massive amounts of world knowledge, to facilitate computer vision tasks like image retrieval based on this information. We represent an image as a weighted mixture of a predetermined set of concrete concepts whose definition has been agreed upon by a wide variety of audience. To achieve this objective, we use concepts defined by Wikipedia articles, e.g., sky, building, or automobile. An important advantage of our approach is availability of vast amounts of highly organized human knowledge in Wikipedia. Wikipedia evolves rapidly steadily increasing its breadth and depth over time.

原文???core.languages.en_GB???
文章編號5484723
頁(從 - 到)1509-1525
頁數17
期刊Proceedings of the IEEE
98
發行號8
DOIs
出版狀態已出版 - 8月 2010

指紋

深入研究「Image interpretation using large corpus: Wikipedia」主題。共同形成了獨特的指紋。

引用此