Discovering irrelevance in the blogosphere through blog search

M. Atif Qureshi, Arjumand Younus, Nasir Touheed, M. Shahid Qureshi, Muhammad Saeed

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Web 2.0 technologies have given birth to the blogosphere, which is an information sharing medium by the users for the users. Furthermore, these technologies have also expanded the search problem to a new form of search known as blog search. Similar to Web search, blog search has been affected by spam which affects the quality of search results. This paper approaches the relevant blog problem in the top search results against the general topic queries. It pursues a study of irrelevant blogs appearing in the top search results of Google Blog Search for the blogspot domains. We define metrics for irrelevant blogs by observing the qualitative relevance of content and by analyzing the link structure of those blogs. Our preliminary results show an overall recall of 0.875 with a precision of 1.0 for finding irrelevant blogs in the top 15 search results against six general topic queries on Google Blog Search.

Original languageEnglish
Title of host publicationProceedings - 2011 International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2011
Pages457-460
Number of pages4
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event2011 International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2011 - Kaohsiung, Taiwan
Duration: 25 Jul 201127 Jul 2011

Publication series

NameProceedings - 2011 International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2011

Conference

Conference2011 International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2011
Country/TerritoryTaiwan
CityKaohsiung
Period25/07/1127/07/11

Keywords

  • Blog search
  • Blogosphere
  • Content-based
  • Irrelevance
  • Link structure
  • Splogs

Fingerprint

Dive into the research topics of 'Discovering irrelevance in the blogosphere through blog search'. Together they form a unique fingerprint.

Cite this