Measuring Similarity Between News Items Using Link Analysis and Semantic Approach

dc.contributor.advisorGetahun, Fekade (PhD)
dc.contributor.authorSeged, Yemane
dc.date.accessioned2018-06-26T07:12:16Z
dc.date.accessioned2023-11-29T04:05:52Z
dc.date.available2018-06-26T07:12:16Z
dc.date.available2023-11-29T04:05:52Z
dc.date.issued2012-08
dc.description.abstractIn the recent years, the ways people acquire information have been completely changed. Activities such as reading hardcopy materials such as books, journals, and newspapers, have radically declined, and most of the people go online to find recent and up-to-date information. As a result, news feeds technology such as RSS and ATOM was created to allow news users to get frequently update information. However, the number of news items that will be downloaded to the aggregator will be unmanageable when the number of provides grows. This will be even annoying when some of the news items are similar to already read news items. One of the possible solutions to this challenge is to measure similarity among news items. Measure similarity between news items is pre-requisite to a number of application areas, grouping, clustering, merging and revision/version control. Since news Feeds are XML files, they do have several sub-elements such as title, description/summery, link, guild, etc…. Previously item/entry sub-elements such as title and description/summary have been used as input in measuring similarity. In this work, we propose to use link sub-element information that improves and supplement the similarity computation between two items. As news page contains links to set of related news pages, our new similarity approach uses these links in measuring similarity. We developed new similarity measures that consider the link sub-element and related news links together with their anchor text. In order to validate our approach, we developed a prototype implementing the link based news Feed similarity measure. Experimental results show that the link based news feed similarity is more helpful in measuring similarity when it is combined with computing similarity only with title and description sub-elements and compared to using SimRank and co-citation. Keywords: similarity measure, link analysis, news Feed, Semantic similarityen_US
dc.identifier.urihttp://etd.aau.edu.et/handle/123456789/3502
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectSimilarity Measureen_US
dc.subjectLink Analysisen_US
dc.subjectNews Feeden_US
dc.subjectSemantic Similarityen_US
dc.titleMeasuring Similarity Between News Items Using Link Analysis and Semantic Approachen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Yemane Seged.pdf
Size:
517.12 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: