Blog Mining for the Fortune 500

Authors:
James Geller;Sapankumar Parikh;Sriram Krishnan
Affiliations:
College of Computing Sciences, Department of Computer Sciences, New Jersey Institute of Technology, Newark, NJ 07102,;College of Computing Sciences, Department of Computer Sciences, New Jersey Institute of Technology, Newark, NJ 07102,;College of Computing Sciences, Department of Computer Sciences, New Jersey Institute of Technology, Newark, NJ 07102,
Venue:
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Year:
2007

Citing 2
Cited 1

Taking the community's pulse: one blog at a time

ICWE '06 Proceedings of the 6th international conference on Web engineering
Latent Friend Mining from Blog Data

ICDM '06 Proceedings of the Sixth International Conference on Data Mining

Textual factors in online product reviews: a foundation for a more influential approach to opinion mining

Electronic Commerce Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

In recent years there has been a tremendous increase in the number of users maintaining online blogs on the Internet. Companies, in particular, have become aware of this medium of communication and have taken a keen interest in what is being said about them through such personal blogs. This has given rise to a new field of research directed towards mining useful information from a large amount of unformatted data present in online blogs and online forums. We discuss an implementation of such a blog mining application. The application is broadly divided into two parts, the indexing process and the search module. Blogs pertaining to different organizations are fetched from a particular blog domain on the Internet. After analyzing the textual content of these blogs they are assigned a sentiment rating. Specific data from such blogs along with their sentiment ratings are then indexed on the physical hard drive. The search module searches through these indexes at run time for the input organization name and produces a list of blogs conveying both positive and negative sentiments about the organization.