Machine learning in building a collection of computer science course syllabi

  • Authors:
  • Nakul Rathod;Lillian N. Cassel

  • Affiliations:
  • Department of Computing Sciences, Villanova University, Villanova, PA;Department of Computing Sciences, Villanova University, Villanova, PA

  • Venue:
  • TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Syllabi are rich educational resources. However, finding Computer Science syllabi on a generic search engine does not work well. Towards our goal of building a syllabus collection we have trained various Decision Tree, Naive-Bayes, Support Vector Machine and Feed-Forward Neural Network classifiers to recognize Computer Science syllabi from other web pages. We have also trained our classifiers to distinguish between Artificial Intelligence and Software Engineering syllabi. Our best classifiers are 95% accurate at both the tasks. We present an analysis of the various feature selection methods and classifiers we used hoping to help others developing their own collections.