Exploring educational standard alignment: in search of 'relevance'

Authors:
René Reitsma;Byron Marshall;Michael Dalton;Martha Cyr
Affiliations:
Oregon State University, Corvallis, OR, USA;Oregon State University, Corvallis, OR, USA;Oregon State University, Corvallis, OR, USA;Worcester Polytechnic Institute, Worcester, MA, USA
Venue:
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Year:
2008

Citing 6
Cited 2

Experimenting with the automatic assignment of educational standards to digital library content

Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Fedora: an architecture for complex objects and their relationships

International Journal on Digital Libraries
Standards or semantics for curriculum search?

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Relevance: A review of the literature and a framework for thinking on the notion in information science. Part III: Behavior and effects of relevance

Journal of the American Society for Information Science and Technology
Relevance: A review of the literature and a framework for thinking on the notion in information science. Part II: nature and manifestations of relevance

Journal of the American Society for Information Science and Technology
User rankings of search engine results

Journal of the American Society for Information Science and Technology

Dimensional standard alignment in K-12 digital libraries: assessment of self-found vs. recommended curriculum

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Automatically characterizing resource quality for educational digital libraries

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries

Quantified Score

Hi-index	0.00

Visualization

Abstract

The growing availability of online K-12 curriculum is increasing the need for meaningful alignment of this curriculum with state-specific standards. Promising automated and semi-automated alignment tools have recently become available. Unfortunately, recent alignment evaluation studies report low inter-rater reliability, e.g., 32% with two raters and 35 documents. While these results are in line with studies in other domains, low reliability makes it difficult to accurately train automatic systems and complicates comparison of different services. We propose that inter-rater reliability of broadly defined, abstract concepts such as 'alignment' or 'relevance' must be expected to be low due to the real-world complexity of teaching and the multidimensional nature of the curricular documents. Hence, we suggest decomposing these concepts into less abstract, more precise measures anchored in the daily practice of teaching. This article reports on the integration of automatic alignment results into the interface of the Teach Engineering collection and on an evaluation methodology intended to produce more consistent document relevance ratings. Our results (based on 14 raters x 6 documents) show high inter-rater reliability (61 - 95%) on less abstract relevance dimensions while scores on the overall 'relevance' concept are (as expected) lower (64%). Despite a relatively small sample size, regression analysis of our data resulted in an explanatory (R2 = .75) and statistically stable (p-values