Libraries, University of Nebraska-Lincoln

Library Philosophy and Practice (e-journal)

A Comparison of Term Clusters for Tokenized Words Collected from Controlled Vocabularies, User Keyword Searches, and Online Documents

Elaine Maytag Nowick, University of Nebraska-LincolnFollow
Daryl Travnicek, University of Nebraska-Lincoln
Kent M. Eskridge, University of Nebraska-LincolnFollow
Stephen Stein, University of Nebraska-LincolnFollow

ORCID IDs

Kent M. Eskridge

Date of this Version

2010

Document Type

Article

Abstract

Tokenized word terms were collected from three sources: controlled vocabulary headings, user keyword searches, and html documents all dealing with issues in water quality. Distances were calculated between word pairs using the Jacquard formula. Distances from the three sources were compared using Spearman rank correlations and clusters were calculated on distances transformed for non-normality using the SAS pseudo-centroid method. Word pair distances from controlled vocabularies were more closely correlated to keyword searches than document distances were to users’ keywords. The mean distance of controlled vocabularies was also closer to that of users. Clusters produced from the three sources were most similar for word pairs with small distances.

Download

Included in

Library and Information Science Commons

COinS

Libraries, University of Nebraska-Lincoln

Library Philosophy and Practice (e-journal)

A Comparison of Term Clusters for Tokenized Words Collected from Controlled Vocabularies, User Keyword Searches, and Online Documents

ORCID IDs

Date of this Version

Document Type

Abstract

Included in

Search

Links

Browse

Author Corner

Links

Libraries, University of Nebraska-Lincoln

Library Philosophy and Practice (e-journal)

A Comparison of Term Clusters for Tokenized Words Collected from Controlled Vocabularies, User Keyword Searches, and Online Documents

Authors

ORCID IDs

Date of this Version

Document Type

Abstract

Included in

Share

Search

Links

Browse

Author Corner

Links