PRODUCING HIGH-DIMENSIONAL SEMANTIC SPACES FROM LEXICAL COOCCURRENCE

Authors
Citation
K. Lund et C. Burgess, PRODUCING HIGH-DIMENSIONAL SEMANTIC SPACES FROM LEXICAL COOCCURRENCE, Behavior research methods, instruments, & computers, 28(2), 1996, pp. 203-208
Citations number
21
Categorie Soggetti
Psychology, Experimental","Psychologym Experimental
ISSN journal
07433808
Volume
28
Issue
2
Year of publication
1996
Pages
203 - 208
Database
ISI
SICI code
0743-3808(1996)28:2<203:PHSSFL>2.0.ZU;2-Y
Abstract
A procedure that processes a corpus of text and produces numeric vecto rs containing information about its meanings for each word is presente d. This procedure is applied to a large corpus of natural language tex t taken from Usenet, and the resulting vectors are examined to determi ne what information is contained within them. These vectors provide th e coordinates in a high-dimensional space in which word relationships can be analyzed. Analyses of both vector similarity and multidimension al scaling demonstrate that there is significant semantic information carried in the vectors. A comparison of vector similarity with human r eaction times in a single-word priming experiment is presented. These vectors provide the basis for a representational model of semantic mem ory, hyperspace analogue to language (HAL).