AUTOMATIC EXTRACTION OF CITATIONS FROM THE TEXT OF ENGLISH-LANGUAGE PATENTS - AN EXAMPLE OF TEMPLATE MINING

Citation
M. Lawson et al., AUTOMATIC EXTRACTION OF CITATIONS FROM THE TEXT OF ENGLISH-LANGUAGE PATENTS - AN EXAMPLE OF TEMPLATE MINING, Journal of information science, 22(6), 1996, pp. 423-436
Citations number
24
Categorie Soggetti
Information Science & Library Science","Information Science & Library Science","Computer Science Information Systems
ISSN journal
01655515
Volume
22
Issue
6
Year of publication
1996
Pages
423 - 436
Database
ISI
SICI code
0165-5515(1996)22:6<423:AEOCFT>2.0.ZU;2-D
Abstract
Methods for automatically isolating and extracting bibliographic refer ences from the full texts of patents are described and evaluated; thes e include citations both to patents and to other bibliographic sources . Patents are unusual as citing documents in that citations occur prin cipally in the text of the abstracts or description parts of tbe docum ents, rather than as footnotes or in separate sections. A template min ing approach has been developed for this purpose, to relieve patent ex aminers of the chore of doing this manually. The sub-languages of cita tions in patents are examined, and the development of templates for th e extraction of citations to patents, journal articles, books and othe r sources in English-language patents described, as well. as the evalu ation of the degree of success of the approach.