Expressing implicit semantic relations without supervision

From National Research Council Canada

Download	View accepted manuscript: Expressing implicit semantic relations without supervision (PDF, 263 KiB)
DOI	Resolve DOI: https://doi.org/10.3115/1220175.1220215
Author	Search for: Turney, Peter¹
Affiliation	National Research Council of Canada. NRC Institute for Information Technology
Format	Text, Article
Conference	21rst International Committee on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics (COLING/ACL 2006), July 17-21, 2006, Sydney, Australia
Abstract	We present an unsupervised learning algorithm that mines large text corpora for patterns that express implicit semantic relations. For a given input word pair X : Y with some unspecified semantic relations, the corresponding output list of patterns (P₁,…,Pₘ) is ranked according to how well each pattern Pᵢ expresses the relations between X and Y. For example, given X = ostrich and Y = bird, the two highest ranking output patterns are "X is the largest Y" and "Y such as the X". The output patterns are intended to be useful for finding further pairs with the same relations, to support the construction of lexicons, ontologies, and semantic networks. The patterns are sorted by pertinence, where the pertinence of a pattern Pᵢ for a word pair X : Y is the expected relational similarity between the given pair and typical pairs for Pᵢ. The algorithm is empirically evaluated on two tasks, solving multiple-choice SAT word analogy questions and classifying semantic relations in noun-modifier pairs. On both tasks, the algorithm achieves state-of- the-art results, performing significantly better than several alternative pattern ranking algorithms, based on tf-idf.
Publication date	2006-07
Publisher	Association for Computational Linguistics
In	Proceedings of the 21rst International Committee on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics (COLING/ACL 2006) (July 2006): 313–320.
Language	English
NRC number	NRCC 48761
NPARC number	8914077
Export citation	Export as RIS
Report a correction	Report a correction (opens in a new tab)
Record identifier	1eae05bc-a516-48b4-be8c-d48f7b722065
Record created	2009-04-22
Record modified	2024-11-04

Date modified:: 2025-05-11