You are here

The Colloquial WordNet: Extending Princeton WordNet with Neologisms

Authors: 

John McCrae, Ian Wood, Amanda Hicks

Publication Type: 
Refereed Conference Meeting Proceeding
Abstract: 
Princeton WordNet is one of the most important resources for natural language processing, but has not been updated for over ten years and is not suitable for analyzing the fast moving language as used on social media. We propose an extension to WordNet, with new terms that have been found from Twitter and Reddit, and cover language usage that is emergent or vulgar. In addition to our methodology for extraction, we analyze new terms to provide information about how new words are entering the English language. Finally, we discuss publishing this resource both as linguistic linked open data and as part of the Global WordNet Association’s Interlingual Index.
Conference Name: 
LDK 2017
Digital Object Identifer (DOI): 
https://doi.org/10.1007/978-3-319-59888-8
Publication Date: 
19/06/2017
Conference Location: 
Ireland
Research Group: 
Institution: 
National University of Ireland, Galway (NUIG)
Open access repository: 
No
Publication document: