How to get the WordNet synset given an offset ID?

As of NLTK 3.2.3, there's a public method for doing this:

wordnet.synset_from_pos_and_offset(pos, offset)

In earlier versions you can use:

wordnet._synset_from_pos_and_offset(pos, offset)

This returns a synset based on it's POS and offest ID. I think this method is only available in NLTK 3.0 but I'm not sure.

Example:

from nltk.corpus import wordnet as wnwn.synset_from_pos_and_offset('n',4543158)>> Synset('wagon.n.01')

python python-2.7 nlp nltk wordnet

For NTLK 3.2.3 or newer, please see donners45's answer.

For older versions of NLTK:

There is no built-in method in the NLTK but you could use this:

from nltk.corpus import wordnetsyns = list(wordnet.all_synsets())offsets_list = [(s.offset(), s) for s in syns]offsets_dict = dict(offsets_list)offsets_dict[14204095]>>> Synset('heatstroke.n.01')

You can then pickle the dictionary and load it whenever you need it.

For NLTK versions prior to 3.0, replace the line

offsets_list = [(s.offset(), s) for s in syns]

with

offsets_list = [(s.offset, s) for s in syns]

since prior to NLTK 3.0 offset was an attribute instead of a method.

python python-2.7 nlp nltk wordnet

You can use of2ss(), For example:

from nltk.corpus import wordnet as wnsyn = wn.of2ss('01580050a')

will return Synset('necessary.a.01')

CodeHunter

How to get the WordNet synset given an offset ID?

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last