Current Research Projects

Discovering Semantic Relations Between Words

Understanding language in any form requires understanding connections among words, concepts, phrases and thoughts. Many of the problems we face today in artificial intelligence depend in some way on understanding this network of relationships, which represent the facts that each of us knows about the world and how words relate to one another. When people communicate with each other, their conversation relies on many basic, unspoken assumptions, and they often learn the basis behind these assumptions long before they can write at all. Only traces of these assumptions are found in corpora.
Generative Lexicon and Common Sense Reasoning are related areas which are focused on learning such word meanings. This paradox of this vital unspoken information has lead to researchers in these fields to manually construct resources either using experts or volunteers from the internet. Such projects are major undertakings, using vast amounts of time and resources, but the resulting resources are invaluable to the field.
In my work, I hope to create a method, using principal component analysis, which can aid in the creation and bootstrapping of a large lexical resource by providing the human ontologist or volunteer contributer with a list of possible possible relations for each word in the system.

Top

The Common Sense Computing Initiative

In 1999, I began working on a project at the MIT Media Lab to collect common sense from volunteers on the internet. Since then, the Open Mind Common Sense project (OMCS) has expanded. The English site has from 700,000 sentences from over 15,000 contributers. There are OMCSes in Korean, Japanese, Portuguese, and Dutch.
With others in the Common Sense Computing Initative, I mantain the semantic network ConceptNet and work extensively on AnalogySpace. AnalogySpace makes rough conclusions about new common sense knowledge based on similarities and tendencies by forming the analogical closure of a semantic network through dimensionality reduction.

Top

The Brandeis Semantic Ontology

I'm also working on the Brandeis Semantic Ontology (BSO), a large generative lexicon ontology and lexical database. The BSO has been designed to allow for more widespread access to Generative Lexicon-based lexical resources and help researchers in a variety of computational tasks.

Top