Current Research Projects
Understanding language in any form requires understanding connections among words, concepts, phrases and thoughts. Many of the problems we face today in artificial intelligence depend
in some way on understanding this network of relationships, which represent the facts that each of
us knows about the world and how words relate to one another. When people communicate with each other, their conversation relies on many basic, unspoken assumptions, and they
often learn the basis behind these assumptions long before they can write at all. Only traces of
these assumptions are found in corpora.
Generative Lexicon and Common Sense Reasoning are related areas which are focused on learning such word meanings. This paradox of this vital unspoken information has lead to researchers in these fields to manually construct resources either using experts or volunteers from the internet. Such projects are major undertakings, using vast amounts of time and resources, but the resulting resources are invaluable to the field.
In my work, I hope to create a method, using principal component analysis, which can aid in the creation and bootstrapping of a large lexical resource by providing the human ontologist or volunteer contributer with a list of possible possible relations for each word in the system.
Top 
In 1999, I began working on a project at the MIT Media Lab to collect common sense from volunteers on the internet. Since then, the
Open Mind Common Sense project (OMCS) has expanded. The English site has from 700,000 sentences from over 15,000 contributers. There are OMCSes in Korean, Japanese, Portuguese, and Dutch.
With others in the Common Sense Computing Initative, I mantain the semantic network
ConceptNet and work extensively on
AnalogySpace. AnalogySpace makes rough conclusions about new common sense knowledge based on similarities and tendencies by forming the
analogical closure of a semantic network through dimensionality reduction.
Top 
I'm also working on the Brandeis Semantic Ontology (BSO), a large generative lexicon ontology and lexical database. The BSO has been designed to allow for more widespread access to Generative Lexicon-based lexical resources and help researchers in a variety of computational tasks.
Top