Thoughts before meeting:
- Do I need to determine semantic categories (Boston vs MIT)?
- If so, I need to expand matching to give some sort of score based on the closeness of the match (perhaps using previous WordNet code)
- How does syntax-based matching help? Isn't immediate context most relevant?
- Google n-grams approach is different to any currently published work
Discussed in meeting:
- Don't bother with sem. cats. yet, just assume exact matches.