•Currently, talks manually
entered through a web
form
interface
•Several things help: (i)
recognizing
entities, e.g.
people) already in the database and (ii) text classification
•Goal: become for research talks what NEC
CiteSeer is for research papers
–Focused search engine to collect talk
announcements in text or HTML or marked up in a partially understood
ontology.
–Information extraction using LMCO’s Aerotext to extract relevant talk parameters and enter
into database