14
Entering talks
•Currently, talks manually
 entered through a web
form interface
•Several things help: (i)
recognizing entities, e.g.
people) already in the database and (ii) text classification
•Goal: become for research talks what NEC CiteSeer is for research papers
–Focused search engine to collect talk announcements in text or HTML or marked up in a partially understood ontology.
–Information extraction using LMCO’s Aerotext to extract relevant talk parameters and enter into database