Provide a DAML representation of the CIA World Fact Book.
Demonstrates
Temporal data (many values are tagged by year).
Uncertainty (some values are tagged by "est.").
The ability to add semantic structure to a widely referenced
data source.
The basic data structure is fairly simple,
but we may be able to add value by extending the ontology to
allow reasoning based on geographic region, industry categories, etc.
Resources Available
The 1998 CD-ROM (most recent available from GPO) includes the source document
in SGML (library/books/fb98/fb98.sgm).
Countries begin with tags.
Facts are recorded as pairs.
An HTML representation of the World Fact Book is available at
http://www.cia.gov/cia/publications/factbook/.
PARKA triples encoding facts (in several ways) are available at
http://www.cs.umd.edu/projects/plus/Parka/parka-kbs.html.
Deb McGuiness has a World Fact Book ontology in Ontolingua.
Sample Queries
Notes
The CIA World Fact Book itself is in the public domain.
No Intellectual Property issues.
Resumes
Encode the resumes of a company's employees and job applicants in DAML.
Demonstrates
Temporal ranges for employee-of relations.
Using DAML for historical records.
Synonymy, as company names change over time.
Resources Available
Sample Queries
What existing employees may have worked with an interview candidate in the past? (based on matching company/dates and optionally city)
What employees have worked with both knowledge representation and Java?
Notes
Wall Street Journal
Imagine if the Wall Street Journal were published every day using DAML as well as text.
Demonstrates
The ability to mix and relate structured and unstructured content.
Ideally, the DAML facts would contain references to the
underlying text from the article.
Reification (DAML assertions are "as reported by the Wall Street Journal").
Temporal data (articles are dated).
Resources Available
Various corpora (including the Wall Street Journal) are available from the
Linguistic Data Consortium at http://www.ldc.upenn.edu/.
Sample Queries
Notify me with normal priority whenever a story appears about my company is mentioned.
Notify me with high priority whenever such a story portrays my company in a bad light.
Notify me whenever one of my competitors is mentioned
(e.g. based on Standard Industrial Classification code).
Notes
We could potentially use another newspaper source,
but the narrower domain focus and recognition of the WSJ are attractive.
This was demonstrated by BBN,
using New York Times articles from the MUC-7 conference,
at the DAML Kickoff Meeting in August 2000.
Center for Army Lessons Learned
Use DAML to encode the information stored by the Center for Army Lessons Learned.
Demonstrates
Resources Available
Some information is publically available at http://call.army.mil.
Sample Queries
Notes
CALL officials have already expressed interest in DAML,
and attended the DAML Kickoff meeting.
InterNet Worth
Various personal financial information is now available via the WWW
(bank accounts, stock portfolios, mutual funds,
retirement funds, credit card balances, etc.).
It would be possible to write an agent that polled these sources daily
to track one's "InterNet Worth" (assets - liabilities based on Internet-accessible data sources),
but this would currently require a considerable amount of code customized for each site.
How much easier would it be to develop such an agent
if all of this information were published using DAML?
Demonstrates
Increased development efficiency
(reduction in source lines of code required to build agents)
afforded by DAML.
Security and privacy issues.
Encoding efficiency
afforded by DAML.
Agents don't read ads!
Resources Available
Most major financial institutions now make account information
available via the Internet (in ad hoc, proprietary formats).
Sample Queries
What's my instantaneous net worth?
Notes
Mike Dean, 28 Sep 2000:
Teknowledge's TekPortal Financial Data Aggregation
(http://www.teknowledge.com/eCommerce/tekportal.htm) does just this,
using the XML-based Open Financial Exchange (EFX) protocol
or custom HTML screen scraping.
W3C Authorization
The W3C DAML proposal is available at http://www.w3.org/2000/01/sw/DevelopmentProposal.
It includes a number of ideas for applying DAML
to the daily operation of the World Wide Web Consortium.
In particular,
it suggests (in sections C and D)
the use of SWeLL to specify and check access control policies for www.w3.org.
Demonstrates
Logical inference
Security
Proof checking
Resources Available
Sample Queries
Notes
Mozilla Open Directory
The Mozilla Open Directory (dmoz) effort
is building a standard taxonomy of WWW sites
similar to that used by Yahoo, Netscape, etc.
Many of the major portals have already adopted
Open Directory.
Demonstrates
The ability to handle a very large, though simple,
ontology.
Resources Available
The Open Directory structure and contents are available in RDF form
from http://dmoz.org/rdf.html.
Sample Queries
Notes
Web-in-a-Box
Web-in-a-Box is a new DARPA ITO program under Jean Scholtz
that seeks to provide cached
versions of WWW pages that have been
identified as particularly relevant by staff at operational
military commands.
It would be desirable to include DAML content as part of this material.
Demonstrates
Resources Available
Mike Sullivan and other BBN staff involved in a Web-in-a-Box jumpstart
effort are currently
surveying operational sites to identify high priority unclassified WWW sources.
Sample Queries
Notes
The Web-in-a-Box BAA00-41 was just cancelled.
SEC EDGAR
The U.S. Security and Exchange Commission's EDGAR database of filings
for publically-traded companies have long been available on the Internet.
The submission are semi-structured SGML with lots of large text blocks.
The submissions contain a wealth of timely information about
financial performance, corporate officers, subsidiary relationships,
competition, etc.
Demonstrates
Digital signing of DAML assertions
(the EDGAR submissions are already signed,
although these signatures wouldn't match for transformed data).
ontology translation.
An EDGAR ontology is likely to be similar to,
but perhaps different from,
a Wall Street Journal ontology.
This could present an opportunity for ontology translation (and/or sharing)
in a restricted yet compelling domain.
Resources Available
See http://www.sec.gov/edgarhp.htm.
Sample Queries
Notes
The EDGAR data is in the public domain.
Rich Site Summary (RSS) News
...
Demonstrates
Resources Available
Sample Queries
Notes
suggested by Stefan Decker
Joint Tactics Techniques and Procedures for Coalition Search and Rescue
Use DAML and DAML-S to encode the tactics, techniques and procedures for joint and coalition search and rescue missions.
Demonstrates
The ability to encode process and agent related information in a form that can be used to provide task support and reasoning.
Resources Available
US Joint Tactics, Techniques, and Procedures (JTTP) for Combat Search and Rescue,
Joint Publication 3-50.21, 23 March 1998, Joint Chiefs of Staff, PDF Format - 1.85MB -
http://www.dtic.mil/doctrine/jel/new_pubs/3_50_21.pdf
SPAWAR, NWDC, AFRL and others have expressed interest in the SAR domain as an exemplar for semantic interoperability for multi-national and joint forces combing to carry our effective search and rescue missions. The DAML CoSAR-TS project is to investigate the use of DAML and DAML-S for SAR.