The Semantic Web should enable greater access not only to content but also to services on the Web. Users and software agents should be able to discover, invoke, compose, and monitor Web resources offering particular services and having particular properties, and should be able to do so with a high degree of automation if desired. Powerful tools should be enabled by service descriptions, across the Web service lifecycle. OWL-S (formerly DAML-S) is an ontology of services that makes these functionalities possible. In this document we describe the overall structure of the ontology and its three main parts: the service profile for advertising and discovering services; the process model, which gives a detailed description of a service's operation; and the grounding, which provides details on how to interoperate with a service, via messages.
Following the layered approach to markup language development, the current version of OWL-S builds on the Ontology Web Language (OWL) Recommendation produced by theWeb-Ontology Working Group at the World Wide Web Consortium (W3C).
Efforts toward the creation of the Semantic Web are gaining momentum [1]. Soon it will be possible to access Web resources by content rather than just by keywords. A significant force in this movement is the development of a new generation of Web markup languages such as OWL[16] and its predecessor DAML+OIL [7,9]. These languages enable the creation of ontologies for any domain and the instantiation of these ontologies in the description of specific Web sites. These languages are also amenable to efficient reasoning procedures and thus reasoning applications can be built to automatically determine the logical consequences of the ontological statements.
Among the most important Web resources are those that provide services. By ``service'' we mean Web sites that do not merely provide static information but allow one to effect some action or change in the world, such as the sale of a product or the control of a physical device. The Semantic Web should enable users to locate, select, employ, compose, and monitor Web-based services automatically.
To make use of a Web service, a software agent needs a computer-interpretable description of the service, and the means by which it is accessed. An important goal for Semantic Web markup languages, then, is to establish a framework within which these descriptions are made and shared. Web sites should be able to employ a standard ontology, consisting of a set of basic classes and properties, for declaring and describing services, and the ontology structuring mechanisms of OWL provide an appropriate, Web-compatible representation language framework within which to do this.
This paper describes a collaborative effort by researchers at several organizations to define just such an ontology. We call this ontology OWL-S1. In what follows, we will first motivate OWL-S in terms of some sample tasks that it is designed to support. In the central part of the paper we describe the upper ontology for services that we have developed, including its subontologies for profiles, processes, and groundings. The ontology is still evolving, and making connections to other development efforts, such as those building ontologies of time and resources. We will sometimes refer to the OWL-S ontology as a language for describing services, reflecting the fact that it provides a standard vocabulary that can be used together with the other aspects of the OWL description language to create service descriptions.
This paper reflects the authors' design consensus as of OWL-S version 1.1, which is available at [3]. Please note that, in addition to the OWL ontology files, the release site includes examples and additional forms of documentation, including, in particular, a code walk-through illustrative of many points in this document, additional explanatory material (in HTML) regarding the grounding and the use of profile-based class hierarchies, and information about the status of this work, including unresolved issues and future directions.
We will be considering both simple, or ``atomic'' services, and complex or "composite" services. Atomic services are ones where a single Web-accessible computer program, sensor, or device is invoked by a request message, performs its task and perhaps produces a single response to the requester. With atomic services there is no ongoing interaction between the user and the service. For example, a service that returns a postal code or the longitude and latitude when given an address would be in this category. In contrast, complex or 'composite' services are composed of multiple more primitive services, and may require an extended interaction or conversation between the requester and the set of services that are being utilized. One's interaction with www.amazon.com to buy a book is like this; the user searches for books by various criteria, perhaps reads reviews, may or may not decide to buy, and gives credit card and mailing information. OWL-S is meant to support both categories of services, but complex services have motivated many of the ontology's elements. The following three task types will give the reader an idea of the kinds of tasks we expect OWL-S to enable [17,18].
Any Web-accessible program/sensor/device that is declared as a service will be regarded as a service. OWL-S does not preclude declaring simple, static Web pages to be services. But our primary motivation in defining OWL-S has been to support more complex tasks of the kinds described above.
Our structuring of the ontology of services is motivated by the need to provide three essential types of knowledge about a service (shown in Figure 1), each characterized by the question it answers:
The class Service provides an organizational point of reference for a declared Web service; one instance of Service will exist for each distinct published service. The properties presents, describedBy, and supports are properties of Service. The classes ServiceProfile, ServiceModel, and ServiceGrounding are the respective ranges of those properties. Each instance of Service will present a ServiceProfile description, be describedBy a ServiceModel description, and support a ServiceGrounding description. The details of profiles, models, and groundings may vary widely from one type of service to another--that is, from one instance of Service to another. But each of these three service perspectives provides an essential type of information about the service, as we explain below.
Generally speaking, the ServiceProfile provides the information needed for an agent to discover a service, while the ServiceModel and ServiceGrounding, taken together, provide enough information for an agent to make use of a service, once found.
The service profile tells "what the service does", in a way that is suitable for a service-seeking agent (or matchmaking agent acting on behalf of a service-seeking agent) to determine whether the service meets its needs. This form of representation includes a description of what is accomplished by the service, limitations on service applicability and quality of service, and requirements that the service requester must satisfy to use the service successfully.
The service model tells a client how to use the service, by detailing the semantic content of requests, the conditions under which particular outcomes will occur, and, where necessary, the step by step processes leading to those outcomes. That is, it describes how to ask for the service and what happens when the service is carried out. For nontrivial services (those composed of several steps over time), this description may be used by a service-seeking agent in at least four different ways: (1) to perform a more in-depth analysis of whether the service meets its needs; (2) to compose service descriptions from multiple services to perform a specific task; (3) during the course of the service enactment, to coordinate the activities of the different participants; and (4) to monitor the execution of the service.
A service grounding ("grounding" for short) specifies the details of how an agent can access a service. Typically a grounding will specify a communication protocol, message formats, and other service-specific details such as port numbers used in contacting the service. In addition, the grounding must specify, for each semantic type of input or output specified in the ServiceModel, an unambiguous way of exchanging data elements of that type with the service (that is, the serialization techniques employed).
The upper ontology for services specifies only two cardinality constraints: a service can be described by at most one service model, and a grounding must be associated with exactly one service. The upper ontology deliberately does not specify any minimum cardinality for the properties presents or describedBy. (Although, in principle, a service needs all three properties to be fully characterized, it is easy to imagine situations in which a partial characterization could be useful.) Nor does the upper ontology specify any maximum cardinality for presents or supports. (It will be extremely useful for some services to offer multiple profiles and/or multiple groundings.)
Finally, it must be noted that while we define one particular upper ontology for profiles, one for service models, and one for groundings, nevertheless OWL-S allows for the construction of alternative approaches in each case. Our intent here is not to prescribe a single approach in each of the three areas, but rather to provide default approaches that will be useful for the majority of cases. In the following three sections we discuss the resulting service profile, service model, and service grounding in greater detail.
A transaction in a web services marketplace involves three parties: the service requesters, the service provider, and infrastructure components [24,25]. The service requester, which may broadly identify with the buyer, seeks a service to complete its work; the service provider, which can be broadly identified with the seller, provides a service sought by the requester. In an open environment such as the Internet, the requester may not know ahead of time of the existence of the provider, so the requester relies on infrastructure components that act like registries to find the appropriate provider. For instance, a requester may need a news service that reports stock quotes with no delay with respect to the market. The role of the registries is to match the request with the offers of service providers to identify which of them is the best match. Within the OWL-S framework, the Service Profile provides a way to describe the services offered by the providers, and the services needed by the requesters.
The Service Profile does not mandate any representation of services; rather, using the OWL subclassing it is possible to create specialized representations of services that can be used as service profiles. OWL-S provides one possible representation through the class Profile. An OWL-S Profile describes a service as a function of three basic types of information: what organization provides the service, what function the service computes, and a host of features that specify characteristics of the service. The three pieces of information are reviewed in order below.
The provider information consists of contact information that refers to the entity that provides the service. For instance, contact information may refer to the maintenance operator that is responsible for running the service, or to a customer representative that may provide additional information about the service.
The functional description of the service is expressed in terms of the transformation produced by the service. Specifically, it specifies the inputs required by the service and the outputs generated; furthermore, since a service may require external conditions to be satisfied, and it has the effect of changing such conditions, the profile describes the preconditions required by the service and the expected effects that result from the execution of the service. For example, a selling service may require as a precondition a valid credit card and as input the credit card number and expiration date. As output it generates a receipt, and as effect the card is charged.
Finally, the profile allows the description of a host of properties that are used to describe features of the service. The first type of information specifies the category of a given service, for example, the category of the service within the UNSPSC classification system. The second type of information is quality rating of the service: some services may be very good, reliable, and quick to respond; others may be unreliable, sluggish, or even malevolent. Before using a service, a requester may want to check what kind of service it is dealing with; therefore, a service may want to publish its rating within a specified rating system, to showcase the quality of service it provides. It is up to the service requester to use this information, to verify that it is indeed correct, and to decide what to do with it. The last type of information is an unbounded list of service parameters that can contain any type of information. The OWL-S Profile provides a mechanism for representing such parameters; which might include parameters that provide an estimate of the max response time, to the geographic availability of a service.
The Profile of a service provides a concise description of the service to a registry, but once the service has been selected the Profile is useless; rather, the client will use the Process Model to control the interaction with the service. Although the Profile and the Process Model play different roles during the transaction between Web services, they are two different representations of the same service, so it is natural to expect that the input, output, precondition, and effects (hereafter IOPEs) of one are reflected in the IOPEs of the other.
OWL-S does not dictate any constraint between Profiles and Process Models, so the two descriptions may be inconsistent without affecting the validity of the OWL expression. Still, if the Profile represents a service that is not consistent with the service represented in the Process Model, the interaction will break at some point. As an extreme example, imagine a service that advertises as a travel agent, but adopts the process model of a book selling agent; it will be selected to reserve travels, but it will fail to do that, asking instead for book titles and ISBN numbers. On the other side, it will never be selected by services that want to buy books, so it will never sell a book either.
The selection of the IOPEs to specify in the Profile is quite a tricky process. It should avoid misrepresentation of the service, so ideally it would require all the IOPEs used in the Process Model. On the other side, some of those IOPEs may be so general that they do not describe the service. Another thing to consider is the registry's algorithm for matching requests with providers. Furthermore, the Profile implicitly specifies the intended purpose of the service: it advertises those functionalities that the service wants to provide, while it may hide (not declare publicly) other functionalities. As an example, consider a book-selling service that may involve two functionalities: the first one allows other services to browse its site to find books of interest, and the second one allows users to buy the books they found. The book seller has the choice of advertising just the book-buying functionality or both the browsing functionality and the buying functionality. In the latter case, the service makes public the fact that it can provide browsing services, and it allows everybody to browse its registry without buying a book. In contrast, by advertising only the book-selling functionality, but not the browsing, the agent discourages browsing by requesters who do not intend to buy. The decision as to which functionalities to advertise determines how the service will be used: a requester who intends to browse but not to buy would select a service that advertises both buying and browsing capabilities, but not one that advertises buying only.
In the description so far, we tacitly assumed a registry model in which service capabilities are advertised, and then matched against requests of service. This is the model adopted by registries like UDDI. While this is the most likely model to be adopted by Web services, other forms of registry are also possible. For example, when the demand for a service is higher than the supply, then advertising needs for service is more efficient then advertising offered services since a provider can select the next request as soon as it is free; furthermore, in a pure P2P architecture there would be no registry at all. Indeed the types of registry may vary widely and as many as 28 different types have been identified [25,4]. By using a declarative representation of Web services, the service profile is not committed to any form of registry, but it can be used in all of them. Since the service profile represents both offers of services and needs of services, then it can be used in a reverse registry that records needs and queries on offers. Indeed, the Service Profile can be used in all 28 types of registry.
In the following we describe in detail the main parts of the profile model; we classify them into four sections: the first one (4.2.1) describes the properties that link the Service Profile class with the Service class and Process Model class; the second section (4.2.2) describes the form of contact information and the Description of the profile -- this is information usually intended for human consumption; in the third section (4.2.3), we discuss the functional representation in terms of IOPEs; finally, in the last section (4.2.4), we describe the attributes of the Profile.
The class ServiceProfile provides a superclass of every type of high-level description of the service. ServiceProfile does not mandate any representation of services, but it mandates the basic information to link any instance of profile with an instance of service.
There is a two-way relation between a service and a profile, so that a service can be related to a profile and a profile to a service. These relations are expressed by the properties presents and presentedBy.
Some properties of the profile provide human-readable information that is unlikely to be automatically processed. These properties include serviceName, textDescription and contactInformation. A profile may have at most one service name and text description, but as many items of contact information as the provider wants to offer.
An essential component of the profile is the specification of what functionality the service provides and the specification of the conditions that must be satisfied for a successful result. In addition, the profile specifies what conditions result from the service, including the expected and unexpected results of the service activity. The OWL-S Profile represents two aspects of the functionality of the service: the information transformation (represented by inputs and outputs) and the state change produced by the execution of the service (represented by preconditions and effects). For example, to complete the sale, a book-selling service requires as input a credit card number and expiration date, but also the precondition that the credit card actually exists and is not overdrawn. The result of the sale is the output of a receipt that confirms the proper execution of the transaction, and as effect the transfer of ownership and the physical transfer of the book from the warehouse of the seller to the address of the buyer.
The Profile ontology does not provide a schema to describe IOPE instances. However, such a schema exists in the Process ontology, as discussed in the next section. Ideally, we envision that the IOPE's published by the Profile are a subset of those published by the Process. Therefore, the Process part of a description will create all the IOPE instances and the Profile instance can simply point to these instances. In this case a single instance is created for any IOPE, unlike in previous versions of OWL-S when, for a certain IOPE, an instance was created both in the Profile and Process part of the OWL-S description. However, if the IOPE's of the Profile are different from those of the Process, the Profile can still create its own IOPE instances using the schema offered by the Process ontology.
The Profile ontology defines the following properties of the Profile class for pointing to IOPE's:
See Figure 2, which shows selected classes and properties of the Profile.
In the previous section we introduced the functional description of services, but there are other aspects of services of which users should be aware. These additional attributes include the quality guarantees that are provided by the service, possible classification of the service, and additional parameters that the service may want to specify.
ServiceCategory describes categories of services on the bases of some classification that may be outside OWL-S and possibly outside OWL. In the latter case, they may require some specialized reasoner if any inference has to be done with it.
The two properties, serviceClassification and serviceProduct, are used to specify the type of service provided and the products that are handled by the service. The values of the two properties are instances of classes specified in OWL ontologies of services and products. The properties serviceClassification and serviceProduct are similar to serviceCategory described above, but they differ in that the values of the properties are OWL instances rather than strings referring to some non-OWL business taxonomy.
To give a detailed perspective on how to interact with a service, it can be viewed as a process. Specifically, OWL-S 1.1 defines a subclass of ServiceModel, Process, which draws upon well-established work in a variety of fields, including work in AI on standardizations of planning languages [6], work in programming languages and distributed systems [20,19], emerging standards in process modeling and workflow technology such as the NIST's Process Specification Language (PSL) [22] and the Workflow Management Coalition effort (http://www.aiim.org/wfmc), work on modeling verb semantics and event structure [21], previous work on action-inspired Web service markup [18], work in AI on modeling complex actions [13], and work in agent communication languages [15,5].
It is important to understand that a process is not a program to be executed. It is a specification of the ways a client may interact with a service. An atomic process is a description of a service that expects one (possibly complex) message and returns one (possibly complex) message in response. A composite process is one that maintains some state; each message the client sends advances it through the process.
A process can have two sorts of purpose. First, it can generate and return some new information based on information it is given and the world state. Information production is described by the inputs and outputs of the process. Second, it can produce a change in the world. This transition is described by the preconditions and effects of the process.
A process can have any number of inputs (including zero), representing the information that is, under some conditions, required for the performance of the process. It can have any number of outputs, the information that the process provides to the requester. There can be any number of preconditions, which must all hold in order for the process to be successfully invoked. Finally, the process can have any number of effects. Outputs and effects can depend on conditions that hold true of the world state at the time the process is performed. (We use the term perform instead of execute to de-emphasize the traditional picture of a single agent being responsible for the occurrence of the process.)
Before we can go into the details of how processes work, it's necessary to explain how inputs, outputs, preconditions, and effects (colloquially known as IOPEs) work, because fitting them into the OWL framework requires bending the rules somewhat.
Inputs and outputs are subclasses of a general class called Parameter. It's convenient to identify parameters with what are called variables in SWRL, the language for expressing OWL Rules.
<owl:Class rdf:about="#Parameter"> <rdfs:subClassOf rdf:resource="&swrl;#Variable"/> </owl:Class>
Every parameter has a type, specified using a URI. This is not the OWL class the parameter belongs to, but a specification of the class (or datatype) that values of the parameter belong to.
<owl:DatatypeProperty rdf:ID="parameterType"> <rdfs:domain rdf:resource="#Parameter"/> <rdfs:range rdf:resource="&xsd;anyURI"/> </owl:DatatypeProperty> <owl:Class rdf:ID="Parameter"> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#parameterType" /> <owl:minCardinality rdf:datatype="&xsd;#nonNegativeInteger"> 1</owl:minCardinality> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
Inputs and outputs are subclasses of parameter:
<owl:Class rdf:ID="Input"> <rdfs:subClassOf rdf:resource="#Parameter"/> </owl:Class> <owl:Class rdf:ID="Output"> <rdfs:subClassOf rdf:resource="#Parameter"/> </owl:Class>
Two other subclasses of Parameter, Local and ResultVar, are described below.
Modeling variables as global, named individuals, as is done for Owl Rules, can be misleading. RDF has no notion of the ``scope'' of a variable, because an RDF document is nothing but a pile of triples. A variable is named with a URI, like any other resource, and so has global scope, or, more accurately, no notion of scope at all. In spite of this lack of structure, we often use RDF to encode hierarchical entites such as formulas and control structures. Wrapping variable references inside literals allows us to sneak in and impose our own scoping rules. We discuss the OWL-S rules in section 5.2.
A process will not execute properly unless its preconditions are true. If and when it does execute, it has various effects. For example, an agent can order 1000 bolts from a web service only if it can get the web service to accept its promise to pay. One effect of placing the order is the transfer of ownership of the bolts from the service to the agent (or the legal person for which it is a proxy).
Preconditions and effects are represented as logical formulas. Getting logical formulas into RDF has not been easy, but it is now reasonably clear how to proceed. There are actually several possible approaches, depending on how close to RDF/OWL one wants to remain. Usually having lots of choices for such a crucial job is a bad idea, but in this case most of the differences are superficial; it is fairly easy to translate between alternative notations.
The key idea underpinning our approach is to treat expressions as literals, either string literals or XML literals. The latter case is used for languages whose standard encoding is in XML, such as SWRL [8] or RDF [11]. The former case is for other languages such as KIF [10] and PDDL [6]. The ontology [http://www.daml.org/services/owl-s/1.1/generic/Expression.owl] defines Expressions and their properties.
<owl:Class rdf:ID="Expression"> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#expressionLanguage"/> <owl:cardinality rdf:datatype="&xsd;nonNegativeInteger"> 1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#expressionBody"/> <owl:cardinality rdf:datatype="&xsd;nonNegativeInteger"> 1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
We annotate expressions with the language they are expressed in:
<owl:ObjectProperty rdf:ID="&expr;#expressionLanguage"> <rdfs:domain rdf:resource="&expr;#Expression"/> <rdfs:range rdf:resource="&expr;#LogicLanguage"/> </owl:ObjectProperty>
The expressionBody property gives the actual expression:
<owl:DatatypeProperty rdf:ID="expressionBody"> <rdfs:domain rdf:resource="#Expression"/> </owl:DatatypeProperty>
As an example, we might state that in order to send the number of a certain credit card to a web agent, one must know what its number is:
<Description rdf:about="#process2"> <hasPrecondition> <expr:KIF-Expression> <expr:expressionBody> (!agnt:know_val_is (!ecom:credit_card_num ?cc) ?num) </expr:expressionBody> </expr:KIF-Expression> </hasPrecondition> </Description>
(where the notation ``!ecom:'' for namespaces is taken from Lassila's WILBUR system [12]). (The declaration of the variable ?cc is not shown. We'll come back to that point shortly.)
In cases where an XML encoding is used, we would declare an expression to be an XML literal. Here's the same example using DRS as the expression language:
<Description rdf:about="#process2"> <hasPrecondition> <Expression expressionLanguage="&drs;#DRS"> <process:expressionBody> <drs:Atomic_formula> <rdf:predicate rdf:resource="&agnt;#Know_val_is"/> <rdf:subject> <drs:Functional_term> <drs:function rdf:resource="&ecom;credit_card_num"/> <drs:term_args rdf:parseType="Collection"> <swrl:Variable rdf:resource="#CC"/> </drs:term_args> </drs:Functional_term> </rdf:subject> <rdf:object rdf:resource="#Num"/> </drs:Atomic_formula> </process:expressionBody> </Expression> </hasPrecondition> </Description>
The references to #CC and #Num in the DRS example are to parameters, the same ones written as ?cc and ?num in the KIF example. We haven't yet provided a mechanism for declaring the scopes of variables (see section 5.2) and how they acquire values. Roughly speaking, variables and parameters are scoped to the process where they are used. We will describe this in more detail later. In the example, #CC is an input parameter to the process, that is, supplied by the client, but #Num is supposed to be set in the process of reasoning about this very precondition. Verifying that the process is feasible requires retrieving the credit-card's 16-digit number, which is then associated with the variable #Num. We call such a parameter a local parameter.
<owl:Class rdf:ID="Local"> <rdfs:subClassOf rdf:resource="#Parameter"/> </owl:Class>
The three types of parameter are disjoint:
<rdf:Description rdf:about="#Input"> <owl:disjointWith rdf:resource="#Output"/> <owl:disjointWith rdf:resource="#Local"/> </rdf:Description> <rdf:Description rdf:about="#Output"> <owl:disjointWith rdf:resource="#Local"/> </rdf:Description>
Of course, flagging bits of RDF as ``Literals'' means that an RDF parser should ignore them. (If it did not, then it might turn the RDF into a set of triples with a simple declarative meaning, which is not appropriate.) The trick is to have the OWL-S parser extract the ignored stuff and interpret it appropriately for its context, treating it as ordinary RDF after transformations such as replacing occurrences of variables with their values. In the example above, the occurrences of #num and #cc are interpreted as the values of these variables, not the variables themselves. In the KIF example, the expressions ?num and ?cc must be similarly interpreted. It is usually not too difficult to do this sort of ``field engineering'' to interface an assertional language to RDF.
There are two special cases of Expression: Condition and Effect. Because they are implemented as literals, there is no way to declare what this difference is, but it's a useful distinction for a human reader of the ontology.
<owl:Class rdf:ID="Condition"> <owl:subClassOf rdf:resource="&expr;#Expression"/> </owl:Class> <owl:Class rdf:ID="Effect"> <owl:subClassOf rdf:resource="&expr;#Expression"/> </owl:Class>
We connect processes to their ``IOPEs'' using the properties shown in this table:
Property | Range | Kind |
---|---|---|
hasParticipant | Participant | Thing |
hasInput | Input | Parameter |
hasOutput | Output | Parameter |
hasLocal | Local | Parameter |
hasPrecondition | Condition | Expression |
hasResult | Result | (see below) |
As promised above, the links from a process to its parameters implicitly gives them scope. Participant, input, output, and local parameters have as scope the entire process they occur in. We introduce result vars below, which have a narrower scope.
In the rest of this section, we will discuss the entries in this table.
A process involves two or more agents. One is TheClient, the agent from whose point of view the process is described. Another is TheServer, the principal element of the service that the client deals with. If there are others, they are listed using the property hasParticipant.
<owl:ObjectProperty rdf:ID="hasParticipant"> <rdfs:domain rdf:resource="#Process"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="hasClient"> <rdfs:subPropertyOf rdf:resource="#hasParticipant"/> </owl:ObjectProperty> <process:Parameter rdf:ID="TheClient"> <process:Parameter rdf:ID="TheServer">
Inputs and outputs specify the data transformation produced by the process. Inputs specify the information that the process requires for its execution. For atomic processes, the information must come from the client. For the pieces of a composite process, some inputs come directly from the client, but others come from previous steps of the process.
We said above that an atomic process corresponds to a one-step service that expects one message, so it might appear to be contradictory to allow an atomic process to have multiple inputs. The contradiction is resolved by distinguishing between the inputs and the message sent to a process. There is just one message, but it can bundle as many inputs as required. The bundling is specified by the grounding of the process model; see section 6. Similarly, the outputs produced by the invocation of an atomic process flow back to the client as a single message, the format of which is specified by the grounding. (Here we refer to the WSDL-based grounding, which is the only style of grounding fully developed to date.)
The following example shows the definition of hasParameter, and its subproperties hasInput, hasOutput, and hasLocal:
<owl:ObjectProperty rdf:ID="hasParameter"> <rdfs:domain rdf:resource="#Process"/> <rdfs:range rdf:resource="#Parameter"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="hasInput"> <rdfs:subPropertyOf rdf:resource="#hasParameter"/> <rdfs:range rdf:resource="#Input"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="hasOutput"> <rdfs:subPropertyOf rdf:resource="#hasParameter"/> <rdfs:range rdf:resource="#Output"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="hasLocal"> <rdfs:subPropertyOf rdf:resource="#hasParameter"/> <rdfs:range rdf:resource="#Local"/> </owl:ObjectProperty>
If a process has a precondition, then the process cannot be performed successfully unless the precondition is true.
<owl:ObjectProperty rdf:ID="hasPrecondition"> <rdfs:domain rdf:resource="#Process"/> <rdfs:range rdf:resource="&expr;#Condition"/> </owl:ObjectProperty>
Please be sure to distinguish between a condition's being true and having various other properties, such as being believed to be true, being known to be true, being represented in a database as true, etc. In OWL-S, if a process's precondition is false, the consequences of performing or initiating the process are undefined.
The performance of a process may result in changes of the state of the world (effects), and the acquisition of information by the client agent performing it (returned to it as outputs). However, we don't link processes directly to effects and outputs, because process modelers often want to model the dependence of these on context. For example, if a process contains a step to buy an item, there are two possible outcomes: either the purchase succeeds or it fails. In the former case, the effect is that ownership is transferred and the output is, say, a confirmation number. In the latter case, there is no effect, and the output is a failure message.
We use the term result to refer to a coupled output and effect.
<owl:Class rdf:ID="Result"> <rdfs:label>Result</rdfs:label> </owl:Class> <owl:ObjectProperty rdf:ID="hasResult"> <rdfs:label>hasResult</rdfs:label> <rdfs:domain rdf:resource="#Process"/> <rdfs:range rdf:resource="#Result"/> </owl:ObjectProperty>
Having declared a result, a process model can then describe it in terms of four properties
<owl:ObjectProperty rdf:ID="inCondition"> <rdfs:label>inCondition</rdfs:label> <rdfs:domain rdf:resource="#Result"/> <rdfs:range rdf:resource="&expr;#Condition"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="hasResultVar"> <rdfs:label>hasResultVar</rdfs:label> <rdfs:domain rdf:resource="#Result"/> <rdfs:range rdf:resource="#ResultVar"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="withOutput"> <rdfs:label>withOutput</rdfs:label> <rdfs:domain rdf:resource="#Result"/> <rdfs:range rdf:resource="#OutputBinding"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="hasEffect"> <rdfs:label>hasEffect</rdfs:label> <rdfs:domain rdf:resource="#Result"/> <rdfs:range rdf:resource="&expr;#Expression"/> </owl:ObjectProperty>
The inCondition property specifies the condition under which this result (and not another) occurs. The withOutput and hasEffect properties then state what ensues when the condition is true. The hasResultVar property declares variables that are bound in the inCondition. These variables, called ResultVars, are analogous to Locals, and serve a similar purpose. Whereas Locals are variables to be bound in preconditions and then used in the specifying result conditions, outputs and effects, ResultVars are scoped to a particular result, are bound in the result's condition, and are used to describe the outputs and effects associated with that condition. For example, if a process were to validate a credit card, then one could have the ResultVar CardAccepted contain the result of that query process, which could then be returned as an output.
<owl:Class rdf:about="ResultVar"> <rdfs:subClassOf rdf:resource="#Parameter"/> <owl:disjointWith rdf:resource="#Input"/> <owl:disjointWith rdf:resource="#Output"/> <owl:disjointWith rdf:resource="#Local"/> </owl:Class>
Another typical example is a process that charges a credit card. The charge goes through if the card is not overdrawn. If it is overdrawn, the only output is a failure notification. So the description of the process must include the description of two Results, possibly in this form:
<process:AtomicProcess rdf:ID="Purchase"> <process:hasInput> <process:Input rdf:ID="ObjectPurchased"/> </process:hasInput> <process:hasInput> <process:Input rdf:ID="PurchaseAmt"/> </process:hasInput> <process:hasInput> <process:Input rdf:ID="CreditCard"/> </process:hasInput> <process:hasOutput> <process:Output rdf:ID="ConfirmationNum"/> </process:hasOutput> <process:hasResult> <process:Result> <process:hasResultVar> <process:ResultVar rdf:ID="CreditLimH"> <process:parameterType rdf:resource="&ecom;#Dollars"/> </process:ResultVar> </process:hasResultVar> <process:inCondition> <expr:KIF-Condition> <expr:expressionBody> (and (current-value (credit-limit ?CreditCard) ?CreditLimH) (>= ?CreditLimH ?purchaseAmt)) </expr:expressionBody> </expr:KIF-Condition> </process:inCondition> <process:withOutput> <process:OutputBinding> <process:toParam rdf:resource="#ConfirmationNum"/> <process:valueFunction rdf:parseType="Literal"> <cc:ConfirmationNum xsd:datatype="&xsd;#string"/> </process:valueFunction> </process:OutputBinding> </process:withOutput> <process:hasEffect> <expr:KIF-Condition> <expr:expressionBody> (and (confirmed (purchase ?purchaseAmt) ?ConfirmationNum) (own ?objectPurchased) (decrease (credit-limit ?CreditCard) ?purchaseAmt)) </expr:expressionBody> </expr:KIF-Condition> </process:hasEffect> </process:Result> <process:Result> <process:hasResultVar> <process:ResultVar rdf:ID="CreditLimL"> <process:parameterType rdf:resource="&ecom;#Dollars"/> </process:ResultVar> </process:hasResultVar> <process:inCondition> <expr:KIF-Condition> <expr:expressionBody> (and (current-value (credit-limit ?CreditCard) ?CreditLimL) (< ?CreditLimL ?purchaseAmt)) </expr:expressionBody> </expr:KIF-Condition> </process:inCondition> <process:withOutput rdf:resource="&ecom;failureNotice"/> <process:OutputBinding> <process:toParam rdf:resource="#ConfirmationNum"/> <process:valueData rdf:parseType="Literal"> <drs:Literal> <drs:litdefn xsd:datatype="&xsd;#string" >00000000</drs:litdefn> </drs:Literal> </process:valueData> </process:OutputBinding> </process:withOutput> </process:Result> </process:hasResult> </process:AtomicProcess>
As a result of the execution of the process, a credit card is charged and the money in the account reduced. Note, once again, that there is a fundamental difference between effects and outputs. Effects describe conditions in the world, while outputs describe information. In a more realistic version of this example, the service may send a notification, or an invoice, that it charged the credit card account. This output is just a datum of one type or another. The effect describes the actual event that the output is part of the description of: that the amount of money in the credit card account has been reduced and that the client now owns the object it intended to purchase.
Finally, there is another output descriptor, called resultForm. This is not attached to a variable, but to a Result:
<owl:DatatypeProperty rdf:ID="resultForm"> <rdfs:label>resultForm</rdfs:label> <rdfs:domain rdf:resource="#Result"/> <rdfs:range rdf:resource="&rdf;#XMLLiteral"/> </owl:DatatypeProperty>
The purpose of resultForm is to provide an abstract XML template for outputs sent back to the client. The reasons we need such a template are subtle, and do not always apply. Normally the grounding suffices to express how the components of a message are bundled, i.e., how inputs are put together to make a message to a service, and how replies are disassembled into the intended outputs. In essence, all we can or need to do is build up and tear down record structures (in XML Schema terminology, ComplexTypes). But in the case of a process with multiple Results, it can be extremely useful to specify other features of an output message that indicate which result actually occurred, sparing us the chore of providing output fields to encode that information, or making the client infer it from the form of the other fields. That's what resultForm is for.
In our example of a credit-card transaction, we had two Results, one for the case where there was a sufficient balance to pay the bill, and one for when there wasn't. We could augment each result with a further binding, such as this one for the failure case:
<owls:Result> <owls:hasResultVar> <owls:ResultVar rdf:ID="CreditLimL"> <owls:parameterType rdf:resource="&ecom;#Dollars"/> </owls:ResultVar> </owls:hasResultVar> <process:inCondition> <expr:KIF-Condition> <expr:expressionBody> (and (current-value (credit-limit ?creditCard) ?CreditLimL) (< ?CreditLimL ?purchaseAmt)) </expr:expressionBody> </expr:KIF-Condition> </process:inCondition> <owls:resultForm rdf:parseType="Literal"> <ecom:CreditExceededFailure> <ecom:gap expressionLanguage="&expr;#KIF" rdf:datatype="&xsd;#string"> (- ?purchaseAmt ?CreditLimL) </ecom:gap> </ecom:CreditExceededFailure> </owls:resultForm> <withOutput rdf:resource="&ecom;failureNotice"/> ... </withOutput> </owls:Result>
We are now ready to formalize the classes of processes: atomic, composite, and, not mentioned before, ``simple.''
<owl:Class rdf:ID="Process"> <rdfs:comment> The most general class of processes </rdfs:comment> <owl:unionOf rdf:parseType="Collection"> <owl:Class rdf:about="#AtomicProcess"/> <owl:Class rdf:about="#SimpleProcess"/> <owl:Class rdf:about="#CompositeProcess"/> </owl:unionOf> </owl:Class>
See Figure 3, which shows selected classes and properties of the process model.
Atomic processes correspond to the actions a service can perform by engaging it in a single interaction; composite processes correspond to actions that require multi-step protocols and/or multiple server actions; finally, simple processes provide an abstraction mechanism to provide multiple views of the same process. We discuss atomics and simples here, reserving composites for the next subsection.
Atomic processes are directly invocable (by passing them the appropriate messages). Atomic processes have no subprocesses and execute in a single step, as far as the service requester is concerned. That is, they take an input message, do something, and then return their output message. For each atomic process, there must be provided a grounding that enables a service requester to construct messages to the process from its inputs and deconstruct replies, as explained in Section 6.
<owl:Class rdf:ID="AtomicProcess"> <owl:subClassOf rdf:resource="#Process"/> </owl:Class>
Simple processes are not invocable and are not associated with a grounding, but, like atomic processes, they are conceived of as having single-step executions. Simple processes are used as elements of abstraction; a simple process may be used either to provide a view of (a specialized way of using) some atomic process, or a simplified representation of some composite process (for purposes of planning and reasoning). In the former case, the simple process is realizedBy the atomic process; in the latter case, the simple process expandsTo the composite process (see below).
<owl:Class rdf:ID="SimpleProcess"> <rdfs:subClassOf rdf:resource="#Process"/> <owl:disjointWith rdf:resource="#AtomicProcess"/> </owl:Class> <owl:ObjectProperty rdf:ID="realizedBy"> <rdfs:domain rdf:resource="#SimpleProcess"/> <rdfs:range rdf:resource="#AtomicProcess"/> <owl:inverseOf rdf:resource="#realizes"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="realizes"> <rdfs:domain rdf:resource="#AtomicProcess"/> <rdfs:range rdf:resource="#SimpleProcess"/> <owl:inverseOf rdf:resource="#realizedBy"/> </owl:ObjectProperty>
Finally, for an atomic process, there are always only two participants, TheClient and TheServer:
<owl:Class rdf:about="#AtomicProcess"> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#hasClient"/> <owl:hasValue rdf:resource="#TheClient"/> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#performedBy"/> <owl:hasValue rdf:resource="#TheServer"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
Composite processes are decomposable into other (non-composite or composite) processes; their decomposition can be specified by using control constructs such as Sequence and If-Then-Else, which are discussed below. Because many of the control constructs have names reminiscent of control structures in programming languages, it is easy to lose sight of a fundamental difference: a composite process is not a behavior a service will do, but a behavior (or set of behaviors) the client can perform by sending and receiving a series of messages. If the composite process has an overall effect, then the client must perform the entire process in order to achieve that effect. We have not yet given a precise specification of what it means to perform a process, but all we mean is that, e.g., if a composite is a Sequence, then the client sends a series of messages that invoke every step in order.
One crucial feature of a composite process is its specification of how its inputs are accepted by particular subprocesses, and how its various outputs are produced by particular subprocesses. We discuss this topic in section 5.5.
<owl:Class rdf:ID="CompositeProcess"> <rdfs:subClassOf rdf:resource="#Process"/> <owl:disjointWith rdf:resource="#AtomicProcess"/> <owl:disjointWith rdf:resource="#SimpleProcess"/> <rdfs:comment> A CompositeProcess must have exactly 1 composedOf property. </rdfs:comment> <owl:intersectionOf rdf:parseType="Collection"> <owl:Class rdf:about="#Process"/> <owl:Restriction> <owl:onProperty rdf:resource="#composedOf"/> <owl:cardinality rdf:datatype="&xsd;#nonNegativeInteger"> 1</owl:cardinality> </owl:Restriction> </owl:intersectionOf> </owl:Class>
A CompositeProcess must have a composedOf property by which is indicated the control structure of the composite, using a ControlConstruct.
<owl:ObjectProperty rdf:ID="composedOf"> <rdfs:domain rdf:resource="#CompositeProcess"/> <rdfs:range rdf:resource="#ControlConstruct"/> </owl:ObjectProperty> <owl:Class rdf:ID="ControlConstruct"> </owl:Class>
Each control construct, in turn, is associated with an additional property called components to indicate the nested control constructs from which it is composed, and, in some cases, their ordering.
<owl:ObjectProperty rdf:ID="components"> <rdfs:domain rdf:resource="#ControlConstruct"/> </owl:ObjectProperty>
For instance, any instance of the control construct Sequence has a components property that ranges over a ControlConstructList (a list of control constructs). We give a complete table of composite control constructs below.
Any composite process can be considered a tree whose nonterminal nodes are labeled with control constructs, each of which has children specified using components. The leaves of the tree are invocations of other processes, indicated as instances of class Perform.
<owl:Class rdf:ID="Perform"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#process"/> <owl:cardinality rdf:datatype="&xsd;#nonNegativeInteger"> 1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
The process property of a perform indicates the process to be performed:
<owl:ObjectProperty rdf:ID="process"> <rdfs:domain rdf:resource="#Perform"/> <rdfs:range rdf:resource="#Process"/> </owl:ObjectProperty>
When a process is performed as a step in a larger process, there must be a description of where the inputs to the performed process come from and where the outputs go. This issue we defer to section 5.5.
A process can often be viewed at different levels of granularity, either as a primitive, undecomposable process or as a composite process. These are sometimes referred to as ``black box'' and ``glass box'' views, respectively. Either perspective may be the more useful in some given context. When a composite process is viewed as a black box, a simple process can be used to represent this. In this case, the relationship between the simple and composite is represented using the expandsTo property, and its inverse, the collapsesTo property.
<owl:ObjectProperty rdf:ID="expandsTo"> <rdfs:domain rdf:resource="#SimpleProcess"/> <rdfs:range rdf:resource="#CompositeProcess"/> <owl:inverseOf rdf:resource="#collapsesTo"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="collapsesTo"> <rdfs:domain rdf:resource="#CompositeProcess"/> <rdfs:range rdf:resource="#SimpleProcess"/> <owl:inverseOf rdf:resource="#expandsTo"/> </owl:ObjectProperty>
We conclude this section with an overview of the OWL-S control constructs: Sequence, Split, Split + Join, Choice, Any-Order, Condition, If-Then-Else, Iterate, Repeat-While, and Repeat-Until.
<owl:Class rdf:ID="Sequence"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#components"/> <owl:allValuesFrom rdf:resource="#ControlConstructList"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
<owl:Class rdf:ID="ControlConstructList"> <rdfs:comment> A list of control constructs </rdfs:comment> <rdfs:subClassOf rdf:resource="&shadow-rdf;#List"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="&shadow-rdf;#first"/> <owl:allValuesFrom rdf:resource="#ControlConstruct"/> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="&shadow-rdf;#rest"/> <owl:allValuesFrom rdf:resource="#ControlConstructList"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
<owl:Class rdf:ID="Split"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#components"/> <owl:allValuesFrom rdf:resource="#ControlConstructBag"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
<owl:Class rdf:ID="ControlConstructBag"> <rdfs:comment> A multiset of control constructs </rdfs:comment> <rdfs:subClassOf rdf:resource="&shadow-rdf;#List"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="&shadow-rdf;#first"/> <owl:allValuesFrom rdf:resource="#ControlConstruct"/> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="&shadow-rdf;#rest"/> <owl:allValuesFrom rdf:resource="#ControlConstructBag"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
Here the process consists of concurrent execution of a bunch of process components with barrier synchronization. That is, Split+Join completes when all of its components processes have completed. With Split and Split+Join, we can define processes that have partial synchronization (e.g., split all and join some sub-bag).
<owl:Class rdf:ID="Split-Join"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#components"/> <owl:allValuesFrom rdf:resource="#ControlConstructBag"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
<owl:Class rdf:ID="Any-Order"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#components"/> <owl:allValuesFrom rdf:resource="#ControlConstructBag"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
<owl:Class rdf:ID="Choice"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#components"/> <owl:allValuesFrom rdf:resource="#ControlConstructBag"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
<owl:Class rdf:ID="If-Then-Else"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#components"/> <owl:allValuesFrom rdf:resource="#ControlConstructBag"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class> <owl:ObjectProperty rdf:ID="ifCondition"> <rdfs:comment> The if condition of an if-then-else</rdfs:comment> <rdfs:domain rdf:resource="#If-Then-Else"/> <rdfs:range rdf:resource="&expr;#Condition"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="then"> <rdfs:domain rdf:resource="#If-Then-Else"/> <rdfs:range rdf:resource="#ControlConstruct"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="else"> <rdfs:domain rdf:resource="#If-Then-Else"/> <rdfs:range rdf:resource="#ControlConstruct"/> </owl:ObjectProperty>
Iterate is an "abstract" class, in the sense that it's not detailed enough to be instantiated in a process model. It's defined to serve as the common superclass of Repeat-While, Repeat-Until, and potentially other specific iteration constructs that might be needed in the future.
<owl:Class rdf:ID="Iterate"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#components"/> <owl:allValuesFrom rdf:resource="#ControlConstructBag"/> </owl:Restriction> </rdfs:subClassOf> </owl:Class>
<owl:ObjectProperty rdf:ID="whileCondition"> <rdfs:domain rdf:resource="#Repeat-While"/> <rdfs:range rdf:resource="&expr;#Condition"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="whileProcess"> <rdfs:domain rdf:resource="#Repeat-While"/> <rdfs:range rdf:resource="#ControlConstruct"/> </owl:ObjectProperty> <owl:Class rdf:ID="Repeat-While"> <rdfs:comment> The repeat while construct</rdfs:comment> <rdfs:subClassOf rdf:resource="#Iterate"/> </owl:Class>
<owl:ObjectProperty rdf:ID="untilCondition"> <rdfs:domain rdf:resource="#Repeat-Until"/> <rdfs:range rdf:resource="&expr;#Condition"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="untilProcess"> <rdfs:domain rdf:resource="#Repeat-Until"/> <rdfs:range rdf:resource="#ControlConstruct"/> </owl:ObjectProperty> <owl:Class rdf:ID="Repeat-Until"> <rdfs:comment> The repeat until process</rdfs:comment> <rdfs:subClassOf rdf:resource="#Iterate"/> </owl:Class>
When defining processes using OWL-S, there are many places where the input to one process component is obtained as one of the outputs of a preceding step, short-circuiting the normal transmission of data from service to client and back. This is one type of data flow from one step of a process to another. There are also other patterns; in particular, the outputs of a composite process may be derived from outputs of some of its components, and specifying which component's output becomes output of the composite is also a data-flow specification.
We adopt the convention that the source of a datum is identified when the user of the datum is declared. If step 1 feeds step 3, we specify this fact in the description of step 3 rather than the description of step 1. We call this a consumer-pull convention, as opposed to a producer-push alternative. We implement this convention by providing a notation for arbitrary terms as the values of input or output parameters of a process step, plus a notation for subterms denoting the output or input parameters of prior process steps.
Consider the following tableau:
I1 input of: { Composite Process CP }: with output O1
composed of Step 1: Perform S1 ⇒ Step 2: Perform S2
where S1 has inputs I11 and I12, and output O11 and S2 has input I21 and output O21
The right-arrow here indicates that step 2 is meant to follow step 1, but not necessarily immediately.
Suppose that we want a straightforward data flow: Input I1 of the overall process CP is used as input I11 of S1, after adding 1. Input I12 of S1 is a constant, the string "Academic". Output O11 of S1 is used as input I21 of S2. The maximum of 0 and output O21 of S2, times π, is used as output O1 of CP. Using a consumer-pull convention, we simply declare the parameters I1, O11, and O21, but for parameters I11, I21, and O1 we provide, in addition to a declaration, bindings that specify that
I11(Step1) comes from incr(I1(CP)) I12(Step1) = "Academic" I21(Step2) comes from O11(Step1) O1(CP) comes from π × max 0, O21(Step2))
Each of these equalities is represented in OWL-S as a Binding, an abstract object with two properties: toParam, the name of the parameter (e.g., I21(S2)), and valueSpecifier, a description of its value. In an effort to provide value specifications in as concise a manner as possible in a variety of situations, we provide four different types: valueSource, valueType, valueData, and valueFunction.
We declare the toParam property in the usual way:
<owl:Class rdf:ID="Binding"> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#toParam"/> <owl:cardinality rdf:datatype="&xsd;nonNegativeInteger"> 1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> </owl:Class> <owl:ObjectProperty rdf:ID="toParam"> <rdfs:domain rdf:resource="#Binding"/> <rdfs:range rdf:resource="#Parameter"/> </owl:ObjectProperty>
The simplest sort of dataflow spec is valueSource:
<owl:ObjectProperty rdf:ID="valueSource"> <rdfs:label>valueSource</rdfs:label> <rdfs:domain rdf:resource="#Binding"/> <rdfs:range rdf:resource="#ValueOf"/> <rdfs:subPropertyOf rdf:resource="#valueSpecifier"/> </owl:ObjectProperty>
The range of valueSource is a simple object of class ValueOf, specified entirely by its properties theVar and fromProcess. If a binding with toParam has valueSource with properties theVar= and fromProcess, that means that parameter of this process = parameter of .
<owl:Class rdf:ID="ValueOf"> <rdfs:label>ValueOf</rdfs:label> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#theVar"/> <owl:cardinality rdf:datatype="&xsd;#nonNegativeInteger"> 1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#fromProcess"/> <owl:maxCardinality rdf:datatype="&xsd;#nonNegativeInteger"> 1</owl:maxCardinality> </owl:Restriction> </rdfs:subClassOf> </owl:Class> <owl:ObjectProperty rdf:ID="theVar"> <rdfs:domain rdf:resource="#ValueOf"/> <rdfs:range rdf:resource="#Parameter"/> </owl:ObjectProperty> <owl:ObjectProperty rdf:ID="fromProcess"> <rdfs:domain rdf:resource="#ValueOf"/> <rdfs:range rdf:resource="#Perform"/> </owl:ObjectProperty>
For example, our simple tableau has one place where a ValueSource expression can be used. Here is how that part of the tableau would be expressed:
<process:Sequence rdf:ID="CP"> <process:components rdf:parseType="Collection"> <process:Perform rdf:ID="Step1"> <process:process rdf:resource="&aux;#S1"/> ... </process:Perform> <process:Perform rdf:ID="Step2"> <process:process rdf:resource="&aux;#S2"/> <process:hasDataFrom> <!-- see below --> <process:Binding> <process:theParam rdf:resource="&aux;#I21"/> <process:valueSource> <process:ValueOf> <process:theParam rdf:resource="#O11"/> <process:fromProcess rdf:resource="#Step1"/> </process:ValueOf> </process:valueSource> </process:Binding> </process:hasDataFrom> </process:Perform> </process:components> ... </process:Sequence>
In this example, we used the hasDataFrom property, which is used to link inputs of Performs to bindings:
<owl:ObjectProperty rdf:ID="hasDataFrom"> <rdfs:domain rdf:resource="#ProcessComponent"/> <rdfs:range rdf:resource="#Binding"/> </owl:ObjectProperty>
The complete tableau appears below, after further discussion of bindings.
There is a subtle issue that arises from the fact that the range of fromProcess is Perform. In our informal tableau above, we used expressions such as to mean the the input I1 of the overall process CP. But we cannot actually refer to a Binding with valueSource that is a ValueOf with fromProcess = CP, because CP is not a Perform, but a process. If you think about it, it makes no sense to refer to a value extracted from a process, because every time the process is invoked different values will be involved. We need an expression that refers to the ``current value'' of a parameter, that is, its value during an actual Perform. We want to say: During any Perform of CP, the value of the input I1 of step S1 is the value of the input I1 of .
Hence we introduce a standard variable to play the role of , and give it the name TheParentPerform.
<swrl:Variable rdf:ID="TheParentPerform"> <rdfs:comment> A special-purpose variable, used to refer, at runtime, to the execution instance of the enclosing composite process definition. </rdfs:comment> </swrl:Variable>
We will show how this device is used in our example tableau shortly.
valueFunction and valueData use the XML Literal trick to encode arbitrary expressions.
<owl:DatatypeProperty rdf:ID="valueFunction"> <rdfs:label>valueFunction</rdfs:label> <rdfs:subPropertyOf rdf:resource="#valueSpecifier"/> <rdfs:domain rdf:resource="#Binding"/> <rdfs:range rdf:resource="&rdf;#XMLLiteral"/> </owl:DatatypeProperty> <owl:DatatypeProperty rdf:ID="valueData"> <rdfs:label>valueData</rdfs:label> <rdfs:domain rdf:resource="#Binding"/> </owl:DatatypeProperty>
The valueFunction of a Binding is an XML literal to be read as a functional term. Some of its subterms may be ValueOfs specifying outputs of previous terms. As with conditions and effects, the denotation of the valueFunction expression cannot be known until variable values are plugged in. The valueData of a Binding is an XML literal to be interpreted as constant data.
The valueType of a Binding is a URI referring to an OWL class definition C. An instance of valueType asserts that the value of the given parameter will belong to C. C must be a subclass of the parameter's overall type, declared using parameterType.
<owl:DatatypeProperty rdf:ID="valueType"> <rdfs:label>valueType</rdfs:label> <rdfs:subPropertyOf rdf:resource="#valueSpecifier"/> <rdfs:domain rdf:resource="#Binding"/> <rdfs:range rdf:resource="&xsd;#anyURI"/> </owl:DatatypeProperty>
Here is our complete example tableau, with all data flows expressed using one of the three data-source specs.
<process:CompositeProcess rdf:ID="CP"> <process:hasInput rdf:ID="I1"/> <process:hasOutput rdf:ID="O1"/> <process:composedOf> <process:Sequence rdf:ID="CP"> <process:components rdf:parseType="Collection"> <process:Perform rdf:ID="Step1"> <process:process rdf:resource="&aux;#S1"/> <process:hasDataFrom> <process:InputBinding> <process:theParam rdf:resource="&aux;#I11"/> <process:valueFunction expressionLanguage="&drs;" rdf:parseType="Literal"> <drs:Functional_term> <drs:term_function rdf:resource="&arith;#incr"/> <drs:term_args rdf:parseType="Collection"> <process:valueOf> <process:theParam rdf:resource="#I1"/> <process:fromProcess rdf:resource="#TheParentPerform"/> </process:valueOf> </drs:term_args> </drs:Functional_term> </process:valueFunction> </process:InputBinding> <process:InputBinding> <process:theParam rdf:resource="&aux;#I12"/> <process:valueData xsd:datatype="&xsd;#string" >Academic</process:valueData> </process:InputBinding> </process:hasDataFrom> </process:Perform> <process:Perform rdf:ID="Step2"> <process:process rdf:resource="&aux;#S2"/> <process:hasDataFrom> <process:Binding> <process:theParam rdf:resource="&aux;#I21"/> <process:valueSource> <process:ValueOf> <process:theParam rdf:resource="#O11"/> <process:fromProcess rdf:resource="#Step1"/> </process:ValueOf> </process:valueSource> </process:Binding> </process:hasDataFrom> </process:Perform> </process:components> <process:Produce> <process:producedBinding> <process:OutputBinding> <process:theParam rdf:resource="#O1"/> <process:valueFunction expressionLanguage="&drs;" rdf:parseType="Literal"> <drs:Functional_term> <drs:term_function rdf:resource="&arith;#times"/> <drs:term_args rdf:parseType="Collection"> <xsd:Integer rdf:datatype="&xsd;#Float" >3.14159</xsd:Float> <drs:Functional_term> <drs:term_function rdf:resource="&arith;#max"/> <drs:term_args rdf:parseType="Collection"> <xsd:Integer rdf:datatype="&xsd;#Integer" >0</xsd:Integer> <process:valueOf> <process:theParam rdf:resource="#O21"/> <process:fromProcess rdf:resource="#S2"/> </process:valueOf> </drs:term_args> </drs:Functional_term> </drs:term_args> </drs:Functional_term> </process:valueFunction> </process:OutputBinding> </process:producedBinding> </process:Sequence> </process:composedOf> </process:CompositeProcess>
Produce is a new class used to capture dataflows to the outputs of the ParentPerform. The outputs can't be declared once and for all, because in the presence of IfThenElses those outputs will depend on which branch of the conditional the agent takes. So at the point in a branch where the data required to compute the output are known, we insert a Produce pseudo-step to say what the output will be.
<owl:Class rdf:ID="Produce"> <rdfs:subClassOf rdf:resource="#ControlConstruct"/> </owl:Class> <owl:Property rdf:ID="producedBinding"> <rdfs:domain rdf:resource="#Produce"/> <rdfs:range rdf:resource="#OutputBinding"/> </owl:Property>
The grounding of a service specifies the details of how to access the service - details having mainly to do with protocol and message formats, serialization, transport, and addressing. A grounding can be thought of as a mapping from an abstract to a concrete specification of those service description elements that are required for interacting with the service - in particular, for our purposes, the inputs and outputs of atomic processes. Note that in OWL-S, both the ServiceProfile and the ServiceModel are thought of as abstract representations; only the ServiceGrounding deals with the concrete level of specification.
OWL-S does not include an abstract construct for explicitly describing messages. Rather, the abstract content of a message is specified, implicitly, by the input or output properties of some atomic process. Thus, atomic processes, in addition to specifying the basic actions from which larger processes are composed, can also be thought of as the communication primitives of an (abstract) process specification.
Concrete messages, however, are specified explicitly in a grounding. The central function of an OWL-S grounding is to show how the (abstract) inputs and outputs of an atomic process are to be realized concretely as messages, which carry those inputs and outputs in some specific transmittable format. Due to the existence of a significant body of work in the area of concrete message specification, which is already well along in terms of industry adoption, we have chosen to make use of the Web Services Description Language (WSDL) in crafting an initial grounding mechanism for OWL-S. As mentioned above, our intent here is not to prescribe the only possible grounding approach to be used with all services, but rather to provide a general, canonical and broadly applicable approach that will be useful for the great majority of cases.
Web Services Description Language (WSDL) ``is an XML format for describing network services as a set of endpoints operating on messages containing either document-oriented or procedure-oriented information. The operations and messages are described abstractly, and then bound to a concrete network protocol and message format to define an endpoint. Related concrete endpoints are combined into abstract endpoints (services). WSDL is extensible to allow description of endpoints and their messages regardless of what message formats or network protocols are used to communicate [ 2 ] .
It may readily be observed that OWL-S' concept of grounding is generally consistent with WSDL's concept of binding. Indeed, by using the extensibility elements already provided by WSDL, along with one new extensibility element proposed here, it is an easy matter to ground an OWL-S atomic process. Here, we show how this may be done, relying on the WSDL 1.1 specification. (After WSDL 2.0 has been finalized, we expect to update the OWL-S Grounding, and this document, appropriately.)
The approach described here allows a service developer, who is going to provide service descriptions for use by potential clients, to take advantage of the complementary strengths of these two specification languages. On the one hand (the abstract side of a service specification), the developer benefits by making use of OWL-S' process model, and the expressiveness of OWL's class typing mechanisms, relative to what XML Schema Definition (XSD) provides. On the other hand (the concrete side), the developer benefits from the opportunity to reuse the extensive work done in WSDL (and related languages such as SOAP), and software support for message exchanges based on these declarations, as defined to date for various protocols and transport mechanisms.
We emphasize that an OWL-S/WSDL grounding involves a complementary use of the two languages, in a way that is in accord with the intentions of the authors of WSDL. Both languages are required for the full specification of a grounding, because the two languages do not cover the same conceptual space. As indicated by Figure 4, the two languages do overlap in providing for the specification of what WSDL calls ``abstract types'', which in turn are used to characterize the inputs and outputs of services. WSDL, by default, specifies abstract types using XML Schema, whereas OWL-S allows for the definition of abstract types as (description logic-based) OWL classes 4. However, WSDL/XSD is unable to express the semantics of an OWL class. Similarly, OWL-S has no means, as currently defined, to express the binding information that WSDL captures. Thus, it is natural that a OWL-S/WSDL grounding uses OWL classes as the abstract types of message parts declared in WSDL, and then relies on WSDL binding constructs to specify the formatting of the messages.
AN OWL-S/WSDL grounding is based upon the following three correspondences between OWL-S and WSDL. Figure 4 shows the first two of these.
Note that OWL-S grounding doesn't mandate a one-to-one correspondence between an atomic process and a single WSDL operation (although that is the most normal case). To accommodate the WSDL-supported practice of providing multiple definitions (within different port types) of the same operation, OWL-S allows for a one-to-many correspondence between an atomic process and multiple WSDL operations. It is also possible, in these situations, to maintain a one-to-one correspondence, by using multiple (differently named) atomic processes.
To construct an OWL-S/WSDL grounding one must first identify, in WSDL, the messages and operations by which an atomic process may be accessed, and then specify correspondences (1) - (3).
Prior to OWL-S version 0.9, correspondences (2) and (3) were required to be direct correspondences. That is, each OWL-S input or output had to directly match up with a particular WSDL message part; and each input/output type had to literally serve as the type specified in WSDL. Starting with version 0.9, this limitation no longer exists. Version 0.9 allows for the specification of XSLT transformations to show how each WSDL input is derived from (one or more) OWL-S input properties, and how each OWL-S output is derived from (one or more) WSDL output message parts.
Although it is not logically necessary to do so, we believe it will be useful to specify these correspondences both in WSDL and in OWL-S. Thus, as indicated in the following, we allow for constructs in both languages for this purpose.
Because OWL-S is an XML-based language, and its atomic process declarations and input and output types already fit nicely with WSDL, it is easy to extend existing WSDL bindings for use with OWL-S, such as the SOAP binding. Here, we indicate briefly how an arbitrary atomic process, specified in OWL-S, can be given a grounding using WSDL and SOAP, with the assumption of HTTP as the chosen transport mechanism.
Grounding OWL-S with WSDL and SOAP involves the construction of a WSDL (1.1) service description with all the usual parts (types, message, operation, port type, binding, and service constructs).
With respect to the types of the WSDL message parts, it is useful to distinguish two cases: those in which the type is an OWL type (that is, the WSDL service is a ``native speaker'' of that OWL type); and all others. In the first case, the OWL class can either be defined within the WSDL types section, or defined in a separate document and referred to from within the WSDL description, using owl-s-parameter, as explained below -- in which case its definition can be omitted from the WSDL types section.
OWL-S extensions are introduced as follows:
Note that WSDL already allows for the use of arbitrary new attributes in message part elements, and for the use of arbitrary values for the encodingStyle attribute. Thus, extension (3) above is the only point on which a modification to the current WSDL 1.1 specification is called for.
Thus far, we have only shown how WSDL definitions may refer to the corresponding OWL-S declarations. It remains to establish a mechanism by which the relevant WSDL constructs may be referenced in OWL-S. The OWL-S WsdlGrounding class, a subclass of Grounding, serves this purpose. Each WsdlGrounding instance, in turn, contains a list of WsdlAtomicProcessGrounding instances.
A WsdlAtomicProcessGrounding instance refers to specific elements within the WSDL specification, using the following properties:
Additional explanation and examples of how to specify groundings are given in an online document [14], as part of the OWL-S release site on www.daml.org.
OWL-S is an ontology, within the OWL-based framework of the Semantic Web, for describing Web services. It will enable users and software agents to automatically discover, invoke, compose, and monitor Web resources offering services, under specified constraints.
We hope to enhance OWL-S in the future in selected ways that we have indicated in this technical overview and elsewhere, and in response to users' experience with it.
We believe OWL-S will help make the Semantic Web a place where people can not only find information but also get things done.
In general, two processes are type compatible if for each output of one that flows to the input of the other, the parameterType of the output is a subtype of the parameterType of the input. It is quite common, and often encouraged, to use OWL classes as the type of parameters in OWL-S descriptions. That is, the URI which is a value of the parameterType property, denotes an OWL class, that is, a set of individuals which answer to a description encoded in OWL. An OWL reasoner can determine type compatibility in such cases by testing that the relevant output type is subsumed by the input type (of course, equivalent classes subsume each other). In the case where the types are XML Schema types (or elements), the subtype relation is determined by the XML Schema specification (and can be tested by an XML schema validator).
XML Schema types and OWL classes are type incompatible. If the service uses XML Schema (or similar) types in WSDL declarations, and a service modeler wishes to use OWL at the process or profile level, the modeler should provide for translations from the XML Schema to OWL.
In OWL DL, the RDF List vocabulary (rdf:List, rdf:first, rdf:rest, rdf:nil) can only be used in conjunction with built-in predicates such as owl:unionOf, owl:intersectionOf, etc. Using RDF lists to denote a list of individuals is only permitted in OWL Full. More precisely, the list vocabulary (and constructs that generate it, such as rdf:parseType="Collection") can only be used as syntax in OWL DL, and not for domain modeling.
To make OWL-S ontologies accessible to existing OWL DL reasoners the OWL-S Coalition has decided to obey this restriction. Unfortunately, the control constructs in the Process ontology heavily used the collection vocabulary. In OWL-S 1.0 such constructs were modeled as RDF lists and the rdf:parseType="Collection" attribute was used to write the list, for example:
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:process-1.0="http://www.daml.org/services/owl-s/1.0/ Process.owl#"> <process-1.0:CompositeProcess rdf:ID="BravoAir_Process"> <process-1.0:composedOf> <process-1.0:Sequence> <process-1.0:components rdf:parseType="Collection"> <process-1.0:AtomicProcess rdf:about="#GetDesiredFlightDetails"/> <process-1.0:AtomicProcess rdf:about="#SelectAvailableFlight"/> <process-1.0:CompositeProcess rdf:about="#BookFlight"/> </process-1.0:components> </process-1.0:Sequence> </process-1.0:composedOf> </process-1.0:CompositeProcess> </rdf:RDF>
This is exactly equivalent to the following verbose representation:
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:process-1.0="http://www.daml.org/services/owl-s/1.0/ Process.owl#"> <process-1.0:CompositeProcess rdf:ID="BravoAir_Process"> <process-1.0:composedOf> <process-1.0:Sequence> <process-1.0:components> <rdf:Description> <rdf:first> <process-1.0:AtomicProcess rdf:about="#GetDesiredFlightDetails"/> </rdf:first> <rdf:rest> <rdf:Description> <rdf:first> <process-1.0:AtomicProcess rdf:about="#SelectAvailableFlight"/> </rdf:first> <rdf:rest> <rdf:Description> <rdf:first> <process-1.0:CompositeProcess rdf:about="#BookFlight"/> </rdf:first> <rdf:rest rdf:resource="&rdf;#nil"/> </rdf:Description> </rdf:rest> </rdf:Description> </rdf:rest> </rdf:Description> </process-1.0:components> </process-1.0:Sequence> </process-1.0:composedOf> </process-1.0:CompositeProcess> </rdf:RDF>
In OWL-S 1.1, the solution to stay in OWL DL is achieved by using a "shadow" vocabulary to redefine List, first, rest and nil concepts in a different namespace. The shadow list ontology is located at http://www.daml.org/services/owl-s/1.1/generic/ObjectList.owl. The use of this vocabulary in OWL-S 1.1 is very similar to the above example:
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:shadow-rdf="http://www.daml.org/services/owl-s/1.1/generic/ ObjectList.owl" xmlns:process-1.1="http://www.daml.org/services/owl-s/1.1/ Process.owl#"> <process-1.1:CompositeProcess shadow-rdf:ID="BravoAir_Process"> <process-1.1:composedOf> <process-1.1:Sequence> <process-1.1:components> <rdf:Description> <shadow-rdf:first> <process-1.1:AtomicProcess rdf:about="#GetDesiredFlightDetails"/> </shadow-rdf:first> <shadow-rdf:rest> <rdf:Description> <shadow-rdf:first> <process-1.1:AtomicProcess rdf:about="#SelectAvailableFlight"/> </shadow-rdf:first> <shadow-rdf:rest> <rdf:Description> <shadow-rdf:first> <process-1.1:CompositeProcess rdf:about="#BookFlight"/> </shadow-rdf:first> <shadow-rdf:rest rdf:resource="&shadow-rdf;#nil"/> </rdf:Description> </shadow-rdf:rest> </rdf:Description> </shadow-rdf:rest> </rdf:Description> </process-1.1:components> </process-1.1:Sequence> </process-1.1:composedOf> </process-1.1:CompositeProcess> </rdf:RDF>
Note that, in this case we cannot use the short-hand notation of rdf:parseType="Collection" since we are using a completely different set of properties to denote lists. The current shadow list vocabulary is a rather shallow representation of lists, but hardly more so than the original RDF vocabulary. In OWL Full, there is no restriction on the use of RDF list vocabulary and it is also possible to define equivalence relations between the built-in RDF list vocabulary and shadow list vocabulary. The following set of assertions is one way to establish this equivalence:
<rdf:Description rdf:ID="&rdf;#List"> <owl:equivalentClass rdf:resource="&shadow-rdf;#List"/> </rdf:Description> <rdf:Property rdf:about="&rdf;#first"> <owl:equivalentProperty rdf:resource="&shadow-rdf;#first"/> </rdf:Property> <rdf:Property rdf:about="&rdf;#rest"> <owl:equivalentProperty rdf:resource="&shadow-rdf;#rest"/> </rdf:Property> <rdf:Description rdf:ID="&rdf;#nil"> <owl:sameAs rdf:resource="&shadow-rdf;#nil"/> </rdf:Description>
A service provider can simply import an ontology that contains these assertions and continue to use the built-in RDF vocabulary (and short-hand rdf:parseType="Collection" notation). Such service descriptions are still consistent with OWL-S ontologies but certainly fall into OWL Full. It is more difficult to go the other way, since one must maintain the distinction between syntax uses of the RDF vocabulary and domain modeling uses. It almost certainly requires an a priori pass of an OWL parser such as the OWL API to remove all the syntax triples first; then the remaining triples may be replaced with the shadow vocabulary.
This document was produced as part of the DARPA DAML Program, which provided funding to many of the authors. We very much appreciate the ongoing support, interest, and patience of DAML program managers Jim Hendler, Murray Burke, and Mark Greaves.
We thank the W3C for permitting and supporting the use of the public-sws-ig mailing list for discussion of OWL-S issues, and Carine Bournez at W3C for helping to make this possible.
We would like to give a special thanks to the following individuals who have made significant contributions to this work:
In addition, we would like to express our gratitude to the many interested researchers and users who have employed OWL-S in their projects and publications, and to those who have taken the trouble to share their suggestions and questions in the discussions of OWL-S on the public-sws-ig list.
This document has benefited from input from members of the Semantic Web Services Initiative (SWSI).