Jump to: navigation, search

Difference between revisions of "Mistral/DSLv2"

(YAML example)
(YAML example)
Line 24: Line 24:
  
 
===== YAML example =====
 
===== YAML example =====
 +
---
 
  version: '2.0'
 
  version: '2.0'
 
   
 
   

Revision as of 20:54, 24 September 2014

Mistral DSL version 2 specification

Introduction

Current document fully describes Domain Specific Language (DSL) version 2 of Mistral Workflow Service. Since version 1 issued in May 2014 Mistral team completely reworked the language pursuing with the goal in mind to make it easier to understand while more consistent and flexible.

Unlike Mistral DSL v1 this second version of DSL assumes that all entities that Mistral works with like workflows, actions and triggers are completely independent in terms of how they're referenced and accessed through API (and also Python Client API and CLI). Workbooks, the entity that can combine combine workflows/actions/triggers still exist in the language but only for namespacing and convenience purposes. See Workbooks section for more details.

All DSL consists of the following main object(entity) types that will be described in details next:

Prerequisites

Mistral DSL is fully based on YAML and knowledge of YAML is a plus for better understanding of the material in this specification. It also takes advantage of YAQL query language to define expressions in workflow, action and trigger definitions.

Workflows

Workflow is the main building block of Mistral DSL, the reason why the project exists. Workflow represents a process that can be described in a various number of ways and that can do some job interesting to the end user. Each workflow consists of tasks (at least one) describing what exact steps should be made during workflow execution.

YAML example
---
version: '2.0'

create_vm:
  type: direct
  input:
    - vm_name
    - image_ref
    - flavor_ref
  output:
    vm_id: $.vm_id

  tasks:
    create_server:
      action: nova.servers_create name={$.vm_name} image={$.image_ref} flavor={$.flavor_ref}
      publish:
        vm_id: $.id
      on-success:
        - wait_for_instance

    wait_for_instance:
      action: nova.servers_find id={$.vm_id} status='ACTIVE'
      policies:
        retry:
          delay: 5
          count: 15

Workflow Types

Mistral DSL v2 introduces different workflow types and the structure of each workflow type varies according to its semantics. Currently, Mistral provides two workflow types:

See corresponding sections for details.

Common Workflow Attributes

  • type - Workflow type. Either 'direct' or 'reverse'. Required.
  • description - Arbitrary text containing workflow description. Optional.
  • input - List of required input parameter names. Optional.
  • output - Any data structure arbitrarily containing containing YAQL expressions that defines workflow output. Optional.
  • task-defaults - Default values for some of task attributes defined at workflow level. Optional. Corresponding attribute defined for a specific task always takes precedence. Specific task attributes that could be defined in task-defaults are the following:
    • on-error
    • on-success
    • on-complete
    • policies
  • tasks - Dictionary containing workflow tasks. See below for more details. Required.

Tasks

Task is what a workflow consists of. It defines a specific computational step in the workflow. Each task can optionally take input data and produce output. In Mistral DSL v2 task can be associated with an action or with calling a workflow. In the example below there are two tasks of different types:

action_based_task:
  action: std.http url='openstack.org'

workflow_based_task:
  workflow: backup_vm_workflow vm_id={$.vm_id}

Actions will be explained below in a individual paragraph but looking ahead it's worth saying that Mistral provides a lot of actions out of the box (including actions for most of the core OpenStack services) and it's also easy to plug new actions into Mistral.

Common Attributes

All Mistral tasks regardless of workflow type have the following common attributes:

  • description - Arbitrary text containing task description. Optional.
  • action - Name of the action associated with the task. Required but mutually exclusive with workflow.
  • workflow - Name of the workflow associated with the task. Mutually exclusive with action.
  • input - Actual parameters of the task. Optional. Value of each parameter is a JSON-compliant type such as number, string etc, dictionary or list. It can also be a YAQL expression to retrieve value from task context or any of the mentioned types containing inline YAQL expressions (for example, string "{$.movie_name} is a cool movie!")
  • publish - Any JSON-compatible data structure optionally containing YAQL expression defining what needs to be published to a workflow context. Published results will be accessible for downstream tasks via using YAQL expressions. Optional.
  • policies - Dictionary-like section defining task policies that influence how Mistral Engine runs tasks. Policies are explained in a separate paragraph. Optional.
Policies

Any Mistral task regardless of what its workflow type can optionally have configured policies.

YAML example
my_task:
  ...
  policies:
    wait-before: 2
    wait-after: 4
    timeout: 30
    retry:
      count: 10
      delay: 20
      break-on: $.my_var = true
'wait-before'

Defines a delay in seconds that Mistral Engine should wait before starting a task.

'wait-after'

Defines a delay in seconds that Mistral Engine should wait after a task has completed before starting next tasks defined in 'on-success', 'on-error' or 'on-complete'.

'timeout'

Defines a period of time in seconds after which a task will be failed automatically by engine if hasn't completed.

'retry'

Defines a pattern how task should be repeated in case of an error.

  • count - Defines a maximum number of times that a task can be repeated.
  • delay - Defines a delay in seconds between subsequent task iterations.
  • break-on - Defines a YAQL expression that will break iteration loop if it evaluates to 'true'. If it fires then the task is considered successful.
Simplified Input Syntax

When describing a workflow task it's possible to specify its input parameters in two ways:

Full syntax:

my_task:
  action: std.http
  input:
    url: http://mywebsite.org
    method: GET

Simplified syntax:

 my_task:
   action: std.http url="http://mywebsite.org" method="GET"

The same rules apply to tasks associated with workflows.

Full syntax:

my_task:
  workflow: some_nested_workflow
  input:
    param1: val1
    param2: val2

Simplified syntax:

 my_task:
   workflow: some_nested_workflow param1='val1' param2='val2'

Note: It's also possible to merge these two approaches and specify a part of parameters using simplified key-value pairs syntax and using keyword 'input'. In this case all the parameters will be effectively merged. If the same parameter is specified in both ways then the one under 'input' keyword takes precedence.

Direct Workflow

Direct workflow consists of tasks combined in a graph where every next task starts after another one depending on produced result. So direct workflow has a notion of transition. Direct workflow is considered to be completed if there aren't any transitions left that could be used to jump to next tasks.

Figure 1. Mistral Direct Workflow.


YAML example
version: '2.0'

create_vm_and_send_email:
  type: direct
  input:
    - vm_name
    - image_id
    - flavor_id
  output:
    result: $.vm_id

  tasks:
    create_vm:
      action: nova.servers_create name={$.vm_name} image={$.image_id} flavor={$.flavor_id}
      publish:
        vm_id: $.id
      on-error:
        - send_error_email
      on-success:
        - send_success_email

    send_error_email:
      action: send_email to='admin@mysite.org' body='Failed to create a VM'
      on_complete:
        - fail

    send_success_email:
      action: send_email to='admin@mysite.org' body='Vm is successfully created and its id: {$.vm_id}'
Transitions with YAQL expressions

Task transitions can be determined by success/error/completeness of the previous tasks and also by additional YAQL guard expressions that can access any data produced by upstream tasks. So in the example above task 'create_vm' could also have a YAQL expression on transition to task 'send_success_email' as follows:

create_vm:
  ...
  on-success:
    - send_success_email: $.vm_id != null

And this would tell Mistral to run 'send_success_email' task only if 'vm_id' variable published by task 'create_vm' is not empty. YAQL expressions can also be applied to 'on-error' and 'on-complete'.

Direct Workflow Task Attributes
  • on-success - List of tasks which will run after the task has completed successfully. Optional.
  • on-error - List of tasks which will run after the task has completed with an error. Optional.
  • on-complete - List of tasks which will run after the task has completed regardless of whether it is successful or not. Optional.

Reverse Workflow

In reverse workflow all relationships in workflow task graph are dependencies. In order to run this type of workflow we need to specify a task that needs to be completed, it can be conventionally called 'target task'. When Mistral Engine starts a workflow it recursively identifies all the dependencies that need to be completed first.

Figure 2. Mistral Reverse Workflow.


Figure 2 explains how reverse workflow works. In the example, task T1 is chosen a target task. So when the workflow starts Mistral will run only tasks T7, T8, T5, T6, T2 and T1 in the specified order (starting from tasks that have no dependencies). Tasks T3 and T4 won't be a part of this workflow because there's no route in the directed graph from T1 to T3 or T4.

YAML example
version: '2.0'

create_vm_and_send_email:
  type: reverse
  input:
    - vm_name
    - image_id
    - flavor_id
  output:
    result: $.vm_id

  tasks:
    create_vm:
      action: nova.servers_create name={$.vm_name} image={$.image_id} flavor={$.flavor_id}
      publish:
        vm_id: $.id

    search_for_ip:
      action: nova.floating_ips_findall instance_id=null
      publish:
        vm_ip: $[0].ip

    associate_ip:
      action: nova.servers_add_floating_ip server={$.vm_id} address={$.vm_ip}
      requires: [search_for_ip]

    send_email:
      action: send_email to='admin@mysite.org' body='Vm is created and id {$.vm_id} and ip address {$.vm_ip}'
      requires: [create_vm, associate_ip]
Reverse Workflow Task Attributes
  • requires - List of tasks which should be execute before this tasks. Optional.

Actions

Action defines what exactly needs to be done when task starts. Action is similar to a regular function in general purpose programming language like Python. It has a name and parameters. Mistral distinguishes 'system actions' and 'Ad-hoc actions'.

System Actions

System actions are provided by Mistral out of the box and can be used by anyone. It is also possible to add system actions for specific Mistral installation via a special plugin mechanism. Currently, built-in system actions are:

std.http

Sends an HTTP request.

Input parameters:

  • url - URL for the HTTP request. Required.
  • method - method for the HTTP request. Optional. Default is 'GET'.
  • params - Dictionary or bytes to be sent in the query string for the HTTP request. Optional.
  • body - Dictionary, bytes, or file-like object to send in the body of the HTTP request. Optional.
  • headers - Dictionary of HTTP Headers to send with the HTTP request. Optional.
  • cookies - Dictionary of HTTP Cookies to send with the HTTP request. Optional.
  • auth - Auth to enable Basic/Digest/Custom HTTP Auth. Optional.
  • timeout - Float describing the timeout of the request in seconds. Optional.
  • allow_redirects - Boolean. Set to True if POST/PUT/DELETE redirect following is allowed. Optional.
  • proxies - Dictionary mapping protocol to the URL of the proxy. Optional.


Example:

http_task:
  action: std.http url='google.com'
std.mistral_http

This actions works just like 'std.http' with the only exception: when sending a request it inserts the following HTTP headers:

  • Mistral-Execution-Id - Identifier of the workflow execution this action is associated with.
  • Mistral-Task-Id - Identifier of the task instance this action is associated with.

Using this action makes it possible to do any work in asynchronous manner triggered via HTTP protocol. That means that Mistral can send a request using 'std.mistral_http' and then any time later whatever system that received this request can notify Mistral back (using its public API) with the result of this action. Header Mistral-Task-Id is required for this operation because it is used a key to find corresponding task in Mistral to attach the result to.

std.email

Sends an email message via SMTP protocol.

  • params - Dictionary containing the following keys:
    • to - Comma separated list of recipients. Required.
    • subject - Subject of the message. Required.
    • body - Text containing message body. Required.
  • settings - Dictionary containing the following keys:
    • from - Sender email address. Required.
    • smtp_server - SMTP server host name. Required.
    • password - SMTP server password. Required.


Example:

http_task:
  action: std.email
  input:
    params:
      to: admin@mywebsite.org
      subject: Hello from Mistral :)
      body: |
        Cheers! (:_:)

        -- Thanks, Mistral Team.
    settings:
      from: mistral@openstack.org
      smtp_server: smtp.google.com
      password: SECRET 

The syntax of 'std.emal' actions is pretty verbose. However, it can be significantly simplified using Ad-hoc actions. More about them below.

std.ssh

Runs Secure Shell command.

Input parameters:

  • cmd - String containing a shell command that needs to be executed. Required.
  • host - Host name that the command needs to be executed on. Required.
  • username - User name to authenticate on the host.
  • password - User password to to authenticate on the host.

Note: Authentication using key pairs is currently not supported.

std.echo

Simple action mostly needed for testing purposes that returns a predefined result.

Input parameters:

  • output - Value of any type that needs to be returned as a result of the action. Required.

Ad-hoc Actions

Ad-hoc action is a special type of action that can be created by user. Ad-hoc action is always created as a wrapper around any other existing system action and its main goal is to simplify using same actions many times with similar pattern.

Note: Nested ad-hoc actions currently are not supported (i.e. ad-hoc action around another ad-hoc action).

YAML example
---
version: '2.0'

error_email:
  input:
    - execution_id

  base: std.email
  base-input:
    params:
      to: admin@mywebsite.org
      subject: Something went wrong with your Mistral workflow :(
      body: |
          Please take a look at Mistral Dashboard to find out what's wrong
          with your workflow execution {$.execution_id}.

          Everything's going to be alright!

          -- Sincerely, Mistral Team.
      settings:
        from: mistral@openstack.org
        smtp_server: smtp.google.com
        password: SECRET 

Once this action is uploaded to Mistral any workflow will be able to use it as follows:

my_workflow:
  tasks:
    ...
    send_error_email
      action: error_email execution_id={$.__execution.id}
Attributes
  • name - action name (string without space, mandatory attribute).
  • base - name of base action that this action is built on top of.
  • base-input - dictionary whose structure is defined by action class. For example, for 'std.http' action it contains 'url', 'method', 'body' and 'headers' according to HTTP protocol specification.
  • input - list containing parameter names which should or could be specified in task. This attribute is optional and used only for documenting purposes.
  • output - any data structure defining how to transform the output of base action into the output of this action. I can optionally have YAQL expressions to access properties of base action output.

Triggers [coming in version 0.2.0]

NOTE: Triggers are not yet implemented as part of version 0.1.0, they will go into 0.2.0

Using triggers it is possible to run workflows according to specific rules: periodically setting a cron (http://en.wikipedia.org/wiki/Cron) pattern or on external events like ceilometer alarm.

YAML example

TODO

Attributes

TODO

Workbooks

TODO

YAML example

TODO

Attributes

TODO