Jump to: navigation, search

Difference between revisions of "Meteos/ExampleKmeans"

(Classify a User Preferences by using Meteos)
(4. Parse a raw data)
Line 228: Line 228:
 
| user_id    | 46b1440477394fa3a1cbe64a9c50b192                |
 
| user_id    | 46b1440477394fa3a1cbe64a9c50b192                |
 
+-------------+-------------------------------------------------+
 
+-------------+-------------------------------------------------+
 +
</pre>
 +
 +
=== 5. Create a prediction model ===
 +
 +
In this example, User creates a Kmeans Model from parsed dataset.
 +
 +
Parsed dataset has been already distributed in hdfs of experiment environment.
 +
 +
So, you spefity the internal url (internal://<dataset-id>) in source_dataset_url parameter.
 +
 +
And you can spefity a numClasses parameter in model_params which shows how many classes you want to classify.
 +
 +
<pre>
 +
$ vim sample/json/model_kmeans.json
 +
</pre>
 +
<pre>
 +
$ cat sample/json/model_kmeans.json
 +
{
 +
    "display_name": "sample-kmeans-model",
 +
    "display_description": "KMeans Model",
 +
    "source_dataset_url": "internal://cfd6f339-5c94-4e1b-a718-11973fe36d95",
 +
    "model_type": "KMeans",
 +
    "model_params": "{'numClasses': 5}",
 +
    "experiment_id": "91504a65-01cf-428f-81aa-596be7ca8619"
 +
}
 +
</pre>
 +
<pre>
 +
$ meteos model-create --json sample/json/model_kmeans.json
 +
+-------------+--------------------------------------+
 +
| Property    | Value                                |
 +
+-------------+--------------------------------------+
 +
| created_at  | 2016-12-05T04:57:32.000000          |
 +
| description | KMeans Model                        |
 +
| id          | cb829823-a57d-45f1-ae38-658677be085f |
 +
| name        | sample-kmeans-model                  |
 +
| params      | eydudW1DbGFzc2VzJzogNX0=            |
 +
| project_id  | a48633f159924753838dc062e070c539    |
 +
| status      | creating                            |
 +
| stderr      | None                                |
 +
| stdout      | None                                |
 +
| type        | KMeans                              |
 +
| user_id    | 46b1440477394fa3a1cbe64a9c50b192    |
 +
+-------------+--------------------------------------+
 +
</pre>
 +
<pre>
 +
$ meteos model-list
 +
+--------------------------------------+---------------------+--------------+-----------+--------+-------------------------------------------------+----------------------------+
 +
| id                                  | name                | description  | status    | type  | source_dataset_url                              | created_at                |
 +
+--------------------------------------+---------------------+--------------+-----------+--------+-------------------------------------------------+----------------------------+
 +
| cb829823-a57d-45f1-ae38-658677be085f | sample-kmeans-model | KMeans Model | available | KMeans | internal://cfd6f339-5c94-4e1b-a718-11973fe36d95 | 2016-12-05T04:57:32.000000 |
 +
+--------------------------------------+---------------------+--------------+-----------+--------+-------------------------------------------------+----------------------------+
 
</pre>
 
</pre>

Revision as of 05:00, 5 December 2016

Classify a User Preferences by using Meteos

In this example, you create a prediction model which classify a user by there skill using KMeans Model.

KMeans.png

1. Create a experiment template

Create template of experiment. Experiment is a workspace of Machine Learning.

You have to confirm a glance image id of meteos image, and a neutron network id before creating a template.

You can use a format located in python-meteosclient/sample/json/template.json

$ glance image-list | grep meteos
| 45de4bbd-8419-40ff-8ed7-fc065c05e34f | meteos                          |
$ neutron net-list | grep public
| 84c13e76-ced9-4142-a885-280784f1f7a3 | public  | a14de1c5-b8d4-434b-a056-9b0049b93402             |
$ vim sample/json/template.json
$ cat sample/json/template.json
{
    "display_name": "example-template",
    "display_description": "This is a sample template of experiment",
    "image_id" : "45de4bbd-8419-40ff-8ed7-fc065c05e34f",
    "master_nodes_num": 1,
    "master_flavor_id": "4",
    "worker_nodes_num": 2,
    "worker_flavor_id": "2",
    "spark_version": "1.6.0",
    "floating_ip_pool": "84c13e76-ced9-4142-a885-280784f1f7a3"
}
$ meteos template-create --json sample/json/template.json
+---------------+-----------------------------------------+
| Property      | Value                                   |
+---------------+-----------------------------------------+
| cluster_id    | None                                    |
| created_at    | 2016-12-04T07:16:29.000000              |
| description   | This is a sample template of experiment |
| id            | 8b7b9b89-f119-4b9b-b9b0-31598f819f1a    |
| master_flavor | 4                                       |
| master_nodes  | 1                                       |
| name          | example-template                        |
| project_id    | 67401cca74c2409b939e944bc6c8fcbe        |
| spark_version | 1.6.0                                   |
| status        | available                               |
| user_id       | 181b1caa9d5b470393ca66b9e511d5b0        |
| worker_flavor | 2                                       |
| worker_nodes  | 2                                       |
+---------------+-----------------------------------------+


2. Create a experiment from template

Create a experiment by using template created in the above step. You have to confirm a neutron private network id and create keypair before creating a template.

You can use a format located in python-meteosclient/sample/json/experiment.json

$ nova keypair-add key1 > ~/key1.pem && chmod 600 ~/key1.pem
$ neutron net-list | grep private
| 8abc626e-2b06-4c67-9b2c-0231f0cef5b8 | private | cb58940f-859b-48c6-b92a-3861470f1fc1 20.0.0.0/26 |
$ vim sample/json/experiment.json
$ cat sample/json/experiment.json
{
    "display_name": "example-experiment",
    "display_description": "This is a sample experiment",
    "key_name": "key1",
    "neutron_management_network": "8abc626e-2b06-4c67-9b2c-0231f0cef5b8",
    "template_id": "8b7b9b89-f119-4b9b-b9b0-31598f819f1a"
}
$ meteos experiment-create --json sample/json/experiment.json
+--------------------+--------------------------------------+
| Property           | Value                                |
+--------------------+--------------------------------------+
| created_at         | 2016-12-04T07:20:11.000000           |
| description        | This is a sample experiment          |
| id                 | 91504a65-01cf-428f-81aa-596be7ca8619 |
| key_name           | key1                                 |
| management_network | 8abc626e-2b06-4c67-9b2c-0231f0cef5b8 |
| name               | example-experiment                   |
| project_id         | 67401cca74c2409b939e944bc6c8fcbe     |
| status             | creating                             |
| user_id            | 181b1caa9d5b470393ca66b9e511d5b0     |
+--------------------+--------------------------------------+

Meteos creates a experiment using OpenStack Sahara spark plugin.

You can see a sahara cluster and nova VMs created by Meteos as below.

$ openstack dataprocessing cluster list (or sahara cluster-list)
+------------------+--------------------------------------+-------------+----------------+----------+
| Name             | Id                                   | Plugin name | Plugin version | Status   |
+------------------+--------------------------------------+-------------+----------------+----------+
| cluster-91504a65 | 13418fd9-5d2a-4ee6-b384-cb250b7e7714 | spark       | 1.6.0          | Spawning |
+------------------+--------------------------------------+-------------+----------------+----------+
$ openstack server list (or nova list)
+--------------------------------------+----------------------------+--------+----------+------------+
| ID                                   | Name                       | Status | Networks | Image Name |
+--------------------------------------+----------------------------+--------+----------+------------+
| 58818eb5-ade7-407c-8c76-9fd9809632b4 | cluster-91504a65-workers-1 | BUILD  |          | meteos     |
| a151dbd9-de51-43ca-afb8-1fdeecce2891 | cluster-91504a65-workers-0 | BUILD  |          | meteos     |
| d02d85c5-0960-4b7e-880c-26b73c5dd8ad | cluster-91504a65-master-0  | BUILD  |          | meteos     |
+--------------------------------------+----------------------------+--------+----------+------------+

3. Upload a raw data

Upload a raw data (in this example user skill data) to OpenStack Swift.

You can use a sample data located in python-meteosclient/sample/data/kmeans_data.txt

Raw data shows "UserID", "Rank of Skill A", "Rank of Skill A", ... from left.

$ cd sample/data/
/sample/data$ head kmeans_data.txt
1,1.0,1.0,0.0,0.0,0.0
2,3.0,3.0,2.0,3.0,3.0
3,5.0,5.0,5.0,4.0,5.0
4,7.0,7.0,6.0,6.0,7.0
5,7.0,8.0,7.0,8.0,8.0
6,0.0,1.0,1.0,0.0,1.0
7,3.0,2.0,2.0,2.0,3.0
8,4.0,4.0,4.0,5.0,5.0
9,6.0,6.0,6.0,7.0,6.0
10,8.0,8.0,8.0,7.0,7.0
/sample/data$ swift upload meteos kmeans_data.txt
kmeans_data.txt

4. Parse a raw data

Parse a raw data to enable Prediction Model to handle it.

KMeans model requires only parameters to classify, so you have to elminate UserID from raw data using map method.

You can use a sample format located in python-meteosclient/sample/json/dataset_parse.json

You can see the head data of parsed dataset by executing "meteos dataset-show <dataset-uuid>" command.

$ vim ../python-meteosclient/sample/json/dataset_parse.json
$ cat ../python-meteosclient/sample/json/dataset_parse.json
{
    "source_dataset_url": "swift://meteos/kmeans_data.txt",
    "display_name": "sample-data",
    "display_description": "user skill dataset",
    "method": "parse",
    "params": [{"method": "map", "args": "lambda l: l.split(',',1)[1]"}],
    "experiment_id": "91504a65-01cf-428f-81aa-596be7ca8619",
    "swift_tenant": "demo",
    "swift_username": "demo",
    "swift_password": "nova"
}
$ meteos dataset-create --json sample/json/dataset_parse.json
+-------------+--------------------------------------+
| Property    | Value                                |
+-------------+--------------------------------------+
| created_at  | 2016-12-05T04:38:01.000000           |
| description | user skill dataset                   |
| head        | None                                 |
| id          | cfd6f339-5c94-4e1b-a718-11973fe36d95 |
| name        | sample-data                          |
| project_id  | a48633f159924753838dc062e070c539     |
| status      | creating                             |
| stderr      | None                                 |
| user_id     | 46b1440477394fa3a1cbe64a9c50b192     |
+-------------+--------------------------------------+
$ meteos dataset-list
+--------------------------------------+-------------+--------------------+-----------+--------------------------------+----------------------------+
| id                                   | name        | description        | status    | source_dataset_url             | created_at                 |
+--------------------------------------+-------------+--------------------+-----------+--------------------------------+----------------------------+
| cfd6f339-5c94-4e1b-a718-11973fe36d95 | sample-data | user skill dataset | available | swift://meteos/kmeans_data.txt | 2016-12-05T04:38:01.000000 |
+--------------------------------------+-------------+--------------------+-----------+--------------------------------+----------------------------+
$ meteos dataset-show cfd6f339-5c94-4e1b-a718-11973fe36d95
+-------------+-------------------------------------------------+
| Property    | Value                                           |
+-------------+-------------------------------------------------+
| created_at  | 2016-12-05T04:38:01.000000                      |
| description | user skill dataset                              |
| head        | [u'1.0,1.0,0.0,0.0,0.0',                        |
|             | u'3.0,3.0,2.0,3.0,3.0', u'5.0,5.0,5.0,4.0,5.0', |
|             | u'7.0,7.0,6.0,6.0,7.0', u'7.0,8.0,7.0,8.0,8.0', |
|             | u'0.0,1.0,1.0,0.0,1.0', u'3.0,2.0,2.0,2.0,3.0', |
|             | u'4.0,4.0,4.0,5.0,5.0', u'6.0,6.0,6.0,7.0,6.0', |
|             | u'8.0,8.0,8.0,7.0,7.0']                         |
| id          | cfd6f339-5c94-4e1b-a718-11973fe36d95            |
| name        | sample-data                                     |
| project_id  | a48633f159924753838dc062e070c539                |
| status      | available                                       |
| stderr      |                                                 |
| user_id     | 46b1440477394fa3a1cbe64a9c50b192                |
+-------------+-------------------------------------------------+

5. Create a prediction model

In this example, User creates a Kmeans Model from parsed dataset.

Parsed dataset has been already distributed in hdfs of experiment environment.

So, you spefity the internal url (internal://<dataset-id>) in source_dataset_url parameter.

And you can spefity a numClasses parameter in model_params which shows how many classes you want to classify.

$ vim sample/json/model_kmeans.json
$ cat sample/json/model_kmeans.json
{
    "display_name": "sample-kmeans-model",
    "display_description": "KMeans Model",
    "source_dataset_url": "internal://cfd6f339-5c94-4e1b-a718-11973fe36d95",
    "model_type": "KMeans",
    "model_params": "{'numClasses': 5}",
    "experiment_id": "91504a65-01cf-428f-81aa-596be7ca8619"
}
$ meteos model-create --json sample/json/model_kmeans.json
+-------------+--------------------------------------+
| Property    | Value                                |
+-------------+--------------------------------------+
| created_at  | 2016-12-05T04:57:32.000000           |
| description | KMeans Model                         |
| id          | cb829823-a57d-45f1-ae38-658677be085f |
| name        | sample-kmeans-model                  |
| params      | eydudW1DbGFzc2VzJzogNX0=             |
| project_id  | a48633f159924753838dc062e070c539     |
| status      | creating                             |
| stderr      | None                                 |
| stdout      | None                                 |
| type        | KMeans                               |
| user_id     | 46b1440477394fa3a1cbe64a9c50b192     |
+-------------+--------------------------------------+
$ meteos model-list
+--------------------------------------+---------------------+--------------+-----------+--------+-------------------------------------------------+----------------------------+
| id                                   | name                | description  | status    | type   | source_dataset_url                              | created_at                 |
+--------------------------------------+---------------------+--------------+-----------+--------+-------------------------------------------------+----------------------------+
| cb829823-a57d-45f1-ae38-658677be085f | sample-kmeans-model | KMeans Model | available | KMeans | internal://cfd6f339-5c94-4e1b-a718-11973fe36d95 | 2016-12-05T04:57:32.000000 |
+--------------------------------------+---------------------+--------------+-----------+--------+-------------------------------------------------+----------------------------+