wiki:Internal/newTutorials/0-2

Tutorial 0-2: Using a testbed

Most interactions with nodes are done via "OMF commands". These are a simple set of commands used to perform basic functions such as checking node status, turning nodes on/off, loading/saving disk images, or executing experiment scripts (which will be addressed in future tutorials). These OMF commands are performed on the console of the testbed you will be using. As such, the first step is to connect to the testbed console after the time slot for your approved reservation begins.


Connecting to a testbed console

All access to a testbed is done through the testbed's console machine. You can connect to the console at any time during your approved reservation time slot.

For example, connecting via SSH to yourUserName@sb7.orbit-lab.org will connect you to the console machine of "Sandbox 7".


Checking the status of a node

To check the status of node1-1, on the console run:

omf stat -t node1-1

Which will give output that looks similar to the following

DEBUG FQDN:console.sb7.orbit-lab.org:
 INFO NodeHandler: OMF Experiment Controller 5.4 (git 861d645)
 INFO NodeHandler: Reading configuration file /etc/omf-expctl-5.4/services.yaml
 INFO NodeHandler: Add domain http - http://internal1.orbit-lab.org:5054/
 INFO NodeHandler: Add domain http - http://repository1.orbit-lab.org:5054/
 INFO NodeHandler: Add domain http - http://external1.orbit-lab.org:5054/
 INFO NodeHandler: Slice ID: default_slice (default)
 INFO NodeHandler: Experiment ID: default_slice-2018-01-24t11.49.44.767-05.00
 INFO NodeHandler: Message authentication is disabled
 INFO Experiment: load system:exp:stdlib
 INFO property.resetDelay: resetDelay = 230 (Fixnum)
 INFO property.resetTries: resetTries = 1 (Fixnum)
 INFO Experiment: load system:exp:eventlib
 INFO Experiment: load system:exp:winlib
 INFO Experiment: load system:exp:stat
 INFO property.nodes: nodes = "node1-1" (String)
 INFO property.summary: summary = false (FalseClass)

Talking to the CMC service, please wait
-----------------------------------------------
 Node: node1-1.sb7.orbit-lab.org         State: POWERON
-----------------------------------------------

 INFO EXPERIMENT_DONE: Event triggered. Starting the associated tasks.
 INFO NodeHandler:
 INFO NodeHandler: Shutting down experiment, please wait...
 INFO NodeHandler:
 INFO run: Experiment default_slice-2018-01-24t11.49.44.767-05.00 finished after 0:5

You can see the state of node1-1 in this case as "POWERON", meaning that the node is powered on. Other states may say "POWEROFF" to indicate a node is powered off or "NOT REGISTERED" indicating the node was unavailable for use at this time.


Loading a disk image onto a node

Example: Suppose we want to load the baseline disk image onto node1-1 and expand the partition to the full size of the disk in node1-1.

On the console run:

omf load -i baseline.ndz -r -t node1-1

For the "topology" listed after -t, you can specify a single node such as "node1-1", a set of nodes such as "node1-1,node1-3,node1-4", or even a range of nodes such as "[1..2,7..9]" which in this case is equivalent to "node1-7,node1-8,node1-9,node2-7,node2-8,node2-9".

After imaging completes successfully, the nodes will be turned off. They must be turned on before you can connect to them.


Turning nodes on/off

To turn on node1-1:

omf tell -a on -t node1-1

Just like a normal computer, it can take several minutes for a node to finish booting up. Please wait before trying to SSH into it.

A clever little shortcut to turning on the nodes that you just imaged is specifying the topology system:topo:imaged which is automatically regenerated every time you perform an imaging operation on a node or set of nodes.


To turn off node1-1 as if you held down the power button (ie. hard off):

omf tell -a offh -t node1-1


To reset node1-1 as if your pressed the reset button:

omf tell -a reset -t node1-1


Accessing a node

On the console:

ssh root@node1-1

If you see an error or warning message while trying to SSH into a node, please read the message. Often times the message has to do with host keys on the node having changed from what the console expects (this is typically normal) because every image has a different host key, therefore it will change depending on what node and image you happen to be using. Follow the instructions in the message to resolve this.


Saving a node's disk image

If you wish to save a node's disk image for future use, it is important to do so well BEFORE your reservation is over because it takes several minutes to complete and there is no guarantee what the next user will do to the node.

Example: Suppose we want to save the disk image of node1-1 on in Sandbox 7.

First SSH into node1-1 and run:

/root/prepare.sh

If you do not run prepare.sh on the node, the disk image WILL NOT WORK when loaded onto another node.

Then on the console run:

omf save -n node1-1.sb7.orbit-lab.org

It will take several minutes to complete the save. Once finished, take note of the filename the disk image was saved to (the format of which is username-node-nodename-date-time.ndz) so that you can use it later when loading the disk image again.

Last modified 7 years ago Last modified on Jan 24, 2018, 4:51:49 PM
Note: See TracWiki for help on using the wiki.