Scripting

Materials and notes for practicals in Masaryk University course DSIB01 Introduction to Bioinformatics taught in the fall semester 2021/2022


Scripting

Script is a document telling computer what to do. A script is just a plain text file and it may have any name you like. You create them the same way you would any other text file, with just a plain old text editor. Anything you can run on the command line you may place into a script and they will behave exactly the same. Vice versa, anything you can put into a script, you may run on the command line and again it will perform exactly the same.

Not only will writing shell scripts make your work faster — you won’t have to retype the same commands over and over again — it will also make it more accurate (fewer chances for typos) and more reproducible. If you come back to your work later (or if someone else finds your work and wants to build on it) you will be able to reproduce the same results simply by running your script, rather than having to remember or retype a long list of commands.

Your first script

This script will print a message to the screen (using a program called echo) then give us a listing of what is in our current directory. Linux is an extensionless system so it is not required for scripts to have a .sh extension. It is common to put them on however to make them easy to identify.

Terminal

$ cd ~/test_dir

$ nano myscript.sh

The very first line of a script should always be this line. This line identifies which interpreter should be used. The first two characters are referred to as a shebang. After that (important, no spaces) is the path to the interpreter. Anything following a # is a comment. The interpreter will not run this, it is just here for our benefit. It is good practice to include your name, and the date you wrote the script as well as a one line quick description of what it does at the top of the script. We’ll use a program called echo It will merely print whatever you place after it, as command line arguments, to the screen. Useful for printing messages. ls is a command that is writing to command line. Even when used in script, it will still write to command line.

Nano

#!/bin/bash

# A simple demonstration script

# Gresova 1.10.2021

echo Here are the files in your current directory:

ls

To run our script, we need to specify path to it. We can use relative path ./myscritp.sh or absobute path /home/student/test_dir/myscript.sh.

Terminal

$ ./myscript.sh

Here are the files in your current directory:

foo myscript.sh

Variables

A variable is a container for a simple piece of data. They are useful if we need to work out a particular thing and then use it later on. Variables are easy to set and refer to but they have a specific syntax that must be followed exactly for them to work.

  • When we set a variable, we specify it’s name, followed directly by an equals sign (=) followed directly by the value. (So, no spaces on either side of the = sign.)
  • When we refer to a variable, we must place a dollar sign ($) before the variable name.

Terminal

$ nano variable.sh

Nano

#!/bin/bash

# A simple demonstration of variables

# Gresova 1.10.2021

name='Katarina'

echo Hello $name

Terminal

$ ./variable.sh

Hello Katarina

Command line arguments

When we run a script, there are several variables that get set automatically for us. Here are some of them:

  • $0 - The name of the script.
  • $1 - $9 - Any command line arguments given to the script. $1 is the first argument, $2 the second and so on.
  • $# - How many command line arguments were given to the script.
  • $* - All of the command line arguments.
  • There are other variables but these should be enough to get you going for now.

Terminal

$ nano morevariables.sh

Nano

#!/bin/bash

# A simple demonstration of command line arguments

# Gresova 1.10.2021

echo My name is $0 and I have been given $# command line arguments

echo Here they are: $*

echo And the 2nd command line argument is $2

Terminal

$ ./morevariables.sh bob fred sally

My name is morevariables.sh and I have been given 3 command line arguments

Here they are: bob fred sally

And the 2nd command line argument is fred


Previous page Home next page