Chapter 15: Functions | The Engineering Corner

Chapter 15: Functions

Index

Now that we have learnt the basic blocks of Bash we are going to add another layer of abstraction that are… the functions.

In Bash scripting, functions serve as essential building blocks that allow for the modularization and organization of code. A function in Bash is a self-contained block of code that performs a specific task, and it can be invoked or called from anywhere within the script. This modular approach not only enhances code readability but also promotes code reuse, making scripts more efficient and maintainable.

To declare a function in Bash, the “function” keyword or a shorthand syntax with parentheses is used, followed by the function name and the curly braces that encapsulate the function body. Functions may or may not receive arguments, allowing for flexibility in handling input parameters. Additionally, they can return values to the calling code, contributing to the versatility of Bash scripts.

Functions in Bash enable the creation of structured and organized scripts by encapsulating specific functionalities. As we go deeper into Bash scripting, we will explore the syntax, parameters, return values, and best practices for utilizing functions. Understanding how to leverage functions enhances the efficiency and readability of Bash scripts, contributing to the development of robust and maintainable automation solutions.

Declaration

In Bash scripting, declaring a function involves specifying the function’s name, defining its behavior or code block, and optionally providing parameters that the function can accept. The syntax for declaring a function is straightforward and can be done using either the “function” keyword or a concise shorthand notation.

Let’s see next a couple of examples of how to declare a function.

The first example is by using the “function” keyword.

#!/usr/bin/env bash
# Script: function-0001.sh
function my_function() {
   echo "Hello from inside the function"
}
my_function

When you run the previous script you get the following output.

$ ./function-0001.sh
Hello from inside the function

And the second example is by declaring the function without the “function” keyword, like the following script.

#!/usr/bin/env bash
# Script: function-0002.sh
my_function() {
   echo "Hello from inside the function"
}
my_function

If you run the last script you will see that it produces the same output as the “function-0002.sh” script.

Although both approaches are equivalent, the second one (without the “function” keyword) is more portable and because of portability reasons it will be the approach that we will use in this book.

Something to notice is that a function can NEVER have an empty body. For example, if we try to execute the following script.

#!/usr/bin/env bash
# Script: function-0003.sh
error_fn_1() {
}
error_fn_2() {
   echo "Some commands"
}

If you try to run the previous script you will get the following error.

$ ./function-0003.sh
./function-0003.sh: line 4: syntax error near unexpected token `}'
./function-0003.sh: line 4: `}'

The body of a function must contain any combination of commands and statements like the ones we learnt before (if, case, for loop, while loop,…). You have to write code that is clean so that you can understand it when you come back to it.

In the next sections we will dive into different details of functions. We will start by comparing declaration vs call of a function.

Declaration vs Call

In Bash scripting you cannot call a function unless it has been previously declared. Bear in mind that declaring and calling a function are two different things. You can declare functions that call other functions/commands and it will work as long as at the moment of the execution the commands/functions are available to the script.

Let’s see what happens when you try to invoke a function that has not been declared before the call.

#!/usr/bin/env bash
# Script: function-0004.sh
my_function # not declared before execution
my_function() {
   echo "My function"
}

When you run the previous script you will receive the following error.

$ ./function-0004.sh
./function-0004.sh: line 3: my_function: command not found

Now if you swap the order having the function declared before the call of the function, everything works.

#!/usr/bin/env bash
# Script: function-0005.sh
my_function() {
   echo "My function"
}
my_function

When you execute the last script you will get a successful execution.

$ ./function-0005.sh
My function

In the following example you will see that there are two functions being declared which are “my_function_1” and “my_function_2”. You will also notice that “my_function_1” invokes “my_function_2”, which is declared after “my_function_1”.

#!/usr/bin/env bash
# Script: function-0006.sh
my_function_1() {
   echo "Inside my_function_1"
   my_function_2
}
my_function_2() {
   echo "Inside my_function_2"
}
my_function_1

This is OK because by the time Bash executes “my_function_1” all the information needed by it to execute successfully (in our case, declaration of “my_function_2”) is available in memory/context.

If you execute the previous script you will get the following output.

$ ./function-0006.sh
Inside my_function_1
Inside my_function_2

In the same way that a Bash script can have variables, a function can have as well variables that are called “local variables”. In the next section we will talk about local variables.

Local variables

What is the “scope” of a variable? The scope of a variable is the context in which it has meaning, in which it has a value that can be referenced. For example, the scope of a local variable lies only within the function, block of code ({...}), or subshell (we will talk later in the book) within which it is defined, while the scope of a global variable is the entire script in which it appears.

A variable declared as “local” is one that is visible only within the block of code in which it appears. If a variable within a function is not declared as local, global scope will be by default.

Before a function is called, all variables declared within the function are invisible outside the body of the function, not just those explicitly declared as “local”.

Let’s see how it works with the following example script.

#!/usr/bin/env bash
# Script: function-0007.sh
custom1() {
   local localVar=324
   globalVar=123
   echo "localVar: $localVar"
   echo "Done with $FUNCNAME"
}
echo "Local variable before function: $localVar"
echo "Global variable before function: $globalVar"
custom1
echo "Local variable after function: $localVar"
echo "Global variable after function: $globalVar"

When you execute the previous script you will have the following output in the terminal window.

$ ./function-0007.sh
Local variable before function:
Global variable before function:
localVar: 324
Done with custom1
Local variable after function:
Global variable after function: 123

So, what is happening in the execution of this script? When the script reaches lines 9 and 10 it just prints the string without the content of the variables. This is because there is no information about the variables at this point.

Then, on line 11 the function “custom1” is executed. Inside the function there are 2 variables. The first variable is a local variable named “localVar” whose scope is the function itself and it will not be available outside the function. The second variable, named “globalVar”, is a global variable (as the “local” keyword was not used, global scope is the default one) that, once the function is executed, will be available to the rest of the script.

Once the execution of the function is done you see that only the global variable is present in the output.

Overriding functions and commands

In Bash you can override the declaration of a function (or a command) by declaring a new function with the same exact name.

The way it works is once the functions (or commands) are available inside the script you are working on, you can add a function with the same name as the function (or command) you want to override and from that moment the overriding will work.

To summarize, the latest declaration wins.

Let’s see how it works with a couple of examples to show how we can override functions (or commands).

#!/usr/bin/env bash
# Script: function-0008.sh
my_function_1() {
   echo "Inside my_function_1 - 1"
}
my_function_1() { # Will override the previous declaration
   echo "Inside the override of my_function_1"
}
my_function_1

As you can see in the previous script the function is declared twice, as we mentioned before the second declaration (the latest one) will be the one that will be used.

If you execute the previous script you get the following result.

$ ./function-0008.sh
Inside the override of my_function_1

As you can see from the execution the second declaration of the function “my_function_1” is the one that got executed.

Now that we know how to override functions, let’s try to do the same with commands.

In the following example we are going to override the command “ls”[1].

#!/usr/bin/env bash
#Script: function-0009.sh
echo "Before overriding"
echo "##########"
ls  # standard command 'ls'. Will print directory content
echo ""
ls() { # Overriding command 'ls'
   echo "Nothing to see here"
}
echo "After overriding"
echo "##########"
ls   # Will print "Nothing to see here"
echo ""

In the previous script, on line 5, the actual “ls” command is used to list the contents of the current folder. Later between lines 7 and 9 we do declare a function with the name “ls” to be able to override the command “ls”.

When you run the previous script you will get the following result in the terminal window.

$ ./function-0009.sh
Before overriding
##########
function-0001.sh  function-0003.sh  function-0005.sh  function-0007.sh  function-0009.sh
function-0002.sh  function-0004.sh  function-0006.sh  function-0008.sh

After overriding
##########
Nothing to see here

As you can see in the execution of the last script before line 7 of the script the actual “ls” command is used. After line 7 the function is the one used.

Variable `$FUNCNAME` associated to a function

The variable “FUNCNAME” is an array containing the names of all shell functions currently in the execution call stack. The element with index 0 is the name of any currently-executing shell function. The bottom-most element (the one with the highest index) is “main“[2]. This variable exists only when a shell function is executing. Assignments to “FUNCNAME” have no effect. If “FUNCNAME” is unset, it loses its special properties, even if it is subsequently reset.

Let’s see how it works with the following example script.

#!/usr/bin/env bash
#Script: function-0010.sh
my_custom_function() {
   echo "We are inside the function '$FUNCNAME'"
   echo "Array: ${FUNCNAME[@]}"
   my_custom_function_2
}
my_custom_function_2() {
   echo "Array: ${FUNCNAME[@]}"
   my_custom_function_3
}
my_custom_function_3() {
   echo "Array: ${FUNCNAME[@]}"
}
my_custom_function
echo "End"

If you run the previous script you will see the following output in the terminal window.

$ ./function-0010.sh
We are inside the function 'my_custom_function'
Array: my_custom_function main
Array: my_custom_function_2 my_custom_function main
Array: my_custom_function_3 my_custom_function_2 my_custom_function main
End

Positional parameters

This section is going to be very useful because what we will learn here is applicable to both functions and scripts.

Till now we learnt how to write functions and scripts that execute a task without receiving anything from the caller. Now we are going to learn how we can pass arguments to a function/script so that it can be used as parameters.

What is the difference between arguments and parameters? To be on the same page we are going to use the notions that appear on this page of Developer Mozilla[3].

Note the difference between parameters and arguments:

Function parameters are the names listed in the function’s definition.
Function arguments are the real values passed to the function.
- An argument is a value passed as input to a function.
Parameters are initialized to the values of the arguments supplied.

All information we can have regarding positional parameters come inside the following variables:

$0, $1, $2, etc: Positional parameters, passed from command line to script or passed to a function.
- $0 is a “special value” and it’s ALWAYS going to be the name of the script being executed in the way you wrote it (relative path, absolute path, etc)
$#: Number of command-line arguments or positional parameters
$*: All of the positional parameters, seen as a single word (it must be quoted , “$*”)
$@: Same as $*, but each parameter is a quoted string, that is, the parameters are passed on intact, without interpretation or expansion. This means, among other things, that each parameter in the argument list is seen as a separate word.

`shift` built-in command

The “shift” command is one of the Bourne shell built-ins that comes with Bash. This command takes one argument, a number. The positional parameters are shifted to the left by this number, N. The positional parameters from N+1 to $# are renamed to variable names from $1 to $# - N+1.

Say you have a command that takes 10 arguments, and N is 4, then $4 becomes $1, $5 becomes $2 and so on. $10 becomes $7 and the original $1, $2 and $3 are thrown away.

If N is zero or greater than $#, the positional parameters are not changed and the command has no effect. If N is not present, it is assumed to be 1. The return status is zero unless N is greater than $# or less than zero; otherwise it is non-zero.

Let’s see how the “shift” command works with the following example.

#!/usr/bin/env bash
#Script: function-0011.sh
args=($@)
echo "Printing original list of arguments"
for((index=0; index <= ${#args[@]}; index++)) {
   echo "Arg[$index]: ${!index}"
}
shift 4
args=($@)
echo "Printing list after shifting"
for((index=0; index <= ${#args[@]}; index++)) {
   echo "Arg[$index]: ${!index}"
}

When you execute the previous script with the numbers from 1 to 9 you will get the following output.

$ ./function-0011.sh
Printing original list of arguments
Arg[0]: ./function-0011.sh
Arg[1]: 1
Arg[2]: 2
Arg[3]: 3
Arg[4]: 4
Arg[5]: 5
Arg[6]: 6
Arg[7]: 7
Arg[8]: 8
Arg[9]: 9
Printing list after shifting
Arg[0]: ./function-0011.sh
Arg[1]: 5
Arg[2]: 6
Arg[3]: 7
Arg[4]: 8
Arg[5]: 9

Pay attention to a few things:

Parameter $0, as we mentioned previously, it’s always the name of the script
Arguments from index 1 to index 4 were discarded
Arguments from index 5 to index 9 were moved to indices 1 to 5

Return status (`$?`) and `return`

In Bash, every function and script “returns” a value which is an integer. For that, the keyword “return” tends to be used.

Once the result is returned from the function and the scope of the function is over, the result will be stored in the variable “$?” which will always contain the return value (integer) of the last statement or function or script executed.

Let’s see how it works with an example.

#!/usr/bin/env bash
#Script: function-0012.sh
# Declaring a function
my_ok_function() {
   echo "This function returns zero"
   return 0 
}
# Invoking the function
my_ok_function
# Printing the result of the function
echo "Result: $?"

When you run the previous script you will have the following output in the terminal window.

$ ./function-0012.sh
This function returns zero
Result: 0

As you already saw the script printed “Result: 0” to the output. Something to be aware of is that the “return” keyword only accepts an integer in the range [0-255]. If an integer beyond this range is specified its binary value will be truncated to what 8 bits allow. For example:

If 256 is specified, the actual value will be zero
If 257 is specified, the actual value will be 1
If -1 is specified, the actual value will be 255
If -2 is specified, the actual value will be 254
And so on.

If no “return” keyword is specified in the return of a function, the value returned will be the return value of the last command in the function.

You can see “return” as the way to signal the exit status of a function.

Returning non-integer values

As we saw before, the keyword “return” is used to terminate the execution of the current function with a specific status code [0-255]. “return” cannot be used to return other values apart from integers in the specified range. In order to “return” other kinds of values we need to use another builtin command we learnt already, which is the “echo” command.

Let’s see how it works with the following example.

$ ./function-0013.sh
Result is 'NON_INTEGER_VALUE'

The previous script will printed “Result is ‘NON_INTEGER_VALUE’” to the screen. But you could be more creative by creating JSON strings, XML strings and so much more!

Recursivity

In computer science, recursion is a programming technique using a function or an algorithm that calls itself one or more times until a specified condition is met, time at which the rest of each repetition is processed from the last one called to the first.

Let’s see how it works with the following example script that implements the Fibonacci[4] function.

#!/usr/bin/env bash
#Script: function-0014.sh
# Declaring the Fibonacci function
fibonacci() {
   nthTerm=$1
   if [ $nthTerm -eq 0 ]; then # F(0)
       echo 0
   elif [ $nthTerm -eq 1 ]; then # F(1)
       echo 1
   else # F(N-1) + F(N-2)
       local n1=$(($nthTerm - 1))
       local fn1=$(fibonacci $n1)
       local n2=$(($nthTerm - 2))
       local fn2=$(fibonacci $n2)
       echo $(($fn1 + $fn2))
   fi
}
# Calling the Fibonacci function with the number 10
fibonacci 10

When you run the previous script you will see the following the terminal window.

$ ./function-0014.sh
55

Just for the record, recursivity is not only specific to functions. It’s a concept that can be used at script level.

The previous function could be written as the following so that the recursion is applied to the script itself.

#!/usr/bin/env bash
#Script: function-0015.sh
nthTerm=$1
if [ $nthTerm -eq 0 ]; then # F(0)
   echo 0
elif [ $nthTerm -eq 1 ]; then # F(1)
   echo 1
else # F(N-1) + F(N-2)
   n1=$(($nthTerm - 1))
   fn1=$($0 $n1) # script calling itself
   n2=$(($nthTerm - 2))
   fn2=$($0 $n2) # script calling itself
   echo $(($fn1 + $fn2))
fi

When you run the previous script providing 10 as input it will generate the same output as the previous script.

$ ./function-0015.sh
55

You will notice that takes a bit longer for the script to be executed because it’s creating different processes[5].

Summary

In this electrifying chapter, we dove headfirst into one of the most powerful tools in a Bash scripter’s arsenal: functions! Functions allow us to streamline our scripts, making them more efficient, reusable, and easy to maintain. We explored how functions are declared and discovered that simply declaring them isn’t enough — they only spring into action when explicitly called. This distinction is crucial for building more complex scripts, where we can define logic once and call it as many times as needed!

We also uncovered the beauty of local variables inside functions, which keep our code clean and isolated, preventing conflicts with global variables. This not only improves readability but also ensures that our functions don’t unintentionally mess up other parts of the script. Then came the mind-blowing revelation: overriding functions and even commands! That’s right — with a little creativity, you can redefine how certain commands work in your script, but with great power comes great responsibility!

One of the most intriguing topics covered was the $FUNCNAME variable, a hidden gem that helps you track the function call stack. It provides a look under the hood when debugging or working with nested functions. To round things off, we dove into positional parameters and the game-changing shift built-in command, which lets us control how arguments are passed and managed within functions. Mastering these concepts opens the door to writing flexible, adaptable scripts that can handle any input thrown their way. This chapter was a true exploration of the versatility and power of functions in Bash!

References

1. The “ls” command is used to list the content of the folders that you pass as arguments, or the current folder if you do not provide any argument.↩

2. "main" represents the global (non-function) execution context of the Bash script.↩

3. https://developer.mozilla.org/en-US/docs/Glossary/Parameter ↩

4. https://en.wikipedia.org/wiki/Fibonacci_sequence ↩

5. We will speak about processes in a later chapter.↩