What is "declare" in Bash?

The output of help declare is quite terse. A clearer explanation can be be found in man bash or info bash — the latter being the source for what follows.

First, some definitions. About variables and attributes:

A parameter is an entity that stores values. ... A variable is a parameter denoted by a name. A variable has a value and zero or more attributes. Attributes are assigned using the declare builtin command ...

And about the declare builtin:

declare

declare [-aAfFgilnrtux] [-p] [name[=value] …]

Declare variables and give them attributes. If no names are given, then display the values of variables instead.

...

-n
Give each name the nameref attribute, making it a name reference to another variable. That other variable is defined by the value of name. All references, assignments, and attribute modifications to name, except for those using or changing the -n attribute itself, are performed on the variable referenced by name’s value. ...

Note that name reference variables are only available in Bash 4.3 or later1.

Also, for a useful introduction to declare and variable attributes in Bash I would point you to this answer to "What do declare name and declare -g do? " (which mainly focuses on variables' scope, though).


Basically2, declare name=[value] is equivalent to the assignment name=[value] you are probably familiar with. In both cases, name is assigned the null value if value is missing.

Note that the slightly different declare name, instead, does not set the variable name3:

$ declare name

## With the -p option, declare is used to display
## attributes and values of variables
$ declare -p name
declare -- name            ## "name" exists

## Parameter expansion can be used to reveal if a variable is set:
## "isunset" is substituted to "name" only if unset 
$ echo "${name-isunset}"
isunset

Thus, the variable name can be:

  • declared and unset, after declare name;
  • declared and set with null as value, after name= or declare name=;
  • declared, set and with a non null value after name=value or declare name=value.

More generally, declare [options] name=value

  1. creates the variable name — which is a parameter with a name, which in turn is just a portion of memory you can use to store information4;
  2. assigns the value value to it;
  3. optionally sets name's attributes, which define both the kind of value it can store (not in terms of a type, strictly speaking, since Bash's language is not typed) and the ways it can be manipulated.

Attributes are probably easier to explain with an example: using declare -i name will set the "integer" attribute of name, letting it be treated as an integer; quoting the manual, "arithmetic evaluation will be performed when the variable is assigned a value":

## Let's compare an ordinary variable with an integer
$ declare var
$ declare -i int
$ var="1+1"
$ int="1+1"
$ echo "$var"
1+1                 ## The literal "1+1"
$ echo "$int"
2                   ## The result of the evaluation of 1+1

In light of the above, what is happening in ilkkachu's code is that:

  1. A variable named ref is declared, with the "nameref" attribute set, and the content of $1 (the first positional argument) is assigned to it:

    declare -n ref="$1"
    

    The aim of a name reference variable such as ref is to hold the name of another variable, which would generally not be known in advance, possibly because we want it to be dynamically defined (e.g. because we want to reuse a piece of code and have it applied to several variables), and to provide a convenient way for referring to (and manipulating) it. (Not the only one, though: indirection is an alternative; see Shell Parameter Expansion).

  2. When the value of the variable tmp1 is assigned to ref:

    ref=$tmp1
    

    an additional variable, whose name is the value of ref, is implicitly declared. The value of tmp1 is also indirectly assigned to the implicitly declared variable by means of this explicit assignment to ref.

In the context of your linked question, calling read_and_verify as

read_and_verify domain "Prompt text here..."

will declare the variable domain and assign it the value of tmp1 (i.e. the user's input). It is exactly designed to reuse the code that interacts with the user and leverage a nameref variable to declare domain and a few other variables.

To take a closer look at the implicit part we can reproduce the process step by step:

## Assign a value to the first positional argument
$ set -- "domain"

## Declare the same "tmp1" variable as in your code
$ tmp1="value for domain"

## Declare a "ref" variable with the nameref attribute set and
## assign the value "domain" to it
$ declare -n ref="$1"

## Note that there is no "domain" variable yet
$ declare -p domain
bash: declare: domain: not found

## Assign a value to "ref" and, indirectly, to the "domain" variable
## that is implicitly declared  
$ ref=$tmp1

## Verify that a variable named "domain" now exists, and that
## its value is that of "tmp1"
$ declare -p domain
declare -- domain="value for domain"

## Verify that "ref" is actually a reference to "domain"
$ domain="new value"
$ echo "$domain"
new value
$ declare -p ref
declare -n ref="domain"
$ echo "$ref"
new value

1 Reference: CHANGES file, section "3. New Features in Bash", point "w".
This may be relevant: for instance, CentOS Linux 7.6 (currently the latest version) is shipped with Bash 4.2.

2 As usual with shell builtins, an exhaustive and concise explanation is elusive since they perform various, possibly heterogeneous actions. I will focus on declaring, assigning and setting attributes only, and I will consider listing, scoping and removing attributes as out of the scope of this answer.

3 This behavior of declare -p has been introduced in Bash 4.4. Reference: CHANGES file, section "3. New Features in Bash", point "f".
As G-Man pointed out in comments, in Bash 4.3 declare name; declare -p name yields an error. But you can still check that name exists with declare -p | grep 'declare -- name'.

4 FullBashGuide, Parameters on mywiki.wooledge.org


In most cases it is enough with an implicit declaration in bash

asdf="some text"

But, sometimes you want a variable's value to only be integer (so in case it would later change, even automatically, it could only be changed to an integer, defaults to zero in some cases), and can use:

declare -i num

or

declare -i num=15

Sometimes you want arrays, and then you need declare

declare -a asdf   # indexed type

or

declare -A asdf   # associative type

You can find good tutorials about arrays in bash when you browse the internet with the search string 'bash array tutorial' (without quotes), for example

linuxconfig.org/how-to-use-arrays-in-bash-script


I think these are the most common cases when you declare variables.


Please notice also, that

  • in a function, declare makes the variable local (in the function)
  • without any name, it lists all variables (in the active shell)

    declare
    

Finally, you get a brief summary of the features of the shell built-in command declare in bash with the command

help declare

I’ll have my go at trying and explain this, but forgive me if I won’t follow the example you provided. I’ll rather try to guide you along my own, different, approach.

You say you already understand concepts such as “variables” and “expanding them”, etc. so I’ll just skim over some background knowledge that would otherwise require deeper focus.

So I’ll start by saying that, at its most basic level, the declare command is just a way for you to tell Bash that you need a variable value (i.e. a value the might change during script execution), and that you will refer to that value using a specific name, precisely the name you indicate next to the declare command itself.

That is:

declare foo="bar"

tells Bash that you want the variable named foo have the value bar.

But.. hold on a minute.. we can do that without using declare at all, can’t we. As in:

foo="bar"

Very true.

Well, it so happens that the above simple assignment is actually an implicit way for.. in fact.. declaring a variable.

(It also so happens that the above is one of a few ways to change the value of the variable named foo; indeed it is precisely the most direct, concise, evident, straight-forward way.. but it’s not the only one.. .. I’ll come back to this later..).

But then, if it is so well possible to declare a “name that will tag variable values” (just “variable” from hereafter, for the sake of brevity) without using declare at all, why would you ever want to use this pompous “declare” command ?

The answer lies in the fact that the above implicit way to declare a variable (foo="bar"), it.. implicitly.. makes Bash consider that variable of being of the type that is most commonly used in the typical usage scenario for a shell.

Such type is the string type, i.e. a sequence of characters with no particular meaning. Therefore a string is what you get when you use the implicit declaration.

But you, as the programmer, sometimes need to rather consider a variable as, e.g., a number.. on which you need to do arithmetic operations.. and using an implicit declaration like foo=5+6 won’t make Bash assign value 11 to foo as you might expect. It will rather assign to foo the sequence of the three characters 5 + 6.

So.. you need a way to tell Bash that you want foo to be considered a number, not a string.. and that’s what an explicit declare comes useful for.

Just say:

declare -i foo=5+6  # <<- note the '-i' option: it means 'integer'

and Bash will happily do the math for you, and assign the numeric value 11 to variable foo.

That is: by saying declare -i foo you give to variable foo the attribute of being an integer number.

Declaring numbers (precisely integers, because Bash still does not understand decimals, floating points, and all that) may be the first reason for using declare, but it’s not the only reason. As you have already understood, there are a few other attributes you can give to variables. For instance you can have Bash to always make a variable’s value uppercase no matter what: if you say declare -u foo, then from then on when you say foo=bar Bash actually assigns the string BAR to the variable foo.

In order to give any of these attributes to a variable, you must use the declare command, there’s no other choice.


Now, one other of the attributes you can give through declare is the infamous “name-ref” one, the -n attribute. (And now I'm going to resume the concept I put on hold earlier).

The name-ref attribute, basically, allows Bash programmers for another way to change the value of a variable. It more precisely gives an indirect way to do that.

Here’s how it works:

You declare a variable having the -n attribute, and it is very recommended (though not strictly required, but it makes things simpler) that you also give a value to this very variable on the same declare command. Like this:

declare -n baz="foo"

This tells Bash that, from then on, each time you will use, or change, the value of the variable named baz, it shall actually use, or change, the value of the variable named foo.

Which means that, from then on, you can say something like baz=10+3 to make foo get the value of 13. Assuming of course that foo was previously declared as integer (declare -i) like we did just one minute ago, otherwise it’ll get the sequence of the four characters 1 0 + 3.

Also: if you change foo’s value directly, as in foo=15, you will see 15 also by saying echo “${baz}”. This is because variable baz declared as name-ref of foo always reflects foo’s value.

The above declare -n command is said a “name-reference” because it makes variable baz refer to the name of another variable. In fact we have declared baz has having value "foo" which, because of the -n option, is handled by Bash as the name for another variable.

Now, why on Earth would you ever want to do that ?

Well.. it’s worth saying that this is a feature for quite advanced needs.

In fact so advanced that when a programmer faces a problem that would really require a name-ref, it is also likely that such problem should rather be addressed to by using a proper programming language instead of Bash.

One of those advanced needs is, for instance, when you, as the programmer, cannot know during development which variable you will have to use in a specific point of a script, but it will be fully known dynamically at run-time. And given that there’s no way for any programmer to intervene at run-time, the only option is to make provision beforehand for such situation in the script, and a “name-ref” can be the only viable way. As a widely known use case of this advanced need, think of plug-ins, for instance. The programmer of a “plugin-able” program needs to make generic provision for future (and possibly third-party) plug-ins beforehand. Therefore the programmer will need to use facilities like a name-ref in Bash.

One other advanced need is when you have to deal with huge amount of data in RAM and you also need to pass that data around functions of your script that also have to modify that data along the way. In such case you could certainly copy that data from one function to another (like Bash does when you do dest_var="${src_var}" or when you invoke functions like in myfunc "${src_var}"), but being that data a huge amount it would make for a huge waste of RAM and for a very inefficient operation. So The Solution if such situations arise is to use not a copy of the data, but a reference to that data. In Bash, a name-ref. This use case is really the norm in any modern programming language, but it is quite exceptional when it comes to Bash, because Bash is mostly designed for short simple scripts that mostly deal with files and external commands and thus Bash scripts seldom have to pass huge amount of data between functions. And when a script’s functions do need to share some data (access it and modify it) this is usually achieved by just using a global variable, which is quite the norm in Bash scripts as much as it is very deprecated in proper programming languages.

Then, there may be a notable use case for name-refs in Bash, and (maybe ironically) it is associated to when you use yet other types of variables:

  1. variables that are declared as “indexed arrays” (declare -a)
  2. variables that are declared as “associative arrays” (declare -A).

These are a type of variables that may be more easily (as well as more efficiently) passed along functions by using name-refs instead of by normal copying, even when they don’t carry huge amounts of data.

If all these examples sound weird, and still incomprehensible, it’s only because name-refs are indeed an advanced topic, and a rare need for the typical usage scenario of Bash.

I could tell you about occasions on which I for one have found use for name-refs in Bash, but so far they have been mostly for quite “esoteric” and complicated needs, and I’m afraid that if I described them I would only complicate things for you at this point of your learning. Just to mention the least complex (and possibly not esoteric): returning values from functions. Bash does not really support this functionality, so I obtained the same by using name-refs. This, incidentally, is exactly what your example code does.


Besides this, a small personal advice, which would actually be better suited for a comment but I haven't been able to condense it enough to fit into StackExchange's comment's limits.

I think that the most you should do at the moment is to just experiment with name-refs by using the simple examples I showed and maybe with the example code you provided, disregarding for the moment the “why on earth” part and focusing only on the “how it works” part. By experimenting a bit, the “how” part may sink better into your mind, so that the “why” part will come clear to you in due time when (or if) you’ll have a real practical problem for which a name-ref would truly come in handy.