Skip to main content

Section 3.1 Introduction to Vectors

Previously, we considered linear systems as strictly from the perspective of relationships between variables. Next, we will consider it from the perspective of vectors. Vectors are at the heart of linear algebra.

Subsection 3.1.1 Vectors

The term vector is used widely in mathematics, physics, and computer science. There are a variety of definitions, ranging from specific and applied to mathematically abstract. For now, a vector will simply mean an ordered list of real numbers. In linear algebra, we represent vectors most commonly as a column matrix, a matrix with dimensions \(1 \times n\text{,}\) i.e. a matrix with 1 row and \(n\) “columns”. Similarly, a row vector is a \(m \times 1\) matrix, a matrix which has 1 columns and \(m\) “rows”.

Subsection 3.1.2 Vectors in \(\mathbb{R}^2\)

First, we will consider vectors that contain two numbers, of the form,
\begin{equation*} \vec{v} = \begin{bmatrix} x \\ y \end{bmatrix} \end{equation*}
The physics perspective on vectors views them as directed line segments. That is, line segments with a direction, typically represented by an arrow. The point where the vector arrow “starts” is called its tail (or initial point), and the point where it ends the head (or terminal point).
With this perspective, vectors don't change if they are moved around in the plane, that is, they are unaffected by translation. This geometric perspective isn't very computational. To bridge this difference, we make the convention of viewing vectors with their tails fixed at the origin \((0,0)\text{.}\)

Definition 3.1.1.

A vector is in standard position if its initial point is at the origin.
The advantage to this convention is that a vector in standard position is completely determined by its terminal point. In particular, if the terminal point of a vector is \((x,y)\text{,}\) then this ordered pair can be used to specific the vector.

Definition 3.1.2.

The vector from the origin to a point \((x,y)\) is called a position vector for the point \((x,y)\text{.}\)
Further, every point in the plane \((x,y)\) can be represented by the orederd list of numbers, \(\begin{bmatrix} x \\ y \end{bmatrix}\text{.}\)
In this way, there is a one-to-one correspondence between geometric vectors, ordered pairs, and vectors. We identify a column vector with an ordered tuple of numbers. That is, we treat them as equivalent. For example,
\begin{equation*} \begin{bmatrix} 1 \\ -3 \end{bmatrix} \quad \text{is equivalent to} \quad (1,-3) \end{equation*}
This correspondence allows us to write vectors horizontally, which is sometimes convenient because it takes up less space. On the other hand, for technical reasons, it will be necessary to keep the vertical convention.
The numbers contained in a vectors are called its entries, or components, or sometimes its coordinates.
The set of all vectors with two entries is denoted by \(\mathbb{R}^2\) (read “r-squared” or simply “r-two”). Again, this can be thought of as the set of all two-dimensional vectors (geometrically), the set of all ordered pairs \((a,b)\text{,}\) or the set of all column matrices of the form \(\begin{bmatrix} a \\ b \end{bmatrix}\text{.}\)

Definition 3.1.3.

Two vectors in \(\mathbb{R}^2\) are equal if their corresponding entries are equal.
Next, we define the fundamental operations for combining vectors: addition and scalar multiplication.

Definition 3.1.4. Addition of vectors.

Let \(\vec{u}, \vec{v}\) be vectors in \(\mathbb{R}^2\text{.}\) Their sum \(\vec{u} + \vec{v}\) is the vector obtained by adding the corresponding entries of \(\vec{u}\) and \(\vec{v}\text{.}\) That is, for \(\vec{u} = \begin{bmatrix} u_1 \\ u_2 \end{bmatrix}, \vec{v} = \begin{bmatrix} v_1 \\ v_2 \end{bmatrix}\text{,}\)
\begin{equation*} \boxed{\vec{u} + \vec{v} = \begin{bmatrix} u_1 \\ u_2 \end{bmatrix} + \begin{bmatrix} v_1 \\ v_2 \end{bmatrix} = \begin{bmatrix} u_1 + v_1 \\ u_2 + v_2 \end{bmatrix}} \end{equation*}
Geometrically, the sum \(\vec{u} + \vec{v}\) is obtained by aligning the tail of \(\vec{v}\) with the head of \(\vec{u}\text{.}\) Then, the vector \(\vec{u} + \vec{v}\) is the arrow from the tail of \(\vec{u}\) to the head of \(\vec{v}\text{.}\) This is called the parallelogram rule for addition, or the tail-to-tip method.
Alternatively, if the tails of \(\vec{u}\) and \(\vec{v}\) are aligned together, and we form the parallelogram with sides \(\vec{u}\) and \(\vec{v}\text{,}\) then \(\vec{u} + \vec{v}\) is the vector formed by an arrow from their tails to the opposite vertex of the paralellogram.
Intuitively, if the vectors are thought of as representing motion, then \(\vec{u} + \vec{v}\) is the resulting motion after movement corresponding to \(\vec{u}\) and then \(\vec{v}\text{.}\)

Definition 3.1.5. Scalar multiplication of vectors.

Let \(\vec{u}\) be a vector, \(c \in \mathbb{R}\text{.}\) Then, the scalar multiple of \(\vec{u}\) by \(c\text{,}\) \(c\vec{u}\text{,}\) is the vector obtained by multiply each entry of \(\vec{u}\) by \(c\text{.}\)
The number \(c \in \mathbb{R}\) is called a scalar. This is because, literally, multiplying a vector by a number \(c\) scales the length of the vector. Geometrically, if \(c > 0\text{,}\) then \(c\vec{u}\) is the vector with the same direction as \(\vec{u}\text{,}\) but scaled to be \(c\) times as long. If \(c \le 0\text{,}\) then \(c\vec{u}\) has the opposite direction as \(\vec{u}\text{,}\) and is also \(c\) times as long as \(\vec{u}\text{.}\)
In fact, this is just an extension of addition of real numbers, represented on a number line. This is because real numbers \(a \in \mathbb{R}\) can be thought of as 1-dimensional vectors \(\begin{bmatrix} a \end{bmatrix}\text{.}\)
We can also define vector subtraction in the natural way,
\begin{equation*} \vec{u} - \vec{v} = \begin{bmatrix} u_1 \\ u_2 \end{bmatrix} - \begin{bmatrix} v_1 \\ v_2 \end{bmatrix} = \begin{bmatrix} u_1 - v_1 \\ u_2 - v_2 \end{bmatrix} \end{equation*}
However, \(\vec{u} - \vec{v}\) is just \(\vec{u} + (-1) \vec{v}\text{,}\) that is, the sum of \(\vec{u}\) and the scalar multiple \(-1\vec{v}\text{.}\) In other words, subtraction is not really a fundamental operation, because it is defined in terms of the other operations addition and scalar multiplication.
For simplicity, the scalar multiple \(-1\vec{u}\) is written as \(-\vec{u}\text{.}\)

Definition 3.1.6.

The zero vector is the unique vector with no magnitude and no direction, and is denoted by \(\vec{0}\text{.}\)

Subsection 3.1.3 Magnitude of a Vector

The physics definition of a vector is a quantity with magnitude and direction. The magnitude of a vector \(\vec{v}\) is denoted by \(\abs{\vec{v}}\text{,}\) or \(\norm{\vec{v}}\text{.}\) The first notation surrounds \(\vec{v}\) by “absolute value” bars, intuitively denoting magnitude.
Geometrically, the magnitude of a vector is the length of its directed line segment in the plane. That is, the distance from initial point to its terminal point. Consider a vector \(\vec{v}\) in the plane in standard position, with terminal point \((a,b)\text{.}\) The magnitude of \(\vec{v}\) is the distance from the origin \((0,0)\) to \((a,b)\text{,}\) given by the distance formula in the plane.

Definition 3.1.7.

Let \(\vec{v} \in \mathbb{R}^2, \vec{v} = (a, b)\text{.}\) The magnitude (or length) of \(\vec{v}\) is defined by,
\begin{equation*} \boxed{\abs{\vec{v}} = \sqrt{a^2 + b^2}} \end{equation*}
It follows directly from this definition that \(\abs{c \vec{v}} = \abs{c} \cdot \abs{\vec{v}}\) for a scalar \(c\text{.}\)

Subsection 3.1.4 Unit Vectors

A vector is called a unit vector if it has magnitude equal to 1. Given a vector \(\vec{v}\text{,}\) there is a unit vector which is parallel to \(\vec{v}\text{,}\) namely,
\begin{equation*} \vec{u} = \frac{\vec{v}}{\abs{\vec{v}}} \end{equation*}
Also, this vector is unique, because if a vector \(\vec{u}\) is a unit vector (\(\abs{u} = 1\)) and is in the direction of \(\vec{v}\) (\(\vec{u} = c \vec{v}\) for some \(c > 0\)), then it follows that \(\vec{u}\) is given by the above formula. This is because \(\abs{\vec{u}} = 1\text{,}\) but also \(\abs{\vec{u}} = \abs{c \vec{v}} = \abs{c} \abs{\vec{v}} = c \abs{\vec{v}}\text{.}\) Thus, \(1 = c \abs{\vec{v}}\text{,}\) and so \(c = \frac{1}{\abs{\vec{v}}}\text{.}\)

Subsection 3.1.5 The Standard Basis Vectors

In \(\mathbb{R}^2\text{,}\) there are two special vectors, typically denoted by \(\ihat\) and \(\jhat\text{.}\) They are defined by,
\begin{equation*} \ihat = \begin{bmatrix} 1 \\ 0 \end{bmatrix} \qquad \jhat = \begin{bmatrix} 0 \\ 1 \end{bmatrix} \end{equation*}
They are called the standard basis vectors in the plane. Then, any vector \(\vec{v} = (x,y)\) can be written in terms of a linear combination of these basis vectors,
\begin{align*} \begin{bmatrix} x \\ y \end{bmatrix} \amp = \begin{bmatrix} x \\ 0 \end{bmatrix} + \begin{bmatrix} 0 \\ y \end{bmatrix}\\ \amp = x \begin{bmatrix} 1 \\ 0 \end{bmatrix} + y \begin{bmatrix} 0 \\ 1 \end{bmatrix}\\ \amp = x \ihat + y \jhat \end{align*}
Intuitively, “basis” means “underlying support or foundation”. The standard basis vectors provide a simple foundation from which all vectors can be represented in terms of.

Subsection 3.1.6 Vectors in \(\mathbb{R}^3\)

The algebra and geometry of vectors in the plane \(\mathbb{R}^2\) extends to 3D space. The set of vectors with three entries is denoted by \(\mathbb{R}^3\text{.}\)
Vectors in \(\mathbb{R}^3\) are \(3 \times 1\) column matrices, of the form,
\begin{equation*} \begin{bmatrix} x \\ y \\ z \end{bmatrix} \end{equation*}
Again, these vectors can be represented geometrically as directed line segments, and as points in space \((x,y,z)\text{.}\)
Similarly, we define addition and scalar multiplication in an analogous way.

Definition 3.1.8. Addition of vectors.

Let \(\vec{u} = (u_1, u_2, u_3), \vec{v} = (v_1, v_2, v_3)\) be vectors in \(\mathbb{R}^3\text{.}\) Their sum \(\vec{u} + \vec{v}\) is given by,
\begin{equation*} \boxed{\vec{u} + \vec{v} = \begin{bmatrix} u_1 \\ u_2 \\ u_3 \end{bmatrix} + \begin{bmatrix} v_1 \\ v_2 \\ v_3 \end{bmatrix} = \begin{bmatrix} u_1 + v_1 \\ u_2 + v_2 \\ u_3 + v_3 \end{bmatrix}} \end{equation*}

Definition 3.1.9. Scalar multiplication of vectors.

Let \(\vec{v} = (v_1, v_2, v_3)\) be a vector in \(\mathbb{R}^3\text{,}\) \(c \in \mathbb{R}\text{.}\) Then, the scalar multiple of \(\vec{v}\) by \(c\text{,}\) \(c\vec{v}\text{,}\) is given by,
\begin{equation*} \boxed{c\vec{v} = c \begin{bmatrix} v_1 \\ v_2 \\ v_3 \end{bmatrix} = \begin{bmatrix} cv_1 \\ cv_2 \\ cv_3 \end{bmatrix}} \end{equation*}
The magnitude of a vector in \(\mathbb{R}^3\) is again the distance from the origin \((0,0,0)\) to its terminal point \((x,y,z)\text{.}\)

Definition 3.1.10.

Let \(\vec{v} = (x,y,z) \in \mathbb{R}^3\text{.}\) The magnitude (or length) of \(\vec{v}\) is defined by,
\begin{equation*} \boxed{\abs{\vec{v}} = \sqrt{x^2 + y^2 + z^2}} \end{equation*}
The standard basis vectors in \(\mathbb{R}^3\) are denoted by \(\ihat, \jhat, \khat\text{,}\) given by,
\begin{equation*} \ihat = \begin{bmatrix} 1 \\ 0 \\ 0 \end{bmatrix} \quad \jhat = \begin{bmatrix} 0 \\ 1 \\ 0 \end{bmatrix} \quad \khat = \begin{bmatrix} 0 \\ 0 \\ 1 \end{bmatrix} \end{equation*}
Similarly, any vector \(\vec{v} = (x,y,z)\) can be represented as a linear combination of these basis vectors, as,
\begin{equation*} \begin{bmatrix} x \\ y \\ z \end{bmatrix} = x \ihat + y \jhat + z \khat \end{equation*}

Subsection 3.1.7 Vectors in \(\mathbb{R}^n\)

The algebra of vectors can be generalized to vectors with \(n\) elements, of the form,
\begin{equation*} \vec{v} = \begin{bmatrix} v_1 \\ v_2 \\ \vdots \\ v_n \end{bmatrix} \end{equation*}
Here, for \(n > 3\text{,}\) it is more difficult to consider vectors geomtrically as directed line segments. However, the algebra and mathematics are not lost.
The set of all ordered \(n\)-tuples of \(n\) real numbers is denoted by \(\mathbb{R}^n\) (read as “r-n”). Intuitively, this can be thought of as “\(n\)-dimensional space”.
Again, we have the convention that,
\begin{equation*} (b_1, \dots, b_n) \quad \text{is equivalent to} \quad \begin{bmatrix} b_1 \\ \vdots \\ b_n \end{bmatrix} \end{equation*}
Note that \((b_1, \dots, b_n)\) is not a row vector.
The definitions of equality, sum, and scalar multiple are analogous to that for \(\mathbb{R}^2\) and \(\mathbb{R}^3\text{,}\)
  • Vectors \(\vec{u}, \vec{v}\) in \(\mathbb{R}^n\) are equal if all of their corresponding entries are equal.
  • The sum \(\vec{u} + \vec{v}\) is obtained by adding entries componentwise.
  • The scalar multiple \(c \vec{u}\) is obtained by multiplying every entry of \(\vec{u}\) by \(c\text{.}\)
  • The zero vector, denoted by \(\vec{0}\text{,}\) is the unique vector with entries which are all zeros.
  • The magnitude of \(\vec{v} = (v_1, \dots, v_n)\) is given by,
    \begin{equation*} \abs{\vec{v}} = \sqrt{v_1^2 + v_2^2 + \dots + v_n^2} \end{equation*}
In \(\mathbb{R}^n\text{,}\) the standard basis vectors are given by,
\begin{align*} \vec{e}_1 \amp = (1,0,0\dots,0)\\ \vec{e}_2 \amp = (0,1,0,\dots,0)\\ \vdots \amp\\ \vec{e}_n \amp = (0,0,0\dots,0,1) \end{align*}
Then, every vector \(\vec{x} = (x_1, \dots, x_n)\) in \(\mathbb{R}^n\) can be written as a linear combination of these basis vectors, as,
\begin{equation*} \begin{bmatrix} x_1 \\ x_2 \\ \vdots \\ x_n \end{bmatrix} = x_1 \vec{e}_1 + x_2 \vec{e}_2 + \dots + x_n \vec{e}_n \end{equation*}

Subsection 3.1.8 Properties of Vector Arithmetic

The arithmetic of vectors satisfies many of the familiar laws of arithmetic with numbers.
These properties all essentially follow from the corresponding properties of real numbers.

Subsection 3.1.9 Linear Combinations of Vectors

The fundamental operations combine to form linear combinations. Consider two vectors \(\vec{u}, \vec{v}\text{.}\) Adding them gives \(\vec{u} + \vec{v}\text{.}\) Multiplying them by (possibly different) scalars gives \(a\vec{u}\) and \(b\vec{v}\text{.}\) Combining these two operations, we get a linear combination,
\begin{equation*} a\vec{u} + b \vec{v} \end{equation*}

Definition 3.1.12.

Let \(\vec{u}, \vec{v}\) be vectors, \(a, b \in \mathbb{R}\) be scalars. Then, the vector given by,
\begin{equation*} a\vec{u} + b\vec{v} \end{equation*}
is called a linear combination of \(\vec{u}\) and \(\vec{v}\text{,}\) with weights \(a, b\text{.}\)

Definition 3.1.13.

Let \(\vec{v}_1, \vec{v}_2, \dots, \vec{v}_n \in \mathbb{R}^n\) be vectors, \(a_1, a_2, \dots, a_n \in \mathbb{R}\) be scalars. Then, the vector \(\vec{v}\) given by,
\begin{equation*} \vec{v} = a_1 \vec{v}_1 + a_2 \vec{v}_2 + \dots + a_n \vec{v}_n = \sum_{i=1}^n a_i \vec{v}_i \end{equation*}
is called a linear combination of \(\vec{v}_1, \vec{v}_2, \dots, \vec{v}_n \in \mathbb{R}^n\) with weights \(a_1, a_2, \dots, a_n\text{.}\)
Given any collection of vectors, the trivial linear combination is that with all weights being 0, i.e.,
\begin{equation*} 0 \vec{v}_1 + 0 \vec{v}_2 + \dots + 0 \vec{v}_n = \vec{0} \end{equation*}