Section 4.3 Solution Sets of Linear Equations
Subsection 4.3.1 Homogeneous Systems
First, it turns out that in order to study solutions to \(A\vec{x} = \vec{b}\text{,}\) it is helpful to first study the more simple case where \(\vec{b} = \vec{0}\text{,}\) i.e. the linear system \(A\vec{x} = \vec{0}\text{.}\) These systems are called homogeneous.
Homogeneous systems always have at least one solution (that is, they are always consistent), in particular the solution \(\vec{x} = \vec{0}\text{,}\) often called the trivial solution. Then, we want to consider whether \(A\vec{x} = \vec{0}\) has any non-trivial solutions, that is, solutions \(\vec{x}\) which are not the zero vector.
Recall that if a system is consistent, then it has a unique solution if the REF of its augmented matrix has no free variables, and it has infinitely many solution if there at least one free variable. Then, for homogeneous systems,
Theorem 4.3.2.
The homogeneous system \(A\vec{x} = \vec{0}\) has a non-trivial solution if and only if the system has at least one free variable.
Example 4.3.3.
Consider the system,
\begin{equation*}
\begin{array}{rrrrr}
3x_1 \amp + 5x_2 \amp -4 x_3 \amp = \amp 0 \\
-3x_1 \amp -2x_2 \amp + 4x_3 \amp = \amp 0 \\
6x_1 \amp + x_2 \amp -8 x_3 \amp = \amp 0
\end{array}
\end{equation*}
This is a homogeneous system. In matrix form,
\begin{equation*}
\begin{bmatrix} A \mid \vec{0} \end{bmatrix} = \left[\begin{array}{ccc|c} 3 \amp 5 \amp -4 \amp 0 \\ -3 \amp -2 \amp 4 \amp 0 \\ 6 \amp 1 \amp -8 \amp 0 \end{array} \right]
\end{equation*}
Converting to REF,
\begin{equation*}
\left[\begin{array}{ccc|c} 3 \amp 5 \amp -4 \amp 0 \\ 0 \amp 3 \amp 0 \amp 0 \\ 0 \amp 0 \amp 0 \amp 0 \end{array} \right]
\end{equation*}
Since \(x_3\) is a free variable, the homogeneous system \(A\vec{x} = \vec{0}\) has a non-trivial solution. Converting to RREF,
\begin{equation*}
\left[\begin{array}{ccc|c} 1 \amp 0 \amp -\frac{4}{3} \amp 0 \\ 0 \amp 1 \amp 0 \amp 0 \\ 0 \amp 0 \amp 0 \amp 0 \end{array} \right]
\end{equation*}
Then, solving for the basic variables \(x_1\) and \(x_2\) in terms of the free variables gives \(x_1 = \frac{4}{3} x_3, x_2 = 0\text{,}\) and \(x_3\) free. Then, in vector form,
\begin{align*}
\vec{x} \amp = \begin{bmatrix} \frac{4}{3} x_3 \\ 0 \\ x_3 \end{bmatrix}\\
\amp = x_3 \begin{bmatrix} \frac{4}{3} \\ 0 \\ 1 \end{bmatrix}
\end{align*}
Every solution to this system is a scalar multiple of the vector \(\vec{v} = (4/3, 0, 1)\text{.}\) In this way, the solution set is precisely the span of this vector,
\begin{equation*}
\vec{x} = \Span{\begin{bmatrix} \frac{4}{3} \\ 0 \\ 1 \end{bmatrix}}
\end{equation*}
Geometrically, this is a line in \(\mathbb{R}^3\) through the origin.
Example 4.3.4. One equation in two variables.
Consider the very simple homogeneous linear “system” which only consists of one equation,
\begin{equation*}
x + 2y = 0
\end{equation*}
which has augmented matrix,
\begin{equation*}
\left[\begin{array}{cc|c} 1 \amp 2 \amp 0 \end{array} \right]
\end{equation*}
This matrix is already in RREF (the only leading entry is 1, and all of the entries above it are 0). Then, \(y\) is a free variable. Letting \(y = t\text{,}\) we get that \(x = -2t\text{.}\) Then, the solution is of the form,
\begin{align*}
\begin{bmatrix} x \\ y \end{bmatrix} \amp = \begin{bmatrix} - 2t \\ t \end{bmatrix}\\
\amp = t \begin{bmatrix} -2 \\ 1 \end{bmatrix}
\end{align*}
Notice that this is precisely the parametric equation of a line in 2D, through the origin \(\vec{r}_0 = \vec{0}\text{,}\) and with direction vector \((-2,1)\text{.}\) Also, the solution set is precisely \(\Span{\begin{bmatrix} -2 \\ 1 \end{bmatrix}}\text{.}\)
Example 4.3.5. One equation in three variables.
Consider the homogeneous system with the single equation,
\begin{equation*}
x - y + 3z = 0
\end{equation*}
with augmented matrix,
\begin{equation*}
\left[\begin{array}{ccc|c} 1 \amp -1 \amp 3 \amp 0 \end{array} \right]
\end{equation*}
Again, this is already in RREF, with free variables \(y\) and \(z\text{.}\) Then, letting \(y = t, z = s\) be parameters, we get that \(x = t - 3s\text{.}\) Then, solutions are of the form,
\begin{align*}
\begin{bmatrix} x \\ y \\ z \end{bmatrix} \amp = \begin{bmatrix} t - 3s \\ t \\ s \end{bmatrix}\\
\amp = t \begin{bmatrix} 1 \\ 1 \\ 0 \end{bmatrix} + s \begin{bmatrix} -3 \\ 0 \\ 1 \end{bmatrix}
\end{align*}
Note that this is precisely the parametric equation of a plane in 3D, which passes through \(\vec{r}_0 = \vec{0}\text{,}\) and with direction vectors \((1,1,0)\) and \((-3,0,1)\text{.}\) Also, the solution set is precisely,
\begin{equation*}
\Span{\begin{bmatrix} 1 \\ 1 \\ 0 \end{bmatrix}, \begin{bmatrix} -3 \\ 0 \\ 1 \end{bmatrix}}
\end{equation*}
In general, the solution set of a homogeneous system can always be written as the span of some collection of vectors. Also, the number of vectors in the span correpsonds to the number of free variables. If the system has no non-trivial solutions, then the solution set is \(\Span{\vec{0}} = \vec{0}\text{.}\)
A solution set described in terms of parameters and vectors is called parametric vector form.
Subsection 4.3.2 Non-homogeneous Systems
Then, it turns out that to solve a non-homogeneous system, it just requires adding a bit more information onto the solution its corresponding homogeneous system.
When solving \(A\vec{x} = \vec{b}\) using row reduction, the solution in parametric vector form will be a particular vector \(\vec{v}_p\text{,}\) along with a linear combination of other vectors.
Geometrically, the vector addition \(\vec{v}_p\) acts as a translation onto vectors which are a solution of the associated homogeneous system. For example, if the solution set to homogeneous equation is a line \(L\text{,}\) then the solution set to \(A\vec{x} = \vec{b}\) is the line \(L\) translated by the vector \(\vec{v}_p\text{,}\) forming a line which is parallel to \(L\text{.}\) In other words, the solution set of \(A\vec{x} = \vec{b}\) is a line through \(\vec{v}_p\) parallel to the solution set of \(A\vec{x} = \vec{0}\text{.}\)
Similarly, if the solution set to \(A\vec{x} = \vec{0}\) is a plane, then the solution set to \(A\vec{x} \vec{b}\) is a plane through \(\vec{v}_p\) which is parallel to the solution set of \(A\vec{x} = \vec{0}\) (recall for planes, parallel means parallel normal vectors).
Theorem 4.3.6.
Let \(A\vec{x} = \vec{b}\) be a linear system, which is consistent with solution \(\vec{v}_p\text{.}\) Then, the solution set of \(A\vec{x} = \vec{b}\) is the set of all vectors of the form \(\vec{v} = \vec{v}_p + \vec{v}_h\text{,}\) where \(\vec{v}_h\) is any solution of the associated homogeneous system.
Proof.
If \(\vec{v}_p\) is a solution of \(A\vec{x} = \vec{b}\text{,}\) and \(\vec{v}_h\) is a solution of the associated homogeneous system, then \(A\vec{v}_p = \vec{b}\) and \(A\vec{v}_h = \vec{0}\text{.}\) Then,
\begin{align*}
A(\vec{v}_p + \vec{v}_h) \amp = \underbrace{A\vec{v}_p}_{=\vec{b}} + \underbrace{A\vec{v}_h}_{=\vec{0}}\\
\amp = \vec{b} + \vec{0}\\
\amp = \vec{b}
\end{align*}
Note that if \(A \vec{x} = \vec{b}\) is not consistent, then the solution set is the empty set.
Subsection 4.3.3 Solving a Consistent System in Parametric Vector Form
In summary, to solve \(A\vec{x} = \vec{b}\text{,}\)
- Use row reduction to convert the augmented matrix to RREF.
- Solve for each basic variable in terms of any free variables.
- Write a typical solution \(\vec{x}\) as a vector which depends on any free variables.
- Decompose \(\vec{x}\) into a linear combination of vectors with only numerical entries, and use free variables as parameters.
Subsection 4.3.4 Intersection of Two Planes
In most cases, two planes will intersect to form a line in \(\mathbb{R}^3\text{.}\) For the planes,
\begin{equation*}
\begin{cases} a_1 x + b_1 y + c_1 z = d_1 \\
a_2 x + b_2 y + c_2 z = d_2 \end{cases}
\end{equation*}
The intersection of these planes is, by definition, the set of all points which are on both planes. A point is on each plane if and only if it satisfies its equation, so finding the intersection of planes is equivalent to solving the system of two equations for \((x,y,z)\text{.}\)
Otherwise, if the two planes have parallel normal vectors, then the planes themselves will be parallel, and not intersect. Finally, if the planes are parallel, it is possible that they coincide (overlap).
Example 4.3.7.
Consider two planes, with equations,
\begin{equation*}
\begin{array}{rrrrr}
x \amp + 2y \amp -z \amp = \amp -3 \\
3x \amp + 7y \amp + 2z \amp = \amp 1
\end{array}
\end{equation*}
The augmented matrix of this system is,
\begin{equation*}
\begin{bmatrix} 1 \amp 2 \amp -1 \amp -3 \\ 3 \amp 7 \amp 2 \amp 1 \end{bmatrix}
\end{equation*}
Converting to RREF using row reduction,
\begin{equation*}
\begin{bmatrix} 1 \amp 0 \amp -11 \amp -23 \\ 0 \amp 1 \amp 5 \amp 10 \end{bmatrix}
\end{equation*}
Then, \(x_3\) is a free variable, and solving for the basic variables \(x_1\) and \(x_2\) in terms of \(x_3\text{,}\) \(x_1 = -23 + 11 x_3\) and \(x_2 = 10 - 5x_3\text{.}\) Then, in vector form,
\begin{equation*}
\vec{x} = \begin{bmatrix} x_1 \\ x_2 \\ x_3 \end{bmatrix} = \begin{bmatrix} -23 + 11x_3 \\ 10 - 5x_3 \\ x_3 \end{bmatrix} = \begin{bmatrix} -23 \\ 10 \\ 0 \end{bmatrix} + x_3 \begin{bmatrix} 11 \\ -5 \\ 1 \end{bmatrix}
\end{equation*}
This is the parametric equation of a line (in \(\mathbb{R}^3\)), which passes through the point \((-23,10,0)\) and with direction vector \(\vec{v} = (11,-5,1)\text{.}\)
Subsection 4.3.5 Equation of Line as Intersection of Two Planes
Subsection 4.3.6 Intersection of Three Planes
In most cases, three planes will intersect at a single point. Again, this corresponds to solving the system of equations for the three planes.