We have talked a lot about linear systems, how many solutions they can have,
what their matrices represent, etc., but we never talked about how to
systematically find the solution to a linear system of equations.
In case it wasn’t clear - because I already did this in the lesson about
linear dependence -
a linear system with an m×n matrix can be rewritten as a linear system of
equations with m equations and n variables.
You should already be familiar with some method to solve a linear system of equations from
earlier in your studies. We will learn more advanced and mechanical methods later in the
course, but for now we will stick to what we already know.
I already mentioned the visual intuition for a linear system of equations in my lesson about
linear dependence.
There, I talked about the possibility of thinking of a linear system with a 2×2
matrix as two lines in a 2D-space. Their intersection (if it exists) is the solution to
the linear system. There are two ways of thinking of the solution to a linear system.
This is one of them, and it’s called the row picture. There’s also the so-called
column picture, and we’ll look into both versions now.
We take the linear system
[1−121][x1x2]=[4−1]
as an example for the explanation.
Row Picture
You can perform the
matrix-by-vector product
between A and x, and then separate the entire equation into rows.
This is what we did above. You basically rewrite the system into a system of equations.
So we multiply A by x, and our linear system becomes
A similar idea holds for 3D-Systems as well. In those cases, instead of lines, we’re
dealing with planes intersecting each other. Two planes intersecting form a line, and
if you add a third plane, you get a point.
So we saw that one way to think of the solution of a linear system is by splitting it
up into rows and plotting the individual equations.
Column Picture
Another way of thinking about the solution to a linear system is by rewriting it as
a linear combination of the columns. This remains closer to the
matrix-by-vector product
between A and x, where we think of the left side of Ax=b as
a linear combination of the columns of A.
We use the same linear system we had above:
[1−121][x1x2]=[4−1].
The multiplication between A and x can be rewritten as
x1[1−1]+x2[21]=[4−1].
If we now try the solution we found before (x1=2, x2=1), it will still work:
What we’re asking now is “what linear combination of the columns of A gets us to the
vector b?”. This can be represented like this:
Where a1 and a2 are the first and second columns of A respectively.
To reach b, we had to
add2a1 and a2.
Conclusion
So in conclusion, the row picture focuses on the equations themselves.
We want to find a solution x1,x2,⋯,xn to solve all equations
simultaneously.
The column picture, on the other hand, focuses on the columns of A.
We see the solution as trying to find the coefficients
x1,x2,⋯,xn such that the linear combination
x1a1+x2a2+⋯+xnan results in b.
Both viewpoints really describe the same problem, but from different perspectives.