Complex Rotations
In mathematics there are multiple ways to represent the same underlying principles or ideas. There are different ways to represent two dimensional spaces — the cartesian, polar, and other coordinate systems — and multiple ways to morph that space. Complex numbers are a way to represent two dimensional spaces, and are an even more elegant way to represent two dimensional rotations.
Real Numbers
This a real number line:
It can be thought of as a one dimensional space — where any real number x
is a point on the space with distance x
from the origin point, zero. Positive numbers are on the right of the origin, and negative numbers on the left.
Since any real number is a point on this space, the output of any operation that results in a real number will also be a point on the one dimensional space — both the input and output can be mapped on the real number line. The operation can then be thought of as a transformation of the number line.
Consider addition: If a point or number is taken, and an addend d
is added to it — then the sum would lie d
distance to the right or left from the original point on the number line, depending on the sign of the addend. Since it is true for all points, the entire number line can be thought of as being shifted by distance d
.
The numbers on the transformed number line (below) is the result of the addition operation when d
: 0.00Transforming number lines
d
is added to the numbers on the old numbers line (above). A single point (zero) is highlighted to make it easier to keep track of how the new number line numbers above is transformed.
Just as addition morphed the number line by shifting it, multiplication morphs the number by scaling, or stretching it — by the amount specified by the multiplier. Multiplication by a negative value stretches and also flips the number line around zero.
m
: 1.00
The visual transformation analogues on a number line for the square function is however much more complicated than a simple shift or stretch.
It is complicated to describe how all the numbers behave using a single transform, but breaking it down into multiple transformations for each number makes it much easier to comprehend. Since squaring a number simply involves multiplying it by itself — it can be represented as stretching (and possibly flipping) the number line twice by the same multiplier.
Square of 1.00 = 1.00
Visualizing the square root function to a number line transformation is tricky as well. But the square root function can alternatively defined using the square function — the root shall be defined as the number which, when squared, yields the operand.
Finding a square root using this transformation involves specifying a number, and then using trial and error to find another number that squares to the number selected earlier — to get the square root.
Square root of 1.00 is 1.00Finding square roots
There is a problem however. It is impossible to find the square root of negative numbers, since the output of the square function is always mapped to a non-negative number.
Square root of 0.00 is not 1.00
This is not a limitation of the visualization, but rather a limitation arising from the current definition of the function, and real numbers itself — it is impossible to find the square root of negative numbers from the existing definition of the number system.
Unless the definition itself is changed.
Imaginary Numbers
Suppose numbers which square to any negative number do exist. The natural questions that might then arise are: What is the points of making these numbers up? Or how should the numbers be organized, or how should the numbers behave? This is where it helps to make certain assumptions.
- Assumption I: Since the square root of negative numbers could not be mapped on the real number line, they must lie on a completely different number line. Since these numbers do not lie on the real number line, they will also be referred to as imaginary numbers from now on.
Assumption II: The square root of any negative numbers (imaginary numbers) can derived using the square root of negative one and scaling it with a real number. Square root of negative one shall be denoted as
i
. So an imaginary number can be represented asb·i
, whereb
is a real number. The real and imaginary numbers together shall be described as a suma+bi
, wherea
andb
are real numbers andi
is the square root of negative one.Assumption III: Addition is defined as
(a+bi) + (c+di)
=(a+c) + (b+d)i
; ie. both operate on their own respective real and imaginary components. Multiplication is defined as(a+bi) · (c+di)
=a·c + a·i + bi·c + bi·di
or simplified,(ac-bd) + (ad+bc)i
— ie. multiplication is distributed over addition.
It can be inferred from the assumptions:
- Addition affects their own component and has no effect on the other entity.
- Multiplication of any real number with another real number multiplier scales the original number by the multiplier.
- Since any imaginary number can be thought of as the root of negative one scaled up by a real number, they can be decomposed into its real factors and
i
. This implies multiplying a real with any imaginary number results in a (scaled) imaginary number as well — ie.kb·i
=k · bi
, sok · bi
=kb·i
. - So, multiplication with a real number results scales a number by the real multiplier, irrespective of whether the multiplicand is real or imaginary.
- Multiplying an imaginary number with an imaginary number involves multiplying their scalars and multiplying their roots of negative one. But squaring the root of negative one is just negative one, so multiplying two imaginary numbers results in a (scaled) negative real number. That is,
bi · di
can be rewritten asb·d · i·i
, which is justbd · -1
=-bd
. - Multiplying two numbers with both non-zero real and imaginary components results in a number that is the product of the two numbers following the rules of distributive multiplication over addition.
It is difficult to visualize how the multiplication of numbers with both non-zero real and imaginary components transform the number lines. But addition and multiplication with only a real component or an the root of negative one is relatively easier to visualize. Geometrically:
- Addition with real numbers shifts only the real number line.
- Addition with imaginary numbers shifts only the imaginary number line.
- Multiplication with real numbers stretches the real number line as well as the imaginary number lines.
- Multiplication with
i
morphs number lines such that:- If the number (line) was on the real number line, it now lies on the imaginary number line.
- If the number (line) was on the imaginary number line, it now lies on the flipped real number line.
Transformed Number: 0.00+0.00i
Add by real: 0.00
Add by imaginary: 0.00
Multiply by real: 1.00
But how does the multiplication of numbers with both non-zero real and imaginary components transform the number lines? And why these assumptions specifically? All of this might seem arbitrary at first, but the motivation behind it starts making more sense when the second dimension is taken into account.
Complex Plane
Take any point on a cartesian plane, and keep track of its coordinates as well as the x
and y
axes when it is rotated counter-clockwise by 90°:
Coordinates: 0.00, 0.00
X: 0.00
Y: 0.00
The x
and y
axes get swapped, and the x
axis also gets flipped. But this behavior is the same as the behavior of real and imaginary number lines when multiplied by i
:
So, if the real and imaginary number lines are perpendicular to each other, and a point (a,b) is represented by a+bi
— then multiplication with i
, represents a 90° counter clockwise rotation of the point.
Transformed Number: 0.00 + 0.00i
Add by Real (a): 0.00
Add by Imaginary (b): 0.00
Multiply by Real: 1.00
The perpendicular real and imaginary number lines together form a plane, where a point (a,b) on the plane can be represented as a number a+bi
. The only defining feature of this plane is that multiplying a point on it by i
rotates it counter-clockwise by 90°. And that begs the question — is it any useful if it can only represent quarter turn rotations? Or can it be generalized for other angles too?
Looking from a Different Angle
Multiplying a number by i
rotates it by a quarter turn, and multiplying a number twice by i
would rotate it by half a turn. But multiplying twice by i
is the same as multiplying by negative one — since i·i
is just -1
. So multiplying a point by negative one should have the effect of rotating it by a half turn. And it does.
Transformed Number: 0.00 + 0.00i
Add by Real (a): 0.00
Add by Imaginary (b): 0.00
Multiply by Real: 1.00
Alternatively framed, multiplying by -1
rotates a point and the plane by a half turn, and multiplying by i
, or √-1
, rotates it by a quarter turn. It is then not totally implausible to assume that multiplying by √i
should result in a one-eight turn, or 45° rotation.
But √-1
was defined as i
as an assumption. Does √i
need to be defined as another new number — that results in a 45° rotation? Before defining new numbers, it’s useful to see if √i
can be defined using the existing number system. Finding √i
is much easier analytically — compared to deducing it from its geometric properties.
The above equation also has another solution: Let a+bi = √i
=> (a+bi)^2 = i
=> a^2 - b^2 + 2abi = i
=> a^2 - b^2 = 0, 2ab = 1
=> a = 1/√2, b = 1/√2
=> √i = 1/√2 + i/√2
Alternate solutions
a = -1/√2, b = -1/√2
. However for simplicity, it has been excluded for now.
Analytically, √i
is equivalent to 1/√2 + i/√2
, and multiplying a number by 1/√2 + i/√2
rotates it by 45° — somehow the analytically derived value of √i
is consistent with its geometric properties which was only assumed to be true.
Transformed Number: 0.00 + 0.00i
Add by Real (a): 0.00
Add by Imaginary (b): 0.00
Multiply by Real: 1.00
The number 1/√2 + i/√2
interestingly lies halfway between 1
and i
. It also lies on unit circle, forming a 45° angle with the positive real number axis. Other points lying on the unit circle such as 1/2 + √3i/2
and √3/2 + i/2
form 30° and 60° angles with the positive real axis, respectively — and multiplying a point with the mentioned numbers again rotates it by 30° and 60° accordingly.
In fact multiplying a number by another number that lies on the unit circle rotates it — by the angle formed by the number on the unit circle and the positive real axis. For example, the multiplier here takes values of numbers lying on the unit circle:
(0.00+0.00i)(1.00i+0.00i) = 0.00 + 0.00i
Add by Real (a): 0.00
Add by Imaginary (b): 0.00
Multiply by Number: 0.00
If a number is multiplied by another number that is not on the unit circle, it again gets rotated by the angle formed by the multiplier and positive real axis. But now, it also gets scaled. The point lies farther or closer from the origin depending on the multiplier — the distance scales linearly with the distance of the multiplier from the origin.
(0.00+0.00i)(1.00i+0.00i) = 0.00 + 0.00i
Add by Real (a): 0.00
Add by Imaginary (b): 0.00
Multiply by Number: 0.00
Multiply by Number: 0.00
It may seem that this what the transformation of the plane looks like when multiplied by real and imaginary numbers, but it is actually wrong. Until now, the transformation of only a single number was tracked — the rest of the plane was simply a translation with respect to that transformed number. To know the actual transformation, all the points of the plane must be tracked instead of a single point.
For example, multiplying all the integers points on this plane with a number reveals that transformation of the plane is a rotation. Using more points reinforces the idea that it applies for all points in the plane. The aforementioned translation transformation of the plane also still occurs, but for addition, not multiplication. Here below, all the points in the plane are operated upon, but a single point can still be tracked for reference.
Z[i] · (0.00+0.00i)(1.00i+0.00i)
(0.00+0.00i)(1.00i+0.00i) = 0.00 + 0.00i
Add by Real (a): 0.00
Add by Imaginary (b): 0.00
Multiply by Number: 0.00
Multiply by Number: 0.00
This answers the original question of how multiplication of numbers with non-zero real and imaginary components translates to a geometric transformation: A stretching and rotation of the entire plane itself.
Definitions
The plane and the number system consisting of real and imaginary numbers can be used to represent rotations. But to be formally used in mathematical context, it is necessary to establish certain formal definitions to describe ‘rotation’ and ‘stretching’ — and to prove it holds true for all points in the plane.
Specific terms can be defined to help describe the properties of the new plane and the number system — which will aid in formally describing and quantifying the rotational and stretching effect of multiplication:
- Numbers of the form
a+bi
wherea
andb
are real numbers, andi
is√-1
are defined as complex numbers — and the plane formed by the perpendicular real and imaginary number lines is defined as the complex plane. - The argument of a complex number is defined as the angle between the line joining the origin and the complex number, and the positive real axis. The argument of a complex number
z
is usually denoted byarg(z)
. - The absolute value of a complex number is defined as its magnitude — its distance from the origin. The absolute value of a complex number
z
is denoted by|z|
.
While having formal definitions to work with is helpful, it is mostly useless if it is used to describe the properties of an idea whose underlying assumption about the premise itself is flawed, or based only on experimental observation of a few numbers — instead of a solid logical proof that applies generally to all numbers.
So there needs to be a proof that shows multiplication rotates and stretches the entire complex plane. Or more formally, it needs to be proved that the argument of the product of two complex numbers is the sum of the argument of the individual numbers, while the magnitude of the product is the product of the magnitude of its individual numbers.
Interpretation
Before going through a proof, it first helps to understand how multiplication has the effect of rotating numbers — by decomposing how the real and imaginary components of the product change with respect to its operands.
The product of two complex numbers a+bi
and c+di
is ac-bd + (ad+bc)i
. The real component of the product is ac-bd
and the imaginary component is ad+bc
. For the real component, ac
signifies that the real components of the product scales with the magnitude of the real components of its factors.
However, it also gets subtracted by bd
— the product of the imaginary components. One way to visualize it is if the imaginary components increase, their arguments increase as well, shrinking the projection of the product on the real number line.
a: 0.00 b: 0.00
c: 0.00 d: 0.00
The imaginary component of the product is ad+bc
. To keep things simpler, b
and c
can be kept constant. Then, a
can be viewed as determining the magnitude of the first number, while d
as the argument of the other number — suggesting that the magnitude of the first number gets roughly ‘rotated’ by the argument of the second number when multiplied. As either of the magnitude or argument gets bigger, the product’s projection on the imaginary axis gets bigger.
A similar argument can be made for bc
, keeping ad
constant — the magnitude of the second number gets rotated by the argument of the first number.
a: 0.00 b: 0.00
c: 0.00 d: 0.00
These are only useful to build an initial intuition, and are in no way complete, and are not at all exact.
Trigonometric Proof
To be more rigorous, some knowledge about trigonometry is required. Let us assume the same two complex numbers, but this time represented using the magnitudes |z1|
and |z2|
, with arguments α
and β
respectively. The components can then alternatively be represented as:
a = |z1|·cos(α)
b = |z1|·sin(α)
c = |z2|·cos(β)
d = |z2|·sin(β)
Multiplying the complex numbers a+bi
and c+di
would then result in:
+ |z1|·|z2|·cos(α)·cos(β)
- |z1|·|z2|·sin(α)·sin(β)
+ |z1|·|z2|·cos(α)·sin(β)·i
+ |z1|·|z2|·sin(α)·cos(β)·i
If the magnitudes are factored out, the result is:
cos(α)·cos(β) - sin(α)·sin(β)
as the real component,
cos(α)·sin(β) + sin(α)·cos(β)
as the imaginary component.
Using basic trigonometric identities, it can be deduced that resultant product will have a real component cos(α+β)
and imaginary component sin(α+β)
. It is evident that the argument of the resultant number will be α+β
.
The final result is |z1|·|z2|·(cos(α+β) + sin(α+β)i)
, after the magnitudes are factored back in. Clearly, the magnitude of the product is |z1|·|z2|
— the product of the individual magnitudes of the operands.
Since a
, b
, c
, and d
are any real numbers, the proof applies generally for all points in the plane. Thus, it can be said that multiplying with a complex number has the effect of stretching the complex plane by its magnitude, as well as rotating it by its argument.
Purpose
Complex numbers can thus be used to model two dimensional rotations. However it can be argued that these rotations can also be described using vectors and rotational matrices, and that those are considerably more powerful since they are not restricted to only two dimensions. So this means that complex numbers are not strictly ‘necessary’.
Although not absolutely necessary (in applied math), complex numbers are nonetheless very handy since they simplify descriptions of rotations — it is a simple multiplication. In fact, this simplicity is embodied in the Euler’s formula, which provides an elegant and concise way to describe periodic functions using rotations — and forms the basis of the Fourier and Laplace transforms.
The Fourier transform transforms a function to another function describing the intensity of various frequencies present in the original function. The Laplace transform is a more generalized form of the Fourier transform which also describes the rate of decay of a function.The Fourier and Laplace transform
The necessity of complex numbers is even more significant in pure mathematics — they form the algebraically closed field for real numbers.
In mathematics, an algebraic structure is a set of elements and operations, that satisfies certain rules. A field is an algebraic structure that satisfies the properties of addition, subtraction, multiplication, and division. Real numbers are fields. An algebraically closed field is a field that must have roots for a non-constant polynomial in that field — if the coefficients of the polynomial are elements of that field. For example, Algebraically Closed Fields
x^2 + 1 = 0
does not have its roots defined in real numbers even though its coefficients are real — so real numbers are not algebraically closed. However, the roots are defined for all complex numbers, and thus the set of complex numbers form an algebraically closed field.
So now the question arises: Does it warrant redefining the number system to describe rotations? Absolutely. But does it make sense that a number system that is intuitive, and is used to describe reality are called imaginary? Absolutely not. Would anyone describe negative numbers as ‘unnatural’ because they are not a part of natural numbers?