Today, we'll have an introduction to some of the essentials from discrete math that we'll build on in this course.

This course is not a math class, and most assignments will involve programming, not written answers. But we'll need shared language around some discrete math – -specifically set operations and the foundations of logic – -to analyze interesting systems.

Spectrum of correctness

We can think of the correctness of our program as existing along a wide spectrum.

On the far left, near 0% certainty of correctness, we have code that we wrote without every running.

On the far right, near (but typically not at) 100% certainty of correctness, we have implementations that have been proven (with machine-checked proofs) to meet a formal specification. Even in that case, we ultimately have to trust some layers of the proof infrastructure, so there is still a vanishingly small chance that there are correctness issues with our program.

0%                                                                            approaching 100%
            <------------------------------------------------------------------------------------->
            Never run the code!
              Make sure it compiles
                 Make sure the code runs one input, manually
                    Manually check several cases
                        Write a few tests
                             Write a LOT of tests
                                                     | automatically
                                                     generating tests


                                                                                    Write a formal specification
                                                                                    Formally prove that your code
                                                                                    implements your specification.
                                                                                    "interactive theorem proving"
                                                                                    lives here

                                            -----------------------------------
                                                        This course!
                                            Sometimes called "lightwight formal methods".
                                            we'll to the nice spot in the middle by
                                            using MODELS of our system, instead of
                                            proving the actual implementation of
                                            every component against a formal
                                            specification.

Preliminaries

The notation we use here is not an important focus of this class, but I'll try to show a few different notations that you may see outside of this class. The goal of notation is to help us be precise, I'll try to not use more than necessary.

Sets

Set definition

A set is a unordered collection of elements. Set elements are distinct – -an element is either contained in the set or not (in a programmer's view, there are no duplicates).

There are many valid ways to define a set.

Often, we'll define sets by using curly brackets ({}). We can define a small (finite) set by listing elements inside the brackets.

For example, this is the set containing the integers 1, 2, and 3:

${1, 2, 3}$ is a set

The following is not a set.

${1, 2, 1, 2}$ is not a set (not a canonical description of one).

Why not? (Think, then click!)

Because it has duplicates! It should be written as $\{1, 2 \}$

Pattern notation

We use ellipses $\dots$ to show "this pattern in the set continues".

${a, b, c, \dots, z}$ is the set of lower case letters, its size is 26.

${0, 1, 2, 3, \dots}$ is the set of natural numbers, its size is infinity.

Containment/element of

To show that an element is contained in a set, we'll use the $\in$ notation (pronounced "in").

(Aside: in class, we voted on the emoji to indicate "the programmer's view").

programmer's view on containment: $\in$ is like Python's in keyword.

$1 \in {1, 2, 3}$ is $t r u e$ .

$4 \in {1, 2, 3}$ is $f a l s e$ .

$4 \notin {1, 2, 3}$ is $t r u e$ .

Set builder/set comprehension notation

We can also define sets by specifying, in math notation, which elements are contained in the set. This is called set builder or set comprehension notation.

programmer's view: This is like Python's list or dictionary comprehension, for example: {x: x+1 for x in [1,2,3]} .

This notation still uses curly brackets, but has two parts: a variable name on the left, then the specification for which variables are in the set on the right:

$S = {x | < s o m e t h i n g >}$

The vertical bar in the middle, $|$ , separates the two parts, and is often pronounced "such that".

For example, we can define a $S$ as the set of evens by writing:

$S = {x | x % 2 = 0}$ is the set of evens.

We would pronounce this as: "The set of integers x such that x mod two equals zero".

Does any set defined by a set comprehension need to be infinite?

(Click for answer)

No, for example, the set $S$ here has size 1:

$A = {3}$
$S = {x | x \in A}$

We'll use set comprehension notation frequently in the Alloy section of this course.

Special sets

There are a few special sets in math that have their own names/symbols.

$\emptyset$ : the empty set (also could be written as ${}$ ). We'll use this often in CS340.
$R$ : the set of reals (the number line). This comes up in other areas of math often, less so in CS340.
$Z$ : the set of integers ${\dots, - 2, - 1, 0, 1, 2, \dots}$ .
$N$ : the set of natural numbers ${0, 1, 2, \dots}$ . Note that some definitions define the natural numbers as not containing 0. Computer scientists, at least in my experience, lean more toward including it!

Set operations

Now that we've defined sets, we can do interesting things with them using set operations.

On sets $S$ and $T$ :

Subset: $\subseteq$

$S \subseteq T$ is pronounced " $S$ is a subset of $T$ " and means every element of $S$ is also an element of $T$ .

This matches our intuitive English definition of a subset.

${1, 2} \subseteq {1, 2, 3}$ is $t r u e$ .

${1, 4} \subseteq {1, 2, 3}$ is $f a l s e$ .

We say it's a strict subset, $S \subset T$ , if there is at least one element in $T$ that is not in $S$ (that is, if $S$ is a subset of $T$ but is strictly smaller than $T$ ). You can remember this by comparing it to $\leq$ vs. $<$ on numbers.

Question: do people also write the subset the other direction, like $\geq$ and $>$ ?

This notation ( $\supseteq$ ), does denote the superset when used in that direction: $S \supseteq T$ means every element of $T$ is an element of $S$ . This does occasionally come up.

An important aside: subset and containment are different!

$1 \in {1, 2, 3}$ is $t r u e$ .

$1 \subseteq {1, 2, 3}$ is $f a l s e$ .

${1} \subseteq {1, 2, 3}$ is $t r u e$ .

programmers view: the types are different. Subset works on things of the same type, contains needs an element type. For example, a subset of Set<Integer> would have type Set<Integer> . An element of Set<Integer> would have type Integer .

Equality: $=$

$S = T$ is pronounced " $S$ equals $T$ " and means the sets have the same elements.

${1, 2} = {2, 1}$ is $t r u e$ .

Exercise: define equality using only the $\subseteq$ operator.

(Click for answer)

$S = T$ if and only if $S \subseteq T$ and $T \subseteq S$ .

Union: $\cup$

$S \cup T = {x | x \in S or x \in T}$
$S \cup T = {x | x \in S \lor x \in T}$

Note: $\lor$ is just or . In CS240 notation: $+$ .

$S \cup T$ is pronounced " $S$ union $T$ " (or sometimes " $S$ or $T$ ")and the result is a new set that contains all of the elements that are in either $S$ or $T$ .

${1, 2} \cup {2, 3} = {1, 2, 3}$

To remember: the word "union" sounds like bringing people together! In my CS240 intuitive framing, union is the more "permissive" operation.

Intersection: $\cap$

$S \cap T = {x | x \in S and x \in T}$
$S \cap T = {x | x \in S \land x \in T}$

Note: $\land$ is just and . In CS240 notation: $\cdot$ .

$S \cup T$ is pronounced " $S$ intersection $T$ " (or sometimes " $S$ and $T$ ") and the result is a new set that contains all of the elements that are in both $S$ or $T$ .

${1, 2} \cap {2, 3} = {2}$

To remember: street "intersections" are the parts of the road where both streets meet. In my CS240 intuitive framing, intersection is the more "strict" operation.

Complement: $S^{c}$ , $\overset{―}{S}$ , $S^{'}$

$S^{c}$ is pronounced "the complement of $S$ " or "not $S$ " and means all of the elements that are not in S.

This could be an infinite number of elements if we are considering the entire universe to be our domain. Often, we'll combine the complement operator with an intersection operator to talk about smaller, more useful sets.

Exercise:

$S = {1, 2}$
$T = {2, 3}$

Find $T \cap S^{c}$ .

(Click for answer)

$T \cap S^{c} = {3}$

Set difference: $∖$

$T ∖ S$ is pronounced " $T$ minus $S$ " or " $T$ set difference $S$ " and means the elements in $T$ and not in $S$ .

Note that this is equivalent to the previous exercise: $T ∖ S = T \cap S^{c}$ . Another way to write this would be ${x | x \in T, x \notin S}$

Note that when we use "," inside a set comprehension, this means "and". It would be equivalent to write ${x | x \in T \land x \notin S}$ .

Ordered tuples/pairs

We can express ordered pairs or tuples using parenthesis.

programmer's view: this is like Python tuples.

Aside: Like Python tuples, sets can be heterogenous (that is, have elements of seemingly different types)

So:

$W = {1, (3, 4), (5, 6, 7)}$ is a valid set.

How many elements does $W$ have?

(Click for answer)

$W$ has size 3.

Relations

Relations are a useful special type of set.

Binary relations

Binary relations are sets of ordered tuples.

${(1, 2), (3, 4)}$ is a binary relation with two elements.

$(1, 2) \in {(1, 2), (3, 4)}$ is $t r u e$ .

"Student of" is a binary relation.

( you , me ) $\in$ "student of" is $t r u e$ .

We can define relation "infix" by putting the name of the relation between the things in the pair:

you "student of" me is $t r u e$ .

General relations

A general relation has elements that are $n$ -tuples, where $n$ is fixed for a given relation.

{(1, 2, 3), (4, 5, 6)} is a relation on 3-tuples.

{(1, 2, 3), (4, 5)} is not a relation (but it is a set).

Back to the programmer's view: we can make analogies to other data representations.

Discuss: is a relation more similar to a Python dictionary or an Excel spreadsheet (/CSV file)? Why?

(Click for answer)

A relation is more like a Python dictionary in these ways:

A Python dictionary's elements are unordered.
A binary relation is more like a Python dictionary in that they both can only have elements with two ordered components.

A relation is more like a spreadsheet in these ways:

A general relation is more like a spreadsheet in that they both can have more than two ordered components per element.
Both a general relation and a (well-formed) spreadsheet should have the same size of tuple per element.
Relations that are not functions (more on that in a moment), such as ${(1, 1), (1, 2)}$ , can be represented by a spreadsheet but not represented by a Python dictionary, because Python dictionaries cannot map the same key to multiple values. For this reason, I tend to like the spreadsheet analogy better for my own mental model.

Note that one weakness of the spreadsheet mental model is that the order of rows could matter in a spreadsheet (and there could be a row that is totally a duplicate), whereas a relation is a set with distinct and unordered elements.

Logic

We'll cover logic more in the next lecture, but for now, a quick note on logical implication.

Implication: $\Rightarrow$

In short, this is the truth table for implication:

                A B    A => B
                -------------
                T T    T
                T F    F
                F T    T
                F F    T

For example, consider if I said:

"If pigs fly tomorrow, I will give you all an A in CS340 without coming to class"

Which we can also write as:

"Pigs flying tomorrow implies I will give you all an A in CS340 without coming to class"

These statement are both $t r u e$ !

The statements themselves are not a lie, since the first part of the conditional is never true. In logic, false implies anything . More on this on Monday.

This is the truth table for equality/bidirectional implication, which might feel more intuitive:

                A B    A <=> B
                -------------
                T T    T
                T F    F
                F T    F
                F F    T

To be continued next week!

Spectrum of correctness

Preliminaries

Sets

Set definition

Pattern notation

Containment/element of

Set builder/set comprehension notation

Special sets

Set operations

Subset: ⊆

Equality: =

Union: ∪

Intersection: ∩

Complement: S c , S ― , S ′

Set difference: ∖

Ordered tuples/pairs

Relations

Binary relations

General relations

Logic

Implication: ⇒

Subset: $\subseteq$

Equality: $=$

Union: $\cup$

Intersection: $\cap$

Complement: $S^{c}$ , $\overset{―}{S}$ , $S^{'}$

Set difference: $∖$

Implication: $\Rightarrow$