Today, we'll:

Discuss the 2 readings: PBT in Practice and Evaluating the Small Scope Hypothesis
Introduce the tool we'll use for the next ~5 weeks: the Alloy language and Analyzer.

Paper discussion: PBT in Practice and Small Scope Hypothesis

Reminder: this class does have readings!

Here is the summary of notes from our paper discussion:

Friday's: PBT in Practice
Today's: Evaluating the “Small Scope Hypothesis”

For everyone:
- Warmup: did you do anything outside this weekend?
    - Hackathon, new dog park!
- In what context did this paper study PBT? How is this similar/different from how we are using PBT in the assignment?
    - In a single workplace, software engineering, Jane Street, OCaml
    - Similar: check for properties, random input generation, compare to output
    - Different: assignment: *you* are doing input generation yourself, the paper: uses QuickCheck
- What is the "small scope hypothesis"? How does it relate to PBT?
    - Define: most bugs can be found by inputs small (ish) in size
    - Relate PBT: both "agree" that we don't need to exhaustively enumerate
        - Difference: small scope: everything up to a size.
- What questions do you have about either reading?
    - Maybe come back to this :)

Per group
    Answer your group's question first, then if time answer the other questions (in whatever order).

1. What did the study participants find to be most beneficial about PBT?
    - It makes their software more robust, reliable. Makes them more confident in their code, other people to build on libraries that have been tested by PBT
    - It makes documentation more clear, across a big company like Jane Street. 
    - Easier to find edge cases: though about these during code, PBT, cases they have not thought about. 
2. What are the main barriers to adoption of PBT?
    - Time constraints have a big range. 
    - Tools don't always convey data on how they are working.
    - Too much time thinking about the properties stage. 
    - Passing property does not mean to code 100% works, might just not generate the right input. 
3. What is “shrinking?”? How does it impact the use of PBT?
    - Also called "test case reduction" 
    - Shrinking makes the input that triggers a bug more readable. 
    - Shrinking makes it easier to understand bugs. 
4. How can teams make it easier for developers to use PBT?
    - Time is a large barrier. Writing the properties, shrinking. 
    - Input generation: generating inputs that satisfy pre-conditions. Some type-based solutions. 
    - Easier if it took less time. 
    - Design up front with PBT in mind.

Introduction to Alloy

Alloy

The Small Scope Hypothesis is the motivation behind the design of Alloy.

The core idea behind Alloy is to model the design of software, independently from (and perhaps even before) the implementation. In practice, Alloy has been used to find bugs in decades-old, lauded distributed systems protocols and find new security vulnerabilities that could rival spectre and meltdown.

Real world context: Alloy for security vulnerabilities

We spent some time in class discussing the Spectre and Meltdown software vulnerabilities, which rocked the computing world when they were discovered by several research teams in 2018.

From the above website:

While programs are typically not permitted to read data from other programs, a malicious program can exploit Meltdown and Spectre to get hold of secrets stored in the memory of other running programs. This might include your passwords stored in a password manager or browser, your personal photos, emails, instant messages and even business-critical documents.

In short, Spectre and Meltdown are both "timing-channel" vulnerabilities that allow an attacker to infer secret program values by observing the time it takes programs to execute.

While these programs were discovered independently, researchers at Princeton used Alloy to discover previously unknown, similar attacks. You can read the full paper here.

While Alloy is not necessarily the most popular system analysis tool, this case study is just one example of its substantial real-world impact. We'll also start with Alloy as our first modeling tool because it is (relatively) easy to learn, with a simple syntax and a visual interface.

Models as imperfect representations

As the statisticians "say", "all models are wrong, but some models are useful".

A model essentially by definition leaves off some of the details of the real system being modeled!

In physics, what sort of factors do we typically ignore? For example: when asked to predict the outcome of a block sliding down a ramp.

(Think, then click!)

In basic mechanics, we ignore:

Friction.
Air resistance.

Physicists ignore these details because they are negligible to the overall system behavior in basic mechanics.

Rather than trying to model every part of a computer system exhaustively, we'll model things on an "as needed" basis. Our models will start as very general, then we will progressively add details.

Modeling vs. programming

Modeling is distinct from "normal" programming.

An empty program does nothing.
An empty model allows all behavior.

(Credit to Daniel Jackson, the principal creator of Alloy, via Tim Nelson, the professor who first taught me Alloy).

Writing models in Alloy will feel quite different from writing programs!

In traditional, imperative programming, you provide specific instructions to be carried out in a designated order. Writing models in Alloy is a example of declarative programming, which describes rules, or constraints, that determine the behavior of the model. The order of these constraints typically does not matter!

Example: our malloc mystery (mystery 2)

Recall the memory allocator implementation (malloc) mystery that we discussed on the first day of class:

All blocks are "alloc" or "free". 
Blocks have at most one "predecessor"
                             
       HEAP        
   +-----------+ 
b0 |alloc 0b11 +
   +-----------+ 
b1 |    ???    |  
   +-----------+ 
b2 |free  0b?0 +
   +-----------+ 

The question: given that we don't know the allocation status of  ???, 
is it possible for us to know if the following is true?
    "There is some free block whose predecessor is allocated"

What do we need to model to answer this question?

(Think, then click!)

We'll start with the basics:

Blocks
Allocation status
The predecessor field

We'll use Alloy to perform this analysis for us (to some extent) by creating a new model.

Note that we'll use Alloy 6 in this course.

Sigs

To model a basic concept in Alloy, we'll use a sig. sigs are conceptually like classes in an object-oriented language like Java.

We'll start with a sig for Block:

sig Block {
}

run {}

We'll also add an empty run {} command, which tells the Alloy Analyzer to find us some examples of the model we have defined. When we click Execute, Alloy will pop up a window to view the example. We can view examples as a graph-based visualization, in plain texts, or as a tree of nested data.

With just a Block sig defined, we might see a single Block as the first instance found. Alloy has a univ built-in sig that contains everything in the model's universe, in this case, our single Block.

Model vs. instance, sig vs atom

A model is the Alloy code we write that defines a universe and some constraints on that universe. An instance is a specific example the Alloy Analyzer finds for that model (the Analyzer usually finds many instances for a given model).

A sig defines a concept in the model (like a class in Java). We'll call a specific instance of a sig in a model an atom (like an instantiated object in Java). Block is the sig, Block$0 is an atom.

Scope

Alloy allows you to write infinite models, but the analyzer will only check finite subsets of behavior. Analyzer commands must be scoped (by default, if you leave the scope off, Alloy runs with scope 3).

Alloy includes an integer type in the universe as well by default. We can run for up to 5 blocks (not exactly 5) and no integers with

run {} for 5 but 0 Int

Fields

Like in Java or Python, we can add fields to our concept definition (under the hood, these will work very differently in Alloy, but the syntax is about the same).

Let's define a new Status sig and add it as a field.

sig Status{}

sig Block {
  status: Status,
}

When we run this, Alloy finds instances with a whole bunch of different statuses, which we don't want. We want two specific ones: alloc and free.

How might we model the allocation status in an object-oriented language?

(Think, then click!)

One answer: a boolean.

Alloy does have a boolean type in the standard library, but let's imagine we might want to extend the status type in the future.

In OOP, we might use an enum or subclass for that.

In Alloy, we'll use an inheritance relationship to describe child sigs.

sig Status {}
one sig Alloc extends Status{}
one sig Free extends Status{}

However, this model still allows plain Status atoms, which we don't want.

How might we model a superclass that we never instantiate in an oriented language?

(Think, then click!)

Making it an abstract class (e.g., in Java).

We can do that in Alloy too:

abstract sig Status {}
one sig Alloc extends Status{}
one sig Free extends Status{}

The one modifier tells Alloy to only allow one atom each of Alloc and Free, since we do not need separate atoms for these concepts.

Fields as relations

Now, we can add another field that points to the predecessor block.

sig Status{}

sig Block {
  status: Status,
  prede: Block,
}

(pred is a keyword in Alloy, so we'll use prede instead).

When we run this, we now see that prede looks something like this in the text representation:

prede = {Block$0->Block$1, Block$1->Block$2}

Does this look familiar, from last week? prede is a binary relation relating each block to its predecessor! In Alloy this is explicit: fields are relations between the sig type you are defining and the sig type of the field.

Alloy's modeling

We'll come to understand this incrementally, but: everything in Alloy is a relation.

Instances of objects make up sets (which you can think of a unary relations).

Single instances, or scalars, are singleton, unary relations (e.g., the integer 1 under the hood in alloy is the set

{1}

Fields are relations between the defining sig type and the listed type of the field.

Note: relations in Alloy are flat: nothing is truly nested. We can picture each relation as a flat spreadsheet with distinct rows (if you print relations in the Alloy Evaluator for a specific instance, it will show in a format like a spreadsheet).

Default multiplicity on fields

If we run the previous model, we see that each Block atom always has a predecessor, even though in our diagram the first block has no predecessor.

sig Block {
  status: Status,
  prede: Block,
}

Here, because the fields do not specify a multiplicity (size of set) on the field, Alloy defaults to one. Instead, we will use the lone keyword on the prede field to indicate each Block can have zero or one predecessor.

We'll update this segment to:

sig Block {
  status: Status,
  prede: lone Block,
}

And with that, we now have a model for our overall (simplified) heap structure. We could imagine restricting the heap further, to force a linear order of blocks. For now, we'll keep the overall heap definition simple, and switch to modeling the specific question around predecessors.

Predicates in Alloy

The question we want to answer is: given our starting conditions on three blocks, is there some free block whose predecessor is allocated?

To model this in Alloy, we'll constrain our starting condition. In Alloy, a predicate that can be true or false is defined with a pred block, that looks almost like a function. Each statement within the block adds a constraint.

pred init(b0, b1, b2: Block) {
  b1.prede = b0
  b2.prede = b1
	
  b0.status = Alloc
  b2.status = Free
  no b0.predecessor -- the first block has no prede
}

Note that we do not need semicolons to separate lines. Each line is a new constraint on the model (so we could equivalently link them with and operators; the order does not matter).

In the final line, we use the no multiplicity to indicate that the block b0 has no predecessors.

Predicates in Alloy are like predicates in logic: they can be either true or false. Importantly: just writing the predicate does not constrain the model to have the predicate be true, unless we invoke it (directly or indirectly) in an analyzer command.

Now when we run this model, we see the initial setup we expect. In viewing the instances, we can color-code block by changing the Theme.

Checking an implication

Now, we want to check if the full scenario in our mystery: must some free block have a predecessor that is allocated?

We'll start by writing a new predicate:

pred someFreeWithAllocPred{
  some b: Block | 
    b.status = Free and b.prede.status = Alloc
}

The | is pronounced "such that", just like in logic.

some is like the existential quantifier: it says there exists some Block where the following lines after the | are true.

We can run the following predicate and command to see examples where someFreeWithAllocPred is true:

pred show {
    some disj b1, b2, b3: Block | init[b1, b2, b3] 
    someFreeWithAllocPred
}
    
run show for 1 but 3 Block, 0 Int

When we click through the different instances the Analyzer finds, our property is true in each. BUT, so far, we have only asked the Analyzer to find examples, rather than to actually reason about whether the statement must always be true.

Asserts and checks

Can we instead ask Alloy if there is every a counterexample to this statement?

Yes! We can use an assert block with a check command to check for counterexamples to the mystery, written as an implication. We'll use the general setup of: initial setup implies final condition. In Alloy:

assert show {
	all b1, b2, b3: Block | init[b1, b2, b3] => someFreeWithAllocPred
}

check show for 1 but 3 Block, but 0 Int

The check command, which invokes a named assert or an unnamed block, tries to find instances (within the given) scope where the assertion is NOT true. That is, check attempts to find counterexamples that would invalidate the assertion. See the comments below for a comparison of run and check.





























































































































/*

All blocks are "alloc" or "free". 
Blocks have at most one "predecessor"
                             
       HEAP        
   +-----------+ 
b0 |alloc 0b11 +
   +-----------+ 
b1 |    ???    |  
   +-----------+ 
b2 |free  0b?0 +
   +-----------+ 

The question: given that we don't know the allocation status of  ???, 
is it possible for us to know if the following is true?
    "There is some free block whose predecessor is allocated"
*/


// Descriptions of objects: "sig"
// abstract: don't have any just Status, have the sub-sigs
abstract sig Status {} 

// extend works like in an object-oriented language. 
// Alloc is a subset of Status. Free is a subset of Status.
// "one" here means there will only be one atom of each.
one sig Alloc extends Status {}
one sig Free extends Status {}

// prede: relation Block -> Block 
// succ: relation Block -> Block

// prede and succ are each a subset of Block x Block

sig Block {
  status: Status, // syntax to define a field: "one Status"
  prede: lone Block, // lone: "either zero or one"
  succ: lone Block
}

// Note" "pred" is a keyword in Alloy, so we'll use "prede"
// instead. 

// "fact"s in Alloy are constraints that are always true. 
fact {
  // A more verbose way to write the following statement:
  //all a, b : Block | a->b in prede <=> b->a in succ

  // Relational operator: ~ is transpose (switch order)
  ~prede = succ
}
 
// "pred"s in Alloy are predicates that can be true or 
// false for a given model (or parts of a model). 
// "pred"s don't apply unless you invoke them!

// In this pred, set up the specific problem.
pred init(b0, b1, b2: Block) {
  no b0.prede
  b1.prede = b0
  b2.prede = b1
  b0.status = Alloc
  b2.status = Free
}

// In this pred, describe the question we are solving.
pred someFreeWithAllocPred {
  some b: Block |
    b.status = Free and b.prede.status = Alloc
}

// To actually analyze the problem, relate these 
// two "pred"s as, conceptually,
// "init" implies "someFreeWithAllocPred"


// The following is equivalent to the above, if
// someFreeWithAllocPred is edited to takes 
// an argument (then the analyzer will show us
// which block the pred is true for).
// pred someFreeWithAllocPred(b : Block) {
//   b.status = Free and b.prede.status = Alloc
// }

//run {
//  some disj b1, b2, b3 : Block | init[b1, b2, b3]
//  someFreeWithAllocPred
//  some Block
//} for 5 but 0 Int, exactly 3 Block

assert showProperty {
  all b1, b2, b3 : Block | init[b1, b2, b3] => someFreeWithAllocPred
  // The following is equivalent to the above, if
  // someFreeWithAllocPred is edited to takes 
  // an argument (then the analyzer will show us
  // which block the pred is true for).
  // 
  // all b1, b2, b3 : Block |
  //   some  b: Block | 
  //     init[b1, b2, b3] => someFreeWithAllocPred[b]
      
  // Here is an example of an assertion that *does*
  // have a counterexample:
  // some  b: Block | someFreeWithAllocPred[b]
}

check showProperty for 5 but exactly 10 Block

/*
Run: predicate (or block of statements {...}),
ask the Analyzer to show instances where the 
block is True.

If our model is correct and there is not a bug in
the system, we expect "run" to show examples.

Check: assertion  (or block of statements {...}), ask the Analyzer to show instances where the 
block is False. In other words, asks the Analyzer 
to find a counter example. 

If our model is correct and there is not a bug 
in the system, we expect "check" to show "no 
counterexamples found".
*/

Summary of useful Alloy Syntax

See the Resources on the website for more!

Some of these we did not get to today, but will see in-class examples of soon.

Built-in sets:

univ everything in the model
none the empty set

Quantifiers/Multiplicities

some >= 1, equivalent to "there exists"

\exists

all every atom, equivalent to "for all"

\forall

no 0, equivalent to "there does not exist"

\neg \exists

one exactly 1
lone 0 or 1

Disjointedness/Distinctness

disj: used to specify that we mean distinct atoms

Set operators:

+ union
& intersection
- difference
in subset
= equality

Relation operators:

-> cartesian product
. join
[] join, flipped syntax
~ transpose

Logical operators:

&&, and, ||, or, !: same as Java/Python
=> implies
<=> iff

Sig relationships
S extends T

abstract S: no atoms other than those contained in its subsets