Architecture
Assignment: Architecture
- Assign: Thursday 5 October
- Due: Monday 16 October
- Policy: Individual graded synthesis assignment
- Submit: Upload a PDF written on the cs240-arch-worksheet-f23.pdf submission sheet
- Reference:
Contents
Exercises
Please write your answers on the cs240-arch-worksheet-f23.pdf submission sheet to streamline the grading process. Submit your work by scanning a PDF of all worksheet pages (in order) and uploading it to Gradescope. Remember that this course uses your Gradescope account attached to your Wellesley email address (the variety that matches your username). You can merge accounts across multiple email addresses if helpful.
Notes:
- This assignment is worth 60 points.
1. A Loopy Program [14 points]
In this exercise you will simulate the execution of an HW ISA program with a loop.
Background
In the solution slides for the HW ISA architecure, there are examples (slides 8-11) of using an execution table to simulate the execution of HW ISA programs, one of which contains a loop.
Here we consider a different program that has a loop to remind
you of the conventions used in an execution table.
Suppose that register R2 has been initialized with
the signed decimal number 3 and register R3 has been initialized
with the signed decimal number 7. Also suppose that
the program P0 has the following Instruction Memory (IM):
IM
| Address | Contents |
| 0x0–0x1 | BEQ R2, R3, 3 |
| 0x2–0x3 | ADD R2, R1, R2 |
| 0x4–0x5 | SUB R3, R1, R3 |
| 0x6–0x7 | JMP 0 |
| 0x8–0x9 | HALT |
Then here is an execution table that shows the step-by-step execution of the
program P0.
Note that the way you should read the meaning of BEQ R1, R2, offset is
that the change to the PC, PC <- PC + 2 + offset*2 replaces the
non-branch change to the PC (PC <- PC + 2), as we designed in the
HW ARCH slides. Specifically, note in the following example that the PC
goes from 0 to 8 (not 10) based on the instruction BEQ R2, R3, 3, because
PC + 2 + offset*2 = 0 + 2 + 3*2 = 8.
Execution table
| PC | Instruction | State Changes |
| 0x0 | BEQ R2, R3, 3 | PC ← PC+2 = 0+2 = 2 (because 3 = R[2] != R[3] = 7) |
| 0x2 | ADD R2, R1, R2 | R[2] ← R[2]+R[1] = 3+1 = 4; PC ← PC+2 = 2+2 = 4 |
| 0x4 | SUB R3, R1, R3 | R[3] ← R[3]-R[1] = 7-1 = 6; PC ← PC+2 = 4+2 = 6 |
| 0x6 | JMP 0 | PC ← 2*0 = 0 |
| 0x0 | BEQ R2, R3, 3 | PC ← PC+2 = 0+2 = 2 (because 4 = R[2] != R[3] = 6) |
| 0x2 | ADD R2, R1, R2 | R[2] ← R[2]+R[1] = 4+1 = 5; PC ← PC+2 = 2+2 = 4 |
| 0x4 | SUB R3, R1, R3 | R[3] ← R[3]-R[1] = 6-1 = 5; PC ← PC+2 = 4+2 = 6 |
| 0x6 | JMP 0 | PC ← 2*0 = 0 |
| 0x0 | BEQ R2, R3, 3 | PC ← PC+2+(2*3) = 0+2+6 = 8 (because 5 = R[2] == R[3] = 5) |
| 0x8 | HALT | program stops executing |
Your Task
This task considers a HW ISA program P1 with the following Instruction Memory IM:
IM
| Address | Contents |
| 0x0–0x1 | ADD R0, R0, R2 |
| 0x2–0x3 | ADD R1, R1, R3 |
| 0x4–0x5 | ADD R3, R3, R4 |
| 0x6–0x7 | BEQ R3, R4, 3 |
| 0x8–0x9 | ADD R2, R4, R2 |
| 0xA–0xB | SUB R4, R1, R4 |
| 0xC–0xD | JMP 3 |
| 0xE–0xF | HALT |
-
[8 points] In the worksheet, fill in the execution table for program
P1. Use the same notational conventions used in the example execution table forP0above. Any numbers beginning with0xwill be interpreted as hexidecimal; any numbers not beginning with0xwill be interpreted as decimal. -
[3 points] Show the final values of the registers
R2,R3, andR4when the program execution halts. Again, any numbers beginning with0xwill be interpreted as hexidecimal; any numbers not beginning with0xwill be interpreted as decimal. -
[3 points] Alice wants to translate program
P1(except for the finalHALTinstruction) into a sequence of statements in a version of C with a 16-bitinttype. Fill in the rest of her program with assignments to variables namedR2,R3, andR4that represent the registers with these names inP1and awhileloop. (We will not be picky on C syntax, as your long your statements are in the right order you will receive full credit).int R0 = 0; int R1 = 1; int R2 = R0 + R0; // ...
2. Taking Control (8 points)
In this exercise you will design the Control Unit for the HW microarchitecture we designed in class. (Base your design on the figure below labeled “HW microarchitecture datapath”.) Our design focused on connecting the datapath components (e.g., the register file, ALU, memory, etc.), but we left the Control Unit as black box.
Flesh out a truth table with 4-bit HW instruction opcodes as inputs, plus
one output for each of the control lines, outputs of the Control Unit
that are control inputs to other components. As shown below, the table should have
one row for each of the seven instructions (one row for each opcode)
except JMP from the HW ISA Instructions table
here.
| Instruction | Opcode3:0 | Reg Write | ALU Op3:0 | Mem Store | Mem | Branch |
| LW | 0000 | |||||
| … | … | … | … | … | … | … |
The control lines we need are:
- Reg Write (1 bit): controls the write-enable of the register file. If an instruction should both read and write to the register file, the write-enable should be 1 (that is, the read data lines still produce the contents of their respective registers even when the write-enable is 1).
- ALU Op (4 bits): controls the ALU (with the same 4 ALU control lines
we developed earlier; see the end of the Arithmetic slides).
- From left to right: Invert A, Negate B, Op1, Op0.
- Mem Store (1 bit): controls the write-enable of the data memory.
- Mem (1 bit): controls three multiplexers deciding which input
to provide for:
- the register file’s write address;
- the register file’s write data; and
- the ALU’s second operand.
- Branch (1 bit, added during class): controls whether to choose the next PC based on a branch target and test.
Please note that Mem Store, Mem, and Branch are all active high (0 = False, 1 = True).
3. Instruction Not Missing (10 points)
The HW ISA introduced in class lacks an instruction for bitwise
not. A logical choice would be NOT Rs,Rd, which takes the
bitwise not of the value in source register Rs and stores
into destination register Rd. You will design two implementations
of the proposed NOT instruction.
-
[4 points] One way to implement
NOTis to replace any occurrence of aNOTinstruction by a sequence of existing HW instructions that has the same effect asNOT.Given source register identifier
Rsand destination register identifierRd, write a sequence of one or more HW instructions that has the same effect asNOT Rs,Rd. The replacement instructions may use values from any registers, but must store values only inRd. Instructions may not store values in any other register or data memory location. The sequence may use only the instructions defined in class. (See the instruction table here.)Hint: Think about the relation of bitwise not, arithmetic, and two’s complement.
-
[4 points] A second way to implement
NOTis to add a new instruction to the ISA, complete with an unique encoding, and to extend the microarchitecture to implement it. First, implement a related instruction (NAND) in this way.- [3] Define a 16-bit encoding of the
NANDinstruction as in the instruction table here. Choose any unused opcode for the encoding. - [1] Add a row to the Control Unit Table from the previous problem
to indicate how to control the datapath to implement
NAND. Do not add extra logic to the datapath diagram.
- [3] Define a 16-bit encoding of the
-
[2 points] Consider how to implement
NOTusingNAND. Define an encoding for theNOTinstruction directly in the ISA. Choose the encoding carefully to minimize additions to the Control Unit and the datapath. Assuming you have completed (b), an ideal encoding requires no new rows or columns in the Control Unit table.Hint: Think of how to translate the
NOTinstruction to aNANDinstruction and use this intuition when designing the bit encoding ofNOT.
4. Jumping into the Unknown [15 points]
-
[10 points]
Add logic to the HW microarchitecture depicted in the figure above to implement the
JMPinstruction, which we did not implement in class.JMPsets the PC to the absolute instruction address given by its argument (a number) multiplied by 2. For example,JMP 3sets the PC to 0x6, causing the instruction stored at address 0x6 (i.e., the 3rd instruction in the program) to be executed next.-
[7 points] In the wiring diagram for the microarchitecture on the worksheet, you will add:
-
One new Jump control line from the Control Unit (in Problem 4), carrying 1 if the current instruction is
JMPand 0 otherwise. -
Wiring and logic using the Jump control line and the offset bits from the
JMPinstruction encoding to store theJMPinstructions’s target address to the PC if and only if the current instruction isJMP. You will need to “cut” one existing wire to splice in some new logic.
Notes:
-
The
JMPhardware will need to use a Zero Extend unit that is different from the existing Sign Extend unit. -
Do not attempt to reuse the existing Shift Left by 1 unit; if you need such a shift, use a separate copy of this unit.
-
-
[1 point] In the Control Unit Truth Table from the previous problem, flesh out the contents of the the Jump control line in the final column.
-
[2 points] In the Control Unit Truth Table from the previous problem, flesh out the contents of the row for the
JMPopcode.
-
-
[5 points] Assume
R2andR3hold unknown input values when the following program starts.R4will hold an output when the program stops. The address of each instruction in the instruction memory is shown at left. The (new)HALTinstruction stops the computer.# R2 and R3 hold program inputs here. 0: AND R2, R2, R4 2: AND R3, R3, R5 4: BEQ R5, R0, 3 6: SUB R5, R1, R5 8: ADD R4, R4, R4 A: JMP 2 C: HALT # Stops execution. # What value is in R4?- [3 points] Execute the program assuming
R2holds5andR3holds3. What are the final values in the registersR2,R3,R4, andR5when the program reachesHALT? You may write your answers in either decimal or hex, but must use a0xprefix for hex. - [2 points] Try a few more examples with (small) positive numbers in
R2andR3. What does this code do withR2andR3to computeR4? Answer with one simple line of valid C code of the form:R4 =exp;, where exp is a simple C expression using the the variablesR2andR3(corresponding to the HW registers) and any simple C operations that accomplish the same effect as these combined HW instructions. exp must not include C statements like loops or conditionals.
- [3 points] Execute the program assuming
5. Reconstructing Memories (RAM Puzzles) [13 points]
In these exercises, you will show how one or more provided RAMs (random access memories) can be used to implement a RAM with different dimensions that has the same number of stored bits as the provided RAMs.
Provided RAMs
In an A×B RAM, A is the number of addressable locations (cells) in the memory and B is the width of each location (cell) in bits. Recall that a RAM is much like a register file, but the underlying storage technology is not based on flip-flops. Memory addresses are analogous to register numbers in a register file; memory locations (cells) are analogous to registers in a register file. Unlike the register file we designed, these RAMs support reading only one location at a time.
The provided RAMs have the following form:
- Each RAM has a Data In port, a Data Out port, an Address input and a Write Enable input.
- Each RAM is always reading from the location given by the current Address, expressing its contents on Data Out.
- Each RAM also stores the value on Data In into the location given by the current Address if and only if the Write Enable input carries a 1.
- Each RAM is indivisible and must be used as a “black box” (that is, you cannot modify the working of components inside each RAM).
Rules
You will design new RAMs with the same kind of inputs, outputs, and externally observable behavior, but with different dimensions. For each:
- On the worksheet, connect inputs and outputs of the larger box with those of the smaller boxes representing RAMs, possibly adding combinational logic building blocks (e.g., muxes, decoders, gates) that make the logic correct.
-
For each bus that contains a multi-bit value (e.g., Address); use hatch marks and width annotations to mark how many bits are carried by that bus. You must specify the number of bits (width of bus) carried by each bus (that is, all lines will be interpreted a single bit values unless they are annotated with a width).
- When a larger bus splits into or is joined from smaller buses, you must show which bit ranges from the larger bus correspond to the width of each smaller bus, as shown in the examples below:
Exercises
-
[5 points] Design a 256×8 RAM by adding minimal wiring and logic to two identical 256×4 RAM components. Label the widths (in bits/lines) of the ports Address, Data In, and Data Out for each 256×4 RAM you use and for the 256×8 RAM you implement.
-
[8 points] Design a 64K×8 RAM by adding minimal wiring and logic to a single provided 32K×16 RAM. (K = “kilo” = 1024)
- Unlike the previous part, this one provided RAM component has
the exactly same capacity as the desired final product, though it has
different dimensions:
- The provided RAM has fewer addressable memory locations (32K) than the RAM you will design (64K). (Hint: How many bits do you need to specify for the address for the provided RAM? That is, what is log2(32*1024)?)
- The provided RAM has a larger addressable unit of data (16 bits = 2 bytes) than the RAM you will design (8 bits = 1 byte).
- The Data Out of the inner RAM supplies 2 bytes, but the Data Out of the outer RAM can provide only 1 byte. How will you determine which of the 2 bytes is chosen? (Hint: what combinational logic building block chooses between inputs?)
- The Data In of the outer RAM supplies 1 byte, but the Data In
of the inner RAM requires 2 bytes.
- Where will the other byte that should come from such that only the supplied byte is written? Think carefully about the combinational logic to implement this.
- If you join 2 1-byte buses to create a 2-byte bus, your wiring notation must unambiguously indicate which is the MSB and which is the LSB.
- Unlike the previous part, this one provided RAM component has
the exactly same capacity as the desired final product, though it has
different dimensions:
5 Extra Fun Affixed or Afloat in a C of Numbers
-
The Sea language (an imaginary programming language otherwise identical to C) supports a
signed fixed8ths chartype1 using an 8-bit signed fixed point number system with precision to the 1/8 (eighths) place.-
What are the minimum and maximum numbers representable by a
signed fixed8ths char? Write your answer as a base ten number using fractions or decimals. -
What are the minimum and maximum numbers representable by a
signed fixed32nds char, which uses 8-bit signed fixed-point representation with precision to the 32nds place? Write your answer as a base ten number using fractions or decimals. -
8-bit two's complement adder.
A, B, and Sum are 8-bit buses.We provide a hardware adder design (at right) that supports 8-bit two’s complement addition (e.g., for
signed charnumbers). Draw a logic diagram for a hardware adder that supports thesigned fixed8ths charnumber system. You may use the provided two’s complement adder as a black box (no need to draw the internals) and extend it with whatever wiring and logic is necessary. There is a “trick” here! Keep it simple.
-
-
Using the 6-bit floating-point encoding defined in these slides, give decimal notation of the numbers represented by the following floating-point encodings.
110101100001011100000011010010111101
Submission
Please write your answers on the cs240-arch-worksheet-f23.pdf submission sheet to streamline the grading process. Submit your work by scanning a PDF of all worksheet pages (in order) and uploading it to Gradescope. Remember that this course uses your Gradescope account attached to your Wellesley email address (the variety that matches your username). You can merge accounts across multiple email addresses if helpful.
-
charis often used for characters in C, including with literal character values like'c', but it is just a number type. It is synonymous withsigned charin most implementations, just a signed 8-bit integer.unsigned charis an unsigned 8-bit integer. ↩
