Mite’s semantics

Reuben Thomas

25th October 2000

1 Introduction

Mite’s semantics are defined in terms of an abstract machine, which consists of a state and a set of rules for transforming it according to a program.

2 Definitions

Quantity: a string of bits
q[i… j]: the quantity consisting of bits i to j inclusive of quantity q
Width: the number of bits in a quantity
A: either 32 or 64
Word: an A-bit quantity
w-aligned: a multiple of w
Size: an expression of the form b+w+r, where b, w, and r are non-negative integers, whose value is b+Aw+32⌊A/64⌋r
ρ: an undefined quantity of infinite width
[S]: a bit representing the truth of statement S; if S is true then [S] is one, otherwise it is zero
q← E: the assignment of expression E to quantity q; before assignment, E is truncated or zero-extended to make it the same width as q
Stack: a last-in-first-out stack whose items are said to be added to the top, and are numbered from one, counting from the bottom
s[i]: the ith item of stack s
s[i… j]: the stack consisting of items i to j inclusive of stack s
s⊕ E: the stack s with an extra item added whose value is E

3 State

Mite’s state consists of the following elements:

Flags f_Z, f_N, f_C, f_V: one bit each
Execution pointer EP: a word
Temporary register T: a word
Memory M: a quantity
Permutation functions p₈, p₁₆, p₃₂, p_A
Stack S: a stack
Frame stack F: a stack

An index into M is called an address.

The function p_w takes a quantity of width w and returns it with its bytes permuted.

F is a stack of pairs of naturals. FP is the index of the top-most item in F. F_S(i) and F_N(i) denote respectively the first and second component of the ith item of F. A stack position is a natural p in the range 1… F_N(FP).

S holds items of two sorts: a register is a word created by NEW(), and a chunk is a quantity of arbitrary size created by NEW(c) (see section 5.7). The stack items are held in M at word-aligned addresses.

S_p, where p is a stack position, denotes S[F_S(FP)+p−1]; &S_p denotes the address of S_p. SP is an abbreviation for F_S(FP)+F_N(FP)−1.

4 Program

An instruction consists of an operation and a tuple of operands. Each operand has a type, given by its name; a subscript is added to distinguish operands of the same type. The allowable instructions are given in section 5. The program P is an array of instructions; P[i] denotes the ith element of P.

The types are:

Stack position p: a stack position
Natural n: a non-negative integer
Register r: a stack position p such that S_p is a register, or T
Chunk c: a stack position p such that S_p is a chunk
Width w: a member of the set {8,16,32,A}
Size s: a size

5 Instructions

The state is transformed by repeatedly performing EP← EP+1 then the semantics of P[EP−1]. The semantics of each instruction are given below in terms of assignments to state elements, and other instructions; the operations are performed sequentially. An underlined expression is a predicate that must evaluate to true when the instruction is executed; otherwise the instruction has no effect.

Arithmetic is integral, performed on A-digit binary numbers using two’s complement interpretation. Quantities are evaluated with bit zero as the least significant digit.

The semantics of every instruction have the assignment T← ρ prepended, and also, for all instructions except branches (see section 5.4), the following:

f_Z← ρ
f_N← ρ
f_C← ρ
f_V← ρ

For branches, the four instructions above are added to the end of the instruction’s semantics.

5.1 Assignment

S_r₁← S_r₂
f_Z← [S_r₁=0]
f_N← [S_r₁<0] S_r← &S_c
f_Z← [S_r₁=0] T← S_r₁
S_r₁← S_r₂
S_r₂← T

5.2 Data processing

All the data processing instructions except MUL, DIVSZ and REMSZ have

f_Z← [S_r₁=0]

f_N← [S_r₁<0]

appended to the end of their semantics.

5.2.1 Arithmetic

S_r₁← −S_r₂
f_C← [S_r₁=0]
f_V← [S_r₁=−2^A−1] S_r₁← S_r₂+S_r₃
f_C← carry out of most significant bit
f_V← [signed overflow occurred] S_r₁← S_r₂−S_r₃
f_C← carry out of most significant bit
f_V← [signed overflow occurred] S_r₁← S_r₂× S_r₃ S_r₃≠ 0 S_r₁← S_r₂÷ S_r₃, treating S_r₂ and S_r₃ as unsigned, and rounding the quotient to 0 S_r₃≠ 0 S_r₁← S_r₂÷ S_r₃, treating S_r₂ and S_r₃ as signed, and rounding the quotient to −∞ S_r₃≠ 0 S_r₁← S_r₂÷ S_r₃, treating S_r₂ and S_r₃ as signed, and rounding the quotient to 0 DIV(T,r₂,r₃)
r₁← r₂−T× r₃ DIVS(T,r₂,r₃)
r₁← r₂−T× r₃ DIVSZ(T,r₂,r₃)
r₁← r₂−T× r₃

5.2.2 Logic

S_r₁← one’s complement of S_r₂ S_r₁← bitwise and of S_r₂ and S_r₃ S_r₁← bitwise or of S_r₂ and S_r₃ S_r₁← bitwise exclusive-or of S_r₂ and S_r₃ 0≤ S_r₃≤ A S_r₁← S_r₂ shifted left S_r₃ places
f_C← carry out of most significant bit, if S_r₃>0 0≤ S_r₃≤ A S_r₁← S_r₂ shifted right logically S_r₃ places
f_C← carry out of least significant bit, if S_r₃>0 0≤ S_r₃≤ A S_r₁← S_r₂ shifted right arithmetically S_r₃ places
f_C← carry out of least significant bit, if S_r₃>0

5.3 Memory

S_r₁← 0
S_r₁[0… w−1]← p_w(M[S_r₂… S_r₂+w−1]) M[S_r₂… S_r₂+w−1]← p_w⁻¹(S_r₁[0… w−1]) S_r₁+s≤ S_r₂ or S_r₂+s≤ S_r₁ M[S_r₁… S_r₁+s−1]← M[S_r₂… S_r₂+s−1] S_r+s≤ &S_c or &S_c+s≤ S_r M[S_r… S_r+s−1]← M[&S_c… &S_c+s−1] &S_c+s≤ S_r or S_r+s≤ &S_c M[&S_c… &S_c+s−1]← M[S_r… S_r+s−1] &S_c₁+s≤ &S_c₂ or &S_c₂+s≤ &S_c₁ M[&S_c₁… &S_c₁+s−1]← M[&S_c₂… &S_c₂+s−1]

5.4 Branch

EP← S_r f_Z=1 EP← S_r f_Z=0 EP← S_r f_N=1 EP← S_r f_N=0 EP← S_r f_C=1 EP← S_r f_C=0 EP← S_r f_V=1 EP← S_r f_V=0 EP← S_r f_C=1 and f_Z=0 EP← S_r f_C=0 or f_Z=1 EP← S_r f_N≠ f_V EP← S_r f_N=f_V EP← S_r f_Z=1 or f_N≠ f_V EP← S_r f_Z=0 and f_N=f_V EP← S_r

5.5 Call and return

NEW(s)
S[SP][o… o+A−1]← EP
EP← S_r
F[FP]← (F_S(FP),F_N(FP)−(p+1))
F← F⊕ (SP+1,p+1) EP← S_c[o… o+A−1]
KILL(c)
F← F[1… FP−2]⊕ (F_S(FP−1),F_N(FP−1)+F_N(FP))

o and s may vary between instructions, but should be the same for corresponding CALLs and RETs; s is at least A, and 0≤ o≤ s−A.

5.6 Catch and throw

S_r← FP EP← S_r₁
F← F[1… S_r₂]
S← S[1… SP−1]⊕ S_r₃

5.7 Stack

S← S⊕ ρ [0… A−1]
F[FP]← (F_S(FP),F_N(FP)+1) S← S⊕ ρ [0… s−1]
F[FP]← (F_S(FP),F_N(FP)+1) S[F_S(FP)+p−1… SP−1]← S[F_S(FP)+p… SP]
F[FP]← (F_S(FP),F_N(FP)−1)
S← S[1… SP]

This document was translated from L^AT_EX by H^EV^EA.

Last updated 2006/06/02