4 releases
0.2.2 | May 10, 2024 |
---|---|
0.2.1 | Apr 20, 2024 |
0.1.1 | Apr 20, 2024 |
0.1.0 | Apr 20, 2024 |
#75 in Programming languages
91KB
2K
SLoC
impc
A compiler for the IMP programming language. Currently,
impc run <program>
will parse the given program and invoke a valid tree-walk interpreter; in this condition it is ready to use for the relatively simple task of executing IMP programs. The development focus has shifted to implementing a minimal optimising backend targeting LLVM, which should supersede the interpreter when feature-complete.
Installation and Usage
Assuming you have a Rust toolchain installed on your system (if not, follow these instructions), then impc
can be installed using cargo install
.
# from crates.io
cargo install impc
# from source
cargo install --git https://github.com/eikopf/impc.git
With impc
installed, run impc --help
and impc run --help
for the specific details of its usage. To run an IMP program, you can use the impc run <filename>
command and optionally provide explicit bindings with the --let
option if you want to avoid the interactive prompt stage.
IMP
IMP is, largely speaking, a deeply uninteresting language; its entire grammar can be given in a single 30-line ANTLR4 .g4
file (see below), and its semantics are incredibly simple.
In IMP,
- all variables are always in scope, and must have bound values prior to execution;
- arithmetic expressions are evaluated with unsigned integer semantics, with the exception that subtraction saturates at 0;
- arithmetic expressions are always defined, so integer overflow is undefined behaviour;
- because integer overflow is UB and integers have no specified bitwidth, a strictly correct implementation should use arbitrary precision integers unless it can show that overflow cannot occur for some fixed-width type;
- boolean expressions have the ordinary boolean semantics;
- both boolean and arithmetic expressions do not define an order of evaluation, since they cannot produce side-effects;
- commands are executed in-order, and only assignment commands can yield side-effects (namely value-binding);
while
andif
commands behave with the expected semantics.
These properties admit the single interesting characteristic of IMP: it is can be aggressively optimised. In general, commands can be arbitrarily reordered so long as the order of assignment commands is preserved, and even then it can often be shown that arbitrary reorderings of assignment commands are valid (i.e. if they are disjoint). In more complex situations, these properties can be used to unroll, inline, or almost entirely erase while
and if
commands.
grammar IMP;
aexp : INT #Atom
| VAR #Variable
| '(' inner=aexp ')' #Brackets
| left=aexp '*' right=aexp #Mult
| left=aexp '+' right=aexp #Add
| left=aexp '-' right=aexp #Sub
;
bexp : 'true' #True
| 'false' #False
| left=aexp '=' right=aexp #Equal
| left=aexp '<' right=aexp #Smaller
| left=aexp '>' right=aexp #Greater
| left=aexp '<>' right=aexp #Inequality
| 'not' inner=bexp #Not
| '(' left=bexp 'and' right=bexp ')' #And
| '(' left=bexp 'or' right=bexp ')' #Or
;
cmd : 'skip' #Skip
| variable=VAR ':=' expression=aexp #Assignment
| first=cmd ';' second=cmd #Sequence
| 'if' condition=bexp 'then' truecase=cmd 'else' falsecase=cmd 'fi' #If
| 'while' condition=bexp 'do' body=cmd 'od' #While
;
VAR : [a-zA-Z][a-zA-Z0-9_]* ;
INT : [0-9]+ ;
WS : [ \r\n\t] -> skip ;
Dependencies
~7–15MB
~174K SLoC