Semantica HL
Semantica HL
CCOM4029
SPRING 2024
Notes mostly from CMSC
and Norbert Zeh Winter,
Dalhousie University
Overview
Parse tree
Semantic analysis and
code generation
Symbol table
Abstract syntax tree or
other intermediate form
Machine-independent
code improvement
Modified
intermediate form
Back end
• How:
Two Approaches:
• Interleaved with syntactic analysis
• As a separate phase
• Formal mechanism:
• Attribute grammars
• Static semantic rules
• Attribute
o Enforced by compiler at compile
Grammars
time
o Example: Do not use undeclared • Abstract Syntax
variable. Tree
• Parse trees follow a grammar and usually have lots of useless nodes
• An abstract syntax tree (AST) eliminates useless structural nodes, using only those
nodes that correspond to constructs in the higher level programming language
• It’s much easier to interpret an AST or generate code from it
e + +
e + int
int e+ int e+ 3
e + int 3 int
e int 3 1 2
int 2 1 2
Definition:
• But the name after “procedure” has to be the same as the name after “end”
• Cannot capture this in a CFG (in practice) because there are too many names
• Solution: annotate parse tree nodes with attributes and add a “semantic”
rules or constraints to the syntactic rule in the grammar.
<proc> => procedure <prName>[1] <prBody> end <prName>[2] ;
<prName>[1].string == <prName>[2].string
Attribute Grammars
• Let X0 => X1 ... Xn be a grammar rule
• Functions of the form S(X0) = f(A(X1),...A(Xn) define synthesized attributes i.e.
attribute defined by a node’s children, passed up the tree.
• Attributes of LHS of production are computed from attributes of RHS of
production.
• Functions of the form I(Xj) = f(A(X0),…A(Xn)) for i <= 0 <= n define inherited
attributes i.e., attribute defined by parent and siblings, passed down the tree
• Attributes flow from left to right:
•From LHS to RHS,
•From symbols on RHS to symbols later on the RHS.
• Initially, there are intrinsic attributes on the leaves, somehow predefined, e.g. the
numerical value of integer constants.
Attribute Grammars: Example
E → E+ T E1 → E2 + T {E1.val = add(E2.val, T.val) }
E→ E−T E1 → E2 −T {E1.val = sub(E2.val, T.val) }
E→ T E→ T {E.val = T.val }
T→ T∗ F T1 → T2 ∗ F {T1.val = mul(T2.val, F.val) }
T → T/ F T1 → T2 / F {T1.val = div(T2.val, F.val) }
T→ F T→ F {T.val = F.val }
F → −F F1 → −F2 {F1.val = neg(F2.val) }
F → ( E) F → ( E) {F.val = E.val }
F → const F → const {F.val = const.val }
Attribute Grammars
Example: expressions of the form id + id
•id's can be either int_type or real_type
• types of the two id's must be the same
• type of the expression must match its expected type
Attributes:
actual_type - synthesized for <var> and <expr>
expected_type - inherited for <expr>
Attribute Grammars
1. Syntax rule:
<expr> -> <var>[1] + <var>[2]
Semantic rules:
<expr>.actual_type ¬ <var>[1].actual_type
Predicate:
<var>[1].actual_type == <var>[2].actual_type
<expr>.expected_type == <expr>.actual_type
Compilers usually maintain a
2. Syntax rule: “symbol table” where they
record the names of proce-
<var> -> id dures and variables along
with type information.
Semantic rule: Looking up this information
<var>.actual_type ¬ lookup_type (id, <var>) in the symbol table is a com-
mon operation.
Attribute Grammars
How are attribute values computed?
•If all attributes were inherited, the tree could be
decorated in top-down order
•If all attributes were synthesized, the tree could be
decorated in bottom-up order
•In many cases, both kinds of attributes are used, and
some combination of top-down and bottom-up is
used. May need multiple “passes” to evaluate the AG
Attribute Grammars
3 A 3 B C
3
2 A a 2 B b 2 C c
1 A a 1 B b 1 C c
a b c
Inherited Attributes: Example
Same language:
L = {anbncn | n > 0} = {abc, aabbcc, aaabbbccc,...}
S → ABC {B.inhCount = A.count; C.inhCount = A.count}
A→ a {A.count = 1}
A1 → A2 a {A1.count = A2.count + 1}
B→ b {Condition : B.inhCount = 1}
B1 → B2 b {B2.inhCount = B1.inhCount − 1}
C→ c {Condition : C.inhCount = 1}
C1 → C2 c {C2.inhCount = C1.inhCount − 1}
Inherited Attributes: Parse Tree
• Input: aaabbccc
S
3 A 3 B C
3
2 A a 2̸
= 1 B b 2 C c
1 A a c
b 1=1 C
a c
Attribute Grammar:
Summary
• AGs are a practical extension to CFGs that allow us to annotate
the parse tree with information needed for semantic processing
• E.g., interpretation or compilation
• We call the annotated tree an abstract syntax tree
• It no longer just reflects the derivation
• AGs can move information from anywhere in the abstract syntax
tree to anywhere else, in a controlled way
• Needed for non-local syntactic dependencies and for
semantics
Static vs. Dynamic Semantics
• Attribute grammars are an example of static semantics (e.g., type
checking) that don’t reason about how things change when a
program is executed
• But understanding what a program means often requires
reasoning about how, for example, a variable’s value changes
• Dynamic semantics tries to capture this
• E.g., proving that an array index will never be out of its intended
range
Dynamic Semantics
<dec_num> ® 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9
| <dec_num> (0|1|2|3|4|5|6|7|8|9)