0% found this document useful (0 votes)

326 views18 pages

Macro Processors: Basic Function Machine-Independent Features Design Options Implementation Examples

Q: Discuss the role of DEFTAB, NAMTAB, and ARGTAB in the macro processing algorithm.

DEFTAB, NAMTAB, and ARGTAB are crucial data structures in macro processing. DEFTAB stores the complete definitions of macros including the macro body and prototype, optimized for efficient argument substitution. NAMTAB holds the names of macros and acts as an index to DEFTAB, managing pointers to the start and end of macro definitions. ARGTAB stores the arguments of macro invocations according to their order, enabling the substitution of these arguments during macro expansion .

Q: What problems might arise from using labels directly within the macro body and how can they be effectively addressed?

Using labels directly within a macro body can lead to label duplication errors because the same label might be generated multiple times if the macro is invoked in different places. One solution is to avoid using labels in the macro body altogether. Alternatively, programmers can utilize PC-relative addressing, although this approach can be inconvenient and error-prone. A better solution involves letting the macro processor generate unique labels by prefixing them with a special character and appending a unique identifier, ensuring each macro expansion has distinct labels .

Q: What implementation strategies can facilitate recursive macro processing, and why are traditional methods inadequate?

Implementing recursive macro processing can be facilitated by writing the macro processor in a language that allows recursive calls, thus preserving local variables via stack operations. Traditional methods are inadequate because they rely on maintaining global state, leading to issues such as overwriting of invocation arguments and premature setting of expansion state to FALSE. A stack mechanism for managing local variables and state or adopting a programming language that naturally supports recursion can resolve these issues and enable recursive macro definitions and calls .

Q: How does the concept of macro expansion differ between a macro and a subroutine call, particularly in terms of the code generated in a program?

In macro expansion, each time a macro is invoked, the statements comprising the macro body are expanded and included in the program. This means that for each macro invocation, the expanded code appears in the final program. On the other hand, a subroutine call results in the subroutine code being written only once, regardless of how many times it is called, which reduces code duplication and program size .

Q: How does a two-pass macro processor handle macro definitions and expansions, and what limitations does this approach have?

A two-pass macro processor first processes all macro definitions in pass one, storing them for later use. In pass two, it expands all macro invocation statements using these stored definitions. However, this method cannot handle recursive macros because all macros need to be fully defined before any expansions occur. Recursive macros necessitate a mechanism to define and expand macros alternately, which is not supported by a strict two-pass approach .

Q: What are the advantages of using keyword parameters in macros compared to positional parameters?

Keyword parameters provide several advantages over positional parameters. They allow arguments to appear in any order, eliminating the need for null arguments to maintain sequence. This makes the macro invocation clearer and less error-prone, especially when dealing with a large number of parameters where only a few are set. With keyword parameters, the clarity and maintainability of code improve since arguments are self-documented by their parameter names .

Q: Describe how macro-time conditional structures enhance the functionality of a macro language.

Macro-time conditional structures, such as IF-ELSE-ENDIF and WHILE-ENDW, enhance macro language functionality by allowing the macro to include or exclude certain blocks of code based on conditions evaluated at macro expansion time. This ensures that decisions about which code segments to include are made before the program is assembled. It improves modularization and customization of generated code according to variable inputs and enhances overall flexibility and power of macros .

Q: Explain the significance of macro generation of unique labels and how it enhances program safety and maintainability.

Generating unique labels within macro expansions mitigates the problem of duplicate labels, which can lead to assembler errors when macros are used multiple times. By using unique label generation, such as appending a unique identifier to labels, a macro processor ensures that each macro expansion produces distinct labels. This prevents conflicts and enhances both the safety, by avoiding label-related errors, and maintainability, by allowing the programmer to use more descriptive labels within macros without concern for duplication issues .

Q: How does conditional macro expansion differ from conditional jump instructions, and what are the implications of this difference?

Conditional macro expansion and conditional jump instructions differ primarily in when condition evaluation occurs. Conditional macro expansion evaluates conditions during macro expansion, leading to compile-time decisions about including or excluding code segments. In contrast, conditional jump instructions evaluate conditions during program execution, affecting runtime control flow. This distinction implies that, with macro expansions, the program structure is determined before execution, optimizing for situations known at compile-time, whereas conditional jumps allow for dynamic decision-making based on runtime values .

Q: What are the potential disadvantages of using integrated macro processors within language translators?

Integrated macro processors within language translators can lead to several disadvantages. They require specialized design to integrate with specific assembler or compiler implementations, which increases development complexity. The combined cost for macro processor development adds to the language translator's overall expense, making the software more costly and resource-intensive. Additionally, the assembler or compiler itself becomes larger and more complex, potentially impacting performance and maintainability .

The document discusses macro processors and their basic functions, design options, and implementation. A macro instruction allows shorthand notation for commonly used code sequences. It is expanded by replacing it with the corresponding statements. Macro processors perform macro definition, invocation, and expansion through techniques like concatenation of parameters, generation of unique labels, and conditional expansion based on arguments. They use data structures like definition and name tables and a one-pass algorithm to process macros recursively in both definition and invocation.

Uploaded by

Charan Saini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

326 views18 pages

Macro Processors: Basic Function Machine-Independent Features Design Options Implementation Examples

Uploaded by

Charan Saini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

Macro Processors

Basic Function Machine-Independent Features Design Options Implementation Examples Macro Instructions A macro instruction (macro) It is simply a notational convenience for the programmer to write a shorthand version of a program. It represents a commonly used group of statements in the source program. It is replaced by the macro processor with the corresponding group of source language statements. This operation is called expanding the macro For example: Suppose it is necessary to save the contents of all registers before calling a subroutine. This requires a sequence of instructions. We can define and use a macro, SAVEREGS, to represent this sequence of instructions. Macro Processors A macro processor Its functions essentially involve the substitution of one group of characters or lines for another. Normally, it performs no analysis of the text it handles. It doesnt concern the meaning of the involved statements during macro expansion. Therefore, the design of a macro processor generally is machine independent. Macro processors are used in assembly language high-level programming languages, e.g., C or C++ OS command languages general purpose Basic Functions Macro Definition Macro Invocation Macro Expansion One-Pass Algorithm Data Structure Macro Definition Two new assembler directives are used in macro definition: MACRO: identify the beginning of a macro definition

MEND: identify the end of a macro definition Prototype (pattern) for the macro: Each parameter begins with & label op operands name MACRO parameters : body : MEND Body: the statements that will be generated as the expansion of the macro. Example of Macro Definition

Macro Invocation A macro invocation statement (a macro call) gives the name of the macro instruction being invoked and the arguments in expanding the macro. The processes of macro invocation and subroutine call are quite different. Statements of the macro body are expanded each time the macro is invoked. Statements of the subroutine appear only one, regardless of how many times the subroutine is called.

Example of Macro Invocation

Macro Expansion Each macro invocation statement will be expanded into the statements that form the body of the macro. Arguments from the macro invocation are substituted for the parameters in the macro prototype. The arguments and parameters are associated with one another according to their positions. The first argument in the macro invocation corresponds to the first parameter in the macro prototype, etc. Comment lines within the macro body have been deleted, but comments on individual statements have been retained. Macro invocation statement itself has been included as a comment line. The label on the macro invocation statement CLOOP has been retained as a label on the first statement generated in the macro expansion. This allows the programmer to use a macro instruction in exactly the same way as an assembler language mnemonic. Example of Macro Expansion

No Label in the Body of Macro Problem of the label in the body of macro: There will be duplicate labels, which will be treated as errors by the assembler, if the same macro is expanded multiple times at different places in the program. Solutions: Simply not to use labels in the body of macro. Explicitly use PCrelative addressing instead. For example, in RDBUFF and WRBUFF macros, JEQ * +11 JLT *-14 It is inconvenient and error-prone. Other better solution? Two-Pass Macro Processor Two-pass macro processor Pass 1: Process macro definition Pass 2: Expand all macro invocation statements Problem This kind of macro processor cannot allow recursive macro definition, that is, the body of a macro contains definitions of other macros (because all macros would have to be defined during the first pass before any macro invocations were expanded). Example of Recursive Macro Definition

MACROS (for SIC) contains the definitions of RDBUFF and WRBUFF written in SIC instructions. MACROX (for SIC/XE) contains the definitions of RDBUFF and WRBUFF written in SIC/XE instructions. A program that is to be run on SIC system could invoke MACROS whereas a program to be run on SIC/XE can invoke MACROX. Defining MACROS or MACROX does not define RDBUFF and WRBUFF. These definitions are processed only when an invocation of MACROS or MACROX is expanded. Example of Recursive Macro Definition

One-Pass Macro Processor A one-pass macro processor that alternate between macro definition and macro expansion in a recursive way is able to handle recursive macro definition. Because of the one-pass structure, the definition of a macro must appear in the source program before any statements that invoke that macro. Data Structures DEFTAB (definition table)

Stores the macro definition including macro prototype macro body Comment lines are omitted. References to the macro instruction parameters are converted to a positional notation for efficiency in substituting arguments. NAMTAB Stores macro names Serves an index to DEFTAB pointers to the beginning and the end of the macro definition ARGTAB Stores the arguments of macro invocation according to their positions in the argument list As the macro is expanded, arguments from ARGTAB are substituted for the corresponding parameters in the macro body. Data Structures

Algorithm MAIN procedure iterations of GETLINE PROCESSLINE PROCESSLINE procedure DEFINE EXPAND output source line DEFINE procedure make appropriate entries in DEFTAB and NAMTAB EXPAND procedure

set up the argument values in ARGTAB expand a macro invocation statement (like in MAIN procedure) iterations of GETLINE PROCESSLINE GETLINE procedure get the next line to be processed from input file DEFTAB Handling Recursive Macro Definition In DEFINE procedure When a macro definition is being entered into DEFTAB, the normal approach is to continue until an MEND directive is reached. This would not work for recursive macro definition because the first MEND encountered in the inner macro will terminate the whole macro definition process. To solve this problem, a counter LEVEL is used to keep track of the level of macro definitions. Increase LEVEL by 1 each time a MACRO directive is read. Decrease LEVEL by 1 each time a MEND directive is read. A MEND can terminate the whole macro definition process only when LEVEL reaches 0. This process is very much like matching left and right parentheses when scanning an arithmetic expression. Algorithm

Machine Independent Macro Processor Features

Concatenation of Macro Parameters Generation of Unique Labels Conditional Macro Expansion Keyword Macro Parameters Concatenation of Macro Parameters Concatenate parameters with other character stings, for example: A program contains a set of series of variables: XA1, XA2, XA3, XB1, XB2, XB3, : The programmer wants to write a macro to process each series of variables. The programmer specifies the series of variables to be operated on (A, B, ). The macro processor construct the symbols by concatenating X, (A, B, ), and (1,2,3,) in the macro expansion. Suppose such parameter is named &ID, the macro body may contain a statement: LDA X&ID1, in which &ID is concatenated after the string X and before the string 1. LDA XA1 (&ID=A) LDA XB1 (&ID=B) Ambiguity problem: E.g., X&ID1 may mean X + &ID + 1 X + &ID1 This problem occurs because the end of the parameter is not marked. Solution to this ambiguity problem:

Use a special concatenation operator -> to specify the end of the parameter X&ID->1 Example of Concatenation

Generation of Unique Labels Labels in the macro body may cause duplicate labels problem if the macro is invocated and expanded multiple times. Use of relative addressing at the source statement level is very inconvenient, error-prone, and difficult to read. It is highly desirable to let the programmer use label in the macro body Labels used within the macro body begin with $. let the macro processor generate unique labels for each macro invocation and expansion. During macro expansion, the $ will be replaced with $xx, where xx is a two-character alphanumeric counter of the number of macro instructions expanded. XX=AA,AB,AC,.. Labels Defined in Macro Body

Unique Labels within Macro Expansion

Conditional Macro Expansion Arguments in macro invocation can be used to: Substitute the parameters in the macro body without changing the sequence of statements expanded. Modify the sequence of statements for conditional macro expansion (or conditional assembly when related to assembler). This capability adds greatly to the power and flexibility of a macro language. Macro-time conditional structure IF-ELSE-ENDIF WHILE-ENDW Example of Conditional Macro Expansion Two additional parameters used in the example of conditional macro expansion &EOR: specifies a hexadecimal character code that marks the end of a record &MAXLTH: specifies the maximum length of a record Macro-time variable (set symbol) can be used to store working values during the macro expansion store the evaluation result of Boolean expression control the macro-time conditional structures begins with & and that is not a macro instruction parameter be initialized to a value of 0 be set by a macro processor directive, SET

Implementation of Conditional Macro Expansion (IF-ELSE-ENDIF Structure) A symbol table This table contains the values of all macro-time variables used. Entries in this table are made or modified when SET statements are processed. This table is used to look up the current value of a macro-time variable whenever it is required. When an IF statement is encountered during the expansion of a macro, the specified Boolean expression is evaluated. TRUE the macro processor continues to process lines from DEFTAB until it encounters the next ELSE or ENDIF statement. If ELSE is encountered, then skips to ENDIF FALSE the macro processor skips ahead in DEFTAB until it finds the next ELSE or ENDLF statement. Conditional Macro Expansion vs. Conditional Jump Instructions The testing of Boolean expression in IF statements occurs at the time macros are expanded. By the time the program is assembled, all such decisions have been made. There is only one sequence of source statements during program execution. In contrast, the COMPR instruction tests data values during program execution. The sequence of statements that are executed during program execution may be different.

WHILE-ENDW Structure

Implementation of Conditional Macro Expansion (WHILE-ENDW Structure) When an WHILE statement is encountered during the expansion of a macro, the specified Boolean expression is evaluated. TRUE the macro processor continues to process lines from DEFTAB until it encounters the next ENDW statement. when ENDW is encountered, the macro processor returns to the preceding WHILE, re-evaluates the Boolean expression, and takes action again. FALSE the macro processor skips ahead in DEFTAB until it finds the next ENDW statement and then resumes normal macro expansion.

Keyword Macro Parameters Positional parameters Parameters and arguments are associated according to their positions in the macro prototype and invocation. If an argument is to be omitted, a null argument should be used to maintain the proper order in macro invocation: For example: GENER ,,DIRECT,,,,,,3. It is not suitable if a macro has a large number of parameters, and only a few of these are given values in a typical invocation. Keyword parameters Each argument value is written with a keyword that names the corresponding parameter. Arguments may appear in any order. Null arguments no longer need to be used. For example: GENER TYPE=DIRECT,CHANNEL=3. It is easier to read and much less error-prone than the positional method. Example of Keyword Parameters Default values of parameters

Macro Processor Design Options

Recursive Macro Expansion General-Purpose Macro Processors Macro Processing within Language Translators Recursive Macro Expansion

Recursive Macro Expansion

RDCHAR: read one character from a specified device into register A should be defined beforehand (i.e., before RDBUFF) Implementation of Recursive Macro Expansion Previous macro processor design cannot handle such kind of recursive macro invocation and expansion, e.g., RDBUFF BUFFER, LENGTH, F1 Reasons: The procedure EXPAND would be called recursively, thus the invocation arguments in the ARGTAB will be overwritten. The Boolean variable EXPANDING would be set to FALSE when the inner macro expansion is finished, that is, the macro process would forget that it had been in the middle of expanding an outer macro. A similar problem would occur with PROCESSLINE since this procedure too would be called recursively.

Solutions: Write the macro processor in a programming language that allows recursive calls, thus local variables will be retained. Use a stack to take care of pushing and popping local variables and return addresses. Another problem: can a macro invoke itself recursively? General-Purpose Macro Processors Goal: macro processors that do not dependent on any particular programming language, but can be used with a variety of different languages Pros Programmers do not need to learn many macro languages. Although its development costs are somewhat greater than those for a language-specific macro processor, this expense does not need to be repeated for each language, thus save substantial overall cost. Cons Large number of details must be dealt with in a real programming language Situations in which normal macro parameter substitution should not occur, e.g., comments. Facilities for grouping together terms, expressions, or statements Tokens, e.g., identifiers, constants, operators, keywords Syntax Macro Processing within Language Translators Macro processors can be Preprocessors Process macro definitions Expand macro invocations Produce an expanded version of the source program, which is then used as input to an assembler or compiler Line-by-line macro processor used as a sort of input routine for the assembler or compiler Read source program Process macro definitions and expand macro invocations Pass output lines to the assembler or compiler Integrated macro processor Line-by-Line Macro Processor Benefits It avoids making an extra pass over the source program. Data structures required by the macro processor and the language translator can be combined (e.g., OPTAB and NAMTAB) Utility subroutines can be used by both macro processor and the language translator. Scanning input lines Searching tables Data format conversion It is easier to give diagnostic messages related to the source statements. Integrated Macro Processor An integrated macro processor can potentially make use of any information about the source program that is extracted by the language translator.

As an example in FORTRAN DO 100 I = 1,20 a DO statement: DO: keyword 100: statement number I: variable name DO 100 I = 1 An assignment statement DO100I: variable (blanks are not significant in FORTRAN) An integrated macro processor can support macro instructions that depend upon the context in which they occur. Drawbacks of Line-by-line or Integrated Macro Processor They must be specially designed and written to work with a particular implementation of an assembler or compiler. The costs of macro processor development is added to the costs of the language translator, which results in a more expensive software. The assembler or compiler will be considerably larger and more complex.

ANSI C Macro Language

Definitions and invocations of macros are handled by a preprocessor, which is generally not integrated with the rest of the compiler. Examples: #define NULL 0 #define EOF (-1) #define EQ == syntactic modification #define ABSDIFF (X,Y) ( (X)>(Y) ? (X)-(Y) : (Y)-(X) ) Parameter substitutions are not performed within quoted strings: #define DISPLAY(EXPR) printf(EXPR= %d\n, EXPR) Macro expansion example DISPLAY(I*J+1) printf(EXPR= %d\n, I*J+1) A special stringizing operator, #, can be used to perform argument substitution in quoted strings: #define DISPLAY(EXPR) printf(#EXPR = %d\n, EXPR) Macro expansion example DISPLAY(I*J+1) printf(I*J+1 = %d\n, I*J+1) Recursive macro definitions or invocations After a macro is expanded, the macro processor rescans the text that has been generated, looking for more macro definitions or invocations. Macro cannot invoke or define itself recursively. DISPLAY(ABSDIFF(3,8)) scan printf(ABSDIFF(3,8) rescan = %d\n, ABSDIFF(3,8))

printf(ABSDIFF(3,8)

= %d\n, ( (3)>(8) ? (3)-(8) : (8)-(3) ))

Conditional compilation statements Example 1: #ifndef BUFFER_SIZE #define BUFFER_SIZE 1024 #endif Example 2: #define DEBUG 1 : #if DEBUG == 1 printf() /* debugging outout */ #endif Miscellaneous functions of the preprocessor of ANSI C Trigraph sequences are replaced by their single-character equipments, e.g., ??< { Any source line that ends with a backlash, \, and a newline is spliced together with the following line. Any source files included in response to an #include directive are processed. Escape sequences are converted e.g., \n, \0 Adjacent string literals are concatenated, e.g., hello, world hello, world.

Common questions