hoc3
adds support for:
^
Our grammar needs some additional rules to support exponents and parentheses, as well as more extensive changes to support multi-character variables. Last, we’ll add productions to call a builtin function with 0, 1, or 2 arguments (which are always expressions).
An overview of the new structure, without any parser actions, is below.
: /* nothing */
list| list terminator
| list assign terminator
| list expr terminator
| list error terminator
: assignable '=' expr
assign| assignable ':' '=' expr
: BUILTIN '(' ')'
call| BUILTIN '(' expr ')'
| BUILTIN '(' expr ',' expr ')'
: NUMBER
expr| assign
| call
| VAR
| CONST
| '@'
| expr '+' expr
| expr '-' expr
| expr '*' expr
| expr '/' expr
| expr '^' expr
| '-'expr
| '(' expr ')'
: VAR
assignable| CONST
: '\n'
terminator| ';'
Two interesting notes to keep in mind:
Even though assign
is a type of expr
,
it has its own list
production, because we don’t want to
output the value of an assignment.
We have a new assignable
production that encompases
both variables and constants.
hoc3
adds enough independent functionality that we will
split it into separate files.
symbol.c – Builtin functions and constants, as well as user-defined variables/constants, are all defined as Symbols; this module exports the symbol table interface, defined later. This interface allows us to define new symbols, and lookup the type and value of existing ones.
builtins.c – The builtins we’ve defined for the
language. All predefined names in hoc3
are declared
here.
math.c – Implements the math builtins for
hoc3
, which are declared in builtins.c.
hoc.h – For simplicity, a single header file is
used to communicate all of our interfaces; the symbol
,
math
, and builtins
modules each export their
APIs here.
hoc.y – Still contains our language grammar and main loop. It also contains helper functions for runtime errors and warnings, used by the rest of the files.
hoc3
’s Math Libraryhoc3
now supports builtin functions; all that remains is
to populate them. hoc3
’s tiny math library is mostly a
wrapper around the standard library.
We implement all math functions inside math.c
; each of
the functions in hoc3
is a wrapper around a
math.h
standard library function. The job of the wrapper
functions is simply to check for error conditions and convert them into
runtime hoc
errors. As an example function, consider our
“power” function, used to implement x ^ y
:
static double check_math_err(double result, const char *op);
double Pow(double x, double y) {
return check_math_err(pow(x, y), "exponentiation");
}
The math library functions don’t fail when given “invalid” arguments,
but they do report errors via other channels. But to expose
these errors to our caller in hoc3
, we want to use our
exec_error
function.
Now, floating-point errors in C may be reported in two different
ways, depending on whether our machine has full IEEE floating-point
support. We can check how errors will be reported using the
math_errhandling
macro, introduced in C99. It has one of
three values:
MATH_ERRNO
- Supports only using the errno
variable to check floating-point errorsMATH_ERREXCEPT
- Only the “new”
fetestexcept
function is supportedThe fetestexcept
function lets us check which kind of
exeptional case has occured; we will ignore “inexact” results,
since our core data type is double, many integer-valued operations will
have inexact results.
static double check_math_err(double result, const char *op) {
static const char *domain_msg = "argument outside domain";
static const char *range_msg = "result outside range";
static const char *other_msg = "floating-point exception occurred";
const char *msg = NULL;
if ((math_errhandling & MATH_ERREXCEPT) && fetestexcept(FE_ALL_EXCEPT)) {
// Special case: Inexact results are not errors.
if (fetestexcept(FE_INEXACT)) {
goto done;
}
if (fetestexcept(FE_INVALID)) {
= domain_msg;
msg } else if (fetestexcept(FE_DIVBYZERO | FE_OVERFLOW | FE_UNDERFLOW)) {
= range_msg;
msg } else { // unknown
= other_msg;
msg }
(FE_ALL_EXCEPT);
feclearexcept} else if (errno) {
if (errno == EDOM) {
= domain_msg;
msg } else if (errno == ERANGE) {
= range_msg;
msg } else {
= other_msg;
msg }
= 0;
errno }
if (msg != NULL) {
("math error during %s: %s", op, msg);
exec_error}
:
donereturn result;
}
Since the IEEE floating-point exceptions provide more information for
us than the old errno system, we default to using them. For embedded or
older systems that support only errno
, we leave logic in
for them. More details can be found in the math_errhandling
documentation.
Once we have our new math.c
module and a
Pow
function, we can add it to our language as a piece of
literal syntax.
First, we define our new ^
operator, alongside (but with higher precedence than) the other binary
operators.
%left '*' '/' /* left-assoc; higher precedence */+ %right '^' /* exponents */
%left UNARY_MINUS /* Even higher precedence
than * or /. Can't use '-', as we've used it already for subtraction. */
Our parser action will assume the Pow
function is
available by declaring it extern
.
extern double Pow(double, double);
So long as the hoc3
program is linked with the
math
object file, we can now simply call Pow()
in our action.
expr: NUMBER { $$ = $1; }
...
| expr '-' expr { $$ = $1 - $3; }
| expr '*' expr { $$ = $1 * $3; }
| expr '/' expr
{
if ($3 == 0) {
exec_error("Division by zero");
} else {
$$ = $1 / $3;
}
}+ | expr '^' expr { $$ = Pow($1, $3); }
| '-'expr %prec UNARY_MINUS { $$ = (-1 * $2); }+
To manage an arbitrary set of user-defined variable names, along with
the names of our builtin functions / constants, we’ll need a more
sophisticated data structure than our simple 26-element
variables
array. To that end, hoc3
adds the
concept of a Symbol
: a named, typed value. The
symbol
module also defines functions for accessing a global
symbol table.
We can call install_symbol
to add a symbol to the table,
and lookup
to fetch a symbol’s current value.
*lookup(const char *name);
Symbol *install_symbol(const char *name, short type, double value); Symbol
The symbol table interface hoc3/hoc.h
The core data type of our symbol table is struct Symbol
;
it contains all the information required to find and use a named
value.
struct Symbol {
short type; // Generated from our token types
char *name;
*next;
Symbol union {
double val; // a double-valued symbol
struct BuiltinFunc func; // a callable symbol
} data;
};
A Symbol
is treated differently depending on its “type”,
stored in the imaginatively-named type
field.
BUILTIN
symbols point to functions, and can be
called with zero or more arguments. They must return a
double
value.VARIABLE
s hold a double. Without an
operator, they will produce their current value. The assignment operator
may be used to assign a new value to a variable.CONSTANT
s operate like variables, but can only be
assigned once.We’ll define the actual values used for the type
field
when we make our grammar changes.
The symbol’s data
field has a different C type depending
on the value of Symbol.type
.
For “value symbols” (i.e. not functions), we have a simple double
value, stored in Symbol.data.val
.
For BUILTIN
types we have Symbol.data.func
,
which is a struct containing enough information to call an appropriate
function pointer. It has enough space for an integer, args
,
and a function pointer, which must return a double and take either 0, 1,
or 2 arguments. When using a Symbol
of type
BUILTIN
, we first examine its args
field,
which counts the expected number of arguments. Based on its value, we
know to use the union field call0
, call1
, or
call2
.
/** Function descriptor for builtins: these always return doubles */
struct BuiltinFunc {
int args;
union {
// anon union: e.g. use `data.func.call0` directly. */
double (*call0)(void);
double (*call1)(double);
double (*call2)(double, double);
};
};
The last two fields of Symbol
are used for
lookup. They implicitly define the symbol table.
name
for lookupnext
link to the next Symbol
in our
table.The “table”, internally, is simply a static pointer to a linked list
of Symbol
objects. This implementation suffices, at least
for “calculator” uses.
static Symbol *symbol_table = NULL;
Installing a new symbol object into the table requires us to:
install_symbol()
lookup()
.Allocation is as simple as a malloc()
to hold the
structure and the name, though we wrap this in our own allocator,
emalloc()
, with the same argument as malloc()
(covered at the end of this section).
*install_symbol(const char *name, short type, double value) {
Symbol size_t name_len = strlen(name);
*s = emalloc(sizeof(Symbol) // actual symbol
Symbol + (sizeof(char)) * (name_len + 1)); // room for name
->name = (char *)(s + 1);
s(s->name, name);
strcpy->name[name_len] = '\0';
s...
For hoc3
, we’ll assume that every symbol we install is a
simple double
value; we’ll revisit this later.
...
->type = type;
s->data.val = value; s
Finally, we link the symbol to the existing global table. Since the
static symbol_table
variable starts as NULL
,
our table is always null-terminated.
...
->next = symbol_table;
s= s;
symbol_table return s;
}
The emalloc()
function is nothing more than a version of
malloc that uses our exec_error()
functionality to report
errors:
void *emalloc(size_t nbytes) {
void *ptr = malloc(nbytes);
if (ptr == NULL) {
("Out of memory!");
exec_error}
return ptr;
}
Looking up symbols is as simple as walking our linked list, returning the first matching Symbol.
*lookup(const char *name) {
Symbol *current = symbol_table;
Symbol while (current != NULL) {
if (strcmp(current->name, name) == 0) {
return current;
}
= current->next;
current }
return NULL;
}
Notice that the way we implemented install_symbol
does
not check for duplicates, and so assigning the same name twice simply
installs a new symbol “in front” of the old one. This, combined with the
linear-search behavior of lookup
, means that re-assigning a
variable “just works” with no additional logic, at the cost of some
additional memory.
With the symbol table module, we can replace our simple
variables
array with uses of the Symbol
table.
When we produce Symbol
tokens, we’ll return the symbol
type as a token type, but the value produced for that
token must be a full Symbol
object.
We can do that by replacing our simple hoc_index
token-value-type with a Symbol version.
%union {
double hoc_value;- int hoc_index;
+ Symbol *hoc_symbol;
}
We’ll also define the token types corresponding to symbols here;
we’ll *reuse these values to populate
Symbol.type
!
%token <hoc_value> NUMBER+ %token <hoc_symbol> VAR CONST BUILTIN UNDEF
%type <hoc_value> expr assign call
Our lexer will need to be updated to recognize symbol tokens and
produce Symbol
objects. A symbol token will be any
alphanumeric identifier that starts with a letter (just as in
C). We read up to SYMBOL_NAME_LIMIT
characters, discard the
rest, and lookup the symbol. If it exists, we produce the existing
value. Otherwise, we install an UNDEF
ined symbol
with that name, and produce it. Our hope is that the symbol
token is part of an assignment expression, so we’re about to
define it. However, we don’t return the
UNDEF
token type; instead we return the type
VAR
. (See the next step for the reasons why.)
All of this logic goes at the bottom of our existing
yylex()
function
int yylex(void) {
int c;
...
if (isalpha(c)) {
*s;
Symbol
char buf[SYMBOL_NAME_LIMIT+1];
size_t nread = 0;
do {
[nread++] = c;
buf= getchar();
c } while (nread < SYMBOL_NAME_LIMIT && (isalpha(c) || isdigit(c)));
// Just in case we exceeded the limit
while ((isalpha(c) || isdigit(c))) {
= getchar();
c }
// at this point, we have a non-alphanumeric 'c'
(c, stdin);
ungetc
[nread] = '\0';
bufif ((s=lookup(buf)) == NULL) {
= install_symbol(buf, UNDEF, 0.0);
s }
// Produce symbol for parser
.hoc_symbol = s;
yylvalreturn (s->type == UNDEF ? VAR : s->type);
}
return c;
}
We’ll create a new production for any symbol that we can assign a
value to; namely, VAR
s and CONST
s. We’ll call
this an “assignable”. For now, we’ll focus on VAR
s. Our
VAR
action changes to handle any assignable
,
and simply returns the value stored inside the symbol’s
data.val
field.
expr: NUMBER { $$ = $1; }
| '@' { $$ = previous_value; }- | VAR { $$ = variables[$1]; }
+ | assignable
+ {
+ $$ = $1->data.val;
+ }
...+ assignable: VAR | CONST
+ ;
Our previous variable-assignment expression will be replaced with a
non-production called assign
.
- %token <hoc_value> NUMBER
- %token <hoc_index> VAR
+ %type <hoc_value> expr assign call
The assign
action will change the Symbol
type from UNDEF
to VAR
, then set the symbol’s
value so it can be evaluated later.
expr: NUMBER { $$ = $1; }- | VAR '=' expr { $$ = (variables[$1] = $3); }
+ | assign
...
+ assign: assignable '=' expr
+ {
+ $1->type = VAR;
+ $$ = ($1->data.val = $3);
+ }
+
We don’t want to let the user evaluate an undefined symbol, but they should be allowed to assign a value to one. To support this nuance, we have two options:
hoc3
chooses to do more in C; its lexer will always
return the token type of an undefined symbol as
VAR
, but the Symbol
object it
produces will have the correct type
field. This allows us
to check the typewithin our parser action, and reject invalid
uses of undefined symbols.
Example: Allowing Assignment of UNDEF
Symbols
: assignable '=' expr
assign{
1->type = VAR;
$= ($1->data.val = $3);
$$ }
Example: Refusing to Evaluate UNDEF
Symbols
expr: NUMBER { $$ = $1; }
...
| assignable
{+ if ($1->type == UNDEF) {
+ exec_error("Undefined variable '%s'", $1->name);
+ }
$$ = $1->data.val; }
The token type will never be UNDEF
, because we
never return that type from our lexer. (It’s mainly used as a way to add
the enumeration value.)
Much like we check Symbol.type
to detect undefined
symbols, we can use a new type to describe symbols that cannot be
reassigned once they have a value. In
hoc3
, we’ll use that to add some built-in constants for
PI
, E
, etc. We’ll also allow our users can
define new constants with the :=
operator.
The implementation of constants is relatively simple; the parser actions contain the logic.
When we are inside the “assign constant value” production, we:
Symbol
, so that it’s a
constant from now on.: ...
assign':' '=' expr
assignable {
if ($1->type == CONST) {
("Cannot reassign constant '%s'", $1->name);
exec_error} else {
1->type = CONST;
$= ($1->data.val = $4);
$$ }
}
Making a Symbol constant hoc3/hoc.y
We also have to update the variable-assignment production, to make
sure that a user cannot use PI = 10
to modify our constants
either.
: assignable '=' expr
assign{
if ($1->type == CONST) {
("Cannot reassign constant '%s'", $1->name);
exec_error} else {
1->type = VAR;
$= ($1->data.val = $3);
$$ }
}
Finally, we want to add the constants that hoc3
provides
as part of its language, PI
and E
. We create a
builtins.c
module, which exposes a single public function
via hoc.h
.
void install_hoc_builtins(void);
This should be called on program startup, and should install all the
symbols we want to “build into” the hoc3
language.
int main(int argc, char *argv[]) {
(void)argc;
= argv[0];
program_name (); <<<< Before setjmp!
install_hoc_builtins(begin);
setjmpreturn yyparse();
}
To try to keep the modules as independent as possible, we’ll let
install_symbol
continue to be the only function that
constructs a Symbol. We’ll use a separate, internal structure to hold
the constants before installation (though using Symbol
would work just as well, and use less memory). Notice that the last
value in our constants
array has a NULL
name,
informing us when we’re done.
static struct {
const char *name;
double value;
} constants[] = {
{"PI", 3.14159265358979323846},
{"E", 2.71828182845904523536},
{NULL, 0},
};
Our “install” function merely creates symbols for each of them.
void install_hoc_builtins(void) {
for (int i = 0; constants[i].name != NULL; i++) {
(constants[i].name, CONST, constants[i].value);
install_symbol}
Once we have the infrastructure for Symbol
s and language
builtin installation, adding Symbol
s for function calls
requires surprisingly little code. First, we’ll add a single builtin
function, to ensure the language structures are working. The next
section will add the math functions described above.
Our language’s builtin functions will be declared in
builtins.c
, just as our constants are. We’ll use another
anonymous structure to hold the data about the builtin functions. (You
could re-use the BuiltinFunc
struct, but it has no `namea
field, since .)
#include <math.h> // From standard library
...
static struct {
const char *name;
int args;
union {
double (*call0)(void);
double (*call1)(double);
double (*call2)(double, double);
};
} builtins[] = {
{"abs", 1, .call1 = fabs}, // temporary function
{NULL, 0, .call0 = NULL},
};
And update our installation function to allow installing functions with 0, 1, or 2 arguments.
void install_hoc_builtins(void) {
...
for (int i = 0; builtins[i].name != NULL; i++) {
*s = install_symbol(builtins[i].name, BUILTIN, 0.0);
Symbol
// Overwrite `data` for the symbol; this is a function
->data.func.args = builtins[i].args;
sswitch (builtins[i].args) {
case 0:
->data.func.call0 = builtins[i].call0;
sbreak;
case 1:
->data.func.call1 = builtins[i].call1;
sbreak;
case 2:
->data.func.call2 = builtins[i].call2;
sbreak;
default:
("Cannot start hoc: Argument count error for builtin '%s'",
exec_error[i].name);
builtins}
}
}
With our symbol installed, we can now evaluate functions. We’ll add a
new expression type, call
, which has three different forms,
one for each number of arguments.
: BUILTIN '(' ')'
call{ ... actions here ... }
| BUILTIN '(' expr ')'
{ ... actions here ... }
| BUILTIN '(' expr ',' expr ')'
{ ... actions here ... }
Calling a function has two parts:
We’ll use call1
as our example; the other two are the
same.
: ...
call| BUILTIN '(' expr ')'
{
($1, 1);
check_call_args= $1->data.func.call1($3);
$$ }
$3
, our argument, is guaranteed to have a
double
value, since expr
is declared to
produce a hoc_value
. That’s the value we produce as well,
as builtin calls are expressions too. (Note that the call
non-terminal is declared to produce a hoc_value
in the
preamble.)
check_call_args
verifies our argument count &
produces an easy-to-read error message, thanks to our
exec_error
helper.
/*
* Verify Symbol's declared arg count matches actual,
* or display a helpful error message with mismatch
*/
void check_call_args(Symbol *s, int actual) {
int expected = s->data.func.args;
if (expected != actual) {
("Wrong number of arguments for %s: expected %d, got %d",
exec_error->name, expected, actual);
s}
}
builtins
ListWe can now replace the temporary builtin
for absolute
values with a list of the math functions from math.c
.
First, we’ll need extern declarations in builtins
, since we
don’t want to expose the entire math library via hoc.h
.
extern double Abs(double), Acos(double), Atan(double), Atan2(double, double),
(double), Exp(double), Integer(double), Lg(double), Ln(double),
Cos(double), Sqrt(double), Sin(double); Log10
Then, we can replace the temporary abs
function with a
list of the math.c
functions.
...
} builtins[] = {
{"abs", 1, .call1 = Abs},
{"acos", 1, .call1 = Acos},
{"atan", 1, .call1 = Atan},
{"atan2", 2, .call2 = Atan2},
{"cos", 1, .call1 = Cos},
{"exp", 1, .call1 = Exp},
{"int", 1, .call1 = Integer},
{"lg", 1, .call1 = Lg},
{"ln", 1, .call1 = Ln},
{"log10", 1, .call1 = Log10},
{"sin", 1, .call1 = Sin},
{"sqrt", 1, .call1 = Sqrt},
{NULL, 0, .call0 = NULL},
};
hoc3
allowed us to build the foundations of our symbol
table and math library without changing the core language too
drastically. In hoc4
, we’ll add an
interpreter layer to hoc, which will handle our core expression
evaluation, and set the stage for flow control.
?= -std=c2x -Wall -Wextra -pedantic -Wformat -Wformat-extra-args -Wformat-nonliteral
CFLAGS ?= -d
YFLAGS := hoc.o builtins.o math.o symbol.o
objects
: hoc
all
: $(objects)
hoc
.o symbol.o math.o: hoc.h hoc.tab.h
builtins
.tab.%: hoc.y
hoc(YFLAGS) hoc.y -o hoc.tab.c
yacc $
.PHONY:clean
:
clean-f hoc.tab.h
rm -f hoc.tab.c
rm -f y.tab.h
rm -f $(objects)
rm -f hoc rm
#include "hoc.h"
#include "hoc.tab.h"
#include <stdlib.h>
/* Defined in math.c */
extern double Abs(double), Acos(double), Atan(double), Atan2(double, double),
(double), Exp(double), Integer(double), Lg(double), Ln(double),
Cos(double), Sqrt(double), Sin(double);
Log10
static struct {
const char *name;
double value;
} constants[] = {
{"PI", 3.14159265358979323846},
{"E", 2.71828182845904523536},
{NULL, 0},
};
static struct {
const char *name;
int args;
union {
double (*call0)(void);
double (*call1)(double);
double (*call2)(double, double);
};
} builtins[] = {
{"abs", 1, .call1 = Abs},
{"acos", 1, .call1 = Acos},
{"atan", 1, .call1 = Atan},
{"atan2", 2, .call2 = Atan2},
{"cos", 1, .call1 = Cos},
{"exp", 1, .call1 = Exp},
{"int", 1, .call1 = Integer},
{"lg", 1, .call1 = Lg},
{"ln", 1, .call1 = Ln},
{"log10", 1, .call1 = Log10},
{"sin", 1, .call1 = Sin},
{"sqrt", 1, .call1 = Sqrt},
{NULL, 0, .call0 = NULL},
};
void install_hoc_builtins(void) {
for (int i = 0; constants[i].name != NULL; i++) {
(constants[i].name, CONST, constants[i].value);
install_symbol}
for (int i = 0; builtins[i].name != NULL; i++) {
*s = install_symbol(builtins[i].name, BUILTIN, 0.0);
Symbol ->data.func.args = builtins[i].args;
sswitch (builtins[i].args) {
case 0:
->data.func.call0 = builtins[i].call0;
sbreak;
case 1:
->data.func.call1 = builtins[i].call1;
sbreak;
case 2:
->data.func.call2 = builtins[i].call2;
sbreak;
default:
("Cannot start hoc: Argument count error for builtin '%s'",
exec_error[i].name);
builtins}
}
}
/*
* Global types & declarations for use in hoc
*/
#define SYMBOL_NAME_LIMIT 100
typedef struct Symbol Symbol;
/** Function descriptor for builtins: these always return doubles */
struct BuiltinFunc {
int args;
union {
// anon union: e.g. use `data.func.call0` directly. */
double (*call0)(void);
double (*call1)(double);
double (*call2)(double, double);
};
};
struct Symbol {
short type; // Generated from our token types
char *name;
*next;
Symbol union {
double val;
struct BuiltinFunc func;
} data;
};
*install_symbol(const char *name, short type, double value);
Symbol *lookup(const char *name);
Symbol
/** global error handling */
((format(printf, 1, 2))) void warning(const char *msg, ...);
__attribute__((format(printf, 1, 2))) void exec_error(const char *msg, ...);
__attribute__
/** builtins */
void install_hoc_builtins(void);
%{
///----------------------------------------------------------------
/// C Preamble
///----------------------------------------------------------------
/*
* "higher-order calculator" - Version 3
* From "The UNIX Programming Environment"
*/
#include <stdio.h>
#include <ctype.h>
#include <signal.h>
#include <setjmp.h>
#include <stdarg.h>
#include "hoc.h"
///----------------------------------------------------------------
/// external declarations
///----------------------------------------------------------------
extern double Pow(double, double);
///----------------------------------------------------------------
/// global state
///----------------------------------------------------------------
double previous_value = 0.0;
;
jmp_buf begin
///----------------------------------------------------------------
/// local declarations
///----------------------------------------------------------------
int yylex(void);
void yyerror(const char *s);
void check_call_args(const Symbol *builtin, int expected);
// Used for the '@' feature
#define remember(v) (previous_value = (v))
%}
%union {
double hoc_value;
*hoc_symbol;
Symbol }
%token <hoc_value> NUMBER
%token <hoc_symbol> VAR CONST BUILTIN UNDEF
%type <hoc_value> expr assign call
%type <hoc_symbol> assignable
%right '=' /* right-associative, much like C */
%left '+' '-' /* left-associative, same precedence */
%left '*' '/' /* left-assoc; higher precedence */
%right '^' /* exponents */
%left UNARY_MINUS /* Even higher precedence
than * or /. Can't use '-', as we've used
it already for subtraction. */
%%
: /* nothing */
list| list terminator
| list assign terminator
| list expr terminator
{
("\t%.8g\n", remember($2));
printf}
| list error terminator {yyerrok;}
;
: NUMBER { $$ = $1; }
expr| '@' { $$ = previous_value; }
| assignable
{
if ($1->type == UNDEF) {
("Undefined variable '%s'", $1->name);
exec_error}
= $1->data.val;
$$ }
| assign
| call
| expr '+' expr { $$ = $1 + $3; }
| expr '-' expr { $$ = $1 - $3; }
| expr '*' expr { $$ = $1 * $3; }
| expr '/' expr
{
if ($3 == 0) {
("Division by zero");
exec_error} else {
= $1 / $3;
$$ }
}
| expr '^' expr { $$ = Pow($1, $3); }
| '-'expr %prec UNARY_MINUS { $$ = (-1 * $2); }
| '(' expr ')' { $$ = $2; }
;
: assignable '=' expr
assign{
if ($1->type == CONST) {
("Cannot reassign constant '%s'", $1->name);
exec_error} else {
1->type = VAR;
$= ($1->data.val = $3);
$$ }
}
| assignable ':' '=' expr
{
if ($1->type == CONST) {
("Cannot reassign constant '%s'", $1->name);
exec_error} else {
1->type = CONST;
$= ($1->data.val = $4);
$$ }
}
;
: BUILTIN '(' ')'
call{
($1, 0);
check_call_args= $1->data.func.call0();
$$ }
| BUILTIN '(' expr ')'
{
($1, 1);
check_call_args= $1->data.func.call1($3);
$$ }
| BUILTIN '(' expr ',' expr ')'
{
($1, 2);
check_call_args= $1->data.func.call2($3,$5);
$$ }
;
: VAR
assignable| CONST
;
: '\n'
terminator| ';'
;
%% // end of grammar
/* error tracking */
char *program_name;
int line_number = 1;
int main(int argc, char *argv[]) {
(void)argc;
= argv[0];
program_name ();
install_hoc_builtins(begin);
setjmpreturn yyparse();
}
/* our simple, hand-rolled lexer. */
int yylex(void) {
int c;
do {
=getchar();
c} while (c == ' ' || c == '\t');
if (c == EOF) {
return 0;
}
if (c == '.' || isdigit(c)) {
(c, stdin);
ungetc("%lf", &yylval.hoc_value);
scanfreturn NUMBER;
}
if (c == '\n') {
++;
line_number}
if (isalpha(c)) {
*s;
Symbol
char buf[SYMBOL_NAME_LIMIT + 1];
size_t nread = 0;
do {
[nread++] = c;
buf= getchar();
c } while (nread < SYMBOL_NAME_LIMIT && (isalpha(c) || isdigit(c)));
// Just in case we exceeded the limit
while ((isalpha(c) || isdigit(c))) {
= getchar();
c }
// at this point, we have a non-alphanumeric 'c'
(c, stdin);
ungetc
[nread] = '\0';
bufif ((s=lookup(buf)) == NULL) {
= install_symbol(buf, UNDEF, 0.0);
s }
.hoc_symbol = s;
yylvalreturn (s->type == UNDEF ? VAR : s->type);
}
return c;
}
void yyerror(const char *s) {
("%s", s);
warning}
/*
* Verify Symbol's declared arg count matches actual,
* or display a helpful error message with mismatch
*/
void check_call_args(const Symbol *s, int actual) {
int expected = s->data.func.args;
if (expected != actual) {
("Wrong number of arguments for %s: expected %d, got %d",
exec_error->name, expected, actual);
s}
}
#define print_error_prefix() fprintf(stderr, "%s: ", program_name)
#define print_error_suffix() fprintf(stderr, " (on line %d)\n", line_number)
void warning(const char *msg, ...) {
va_list args;
(args, msg);
va_start
();
print_error_prefix(stderr, (msg), args);
vfprintf();
print_error_suffix
(args);
va_end}
void exec_error(const char *msg , ...) {
va_list args;
(args, msg);
va_start
();
print_error_prefix(stderr, (msg), args);
vfprintf();
print_error_suffix
(args);
va_end
(begin, 0);
longjmp}
#include "hoc.h"
#include <errno.h>
#include <fenv.h>
#include <math.h>
#include <stddef.h>
static double check_math_err(double result, const char *op);
double Abs(double x) {
return check_math_err(fabs(x), "fabs");
}
double Acos(double x) {
return check_math_err(acos(x), "acos");
}
double Atan(double x) {
return check_math_err(atan(x), "atan");
}
double Atan2(double x, double y) {
return check_math_err(atan2(x, y), "atan2");
}
double Cos(double x) {
return check_math_err(cos(x), "cos");
}
double Exp(double x) {
return check_math_err(exp(x), "exp");
}
double Integer(double x) {
return (double)(long)x;
}
double Lg(double x) {
return check_math_err(log2(x), "lg");
}
double Ln(double x) {
return check_math_err(log(x), "log");
}
double Log10(double x) {
return check_math_err(log10(x), "log10");
}
double Pow(double x, double y) {
return check_math_err(pow(x, y), "exponentiation");
}
double Sin(double x) {
return check_math_err(sin(x), "sin");
}
double Sqrt(double x) {
return check_math_err(sqrt(x), "sqrt");
}
static double check_math_err(double result, const char *op) {
static const char *domain_msg = "argument outside domain";
static const char *range_msg = "result outside range";
static const char *other_msg = "floating-point exception occurred";
const char *msg = NULL;
if ((math_errhandling & MATH_ERREXCEPT) && fetestexcept(FE_ALL_EXCEPT)) {
// Special case: Inexact results are not errors.
if (fetestexcept(FE_INEXACT)) {
goto done;
}
if (fetestexcept(FE_INVALID)) {
= domain_msg;
msg } else if (fetestexcept(FE_DIVBYZERO | FE_OVERFLOW | FE_UNDERFLOW)) {
= range_msg;
msg } else { // unknown
= other_msg;
msg }
(FE_ALL_EXCEPT);
feclearexcept} else if (errno) {
if (errno == EDOM) {
= domain_msg;
msg } else if (errno == ERANGE) {
= range_msg;
msg } else {
= other_msg;
msg }
= 0;
errno }
if (msg != NULL) {
("math error during %s: %s", op, msg);
exec_error}
:
donereturn result;
}
#include "hoc.h"
#include "hoc.tab.h" // generated from yacc -d on our grammar
#include <stddef.h> // NULL
#include <stdlib.h> // malloc
#include <string.h> // strcmp
static Symbol *symbol_table = NULL;
/** A malloc() implementation that's aware of our runtime error system. */
void *emalloc(size_t nbytes);
*lookup(const char *name) {
Symbol *current = symbol_table;
Symbol while (current != NULL) {
if (strcmp(current->name, name) == 0) {
return current;
}
= current->next;
current }
return NULL;
}
*install_symbol(const char *name, short type, double value) {
Symbol size_t name_len = strlen(name);
*s = emalloc(sizeof(Symbol) // actual symbol
Symbol + (sizeof(char)) * (name_len + 1)); // room for name
->name = (char *)(s + 1);
s(s->name, name);
strcpy->name[name_len] = '\0';
s
->type = type;
s->data.val = value;
s->next = symbol_table;
s= s;
symbol_table return s;
}
void *emalloc(size_t nbytes) {
void *ptr = malloc(nbytes);
if (ptr == NULL) {
("Out of memory!");
exec_error}
return ptr;
}