skx / math-compiler Goto Github PK

View Code? Open in Web Editor NEW

60.0 3.0 7.0 140 KB

A simple intel/AMD64 assembly-language compiler for mathematical operations

License: GNU General Public License v2.0

Go 91.32% Shell 8.68%

golang compiler maths toy trivial reverse-polish

math-compiler's Introduction

math-compiler
Installation
Quick Overview
About Our Output
Test Cases
- Debugging the generated programs
Possible Expansion?
- See Also
Github Setup

math-compiler

This project contains the simplest possible compiler, which converts mathematical operations into assembly language, allowing all the speed in your sums!

Because this is a simple project it provides only a small number of primitives:

+ - Plus
- - Minus
* - Multiply
/ - Divide
^ - Raise to a power
% - Modulus
! - Factorial
abs
sin
cos
tan
sqrt
Stack operations:
- swap - Swap the top-two items on the stack
- dup - Duplicate the topmost stack-entry.
Built-in constants:
- e
- pi

Despite this toy-functionality there is a lot going on, and we support:

Full RPN input
Floating-point numbers (i.e. one-third multipled by nine is 3)
- 1 3 / 9 *
Negative numbers work as you'd expect.

Some errors will be caught at run-time, as the generated code has support for:

Detecting, and preventing, division by zero.
Detecting insufficient arguments being present upon the stack.
- For example this program is invalid 3 +, because the addition operator requires two operands. (i.e. 3 4 +)

Installation

If you just need a binary you can find them upon the project release page, however if you wish to build and install locally you can do that in either of the standard ways:

Install from the latest revision:

$ go install github.com/skx/math-compiler@master

Or you can clone the source, and build from it:

$ git clone https://github.com/skx/math-compiler
$ cd math-compiler
$ go install .

Quick Overview

The intention of this project is mostly to say "I wrote a compiler", because I've already experimented with a language, an embedded evaluation engine, and implemented a BASIC interpreter. The things learned from those projects were pretty useful, even if the actual results were not so obviously useful in themselves.

Because there are no shortages of toy-languages, and there is a lot of complexity in writing another for no real gain, I decided to just focus upon a simple core:

Allowing "maths stuff" to be "compiled".

In theory this would allow me to compile things like this:

2 + ( 4 * 54 )

However I even simplified that, via the use of a "Reverse Polish" notation, so if you want to run that example you'd enter the expression as:

4 54 * 2 +

About Our Output

The output of math-compiler will be an assembly-language file, which then needs to be compiled before it may be executed.

Given our previous example of 2 + ( 4 * 54) we can compile & execute that program like so:

$ math-compiler '4 54 * 2+' > sample.s
$ gcc -static -o sample ./sample.s
$ ./sample
Result 218

There you see:

math-compiler was invoked, and the output written to the file sample.s.
gcc was used to assemble sample.s into the binary sample.
The actual binary was then executed, which showed the result of the calculation.

If you prefer you can also let the compiler do the heavy-lifting, and generate an executable for you directly. Simply add -compile, and execute the generated a.out binary:

$ math-compiler -compile=true '2 8 ^'
$ ./a.out
Result 256

Or to compile and execute directly:

$ math-compiler -run '3 45 * 9 + 12 /'
Result 12

Test Cases

The codebase itself contains some simple test-cases, however these are not comprehensive as a large part of our operation is merely to populate a simple template-file, and it is hard to test that.

To execute the integrated tests use the standard go approach:

$ go test [-race] ./...

In addition to the internal test cases there are also some functional tests contained in test.sh - these perform some calculations and verify they produce the correct result.

frodo ~/go/src/github.com/skx/math-compiler $ ./test.sh
...
Expected output found for '2 0 ^' [0]
Expected output found for '2 1 ^' [2]
Expected output found for '2 2 ^' [4]
Expected output found for '2 3 ^' [8]
Expected output found for '2 4 ^' [16]
Expected output found for '2 5 ^' [32]
...
Expected output found for '2 30 ^' [1073741824]
...

Debugging the generated programs

If you run the compiler with the -debug flag a breakpoint will be generated immediately at the start of the program. You can use that breakpoint to easily debug the generated binary via gdb.

For example you might generate a program "2 3 + 4 /" like so:

$ math-compiler -compile -debug '2 3 + 4 /'

Now you can launch that binary under gdb, and run it:

$ gdb ./a.out
(gdb) run
..
Program received signal SIGTRAP, Trace/breakpoint trap.
0x00000000006b20cd in main ()

Dissassemble the code via disassemble, and step over instructions one at a time via stepi. If your program is long you might see a lot of output from the disassemble step:

(gdb) disassemble
Dump of assembler code for function main:
   0x00000000006b20cb:	push   %rbp
   0x00000000006b20cc:	int3
=> 0x00000000006b20cd:	fldl   0x6b20b3
   0x00000000006b20d4:	fstpl  0x6b2090
   0x00000000006b20db:	mov    0x6b2090,%rax
   0x00000000006b20e3:	push   %rax
   0x00000000006b20e4:	fldl   0x6b20bb
   0x00000000006b20eb:	fstpl  0x6b2090
   0x00000000006b20f2:	mov    0x6b2090,%rax
   0x00000000006b20fa:	push   %rax
   ...
   ...

You can set a breakpoint at a line in the future, and continue running till you hit it, with something like this:

 (gdb) break *0x00000000006b20fa
 (gdb) cont

Once there inspect the registers with commands like these two:

 (gdb) print $rax
 (gdb) info registers

My favourite is info registers float, which shows you the floating-point values as well as the raw values:

 (gdb) info registers float
 st0            0.140652076786443369638	(raw 0x3ffc90071917a6263000)
 st1            0	(raw 0x00000000000000000000)
 st2            0	(raw 0x00000000000000000000)
 ...
 ...

Further documentation can be found in the gdb manual, which is worth reading if you've an interest in compilers, debuggers, and decompilers.

Possible Expansion?

The obvious thing to improve in this compiler is to add support for more operations. At the moment support for the most obvious/common operations is present, but perhaps more functions could be added.

Github Setup

This repository is configured to run tests upon every commit, and when pull-requests are created/updated. The testing is carried out via .github/run-tests.sh which is used by the github-action-tester action.

Releases are automated in a similar fashion via .github/build, and the github-action-publish-binaries action.

Steve

math-compiler's People

Contributors

Stargazers

Watchers

Forkers

watmough r3dsm0k3 ant4g0nist walcz-hu omkz

math-compiler's Issues

Alert if the syntax is unexpected

Catch errors of the form:

3 3 3 3 ..

Catch division by zero at run-time.

Now we're using RPN we can't catch this at compile-time.

Detect overflow ..

We can't hold all numbers in our result-values, as we only have 64-bits.

This presents itself most obviously in the case of the factorial operation:

  $ ./math-compiler -run '20!'
  Result 2.4329e+18

   $ ./math-compiler -run '21!'
   Result -4.24929e+18

The first output there is valid, and correct. The second is negative, as a result of overflow.

We should be able to catch that at run-time, and error rather than showing the bogus-result.

Implement factorial

For example:

1! => 1
2! => 2
3! => 6
4! => 4 * 3 * 2 * 1 = 24
5! => 5 * 4 * 3 * 2 * 1 = 120

Not sure it will be hugely useful, but it's a simple thing to add.

Convert to being 100% RPN-based.

Currently the input we parse is like a reverse-polish stack-based calculator, but it is not reall.

We should make it such that we are, allowing input such as this:

 1 2 3 4 5 6 7 8 9 10+++++++++
  (-> 55 )

This isn't a significant change, but could be improved by code-folding. The biggest difficulty will be that we want to split the template we use for generating the program into distinct steps.

Add `dup`

This would be useful, for duplicating the stack-top.

For example:

  ' 3 dup *  '
   -> 9

Simplify the internal API

The main.go driver needs to know about tokens, internal form, etc.

Export a single function in the compiler. package which does the compilation, and hides those details.

That makes usage simpler.

Allow compiling/running.

Rather than forcing the user to run the program, compile and run interactively.

Prevent division by zero.

Since that is bad :)

Show the output - don't set the exit code

As things stand the exit-code of Linux is capped to 128. Show the output instead :)

Our modulus results are wrong.

Just noticed when I was sharing the program with a colleague .. Oops!

Support floating-point operations.

I spent a few hours over the weekend re-acquainting myself with floating-point operations, and it wouldn't be hard to switch from integers to using them.

Currently we always keep the starting number inf rax, then make sure that all operations use rax as the source. We'll do something similar with the FPU instructions, keep [acc] as the accumulator, and use that as the source for all issues.

The biggest different I guess, is that we'll use the data-region to store our constants. Since you can't run:

 mov eax, 3.13

Instead you need something like this:

    ; this example sets `acc = a + b`
    section .data
       a:      dq      3.0   
       b:      dq      4.0

    section .bss
       acc:      resq    1    ; our temporary / result / accumulator.


    section .text

    global  main

    main:                           
       fld    qword [a]       
       fadd qword [b]       
       fstp  qword [acc]  

     ..

It's not horrid, but it means that we need to generate a list of constants, which can happen from any operation.

It's probably worth templating add, div, mul, & other sections. Rather than using one constant template.

As long as all operations work on the acc temporary value, and their own operand it'll be simple enough though :)

This is probably a pretty mechanical transformation..

Handle the case of insufficient arguments.

All our operators pop items from the stack, do something, then store the result back there.

We should abort, cleanly, if there are insufficient arguments present upon the stack.

This would allow the following program to abort cleanly:

    $ math-compiler -run '3 +'
    Error launching a.out: signal: segmentation fault (core dumped)

We generate code to push (numbers) items upon the stack. I guess we allocate a register, or address, to keep a running count of stack-entries. Increment when we add, and decrement when we remove.

Should be a simple change. Also a good opportunity to cleanup the termination behaviour.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.