gluon-lang / gluon Goto Github PK

View Code? Open in Web Editor NEW

3.1K 3.1K 142.0 12.42 MB

A static, type inferred and embeddable language written in Rust.

Home Page: https://gluon-lang.org

License: MIT License

Rust 99.60% Shell 0.40%

compiler embeddable functional gluon language programming-language repl rust type-inference

gluon's People

Contributors

Stargazers

Watchers

Forkers

neuroradiology rockymeza beneills ricelang brendanzab raj347 leshow zjhmale mxmurw agatan lpelleau rzel marwes nxnfufunezn lifeisstrange vbarrielle nlinker salpalvv happy-ferret zipper97412 jgreene badfactor jgrowl dwnusbaum jdomantas ordovicia endeav0r dantengsky gogobook zummenix colelawrence caulagi oooooba ix5231 samanpa dylan-dpc-zz vishalsodani pgarland mapbased shalokshalom ethanpailes memoryruins fduxiao tempbottle laegluin talldan elcritch hhy5277 shaunstanislauslau linecode chrisosaurus jithinraj hwchen mikeyhew sunng87 topelinux c410-f3r traviscross jamesmacaulay ftilde freekh highwayns arnau joprice teresy etherian deepankarsharma playxe comfanter ifwu jheinemeyer uuhan jameekim abruel e7dal mluszczyk toastal zoosky hce jason-cooke leperdu hoangpq nomsi omegablitz happydpc snarpix sailfish009 tshepang zoechi mwatts ngirard rivy kindlychung juaby joefiorini shuenhoy sknji jim-holmstroem klotzambein skyne98

gluon's Issues

Rename `*` to `Type`

This seems to be the general movement of programming languages of late. It's also probably clearer, and easier for newcomers to understand.

A new functional language with a GC is not a earth shattering. If you can do static analysis to figure out the lifetimes without having to specify them at all, this would be a breakthrough. Also see: https://github.com/carp-lang/Carp

Make the library compile under stable rust

Currently the following features are needed to build the library.

core_intrinsics
heap_api - Used to allocate memory for the gc. Could likely be removed using Vec for memory allocation
raw - Used for transmutes into &[T]
slice_bytes
test - Only for needed for testing (benchmarks)

Compile the pattern matching of large structures more efficiently

Currently all patterns always compiled as Split which means that large objects such as the prelude push a lot of values to the stack when they are pattern matched. Often though only a few of them are actually used. For these cases the compiler should produce code which only takes the necessary arguments from the object.

Favor giving names to functions over operators

It might be nice to provide named functions for interfaces first, then have the operators as shorcuts. So for example, (>>=) would become flat_map (I personally prefer this over bind, heh), and make_Monad would define the operator based on that.

Explore possibilities for modular implicits

At the moment module polymorphism is kind of clunky. Whilst our priority is more focused on polishing what we already have, it might be good to note down our thoughts here.

Some references:

Add an environment variable which specifies directories to look for imported files

The import.rs module would should be extended to look for an environment variable (maybe GLUON_PATH?) and use that for import lookup as well.

Consider using a chat service for community discussion

Would be nice to have some kind of place for free-form, asynchronous discussions that don't really fit into github issues. Perhaps Gitter? http://gitter.im/ - we could put a badge to it on the README.

Improve compilation times of gluon_parser

This is causing timeout woes with travis and makes things hard for users on stable. As discussed on https://github.com/Marwes/gluon/issues/33#issuecomment-233157725 and https://github.com/Marwes/gluon/pull/66#issuecomment-234803447 we could possibly consider switching to a parser generator like rustpeg of LALRPOP.

Update rustyline when Windows support appears on crates.io

As discussed in https://github.com/Marwes/gluon/pull/35#issuecomment-233140511:

I started some work to replace the linenoise dependency with rustyline. There shouldn't be any problems porting over any changes since it is mostly a rust implementation of linenoise but it also has a better completion api which should make it possible to add tab-completion in the REPL!

Add a command which shows the kind of a type to the REPL

Example:

>> :k Option
* -> *

>> :k Monad
(* -> *) -> *

Add proper error variants instead of the current use of TypeError::StringError

The use of the StringError variant is rather lazy and does not provide good error messages. If all uses of it are replaced by adding additional variants to the TypeError variants those errors can have the data they need to explain the error properly.

Add functional record update

Currently there is no good way of creating a new record which has most of the fields of an existing record with a few of them modified. Adding functional record update ala Haskell or Rust would make this easier in some cases.

Functiona record update may be a bit limiting in some/many cases so it may be worth investigating a more general way of updating data structures (something like lenses).

Surround operators with parens when pretty-printing

For example:

> { (+) = \x y -> "squiggly squiggle" }
<top>:Line: 1, Column: 1: Expected the following types to be equal
Expected: { +: 1476 -> 1477 -> String }
Found: { ord: std.prelude.Ord 1479, +: 1479 -> 1479 -> 1479, -: 1479 -> 1479 -> 1479, *: 1479 -> 1479 -> 1479, /: 1479 -> 1479 -> 1479, negate: 1479 -> 1479 }
1 errors were found during unification:
Types do not match:
    Expected: { +: 1476 -> 1477 -> String }
    Found: { ord: std.prelude.Ord 1479, +: 1479 -> 1479 -> 1479, -: 1479 -> 1479 -> 1479, *: 1479 -> 1479 -> 1479, /: 1479 -> 1479 -> 1479, negate: 1479 -> 1479 }
{ (+) = \x y -> "squiggly squiggle" }
^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Consider removing the IO monad

From writing the REPL I get the feeling that a strict language does not work that well with monadic action once a few actions are chained. Adding better syntax for monadic actions could maybe make it convenient enough but as the language isn't pure anyway it is likely better to remove the IO monad for now.

Formalize the type system

It is probably a far-future issue to actually formalize the type system, however, more pressing is that there is not a good way of determining whether a program should typecheck or not. This is mainly because the fact that type aliases can be used as type lambdas which makes the current typechecking undecidable.

I want to investigate whether it is possible to restrict type aliases in a way that avoids the need of higher order unification (which is not decidable) while not going as far as Haskell does when restricting aliases (as that would require the ability to define new distinct types, something I think would be interesting to avoid TODO explain reasoning).

--allowed to use partially
type Error = Either String
--Not allowed to use partially
type IntEither e = Either e Int

Dont print the entirety of large types when pretty printing

Large types such as the prelude should not be printed in their entirety. See https://github.com/Marwes/gluon/issues/58#issuecomment-234927599.

Type `{
    Bool = | False | True,
    Ordering = | LT | EQ | GT,
    Option a = | None | Some a,
    Result e t = | Err e | Ok t,
    List a = | Nil | Cons a (std.prelude.List a),
    Monoid m = { append: m -> m -> m, empty: m },
    ... 'x' fields omitted
}` does not have the field `Option Int`

Fix the Type::Function variant

Currently the Function variant has holds a Vec<Type> for function arguments but it is always assumed that the vector only has one element as functions are curried and are always treated as 1-argument functions.

The variant should either just hold one argument type, mirroring how the type checker sees it, or the code should take advantage of the Vec to store multiple arguments (possibly for efficency's sake).

Add F#/OCaml/Elm-style piping operators

This seems to be what all the cool kids are doing! It is also much easier for people to understand vs Haskell's (.) and ($):

/// Backward function application, where `f <| x == f x`
let (<|) f x : a -> (a -> b) -> b = f x

/// Forward function application, where `x |> f == f x`
let (|>) x f : (a -> b) -> a -> b = f x

/// Function composition, where `f >> g == (\x -> f (g x))`
let (>>) f g : (a -> b) -> (b -> c) -> a -> c = \x -> f (g x)

/// Function composition, where `f << g == (\x -> g (f x))`
let (<<) f g : (b -> c) -> (a -> b) -> a -> c = \x -> g (f x)

Seeing as we are thinking in higher kinds, I wonder if we could define these in terms of Applicative and Category?

Add more precise locations to type errors

Currently the location of a type error is reported as being in the same place as the expression it occured in. This location can be rather misleading in many common cases.

Type errors in function calls should be reported at the argument rather than the call.

Current:
map 1 Nil
^~~~~~~~
Better version:
map 1 Nil
    ^

Type errors in patterns should be reported at the patterns (or expressions) location rather than the enclosing expression

Expression:
let { x, y } = 1 in ...
Current:
let { x, y } = 1 in ...
^~~~~~~~~~~~~~~~~
Better version:
let { x, y } = 1 in ...
    ^~~~~~~~

Type/identifier search in the repl

Would be super groovy if you could search for functions in-scope with :s. Here is what you can do in the Idris repl:

Idris> :search Monad f => f a -> f b -> f a
< Prelude.Applicative.(*>) : Applicative f => f a -> f b -> f b


< Prelude.Applicative.(<*) : Applicative f => f a -> f b -> f a


> Prelude.List.intercalate : List a -> List (List a) -> List a
Given a separator list and some more lists, produce a new list
by placing the separator between each of the lists.

> Prelude.List.(++) : List a -> List a -> List a
Append two lists

> Language.Reflection.Elab.Prim__Try : Elab a ->
                                       Elab a -> Elab a


> Prelude.List.(\\) : Eq a => List a -> List a -> List a
The \\ function is list difference (non-associative). In the
result of xs \\ ys, the first occurrence of each element of ys
in turn (if any) has been removed from xs, e.g.,

> Prelude.List.merge : Ord a => List a -> List a -> List a
Merge two sorted lists using the default ordering for the type
of their elements.

> Prelude.List.union : Eq a => List a -> List a -> List a
Compute the union of two lists according to their Eq
implementation.

> Prelude.while : IO' l Bool -> IO' l () -> IO' l ()
Loop while some test is true

Debugging

List of some things that are needed to make it easier to debug programs.

Stacktraces
Source maps
Breakpoints

Avoid rust panics when slicing embed_lang strings

After Rust 1.9 is released is_char_boundary will be stabilized and this is easy to implement.

Add exhaustiveness checking for patterns

Either an error or a warning should be triggered when not all alternatives are matched on.

Make it easier to use (+), (-), (==), etc

As no form of overloading currently exist these basic functions require explicit importing like let { (+), (-), (*) } = prelude.num_Int in 2 + 2. This is obviously not so nice so some sort of resolution for this problem will be needed. Following is a list of possible ways to resolve this.

Implementing overloading with traits/typeclasses/"Ocaml implicit modules"
Pros
- Well explored territory
Cons
- Adds significant complexity to the compiler
Use differently named functions for different types (like OCaml)
Pros
- No additional complexity added to the compiler
Cons
- Inconvenient for users
- Does not solve (==) well as it is often implemented for lots of types. Ocaml has a single (==) operator which takes any any two values as long as they have the same type which works most of the time but then throws errors when comparing functions which is not very friendly.

Allow types defined in the host language to be pattern matched on

Currently only "gluon types" (records and enumerations) can be matched on in case expressions. Allowing userdata types to be matched on would let them integrate better.

Store arrays more efficiently

Currently arrays are just stored as Array<Value> which wastes a lot of space as the tag information in the enum is duplicated. A better way would be to have multiple array types (Array, Array, Array and maybe even Array if a byte type were added).

Though the idea is straightforward the implementation may have some subtleties. For instance consider the following code

let twice x = [x, x]
twice 1
twice { x = 1 }

When constructing an array in the twice function the tag of x needs to be inspected so the correct array can be constructed. A similar problem comes when indexing an array where the code has to check the type of the array to construct the correct Value variant. Theses issues aren't to complicated but there may be other problems hiding as well.

Consider renaming case ... of to match ... with

case ... of expressions are a hold over from the language's roots in Haskell but currently the syntax has more in common with F# and Ocaml which uses match ... with. Rust also uses match but that could be both an argument for (to make it more similiar) or against (the syntax still differs so having case instead could be a good distinction.

Consider having a special syntax for macros

Macros currently just look like functions import "test.glu"which can easily lead to confusing behaviour as Macros override any other bindings (essentially making macros reserved key words). Rust ends all macros with a ! which makes macros distinct and does essentially namespaces macros. Copying this behaviour may be a good idea and shouldn't require to much effort to implement

Improve sharing of types and kinds

Anytime Type::int, Type::float, Kind::star, is called a new reference count type or kind is allocate which is rather wasteful. As these types are quite common they could be cached in the typechecker (and else where) so that the same pointer could be reused. The kindchecker already does some of this but in the typechecker this is not done.

https://github.com/Marwes/embed_lang/blob/master/check/src/kindcheck.rs

PrimitiveEnv might be possible to extend for this (though it was originally created to retrieve a canonical boolean).

Don't allocate unit values

() should not need allocation

Allow use of non-Sync data in the interpreter

To allow the interpreter to be used safely in multiple threads the interpreter needs to implement Sync and in turn this means that all values it owns also needs to be Sync. This can be problematic though as this requires mutable data to either just use atomic operations or using Mutex/RwLock which has more overhead than Cell and RefCell.

A possible way of allowing this is to create two ways of spawning gluon threads. A sync thread would work exactly as it does now. A non-sync thread on the other hand is allowed to contain non-sync data but it has some additional restrictions to make it safe.

// Thread does not implement `Sync` or `Send by default`
impl Thread {
    fn new_sync_thread() -> RootedSyncThread;
    fn new_thread() -> RootedThread;

    fn new_child_thread(&self) -> RootedThread;
    // A `&Thread` can be converted into a thread implementing `Sync` if it was spawned as a sync thread
    fn to_sync_thread(&self) -> Result<RootedSyncThread>;
    // It is always possible to convert a `Thread` into a `RootedThread` same as now
    fn to_rooted_thread(&self) -> RootedThread;
}

impl RootedSyncThread {
    fn new_child_sync_thread(&self) -> RootedSyncThread;
}

Restrictions:

Non sync threads can only spawn other non sync threads - Necessary as any child threads can access data from a parent
Non sync threads can only send sync data upwards to a parent thread - The data would of course still need to be copied but care also needs to be taken that non non-sync data is created.

I don't feel confident about this being all restrictions necessary so I will leave this as a tracking issue in which I can consolidate more data before actually starting on an implementation.

Flesh out the C api

Freeing garbage collected objects does not call drop

Implement Applicative and Functor for State and Writer

I reckon it would be nice to make Monad inherit from Applicative, and to remove return in favor of pure. In order to do that however, we need to implement the missing interfaces for the State and Writer types.

Only use the (->) operator for function type constructors

At the moment the Gluon's syntax overloads the (->) operator for three things:

as a function type constructor
to separate arguments from the body in lambdas
to separate the pattern from the body match arms

It would be nice to use (=>) for the latter two cases to disambiguate this, and to leave us open to unifying the type and value syntaxes in the future.

For example:

 type Functor f = {
     map : (a -> b) -> f a -> f b
 }

 let functor_Option : Functor Option = {
-    map = \f x ->
-        match x with
-            | Some y -> Some (f y)
-            | None -> None
+    map = \f x =>
+        match x with
+            | Some y => Some (f y)
+            | None => None
 }

Simplify the Type representation and reduce its heap usage

Currently there are quite a lot of different representation in the Type enum this leads to complications in unification (and elsewhere) as types that should be equivalent can have multiple representations. To some extent these different representations are necessary either for convenience or for efficiency (Function(Int, Float) is a lot nicer to work with than App(App(->, Int), Float)).

Currently the App variant only exists to avoid allocation a one element vector for single argument constructors which are very common but with smallvec (or equivalent) the App variant could be completely superseded by Data.

Other simplifications may be to replaceFunction(a, b) with Data(-> [a, b]) and Array(a) with Data([], [a].

Marking this as beginner as this change (though maybe a bit tedious) should just involve replacing all uses of Type::App(a, b) with Type::Data(a, [b]).

Checklist

Rename the language

embed_lang is just a working title, need to think of a better name.

Allow type signatures to be placed above declarations, ala-Haskell

As discussed on /r/rust, I've always found that typed declarations in MLs get long fast. Leads to people writing code that relies too much on global type inference, and is therefore is hard to figure out later. :(

let my_long_function : LongType -> LongerType -> LongestType -> Return
    my_long_function x y z = ...

Add proper scoping for types

Currently worked on in the https://github.com/Marwes/embed_lang/tree/associated_types branch.

The idea is that record types can have both value and type fields so the prelude module might have the type defined as (making records similiar to first class modules in OCaml):

{ Int, Float, Option, Result, id, map, /* etc */ }

Types can then be brought into scope in the same way as variables

let { Int, num_Int = { (+), (-), (*) } } = prelude
and fac n : Int -> Int = ...

There is certainly going to be some issues with this approach which hasn't been considered yet but I think the basic idea is sound.

Merge duplicate calls of make_Monad, make_Applicative etc

As mentioned in #56 each call to make_Monad or similiar functions make a completely new set of functions for each invocation. These could be removed with optimizations but that is a long way away of being a solution. An alternative solution may be to add a bake macro which functions as follows.

This:

let prelude = import "std/prelude.glu"
...
let monad = bake (prelude.make_Monad prelude.monad_IO)

Becomes this:

// This expression is evaluated first and stored in a global location
let prelude = import "std/prelude.glu"
prelude.make_Monad prelude.monad_IO

Then this expression can access the cached value.

let prelude = import "std/prelude.glu"
...
let monad = make_Monad_IO_global)

The bake macro would function a bit like import in that it evaluates its argument once and caches the result in some globally accessible place. It becomes far trickier though as it would need to track from where all identifiers come from to be able to reconstruct a valid expression.

I may have been a bit to optimistic about this solution as taking it to the limit is more or less as much work as implementing a proper optimization to do this. It might be that this handles 90% of the cases though

Add a bytecode loader

This would allow the vm crate to be included and used separately though limited to only loading pre-compiled bytecode. For starters it doesn't need any checking for bytecode validity but eventually it would be useful to have safe loading as well.

Make it possible to test the REPL

As more features gets added to the REPL it would be nice to improve the testing for it. The REPL can however be tricky to test as one needs to be able to send commands to the terminal programmatically. In python I found the pexpect library which seems to be able to do this. It is a bit of a heavy weight solution to take on a python dependency just for some tests though and it would really nice if a library like pexpect could be ported to RUST. I don't have a good estimate to how much work such a port would be but it might be an interesting project to do (which would likely be useful outside of gluon as well)!

Precedence of operators

Currently the precedence for common operators such as + are predefined in the parser which makes it possible to at least write arithmetic without any surprises. This won't scale for user defined operators however so language construct needs to be added which binds precedence to an operator.

The current line of thinking is to attach fixity to bindings and let them propagate where they are used as long as it is not ambigious.

let { (+) } = 
    let lassoc 3 (+) = ....
    in { (+) }
in /* assoc is still 3 for + here */

Allow nested block comments

Nested block comments (e.g. /* /* foo */ */) are my favorite feature from Rust. It would be nice if Gluon supports them too.

Suggest improvements to the documentation

There is still a lot of parts of gluon which lack documentation and while I will do my best to recognize and improve it where I can, it would be very helpful to get suggestions for which part I should prioritize. Opening a new issue for any needed documentation improvements is often still a good idea but consider this issue a place to add suggestions which do not seem to warrant a separate issue.

Example

//Experimental syntax
let { * } = prelude.num_Int in 1 + 2 * 3  - 4

gluon-lang / gluon Goto Github PK

gluon's People

Contributors

Stargazers

Watchers

Forkers

gluon's Issues

Pros

Cons

Pros

Cons

Checklist

Example

Recommend Projects

Recommend Topics

Recommend Org