|
|
──────────────────────────────────
Oryx — Programming Made Better
──────────────────────────────────
Oryx is named after the oryx animal. This means that when referring to
Oryx the programming language in languages other than English you should
use the given language’s translation of the animal’s name (e.g. ‘Órix’ in
Portuguese or ‘Όρυξ’ in Greek) as opposed to using the English name.
Oryx is intended to be a sane programming language for serious software
development. To be more specific Oryx aims to be the ideal language for
general-purpose application development for modern systems. We do not
waste our time attempting to support or perform well on legacy systems or
on your dishwasher.
Oryx assumes that the programmer is competent, and allows the programmer
to do what the programmer wishes to do without getting in their way.
Oryx rejects the notion that your tools need to be actively defensive,
and assume that you are a web developer that lacks real programming
skills.
Oryx also aims to be a very simple language. Learning most of the
languages useful features should be possible within a day of
experimentation, and language features and syntax should be as consistent
and common-sense as possible.
──────────────────────
Build Instructions
──────────────────────
Building the Oryx compiler is rather trivial. The steps are as follows:
1. Install the LLVM libraries and -headers. They should be available
through your systems package manager. Do note that as of 17/07/2024
the version of LLVM being utilized is 18.1.x. The compiler may work
with other versions, but it isn’t guaranteed.
2. Install Gperf. It should be available in your systems package
manager, and if not then you can easily find instructions online to
build from source. Gperf 3.1 is the oldest version that is actively
tested on.
3. Clone the compiler repository.
$ git clone https://github.com/Mango0x45/oryx.git
4. Bootstrap and run the build script.
$ cc -o make make.c
$ ./make # See below for more details
If you followed the above steps, you should find the compiler located in
the root directory of the git repository under the name ‘oryx’.
The build script takes a few optional parameters that might be of
interest. They are as follows:
-F Force rebuild the compiler and its dependencies in vendor/.
-f Force rebuild the compiler but not its dependencies in vendor/.
-r Build a release build with optimizations enabled.
-S Do not build with the GCC sanitizer. This option is not required
if -r was specified.
The build script also accepts some subcommands. They are as follows:
clean Delete all build artifacts and compiled binaries.
distclean Delete all build artifacts and compiled binaries, as well
as those creates by any dependencies in vendor/.
test Run the tests in test/. This subcommand should only be
run after a regular invocation of the build script so
that the tests get compiled.
──────────────────────────────
Existing Language Features
──────────────────────────────
1. The following datatypes are supported. The unsized integer types
default to the systems word size (typically 64 bits). The rune type
is an alias for the i32 type and serves a purely semantic purpose.
In the future it will be a distinct type.
/* Integer types */
i8, i16, i32, i64, i128, int
u8, u16, u32, u64, u128, uint
rune
/* Floating-point types */
f16, f32, f64, f128
2. C-style block comments. Line comments are intentionally not
included.
3. Declaration of mutable variables with optional type-inference. The
syntax is simple and consistent regardless of if type-inference is
used or not. Variables are also zero-initialized unless ‘…’
(U+2026 HORIZONTAL ELLIPSIS) or ‘...’ is given as a value.
x: int; /* Declare a zero-initialized integer */
x: int = 69; /* Declare an integer and set it to 69 */
x: = 69; /* Same as above but infer the type */
x := 69; /* Recommended style when inferring types */
x: int = …; /* Declare an uninitialized integer */
x: int = ...; /* Same as above when Unicode is not possible */
When declaring an uninitialized variable, the recommended style is to
use U+2026 HORIZONTAL ELLIPSIS. If you cannot bind that codepoint to
your keyboard, you should investigate the key-remapping faculties of
your text editor. For example, (Neo)Vim users may try the following:
inoremap ... …
" or if you don’t like the above…
inoremap <C-.> …
4. Declaration of constant variables with optional type-inference
including constants of arbitrary precision. The syntax is
intentionally designed to be consistent with mutable variable
declaration.
Constants are unordered, meaning that a constant may refer to another
constant that is declared later in the source file.
FOO: u8 : BAR
BAR: u8 : 69;
REALLY_BIG :: 123'456'789'876'543'210;
pub my_func :: () int {
return BAR;
}
5. Constants of arbitrary precision (overflow is not possible), with ‘'’
(U+0027 APOSTROPHE) as an optional digit seperator.
REALLY_BIG :: 123'456'789'876'543'210;
6. No implicit type conversions between types. This includes between
different integer types which may have the same size (i.e. int and
int64)
pub my_func :: () {
x: int = 69;
y: i64 = x; /* Compile-time error */
}
7. Nested functions are supported, but not closures. Closures will
never be supported in the language.
/* Recall that constants (including functions!) can be declared
in any order. This lets us define inner *after* it gets
called by the assignment to ‘x’. */
outer :: () {
x := inner(5);
inner :: (x: int) int {
return x;
}
}
8. No increment/decrement operators. The following functions both
return 42 as the return values are parsed as (+ (+ 42)) and
(- (- 42)) respectively.
x := 42;
returns_42 :: () int {
return ++x;
}
returns_42′ :: () int {
return --x;
}
9. Assignment statements (not expressions). Unlike in C, you cannot put
an assignment inside of an expression.
return_42 :: () int {
x := 4;
y := 2;
x = x*10 + y;
return x;
}
Due to quirks of the language grammar identifiers may be wrapped in
(arbitrary levels of) parenthesis, however assignments are only
permitted if the left-hand side with parenthesis removed is a lone
identifier. The rationale behind this is to allow in the future
assignments to expressions that return pointers.
((x)) = x*10 + y; /* legal */
(true ? x : y) = x*10 + y; /* illegal */
10. Static local variables allow for block-scoped global variables. This
is useful for having function state persist across multiple calls.
iota :: () int {
static x := -1;
x = x + 1;
return x;
}
pub main :: () {
zero := iota();
one := iota();
two := iota();
three := iota();
}
|