The accelerated-zig-parser from validark

accelerated-zig-parser's Introduction

A little about me

Accelerated the Zig parser up to 3x 🚀🚀🚀
- ⚡ Zig is my personal favorite programming language
Assembly auditor
- Reduced the instruction count of must_be_2_3_continuation in simdutf/simdjson from 6 to 4 on x86-64 (with similarly small improvements on other architectures). Although this sounds trivial, this garnered a 4% performance uplift in utf8 validation!
📃 Invented a data structure [demo, paper] that improves upon prefix trees (i.e. tries) to solve the scored autocomplete problem orders of magnitude faster 🚀🚀🚀🚀🚀
Co-Developed the initial version of roblox-ts with @Osyris_rblx
- Created design to eliminate immediately-invoked function expressions (IIFE's)
- Ask me how to parse and transform your complex type in the TypeScript type-language!
Created the RoStrap project
- Certifiable material design enjoyer
My first programming language was Lua
- The __index metamethod makes the Aho-Corasick algorithm so clean!
Big-Theta connoisseur
- Did you know that a priority queue implemented as a 1-2-3 Skip list can perform the extract-min operation in amortized constant time?
  - And yes, insertions are still logarithmic! And any other value can be extracted in logarithmic time!

Click the following image for a demo of my data structure:

Favorite talks


Performance Matters (Strange Loop 2019)
Data-Oriented Design and C++
Practical Data-oriented Design

Exciting new tech

Mill instruction set architecture
- An in-order statically scheduled architecture that achieves the performance of an out-of-order superscalar with extremely innovative tricks
- Eats loops like goats eat underwear
LuaJIT Remake
- Automatically generates a blazingly fast interpreter and multi-level JIT compiler given only a semantic description of a language's bytecodes
Pijul version control system
- Based on the theory of patches and not slow like DARCS
Bun JavaScript runtime
- Blazingly fast runtime and toolkit for JavaScript

accelerated-zig-parser's People

Contributors

Stargazers

Watchers

accelerated-zig-parser's Issues

embedded as fallback without simd (1) and parser design (2-4)

As far as I understand, using Zig compiler on embedded (100-300MB RAM) is a potential use cases. Is compatibility without SIMD possible/planned and what would be slowdown and/or drawback?
I guess you plan to make the parser iterative to backtrack: Can the same be applied to zig fmt, which as of now recursively traverses things?
if yes: Incremental parsing planned/possible?
if yes: api to incremental parsing for language server error recovery/robust parsing?

I guess you have probably made up your mind about 1-2 with a draft impementation, so I wanted to ask about it.

Investigate slowdown of reading files

I randomly noticed that the runtime of reading files is now ~70ms, whereas it used to be ~30ms? Not sure what's going on there but something is not right.

more crazy idea for benchmarking: compare against optimal AST structure (found by applying simplex)

A relative optimal AST structure can be found from computing simplex with the derived constrains on the first AST parse + copy things over.

See also https://en.wikipedia.org/wiki/Simplex_algorithm. The general idea of non-recursive tree traversal is given here: https://stackoverflow.com/questions/28544980/data-oriented-tree-traversal-without-recursion/28616278#28616278.

I got the idea from user Prokop on discord, which was asking "is the adjacency of ast nodes in the node list used somehow to store information?".

split files by license

It sounds nicer to me to split Apache derived files and ideally put them into a package.
This would also allow eventual usage of SPDX ids via package manager. Opinions?

Afaik, SPDX does not allow explicit tagging of code ranges, so doing things differently would not be strictly standard-conform. However, feel free to disagree in what the package manager should use.

Recommend Projects

validark / accelerated-zig-parser Goto Github PK

accelerated-zig-parser's Introduction

A little about me

Favorite talks

Performance Matters (Strange Loop 2019)

Data-Oriented Design and C++

Practical Data-oriented Design

Exciting new tech

accelerated-zig-parser's People

Contributors

Stargazers

Watchers

accelerated-zig-parser's Issues

embedded as fallback without simd (1) and parser design (2-4)

Investigate slowdown of reading files

more crazy idea for benchmarking: compare against optimal AST structure (found by applying simplex)

split files by license

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent