Git Product home page Git Product logo

Sebastian Müller's Projects

dspy icon dspy

DSPy: The framework for programming—not prompting—foundation models

guarantees icon guarantees

Python: guarantee test coverage, guarantee type and runtime-guarantees

hlb-cifar10 icon hlb-cifar10

Train to 94% on CIFAR-10 in ~6.84 seconds on a single A100, the current world speed record. Or ~95.78% in ~114 seconds (or less!)

hlb-gpt icon hlb-gpt

Minimalistic, fast, and experimentation-friendly researcher's toolbench for GPT-like models in ~<365 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in ~138 seconds.

hlb-gpt-value-activation icon hlb-gpt-value-activation

Check out how much of a difference the activation of the value makes vs. keeping it linear as in standard attention

kan icon kan

Ablate KAN and Fourier KAN vs. normal Linear Layers in LLMs

mask icon mask

Some experiments with Attention masks

neuralsort icon neuralsort

Sort lists with the help of an ANN to allow maximal parallelism in execution.

parameter-checks icon parameter-checks

Extend typehints to include dynamic checks (that might otherwise be dealt with by assertions) in Python.

rebasin icon rebasin

Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)

sglang icon sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with LLMs faster and more controllable.

torch-nested icon torch-nested

Easily manipulate torch.Tensors inside highly nested data-structures.

typing-exe icon typing-exe

Executable typehints for Python: make assertions about and/or modify parameters & return values

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.