stellaathena Goto Github PK

followers: 443.0 following: 20.0 repos: 65.0 gists: 5.0

Name: Stella Biderman

Type: User

Company: Booz Allen Hamilton, EleutherAI

Bio: Democratizing language models and understanding how they work

Twitter: blancheminerva

Blog: www.stellabiderman.com

Hi there 👋, my name is Stella Biderman

I'm an AI researcher seeking to understand how large language models work better.

🔭 I’m currently working on language model interpretability with Pythia
🤔 I’m looking for help with statistical models of learning dynamics and designing custom datasets to test theories about language models.
💬 Ask me about training large language models
😄 Pronouns: she/her

Catch me on:

Google Scholar Twitter Stack Exchange

Some stats:

Stella Biderman's Projects

annotated-transformer

http://nlp.seas.harvard.edu/2018/04/03/attention.html

auditing-text-generation

Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)

backdoored_transformers

big-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

cc-licenses

Creative Commons Licenses for Github

city-circuits

client

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

deeperspeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

deepspeedexamples

Example models using DeepSpeed

document-winnowing

Implementation of the plagiarism-detection algorithms behind MOSS

egnn-pytorch

Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch

eleuther.ai

equivariance

A framework for implementing equivariant DL

feschenko

My personal repo.

$fractal-ml icon$ fractal-ml

Fun stuff with fractal machine learning

gnn-meta-attack

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

gpt-neo

An implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

graph-universal-attack

graph_nets

Build Graph Nets in Tensorflow

hsmm

:exclamation: This is a read-only mirror of the CRAN R package repository. hsmm — Hidden Semi Markov Models

huggingface.js

Utilities to use the Hugging Face Hub API

impact

ML has an impact on the climate. But not all models are born equal. Compute your model's emissions with our calculator and add the results to your paper with our generated latex template

lichless_chess_scraper

scrapes data from https://database.lichess.org/ and converts it to json

llama

Inference code for LLaMA models

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

magicdraftbot

My Initial Attempt at a Magic: the Gathering Draft AI

magma

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

stellaathena Goto Github PK

Hi there 👋, my name is Stella Biderman

Catch me on:

Some stats:

Stella Biderman's Projects

Recommend Projects

Recommend Topics

Recommend Org