Git Product home page Git Product logo

canrager's Projects

algorithms icon algorithms

Coding exercises in algorithms and data structures

articulate_rules icon articulate_rules

Investigating the ability of LLMs to articulate rules for classification tasks they can solve.

chess_llm_interpretability icon chess_llm_interpretability

Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.

clas icon clas

Circuit-Level Activation Steering

eap icon eap

Edge Attribution Patching for TransformerLens

expert-sae icon expert-sae

Examining the role of experts in MoE models using dictionary learning.

feature-clustering-webapp icon feature-clustering-webapp

Unsupervised method for clustering sentences with diverse contexts into groups where similar features are involved for next token prediction.

indirect_object_identification icon indirect_object_identification

Replicating the results of the paper "Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small" by Wang et al.

nanogpt icon nanogpt

Rebuilding a minimalistic version of GPT along Andrej Karpathy's lecture

nnsight_workshop icon nnsight_workshop

Playground for manipulating activations in large language models with the nnsight module by BauLabs.

ravel icon ravel

Evaluate interpretability methods on localizing and disentangling concepts in LLMs.

sparse-coding icon sparse-coding

Improvements on dictionary learning for language model interpretability: (1) training and (2) evaluation of sparse autoencoders.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.