Git Product home page Git Product logo

ltc's Introduction

ltc

An ambitious project. An project to archive one million Letters To Crushes.

#119507: What other 16-year-old girl would ask her neighbors to borrow blankets so we could finish our fort? I love you more than endless Swedish Fish.

A website and more experimental media to present them

  • /site: a lightning fast website to browse the letters
  • /experiments:
    • /discord-bot: A frontend for discord, again, to browse letters
    • /machine: crap AI to write letters
  • /analysis: SQL, Sheets, And CSVs to analyze the letters

#668440: I think you'd like some of the music I recently discovered. I don't know, I'm too shy to do anything but orchestrate(ha) coincidences.

Open source?

NOT EVERYTHING IS OPEN SOURCE

/site/*, /experiments/discord-bot/*, analysis/* (both sql scripts and results)

Will be open source, license TBD

Dataset

Please Note: The dataset is not open source and there are no plans to open source it for the following reasons:

  • Touchy copyright issues. I don't own these letters, I don't have the right to open source them in any more significant way than what I have already done so
    • The authors of each letter released them expecting them to be read by a human one by one, not to be potentially abused automatically for evil
  • Touchy ethical issues. It's already dubious to be doing what I'm doing (misusing public apis), let alone exposing those intimate moments of 200k people to the internet, to be analyzed, warped.
    • To some degree I have already done this by open sourcing the archival scripts, but it would take many days to obtain this dataset
  • The dataset contains many columns. For privacy and obvious reasons, I would need to:
    • Remove IP addrs
    • Shuffle ids

That being said, I am happy to explore this medium. Open a pull request with a SQL script, if it deals with aggrate data I am happy to run it and upload the results as a CSV. I hope this will strike a happy medium.

  • Open data
  • NOT available for evils or privacy infringement

Toolkit

Source available, NOT OPEN SOURCE (That means you are not allowed to use them). I don't want you to scrape the site for the above reasons.

Migrations

I hate writing migrations, and they really only exist to make your life easier, and I really don't want to do that.

ltc's People

Contributors

boehs avatar

Stargazers

 avatar

Watchers

 avatar

ltc's Issues

scrape hidden letters

not returned via api, often avaliable with direct navigation unless truely deleted

archive all comments

(including disqus?!)

Normal Comments

  • 25k/325k
  • 50k/325k
  • 75k/325k
  • 100k/325k
  • 125k/325k
  • 150k/325k
  • 175k/325k
  • 200k/325k
  • 225k/325k
  • 250k/325k
  • 275k/325k
  • 300k/325k
  • 325k/325k

Disqus

??/??

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.