Git Product home page Git Product logo

Comments (2)

rbjanis avatar rbjanis commented on September 18, 2024

I have a similar organizational issue! My work collects annual data sets, which we've been storing in a central Data folder. These data files get used in numerous projects, and it's been difficult to know how to store data and other project files that still works with RStudio Projects. Besides copying the data files into the folders from each project, is there a better way to handle this that avoids having many versions of data files that can easily get out of sync with the "true" data file in the Data folder?

from wtf-2019-rsc.

jennybc avatar jennybc commented on September 18, 2024

I'll recap an in-person conversation here. There are a few points to consider:

  • If you have a dataset that is used in many projects, the dataset should probably be its own "thing".
  • You might make into a proper R package (a data package, like gapminder or babynames, but for your data). You could still keep it personal to you or your group, i.e. it doesn't have to go to CRAN. But the packaging infrastructure gets you a lot of useful structure.
  • Otherwise, you could create a symlink (a.k.a. shortcut or alias) from the central data store into the relevant projects. This makes the data look local to each project, but keeps you from copying it several times. I.e. you'd still have a single source of data truth. Some more words about this here: https://community.rstudio.com/t/project-oriented-workflow-setwd-rm-list-ls-and-computer-fires/3549/35?u=jennybryan

from wtf-2019-rsc.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.