Git Product home page Git Product logo

Comments (10)

aqw avatar aqw commented on August 12, 2024

@bpoldrack I used the term. There was no intent other than to communicate what a data handle is. The content is very much in flux. If you have a problem with it, please suggest language that is a clear win over it.

---Alex

from datalad.org.

mih avatar mih commented on August 12, 2024

It is true that we stopped using that. However, when reading it again, I found that in this particular place it is more useful than "dataset" to convey the idea that our datasets are not the same as a tarball -- without going down the rabbit hole of explanation. Just the right vibes....

from datalad.org.

aqw avatar aqw commented on August 12, 2024

@mih I was using "data handle" in the sense that a given file can be transparently acquired over a myriad of different protocols and mirrors. Not sure what else to call is, because I find the term "special remote" to be uninformative.

---Alex

from datalad.org.

bpoldrack avatar bpoldrack commented on August 12, 2024

Well, that's a good point, @mih.
I don't really have a problem with the term, Alex. I just wondered whether we introduce confusion here, if we don't use it elsewhere anymore.

from datalad.org.

aqw avatar aqw commented on August 12, 2024

@bpoldrack No worries. I was just explaining, and was unaware of "data handle"'s deprecated status.

If no one has a problem, I'll close this.

---Alex

from datalad.org.

yarikoptic avatar yarikoptic commented on August 12, 2024

I have tripped on it as well, forgot to complain. I don't think people associate dataset with tarball (re @Hanke). I know that they know the term dataset. I know that they most likely don't know the term data handle. If we could avoid introducing terms on the main page, would be easier to consume/digest and less scary. So I would keep this open

from datalad.org.

aqw avatar aqw commented on August 12, 2024

@yarikoptic What intuitive term do you suggest that reflects the placeholder for a file that isn't actually a file that can acquire the actual data from potentially a wide range of sources?

---Alex

from datalad.org.

yarikoptic avatar yarikoptic commented on August 12, 2024

This is an implementation detail AFAIK and as such not worthwhile the front page. That sentence would read smoother without it... what about something like

Leveraging git-annex and taking git's concept of decentralization a step further, DataLad enables scalable and version-controlled management of datasets, efficient collaboration with selective acquisition and sharing of data across dataset instances via variety of transport methods (HTTP, FTP, rsync, and more). Learn more on how DataLad relates to Git and git-annex.

from datalad.org.

aqw avatar aqw commented on August 12, 2024

@yarikoptic I'm ok with dropping that. Could you please reword your proposed sentence so that it doesn't list a pair of pairs? The easy fix to your sentence results in 3 confusing "and"s in a row.

DataLad enables A) the 1) [scalable] and 2) [version-controlled management] of datasets and B) efficient collaboration with selective 1) [acquisition] and 2) [sharing] of data across dataset instances via a variety of transport methods (HTTP, FTP, rsync, and more).

And I would recommend dropping the word "instances" from "dataset instances".

---Alex

from datalad.org.

aqw avatar aqw commented on August 12, 2024

@yarikoptic Ignore the previous suggestion. That paragraph has been deleted from the main page. So better wording would only be needed if the text resuscitated.

Closing as "data handles" are no longer mentioned anywhere.

---Alex

from datalad.org.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.