Comments (10)
@bpoldrack I used the term. There was no intent other than to communicate what a data handle is. The content is very much in flux. If you have a problem with it, please suggest language that is a clear win over it.
---Alex
from datalad.org.
It is true that we stopped using that. However, when reading it again, I found that in this particular place it is more useful than "dataset" to convey the idea that our datasets are not the same as a tarball -- without going down the rabbit hole of explanation. Just the right vibes....
from datalad.org.
@mih I was using "data handle" in the sense that a given file can be transparently acquired over a myriad of different protocols and mirrors. Not sure what else to call is, because I find the term "special remote" to be uninformative.
---Alex
from datalad.org.
Well, that's a good point, @mih.
I don't really have a problem with the term, Alex. I just wondered whether we introduce confusion here, if we don't use it elsewhere anymore.
from datalad.org.
@bpoldrack No worries. I was just explaining, and was unaware of "data handle"'s deprecated status.
If no one has a problem, I'll close this.
---Alex
from datalad.org.
I have tripped on it as well, forgot to complain. I don't think people associate dataset with tarball (re @Hanke). I know that they know the term dataset. I know that they most likely don't know the term data handle. If we could avoid introducing terms on the main page, would be easier to consume/digest and less scary. So I would keep this open
from datalad.org.
@yarikoptic What intuitive term do you suggest that reflects the placeholder for a file that isn't actually a file that can acquire the actual data from potentially a wide range of sources?
---Alex
from datalad.org.
This is an implementation detail AFAIK and as such not worthwhile the front page. That sentence would read smoother without it... what about something like
Leveraging git-annex and taking git's concept of decentralization a step further, DataLad enables scalable and version-controlled management of datasets, efficient collaboration with selective acquisition and sharing of data across dataset instances via variety of transport methods (HTTP, FTP, rsync, and more). Learn more on how DataLad relates to Git and git-annex.
from datalad.org.
@yarikoptic I'm ok with dropping that. Could you please reword your proposed sentence so that it doesn't list a pair of pairs? The easy fix to your sentence results in 3 confusing "and"s in a row.
DataLad enables A) the 1) [scalable] and 2) [version-controlled management] of datasets and B) efficient collaboration with selective 1) [acquisition] and 2) [sharing] of data across dataset instances via a variety of transport methods (HTTP, FTP, rsync, and more).
And I would recommend dropping the word "instances" from "dataset instances".
---Alex
from datalad.org.
@yarikoptic Ignore the previous suggestion. That paragraph has been deleted from the main page. So better wording would only be needed if the text resuscitated.
Closing as "data handles" are no longer mentioned anywhere.
---Alex
from datalad.org.
Related Issues (20)
- Add github star banner to website HOT 1
- Bring back or redirect to benchmarks results
- Filterable DataLad cheat sheet HOT 3
- Add link to DataLad workshop material resource
- Add neurobagel.org to "in-the-wild"
- Let `datalad-gooey` feature on the index page
- Create a "what's new?" page HOT 1
- licence? HOT 3
- Copyright date ends 2021
- "Get support" section
- Add link to weekly office hour HOT 1
- Navbar's top-right menu button is gone HOT 2
- Change highlighted button in "Use Datalad" section
- Add section / info about distribits meeting HOT 4
- Remove PASTA from use cases
- manifest.json is not installed
- Add hellotux tshirt link to website
- "daily contributor merge" deletes contricutors
- Add page with datalad fact sheet HOT 2
- Add link to blog HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from datalad.org.