Git Product home page Git Product logo

Comments (17)

kdere avatar kdere commented on July 19, 2024 2

Public domain or CC0 sound like something that would be appropriate and I will see if I can convince the CHIANTI crowd. We are have a Skype in the middle of June.

from chiantipy.

namurphy avatar namurphy commented on July 19, 2024 1

@kdere - Is there any news about adding a license to the Chianti database? I recently ran across a website that describes the potentially serious disadvantages of having no license, and it would be really helpful for Chianti to have one!

I am wondering if the Creative Commons Attribution 4.0 International (CC BY 4.0) license would be the best choice for Chianti for the same reason that the database itself contains references to the source of the data. The attribution requirement would make sure that researchers in the future will be able to figure out the provenance of the data. This in turn will help reproducibility.

Thanks!
-Nick

from chiantipy.

namurphy avatar namurphy commented on July 19, 2024 1

They might be thinking of the CC0 license, which approximates putting the database into the public domain and gives anyone broad freedom to use, redistribute, and modify the work. The developers would effectively give up the copyright under this license, but would still be able to update and maintain the database. It sounds like the objection was related to this license.

The CC BY 4.0 license (or other Creative Commons licenses) would allow the Chianti developers to maintain the copyright but would allow others to use, redistribute, and modify the work as long as attribution is given. A license that allows derivative works would be necessary to allow users to put the database in HDF5 form or to put ionization and recombination rates into an eigenvalue matrix, for example. The attribution is important, in my opinion, because it requires users to provide metadata on where the data they use or adapted came from, which in turn leads to better science.

The Creative Commons site for choosing a license is a straightforward resource to navigate the different options.

As discussed earlier in this issue, the source code and database can and should be licensed separately, which would involve specifying what each license would cover. I've been thinking about this too much over the last two days for some PlasmaPy repositories that will contain both source code and written/graphical content. It would be possible to have language that says something like:

Source code files and code snippets provided in this distribution are licensed under the MIT license. All other content is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
Copyright 199X-2017, CHIANTI developers. All rights reserved.

Thank you for discussing this with the CHIANTI team!

from chiantipy.

Cadair avatar Cadair commented on July 19, 2024 1

For the avoidance of all doubt, the relevant part of the licence is:


Section 3 – License Conditions.

Your exercise of the Licensed Rights is expressly made subject to the following conditions.

a. Attribution.

  1. If You Share the Licensed Material (including in modified form), You must:

A. retain the following if it is supplied by the Licensor with the Licensed Material:

i. identification of the creator(s) of the Licensed Material and any others designated to receive attribution, in any reasonable manner requested by the Licensor (including by pseudonym if designated);

ii. a copyright notice;

iii. a notice that refers to this Public License;

iv. a notice that refers to the disclaimer of warranties;

v. a URI or hyperlink to the Licensed Material to the extent reasonably practicable;

B. indicate if You modified the Licensed Material and retain an indication of any previous modifications; and

C. indicate the Licensed Material is licensed under this Public License, and include the text of, or the URI or hyperlink to, this Public License.

...

(emphasis mine)


The core part of CC-BY here is that if someone uses the database they must acknowledge that use of the database in a form that is specified by the CHIANTI team. (i.e. a list of all authors or just "The CHIANTI team").

In return for obeying these terms of the licence the CHIANTI team grants the user the right to modify, and redistribute the database. (Unlike at the moment where no rights are granted to the user, so the user can not actually use the database).

from chiantipy.

kdere avatar kdere commented on July 19, 2024

from chiantipy.

Cadair avatar Cadair commented on July 19, 2024

I agree on the citation requirement thing, although other open source packages have had issues with the specific wording of that "request" being interpreted as a "requirement" so one should tread carefully around such things.

What would a good way be to make the CHIANTI group consider a license? To my mind (and probably to that of any lawyer) "freely available" does not grant any rights for re-distribution or modification at all, and probably nothing else beyond download and read this (with your eyes).

from chiantipy.

dpshelio avatar dpshelio commented on July 19, 2024

I'll have to look more in detail to give a better suggestion from my side, check also a couple of sources because databases are in a special situation, don't know whether that was in US or in EU. Also, it probably matter where was done. The more permissive would be Public Domain, anyone could do whatever it pleased with it. However, I'm a bit confused between what "Package" means, is it the code? is it the database? is it both together? I imagine there is people who uses one or the other or both. For simplicity I would separate the two parts of CHIANTI.

from chiantipy.

Cadair avatar Cadair commented on July 19, 2024

The CHIANTI tarball includes both code and data I think, but they don't have to be licensed the same even if they are distributed together.

from chiantipy.

kdere avatar kdere commented on July 19, 2024

there are two separate tarballs, one for the database and one for the IDL code

from chiantipy.

wtbarnes avatar wtbarnes commented on July 19, 2024

But in the excerpt from the CHIANTI webpage that @Cadair cited above, does "package" mean the database, the code, or both? I think this is what @dpshelio is getting at as well.

To me it seems like the two should be licensed separately as they'll be used in very different ways.

from chiantipy.

wtbarnes avatar wtbarnes commented on July 19, 2024

This licensing issue seems intertwined with the issue of distributing the data. This is partly what is holding up sunpy/sunpy#1897 as ChiantiPy (and thus the CHIANTI database) cannot become a dependency of SunPy unless it can be installed with conda (e.g. from conda-forge).

@Cadair do you know of any way of distributing a database/dataset as an installable (pip or conda) package? Is there any precedent for this? I've done a fair bit of googling and have not found anything.

One possible alternative to distributing a tarball/(a bunch of files and directories) would be to repackage the CHIANTI database as a single HDF5 file. However, such a file still comes out to ~600 Mb. Though this could be compressed a bit, this still seems too large to just throw in conda-forge. Maybe it could be hosted somewhere else (or just grabbed from its current location) and then downloaded automatically?

Either way, the database must be licensed before it can be placed on conda-forge. Re: licensing, Creative Commons may be a good option though there is still the issue of getting the CHIANTI team to include a license when distributing the data.

Another possible issue: the database contains data from other sources, e.g. NIST, National Bureau of Standards, etc. (see footer in h_1.wgfa file), so could the CHIANTI team license this data even if they wanted? Would it need to be distributed under multiple licenses? Unsurprisingly, the NIST atomic database also appears to be unlicensed.

from chiantipy.

dpshelio avatar dpshelio commented on July 19, 2024

I think NIST ADS has a copyright and the software as Public Domain. I've looked around and I've found @HEPData from CERN that uses CC0 for their database. I would suggest to use the same for CHIANTI, but that has to be decided by them. I've also written to CDS asking what they do in their cases (I didn't find any mention on their website, only to "how to acknowledge it").

Regarding format, I would suggest something like sharing the text tables via zenodo (now it includes versioning!) so the data could be mined easily (probably you could even search it even from google). Software to convert that into something more usable (sqlite, hdf5, idl sav files,...) could be done as a external thing - via a package under @chianti-atomic.

from chiantipy.

kdere avatar kdere commented on July 19, 2024

I doubt that the CHIANTI team wants to put its resources into this. We don't really have that much funded time to work on CHIANTI and the database itself is in need up data upgrades. When we started, there was not such thing as zenodo and are formats are pretty well fixed aside from changing one or other as is needed.

from chiantipy.

dpshelio avatar dpshelio commented on July 19, 2024

At the moment, all it's needed from CHIANTI team is to choose a license (Public Domain, CC0 are the most permissive; CC-By would require people t acknowledge CHIANTI). Putting it in something like zenodo is secondary, but it would be pretty useful as you would get a DOI for easy reference to a particular version. I don't think there's need to change the format of the data files - they are plain text, so that's good.

from chiantipy.

wtbarnes avatar wtbarnes commented on July 19, 2024

@dpshelio Thanks for looking into this and for contacting CDS. I agree that CC0 and CC-By seem to be the most sensible choices. The statement on the CHIANTI webpage seems most consistent with CC-By.

@kdere I didn't mean to imply the CHIANTI team should change the format of the database. This is certainly nontrivial and I recognize time and funding are limited. Rather, once it is licensed appropriately, any one of us could upload it to Zenodo so that it could be assigned a DOI and be more easily accessed (though I'm still not sure if this really addresses the "database-as-a-dependency" problem). This data could be reshaped into any format we like (e.g. HDF5 as opposed to a tarball).

from chiantipy.

kdere avatar kdere commented on July 19, 2024

It has been difficult getting agreement on this within the CHIANTI team. I seem to remember a comment that one of the licenses would have us announce that we were giving up all claims(?) to the data forever and this did not sound righe

from chiantipy.

kdere avatar kdere commented on July 19, 2024

when posting on github-ChiantiPy, you are really only talking to me. It would be better to start a discussion on the chianti google groups which will be seen by the whole CHIANTI team. I would start off with why licensing the CHIANTI database is important, what you see has the best way to do it and how have other databases been licenses. One of our problems is that we do some calculations ourselves but the main source of data is from the journals.

from chiantipy.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.