Comments (7)
Also if you can confirm that (i) I listed the correct references and (ii) the datasets I restored are correct (not out of date or anything from your latest commit) that would be helpful
from matminer.
Yeah I'll look into this today and make those updates. I'll see if I can make the dataframes the exact same (i.e. the same columns). Sorry I didn't get the tests and references done when I first wrote up the datasets pacakge; I was rushing a bit too much.
Also, one of the reasons git lfs is useful (and why we will probably need a new repo or data storage repo if we do a bunch of big datasets) is that making big changes to big files can make the git log very large (e.g. every time I commit changes to the CSV files I could be changing like a MB of data, which all has to go into the git log). This is fine for just these three datasets if we can get them the way we want them and leave them that way, but we might want to get rid of the git log for those datasets to save space once we finalize them.
from matminer.
Hi Kyle
Sure, we can use something like git BFG to clean out the logs later. I'll probably want to do this for some of the IPython notebooks too since those are really the biggest culprit. For now I think it's OK.
from matminer.
It looks like the citations and data are good to go, except that you listed the dielectric paper as being 2017 when the link I originally downloaded it from says 2016. I'll fix that when I add the additional columns
from matminer.
Hi @computron ,
I pulled the useful descriptors out of the meta dictionary for the piezoelectric and dielectric datasets, and I fixed the date for the dielectric citation. Everything should be up to date, but I saved an exact copy of the previous datasets in case something gets messed up.
from matminer.
Thanks! I updated the unit tests
from matminer.
Ah shoot sorry, I updated them too but forgot to push them. Thanks!
from matminer.
Related Issues (20)
- Materials Project time split dataset - `load_data_from_json` returns `None` during debugging (conditionally)
- `matminer.datasets.utils._validate_dataset()` flaky on Windows? HOT 5
- [FEATURE REQUEST] SkipAtom compositional featurizer
- AttributeError: 'DensityFeatures' object has no attribute 'desired_features' HOT 1
- SOAP features HOT 2
- Suggestion: OPTIMADE data retriever HOT 1
- Fail to approach MPData HOT 4
- Handling NaNs from ElementProperty HOT 3
- CI failing due to broken mongo service
- Fixing matminer's multiprocessing problem HOT 1
- New release 0.9.0? HOT 2
- WenAlloy wrong valence electron counts
- mp-api for MPDataRetrieval needs upgrade badly HOT 1
- Missing compatibility with pandas v2 HOT 11
- Issue link to matsci.org broken
- compatibility request: pandas-2+ HOT 2
- Error in import composition HOT 6
- Simple composition-based featurization fails due to an upgrade in pymatgen HOT 2
- Re-enable tests that are skipped in CI HOT 3
- when I import matminer,mistake as following:ValueError: Unexpected atomic number Z=119。 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from matminer.