Git Product home page Git Product logo

iclabel-dataset's Introduction

ICLabel Dataset


What is ICLabel?

ICLabel is a project aimed at advancing automated electroenephalographic (EEG) independent component (IC) classification. It is comprised of three interlinked parts: the ICLabel classifier, the ICLabel website, and this dataset. The website crowdsources labels for the dataset that in turn is used to train the classifier.

See the accompanying publication [Coming Soon].


What is this dataset?

The ICLabel dataset contains an unlabeled training dataset, several collections of labels for small subset of the training dataset, and a test dataset 130 ICs where each IC was labeled by 6 experts. In total it is comprised of features from hundreds of thousands of unique EEG ICs (millions if you count similar ICs from different processing stages of the same datasets). Roughly 8000 of those have labels, though the actual usable number is typically closer to 6000 depending on which features are being used. The features included are:

  • Scalp topography images (32x32 pixel flattened to 740 elements after removing white-space)
  • Power spectral densities (1-100 Hz)
  • Autocorrelation functions (1 second)
  • Equivalent current dipole fits (1 and 2 dipole)
  • Hand crafted features (some new and some from previously published classifiers)

The original time series data are not available. All that is provided is included in this repository as is. I realize having the original time series would make this dataset much more versatile, but unfortunately that's not possible.


Usage

  1. Load the class, passing any desired options.
  2. Load the dataset.

Example:

icl = ICLabelDataset()
icl.download_trainset_features()
icldata = icl.load_semi_supervised()

iclabel-dataset's People

Contributors

lucapton avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

jacobf18

iclabel-dataset's Issues

Can't use the dataset :(

Hi! I'm working on my master thesis and trying to use your ICA labeled data to train my model to detect artifacts.
I keep getting 500 Error:
Downloading individual ICLabel training set CL label files...
Downloading label file 0 of 2...
HTTP Error: 500 https://labeling.ucsd.edu/download/ICLabels_experts.pkl
Downloading label file 1 of 2...
Done.
Loading full dataset...

and cant open features dataset:
`---------------------------------------------------------------------------
IOError Traceback (most recent call last)
in ()
----> 1 icl.load_data()

/Users/ivkitov/univer/diploma/diploma_code/data/ICLabel-Dataset/icldata.py in load_data(self)
954 self.check_for_download('train_features')
955 # topo maps, old psd, dipole, and handcrafted
--> 956 with h5py.File(join(self.datapath, 'features', 'features_0D1D2D.mat'), 'r') as f:
957 print('Loading 0D1D2D features...')
958 features.append(np.asarray(f['features']).T)

/Users/ivkitov/anaconda3/envs/python2Env/lib/python2.7/site-packages/h5py/_hl/files.pyc in init(self, name, mode, driver, libver, userblock_size, swmr, rdcc_nslots, rdcc_nbytes, rdcc_w0, track_order, **kwds)
406 fid = make_fid(name, mode, userblock_size,
407 fapl, fcpl=make_fcpl(track_order=track_order),
--> 408 swmr=swmr)
409
410 if isinstance(libver, tuple):

/Users/ivkitov/anaconda3/envs/python2Env/lib/python2.7/site-packages/h5py/_hl/files.pyc in make_fid(name, mode, userblock_size, fapl, fcpl, swmr)
171 if swmr and swmr_support:
172 flags |= h5f.ACC_SWMR_READ
--> 173 fid = h5f.open(name, flags, fapl=fapl)
174 elif mode == 'r+':
175 fid = h5f.open(name, h5f.ACC_RDWR, fapl=fapl)

h5py/_objects.pyx in h5py._objects.with_phil.wrapper()

h5py/_objects.pyx in h5py._objects.with_phil.wrapper()

h5py/h5f.pyx in h5py.h5f.open()

IOError: Unable to open file (truncated file: eof = 6178545152, sblock->base_addr = 512, stored_eof = 14419006177)`

Can you help me please?

Raw data of training dataset?

Idr if this was asked before but do you have access to the raw EEG data used to generate the features? Or the ICA time series?

Original Time Series Data

Is the original time series data in the ICLabel dataset as well? If not, is there a place I could access it?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.