Git Product home page Git Product logo

Comments (3)

BirkhoffLee avatar BirkhoffLee commented on May 27, 2024

Here's the python code shared by someone on Kaggle that transforms the raw .mat files into numpy arrays of brain tumor MRI images: https://www.kaggle.com/code/tasni18/brain-tumor-classification

from datasets.

ccl-core avatar ccl-core commented on May 27, 2024

Hello @BirkhoffLee and thank you for raising this issue!

Are you planning to add this dataset to TFDS yourself? If yes, you can follow this guide to adding a dataset.

As an example, you can refer to this recent commit that introduced the Databricks Dolly dataset.

from datasets.

BirkhoffLee avatar BirkhoffLee commented on May 27, 2024

Hello @BirkhoffLee and thank you for raising this issue!

Are you planning to add this dataset to TFDS yourself? If yes, you can follow this guide to adding a dataset.

As an example, you can refer to this recent commit that introduced the Databricks Dolly dataset.

I'd love to, but I have a few questions:

  1. Removal of some data. I currently use the dataset on an image classification research project. The original dataset was published with MATLAB format. I have extracted the images as .PNG files (i.e.: removing some data in the orig dataset). Can I keep it as-is in the TFDS repo? To be more specific, only retaining cjdata.label and cjdata.image.
  2. Training split. The original dataset does not split the data for training and testing. How am I supposed to handle it in this repo?
  3. Hosting. Does the TFDS / Tensorflow project offer any place to store the dataset files? I do not see other datasets hosted here.

I have another dataset that I wish to be added into this repo: https://www.kaggle.com/datasets/sartajbhuvaji/brain-tumor-classification-mri. If I can have guidelines clarified then I'd be able to add it as well.

I'm new to the sector and apologies for any naive questions that I may have above, however I do wish to contribute to this repo because it makes research a lot easier. Much thanks :-)

from datasets.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.