Git Product home page Git Product logo

eda-and-modelling-of-retracted-papers's Introduction

EDA and Modelling of Retracted Papers

Project Overview

This project performs exploratory data analysis (EDA) and predictive modeling on retracted papers. The aim is to understand the characteristics of retracted papers and to develop models to predict retractions.

Directory Structure

your-github-repo/
│
├── README.md
├── requirements.txt
│
├── data/
│   ├── raw/
│   └── processed/
│
├── src/
│   ├── eda/
│   │   └── EDA_retraction.py
│   ├── modeling/
│   │   ├── Preparation_modelling.py
│   │   ├── predictive_modelling_approach_2.py
│   │   ├── predictive_modeling_approach_3.py
│   │   └── confusion_matrix_random_forest.py
│
├── results/
│   ├── figures/
│   └── reports/
│
└── scripts/
    ├── run_modeling.py
    └── run_all.py

Setup Instructions

Prerequisites

  • Python 3.6 or higher
  • Git (optional, for cloning the repository)

Steps to Run the Project

  1. Clone the Repository:

    git clone https://github.com/your-username/your-repo.git
    cd your-repo
  2. Create a Virtual Environment (Recommended):

  3. Install Dependencies:

    pip install -r requirements.txt
  4. Run the python file: For example

    python scripts/run_all.py

Explanation of Key Scripts

EDA_retraction.py

This script performs exploratory data analysis on the retracted papers dataset. It generates visualizations and descriptive statistics to understand the characteristics of the data. Before this data cleaning has been done.

Preparation_modelling.py

This script prepares the data for modeling by preprocessing and transforming the dataset. It ensures the data is in the correct format for the predictive models.

predictive_modelling_approach_1.py

check https://github.com/bibekdhakal/research-retraction

predictive_modelling_approach_2.py

This script implements the second approach for predictive modeling. It trains and evaluates a machine learning model to predict retractions.

predictive_modeling_approach_3.py

This script implements the third approach for predictive modeling. It trains and evaluates another machine learning model to predict retractions. Applying clustering techniques (e.g., K-Means) to group similar data points, thereby capturing underlying patterns in the data.

confusion_matrix_random_forest.py

This script generates a confusion matrix for the Random Forest model. It evaluates the performance of the model and visualizes the results.

run_all.py

This script orchestrates the execution of all the key scripts in the correct order. It ensures that the entire workflow from data preparation to model evaluation is completed.

Results

The results of the analysis and modeling are saved in the results directory. This includes figures. Based on this a report has been made using Texmaker.

Contact

For any questions or issues, please contact at [email protected]

eda-and-modelling-of-retracted-papers's People

Contributors

mahesh989 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.