- Introduction
- Problem Statement
- Data Collection
- Data Checks
- Feature Wise Visualization
- Model Training and Evaluation
- Deployment
- Installation
- Usage
- Contributing
- License
- Contact
This project aims to understand the influence of various factors such as parents' background, test preparation, etc., on students' academic performance. Analyzing student performance is crucial as it allows educators to assess and improve teaching methods. Many factors can influence a student's performance, including the parents' educational background, test preparation, student health, and more.
Analyzing student work is an essential part of teaching. Teachers assign, collect, and examine student work regularly to assess student learning and to revise and improve teaching. Ongoing assessment of student learning allows teachers to engage in continuous quality improvement of their courses. This project seeks to understand how various factors, including parental background and test preparation, impact student performance.
- Dataset Source: Kaggle Dataset
- Dataset Information: The dataset comprises 8 columns and 1000 rows detailing the marks secured by students in various subjects. The columns are:
- gender
- race/ethnicity
- parental level of education
- lunch
- test preparation course
- math score
- reading score
- writing score
Before diving into the analysis, it's essential to ensure the data's integrity:
- Check for missing values.
- Verify there are no duplicates.
- Confirm the data types of each column.
- Examine the number of unique values in each column.
- Review the dataset's statistics.
- Investigate the categories present in the categorical columns.
This section delves into the insights derived from the data:
- Distribution of gender and its impact on performance.
- Influence of lunch type on exam scores.
- The effect of parental education on student performance.
- Score distributions for different subjects.
- Demographic insights based on ethnicity, lunch type, test preparation, and parental education.
For this project, two models were trained to predict student performance: Xgboost and Catboost. After rigorous training and evaluation, the models achieved an accuracy of 88%.
The trained model has been deployed using Flask, making it accessible for real-time predictions. To run the application, use the app.py
script.
- Clone the repository:
git clone https://github.com/nani2357/ML_project_Students-Performance-in-Exams_with-Deployment.git
pip install -r requirements.txt
python app.py
This project is licensed under the MIT License.
For any queries or feedback, please reach out to the repository owner.