Git Product home page Git Product logo

scei-statistics's Introduction

SCEI statistics

Question I want to answer

To reduce bias, I state what I want to look at before I start with the analysis as opposed to looking at the data first and then report my findings.

Main question

Is there a gender difference in the performance at the selective exam “concours grandes écoles d’ingénieurs”?

Follow ups?

There are other interesting questions:

  • Evolution over time of the male / female ratio.
  • Evolution over time of the size of the global population.
  • Does the male / female ratio correlate with female performance?
  • If I do find a gender difference in the performance, does it correlate with the exams’ weighting coefficient at the Ecoles? I probably won’t answer this one since it takes too much effort to gather the data.

Explore

I might also report on interesting features I find while looking at the data. If these are not stated above, then these features are things I didn’t look for, but stumble upon.

Data available

A more detailed work in progress summary of what I’ve done so far is available at project.org.

Broader look

SCEI (Service concours écoles d’ingénieurs) publishes yearly statistics about the results. Each year, SCEI gives a global statistics about all branches merged. Each year,for each branch, SCEI gives per Ecole basis statistic. This data is mostly complete, but there are some blanks. For some years, for some branches, SCEI makes per CPGE (Classe préparatoire aux grandes écoles) data available too.

More detailed look

From 2004 (included) up to 2017 (included), the rank of the last candidate called is available. From 2018 onward, median and mean rank of accepted candidates are instead made available. These data can be used to have an idea of the desirability / selectivity of Ecoles within a school.

License

The code is licensed under GPL License but the data is © SCEI and was scrapped on February / December 2021. It is not my property, and it is stored here for archive and research purpose. I will delete it if SCEI asks.

scei-statistics's People

Contributors

mhaz avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

scei-statistics's Issues

Also offer SQLite file(s) for the TSV data as easily explorable datasets

Hi again,
Also, I am very interested in these datasets, I would like to have SQLite files (or just one file) containing the data.

When doing data exploration, on such "small" databases, I like to do everything using SQL from Jupyter notebooks.
It can be done from a Python notebook, with ipython-sql, or with pandas and Pandas-SQL, to have a really neat interface (any result of a SQL query is a pandas dataframe which is nicely displayed, which can be plotted easily etc).

It seems more efficient to me than using a specific Jupyter kernel, like xeus-sqlite or xeus-sql.

Tell me if you're interested and able to do this yourself, or I'll come back in a couple of months and do it myself.
Thanks, regards from Rennes in France, @Naereen

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.