Git Product home page Git Product logo

data's Introduction

Datasets

A collection of open data sets created by Code for Nepal team. The data are in standard CSV format.

Data Source Year
Complete Census Data: All Districts National Population and Housing Census (PDF) 2011
Districts at glance Nepal Census 2011
Population by disability National Population and Housing Census (PDF) 2011
Population aged 5 years and above by literacy status National Population and Housing Census (PDF) 2011
Literate population aged 5 years and above by educational attainment National Population and Housing Census (PDF) 2011
Population above SLC by major field of study National Population and Housing Census (PDF) 2011
Literate population in Nepal UNESCO 2013
Life expectancy and income in Nepal Human Development Report 2014 (PDF) 2014
Women prisoners in Nepal International Centre for Prison Studies 2015

data's People

Contributors

amitness avatar andy-j avatar cliftonmcintosh avatar ferranmarin avatar hbvj99 avatar hrhwilliams avatar insiyaa avatar msdeibel avatar nhorning avatar nikeshbalami avatar pratimakandel avatar rafagarciac avatar ravinepal avatar sarahlizdettloff avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

data's Issues

Create data set on oral polio vaccination rates in children ages 12-23 months

For the upcoming federal version of NepalMap, we would like to present data on oral polio vaccination rates. These data are available in Table 10.3 on page 212 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 10.3 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, number of doses, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "number of doses" options should be "NONE", "ONE", "TWO", "THREE"
  • The "total" should be the number of children age 12-23 months in a province whose parents were surveyed and whose dosage under the "Polio" header in the table fits each "number of doses" category. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the first "Number of children" column in Table 10.3.
  • Note that the values are cumulative. For example, if 1 dose is at 75%, 2 doses is at 65%, and 3 doses is at 40%, that means that 10% of the children have received only one dose, 25% of the children have received only 2 doses, 40% have received 3 doses and 25% have received no doses.
  • Calculate raw numbers from the percentages. For example, if there are 100 total children age 12-23 months in Province 1 and 50.1% of the children have received one dose of oral polio vaccine, then there should be a row that looks like this: "Province 1,ONE,50". Please note that the number of children who have received no oral polio vaccine doses will need to be inferred from the percentage of children who have received one or more. Please note that there are TWO "number of children" columns. You should use the first "number of children" column (16th column overall) because it is the one that is for children ages 12-23 months. The last column in the table is NOT the one to use because it is for older children.
  • The file should be named "oral_polio_vaccination_twelve_to_twenty_three_months.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on mosquito net availability

For the upcoming federal version of NepalMap, we would like to present data on possession of mosquito nets by each household. These data are available in Table 2.17 on page 39 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 2.17 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, number of nets, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "number of nets" options should be "NONE", "ONE", "TWO_TO_THREE", "FOUR_OR_MORE"
  • The "total" should be the number of households in a province that fit each "number of nets" category. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the "Number of households" column in Table 2.17. For example, if there are 100 total households in Province 1 and 50.1% of the households have 2-3 nets, then there should be a row that looks like this: "Province 1,TWO_TO_THREE,50". Please note that the households with no nets will need to be inferred from the percentage of households with nets.
  • The file should be named "mosquito_nets.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on BCG (tuberculosis) vaccination rates among children ages 12-23 months

For the upcoming federal version of NepalMap, we would like to present data on BCG (tuberculosis) vaccination rates among children ages 12-23 months. These data are available in Table 10.3 on page 212 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 10.3 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, bcg vaccine received, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "bcg vaccine received" options should be "YES", "NO"
  • The "total" should be the number of children age 12-23 months in a province whose parents were surveyed and reported that they have received the BCG vaccine. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the first "Number of children" column in Table 10.3. For example, if there are 100 total children age 12-23 months in Province 1 and 50.1% of the children have received the BCG vaccine, then there should be a row that looks like this: "Province 1,YES,50". Please note that the number of children who have received no BCG vaccine will need to be inferred from the percentage of children who have received it. Please note that there are TWO "number of children" columns. You should use the first "number of children" column (16th column overall) because it is the one that is for children ages 12-23 months. The last column in the table is NOT the one to use because it is for older children.
  • The file should be named "bcg_vaccination_twelve_to_twenty_three_months.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

In the new federal system, how many local bodies are there?

The geo data used to create the geography table for https://github.com/Code4Nepal/nepalmap_federal has 774 local bodies.

The data in https://github.com/Code4Nepal/data/tree/master/Federal%20Data/753%20Local%20Unit%20Population%20and%20HouseHold suggests that there are 753 local bodies.

What is the right answer? There needs to be a match in order for the geographic data and the census data to work together correctly in nepalmap.

Here is a CSV file showing the localities that have been imported into the geography table for nepalmap_federal:
localities.csv.zip

Create data set on food security

For the upcoming federal version of NepalMap, we would like to present data on food security for each household. These data are available in Table 2.20 on page 42 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 2.20 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, food security, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "food security" options should be "FOOD_SECURE", "MILDLY_FOOD_INSECURE", "MODERATELY_FOOD_INSECURE", "SEVERELY_FOOD_INSECURE"
  • The "total" should be the number of households in a province that fit each "food security" category. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the "Number of households" column in Table 2.20. For example, if there are 100 total households in Province 1 and 50.1% of the households are food secure, then there should be a row that looks like this: "Province 1,FOOD_SECURE,50".
  • The file should be named "food_security.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on DPT-HepB-Hib (pentavalent) vaccination rates among children ages 12-23 months

For the upcoming federal version of NepalMap, we would like to present data on DPT-HepB-Hib (pentavalent) vaccination rates. These data are available in Table 10.3 on page 212 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 10.3 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, number of doses, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "number of doses" options should be "NONE", "ONE", "TWO", "THREE"
  • The "total" should be the number of children age 12-23 months in a province whose parents were surveyed and whose dosage fits each "number of doses" category under the heading "DPT-HepB-Hib." In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the first "Number of children" column in Table 10.3.
  • Note that the values are cumulative. For example, if 1 dose is at 75%, 2 doses is at 65%, and 3 doses is at 40%, that means that 10% of the children have received only one dose, 25% of the children have received only 2 doses, 40% have received 3 doses and 25% have received no doses.
  • Calculate raw numbers from the percentages. For example, if there are 100 total children age 12-23 months in Province 1 and 50.1% of the children have received one dose of DPT-HepB-Hib vaccine, then there should be a row that looks like this: "Province 1,ONE,50". Please note that the number of children who have received no DPT-HepB-Hib vaccine doses will need to be inferred from the percentage of children who have received one or more. Please note that there are TWO "number of children" columns. You should use the first "number of children" column (16th column overall) because it is the one that is for children ages 12-23 months. The last column in the table is NOT the one to use because it is for older children.
  • The file should be named "pentavalent_vaccination_twelve_to_twenty_three_months.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on antenatal care

For the upcoming federal version of NepalMap, we would like to present data on antenatal care. These data are available in Table 9.1 on page 165 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 9.1 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, antenatal care, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "antenatal care" options should be "NONE", "DOCTOR", "NURSE_OR_MIDWIFE", "HEALTH_ASSISTANT", "MCH_WORKER", "COMMUNITY_HEALTH_VOLUNTEER"
  • The "total" should be the number of women who had a live birth in the last five years in a province that fit each "antenatal care" category. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the "Number of women" column in Table 9.1. For example, if there are 100 total women giving birth in the last five years in Province 1 and 50.1% of the women received antenatal care from a doctor, then there should be a row that looks like this: "Province 1,DOCTOR,50".
  • The file should be named "antenatal_care.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on travel time to nearest government health facility

For the upcoming federal version of NepalMap, we would like to present data on the travel time to nearest government health facility. These data are available in Table 2.6 on page 25 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 2.6 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, travel time, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "travel time" options should be "LESS_THAN_THIRTY_MINUTES", "THIRTY_TO_SIXTY_MINUTES", "MORE_THAN_SIXTY_MINUTES", "UNKNOWN"
  • The "total" should be the number of households in a province that fit each "travel time" category. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the "Number of households" column in Table 2.6. For example, if there are 100 households in Province 1 and 50.1% of the households are within less than thirty minutes from a health facility, then there should be a row that looks like this: "Province 1,LESS_THAN_THIRTY_MINUTES,50".
  • The file should be named "travel_time_to_nearest_govt_health_facility.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on places for washing hands

For the upcoming federal version of NepalMap, we would like to present data on handwashing facilities in each household. These data are available in Table 2.8 on page 27 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 2.7 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, handwashing place, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "handwashing place" options should be "FIXED", "MOBILE", "NONE"
  • The "total" should be the number of households in a province that fit each "handwashing place" category. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the "Number of households" column in Table 2.8. For example, if there are 100 households in Province 1 and 50.1% of the households have a fixed place for washing hands, then there should be a row that looks like this: "Province 1,FIXED,50".
  • The file should be named "hand_washing.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on pneumococcal vaccination rates among children ages 12-23 months

For the upcoming federal version of NepalMap, we would like to present data on pneumococcal vaccination rates. These data are available in Table 10.3 on page 212 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 10.3 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, number of doses, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "number of doses" options should be "NONE", "ONE", "TWO", "THREE"
  • The "total" should be the number of children age 12-23 months in a province whose parents were surveyed and whose dosage fits each "number of doses" category under the heading "Pneumococcal." In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the first "Number of children" column in Table 10.3.
  • Note that the values are cumulative. For example, if 1 dose is at 75%, 2 doses is at 65%, and 3 doses is at 40%, that means that 10% of the children have received only one dose, 25% of the children have received only 2 doses, 40% have received 3 doses and 25% have received no doses.
  • Calculate raw numbers from the percentages. For example, if there are 100 total children age 12-23 months in Province 1 and 50.1% of the children have received one dose of pneumococcal vaccine, then there should be a row that looks like this: "Province 1,ONE,50". Please note that the number of children who have received no pneumococcal vaccine doses will need to be inferred from the percentage of children who have received one or more. Please note that there are TWO "number of children" columns. You should use the first "number of children" column (16th column overall) because it is the one that is for children ages 12-23 months. The last column in the table is NOT the one to use because it is for older children.
  • The file should be named "pneumococcal_vaccination_twelve_to_twenty_three_months.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on measles/rubella vaccination rates for children ages 12-23 months

For the upcoming federal version of NepalMap, we would like to present data on measles/rubella vaccination rates for children ages 12-23 months. These data are available in Table 10.3 on page 212 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 10.3 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, bcg vaccine received, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "bcg vaccine received" options should be "YES", "NO"
  • The "total" should be the number of children age 12-23 months in a province whose parents were surveyed and reported that they have received the measles/rubella vaccine. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the first "Number of children" column in Table 10.3. For example, if there are 100 total children age 12-23 months in Province 1 and 50.1% of the children have received the measles/rubella vaccine, then there should be a row that looks like this: "Province 1,YES,50". Please note that the number of children who have received no measles/rubella vaccine will need to be inferred from the percentage of children who have received it. Please note that there are TWO "number of children" columns. You should use the first "number of children" column (16th column overall) because it is the one that is for children ages 12-23 months. The last column in the table is NOT the one to use because it is for older children.
  • The file should be named "measles_rubella_vaccination_twelve_to_twenty_three_months.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Create data set on overall vaccination rates for children ages 12-23 months

For the upcoming federal version of NepalMap, we would like to present data on overall vaccination rates for children ages 12-23 months. These data are available in Table 10.3 on page 212 of the Nepal Demographic and Health Survey of 2016 (PDF here). Extract the data from Table 10.3 so that it is in a csv that conforms to the following specifications:

  • It should have three columns: province, vaccinations, total
  • The columns should be separated by commas
  • The headers should be the first row in the CSV
  • The "vaccinations" options should be "NONE", "SOME", "BASIC", "ALL"
    -- "NONE" is for the column "No vaccinations" in Table 10.3
    -- "BASIC" is for the column "All basic vaccinations" in Table 10.3
    -- "ALL" is for the column "All age-appropriate vaccinations" in Table 10.3
    -- "SOME" is for 100% minus the percent who have received basic vaccinations minus the percent who have received no vaccinations.
  • Note that the values are cumulative. Here is an example using Province 1 data:
All basic vaccinations All age-appropriate vaccinations No vaccinations Number of children
79.4 43.1 0.8 169

Since 0.8% have received none, the value for "NONE" is 169 * 0.008.
Since 43.1% have received "All age-appropriate vaccinations", the value for "ALL" is 169 * 0.431
The percent receiving "BASIC" is calculated by subtracting the percentage receiving "All age-appropriate vaccinations" from the percentage receiving "All basic vaccinations". So 169 * (0.794 - 0.431)
The percent receiving "SOME" is calculated by subtracting the percentage receiving none and the percentage receiving all basic from 100%. So 169 * (1 - 0.794 - 0.008).

  • The "total" should be the number of children age 12-23 months in a province who fit in one of the four groups described above. In other words, the percent statistics in the table should be converted to raw numbers. Percents should be rounded with a sensible strategy. Totals after rounding should match the total in the first "Number of children" column in Table 10.3. Please note that the last two columns in the table are not for children ages 12-23 months. The columns to be used for this data set are in columns 13-16 in the table. For example, if there are 100 total children age 12-23 months in Province 1 and 50.1% of the children are in the "All basic vaccinations" column, then there should be a row that looks like this: "Province 1,BASIC,50".
  • The file should be named "overall_vaccination_twelve_to_twenty_three_months.csv"
  • The file should be placed in the directory "Federal Data/Nepal Demographic and Health Survey 2016"

Consolidate data for "PRADESH 6"

The 753 Local Unit Population and HouseHold directory has two directories for Province 6. One of them is "PRADESH 6" and has one subdirectory. The other is "Pradesh 6" and has several subdirectories. We should consolidate these in one directory, preferably as "PRADESH 6" to match the all-caps of the other province directories.

Scrape data about agricultural production in Nepal

Here's the dataset published by the Government of Nepal.

Here's a detailed guide by @cliftonmcintosh on how we will use the data and how your data should look like when you submit here.

We'd like to scrape and visualize from the document:
Major Cereal Crops by Districts
Cash Crops by Districts
Oil Seed Crops by Districts
Jute by Districts Done in #3
Cotton by Districts Done in #3
Tea by Districts
Coffee by Districts
Pulses by Districts
Live Stock Population by Districts
Milk Animals and Milk Production by Districts
Net Meat Production by Districts
Egg Production by Districts
Wool Production by Districts
Yak/ Nak/ Chauri Population by Districts
Rabbit Population by Districts
Horse/Asses Population by Districts
Water Surface Area and production of fish by Districts
done in #5
Citrus Fruits: Area, Productive Area, Production, and Yield by Districts
Summer Fruits: Area, Productive Area, Production, and Yield by Districts
Fresh Vegetable Crops: Districtwise Area, Production and Yield

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.