Git Product home page Git Product logo

python-pandas-tutorial's Introduction

Pandas for machine learning

Pandas is the best and most popular Python library for machine learning. This library offers a wide variety of functions that will help you manipulate data, optimize your machine-learning algorithm, and much more. This tutorial will help you to get familiar with this library and master the most used functionalities with code samples and video tutorials that will help you to create your first data frame, clean a dataset of information, read CSV files, etc...

The exercises in this tutorial have been created in about 60 hours of development by many experts in machine learning and carefully reviewed by our contributors to make sure you have the most accurate and important information that will help to start your machine learning career.

Content table

In this tutorial, we will see the most important and basic functions provided by Pandas that will help you to work with data in machine learning, the following are some of the topics that will be covered in this tutorial.

Exercise Description of the topic
Install Pandas These exercises cover how to install Pandas, how to import the Pandas library in a Python file, and how to create your first Python script.
DataSets These exercises explain what datasets are and how to work with them.
Series These exercises explain what series are in Pandas and how to use them.
DataFrames These exercises explain how to create an information data frame and what functions can be used to work with them.
Clean DataSets This class covers what data cleaning is, the functions Pandas offers to clean up a dataset, and the best practices to use when cleaning a dataset.

Tutorial Installation

There are two ways to initialize this tutorial, the first and easiest is to open the tutorial in a cloud environment such as Codespaces or Gitpod, and the second is to clone this repository in your local environment.

We recommend that you use Codespaces because it is the easiest en fastest way to start the tutorial.

1. Open the tutorial in a cloud environment

You can start this tutorial in just a few seconds with Codespaces by clicking on the following link: open in codespaces (recommended) or you can use Gitpod by clicking on: open in gitpod.

Once you have opened the cloud environment either Codespaces or Gitpod, the LearPack exercises should start automatically. If the exercises do not start automatically you can open a terminal and type the command: learnpack start

2. Open the tutorial in your local environment

To start this tutorial in your local environment, follow the steps below:

  1. Open a terminal and clone this repository on your local environment, you can use the following command:
git clone https://github.com/4GeeksAcademy/python-pandas-tutorial.git 
  1. Make sure you have installed a Node.js version of 12.01.1 or higher:
node --version
  1. Install Learnpack, the package manager for learning tutorials, and also run the Python compiler plugin for LearnPach, you can do this with the following commands:
> npm i learnpack -g
> learnpack plugins:install learnpack-python
  1. Finally, Install Jest to perform the necessary tests throughout the tutorial and start the exercises with the following commands:
> npm i [email protected] -g
> learnpack start

Contributors

We would like to express our deepest gratitude to the following contributors for their invaluable support in making this tutorial possible.

Contributor Github account
Alejandro Sanchez alesanchezr
Martín Suárez kiddopro
lorena Gubaira Lorenagubaira
Tomas Gonzalez tommygonzaleza
Hernán García hernanjkd
Ernesto Gonzalez UmiKami
Hector Chocobar hchocobar
Charly Chacón Charlytoc
Agustín Fernández Dasher83
Ignacio Cordoba nachovz

This tutorial and many other exercises are designed for students as part of the 4Geeks academy's Coding Bootcamp. Currently, we have two courses available. The first one is the Full Stack developer Course, in this course, you will learn technologies like HTML5, CSS3, Javascript, Python, Flask, SQL and many others. The second one is the Data Science Bootcamp where you will learn technologies like Python, Algorithms' basics, Pandas, SQL Database, and many other technologies. You can find more information about these courses and the upcoming Blockchain and Web3 course on the official 4Geeks Academy web page.

python-pandas-tutorial's People

Contributors

alesanchezr avatar tommygonzaleza avatar kiddopro avatar lorenagubaira avatar umikami avatar hernanjkd avatar dasher83 avatar charlytoc avatar fernandorojascarrillo avatar hchocobar avatar nachovz avatar jsolis4geeks avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.