Git Product home page Git Product logo

carnd-advanced-lane-lines's Introduction

Advanced Lane Finding Project

The goals / steps of this project are the following:

  • Compute the camera calibration matrix and distortion coefficients given a set of chessboard images.
  • Apply a distortion correction to raw images.
  • Use color transforms, gradients, etc., to create a thresholded binary image.
  • Apply a perspective transform to rectify binary image ("birds-eye view").
  • Detect lane pixels and fit to find the lane boundary.
  • Determine the curvature of the lane and vehicle position with respect to center.
  • Warp the detected lane boundaries back onto the original image.
  • Output visual display of the lane boundaries and numerical estimation of lane curvature and vehicle position.

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.


Writeup / README

1. Writeup / README includes all the rubric points. Here is a template writeup for this project.

You're reading it!

Camera Calibration

1. Compute camera matrix and distortion coefficients.

The code for this step is contained in the code cells #2-3 of the IPython notebook located in "./CarND-Advanced-Lane-Lines.ipynb".

Step 1: Prepare "object points", which will be the (x, y, z) coordinates of the chessboard corners in the world.

Assuming the chessboard is fixed on the (x, y) plane at z=0, such that the object points are the same for each calibration image. Thus, objp is just a replicated array of coordinates, and objpoints will be appended with a copy of it every time algorithm successfully detects all chessboard corners in a test image.

imgpoints will be appended with the (x, y) pixel position of each of the corners in the image plane with each successful chessboard detection.

Step 2: Use the output objpoints and imgpoints to compute the camera calibration and distortion coefficients using the cv2.calibrateCamera() function.

Step 3: Apply this distortion correction to the test image using the cv2.undistort() function and obtained this result:

alt text

Pipeline (single images)

1. Example of a distortion-corrected image.

alt text

2. Methods to create a thresholded binary image.

Pipeline contains combination of color and gradient thresholds to generate a binary image (thresholding steps at cell #5 through #7.

Step 1: Apply Sobel on x-axis (calculate directional gradient and apply gradient threshold)

Step 2: Apply Sobel on y-axis

Step 3: Apply Sobel x and y, compute the magnitude of the gradient and apply a threshold

Step 4: Apply Sobel x and y, computes the direction of the gradient and apply a threshold

Step 5: Combine the thresholds from steps 1-4

Step 6: Threshold color channel (S channel from HLS color space)

Step 7: Combine color and gradient thresholds (steps 5-6)

Results:

alt text

alt text

alt text

alt text

alt text

3. Perspective transform.

The code for perspective transform includes a function called warp(), which appears in code cell #12 of the IPython notebook. The warp() function takes as inputs an image (thresholded). The source and destination points were hardcoded in the following manner:

src = np.float32([[180, img.shape[0]], [575, 460], [705, 460], [1150, img.shape[0]]])

dst = np.float32([[320, img.shape[0]], [320, 0], [960, 0], [960, img.shape[0]]])

This resulted in the following source and destination points:

Source Destination
180, 720 320, 720
575, 460 320, 0
705, 460 960, 0
1150, 720 960, 720

Draw the src and dst points onto a test image and its warped counterpart to verify that perspective transform works as expected. Lines appear parallel in the warped image:

alt text

Example with warped and masked image:

alt text

4. Identify lane-line pixels and fit their positions with a polynomial.

This step implemented in cell #13 in the function detect_lines()

Steps to fit lane lines with a 2nd order polynomial:

Step 1: Find the starting point for the left and right lines (take a histogram of the bottom half of the masked image)

alt text

Step 2: Set the width of the windows +/- margin (= 100), set minimum number of pixels found to recenter window (= 50)

Step 3: If lines are not detected - step through the windows one by one

alt text

Step 4: Extract left and right line pixel positions and Fit a second order polynomial to each (using np.polyfit

Step 5: Generate x and y values for plotting

5. Calculate the radius of curvature of the lane and the position of the vehicle with respect to center.

This step implemented in cell #14 in the function rad_and_offset

Fit the left and right lines with a second order polynomial shown in green:

alt text

6. Result plotted back down onto the road such that the lane area is identified clearly.

This step implemented in cell #15 in the function draw_res().

Here is an example of result on a test image:

alt text


Pipeline (video)

Here's a link to my video result

Here's link to my video result on YouTube


Discussion

Difficulties:

Pipeline becames relatively nonstable in cases of very fast turns and difficult light conditions (experimental way to fix this issue is to use high light sensitive camera with high fps values).

What could be improoved:

  • Drawing procedure could be improved by adding stabilization features

  • More experiments with threshold values and color spaces (and different combination of both) could be performed

  • Include additional information on each frame (statistics, etc.)

carnd-advanced-lane-lines's People

Contributors

olpotkin avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Forkers

jandal487

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.