Git Product home page Git Product logo

making-the-computer-see-ndc-2014's Introduction

Making The Computer See - Computer Vision For Everyday Applications

Presentation and code for NDC Oslo 2014.

Abstract

With a camera and a powerful computer in every pocket, the shift from typing to computer vision is just about to happen.

Instead of entering information about a wine for our tasting notes, we snap a photo and let the computer recognize it and look up the information. Instead of having the shopper enter his credit card info, we can read the information straight from the card with the camera. We can even use computer vision to play a fanfare when someone arrives at our doorstep carrying a pizza box.

The tools to make this happen are in the field of computer vision. Today, the are solid techniques and cross-platform open-source libraries such as OpenCV available that make it easy to build this into everyday applications.

This presentation will show you how.

We will look at practical applications of computer vision such as using the camera to scan text (OCR), reading the info from a credit card or a license plate from a passing car, recognizing the pizza box in the image or using the wine label as a visual query against our wine database to find the producer, vintage, grapes and other information with no manual data entry.

The talk combines practical examples with a presentation of the basic image processing techniques that make it possible.

You will learn about basic image transformations, how to find interesting or characteristic parts of image, extract and recognize text and and ways to compare images and how to recognize known shapes and objects in images.

Be prepared for a highly visual talk that provides not only an introduction to making the computer see but also presents its mathematical and statistical machine learning underpinnings in a very practical context.

Intended audience:

Developers curious about how to max out the CPU and camera on their smartphone while saving their users a lot of trouble. Entry level talk: No Ph.D., machine learning or computer vision background required.

About the speaker

Martin Jul is a developer from Copenhagen, Denmark working in startup stealth mode in the field of computer vision and machine learning.

Some time before the Internet Age he studied mathematics and computer science in Copenhagen, and he is very happy that libraries and powerful processors are finally making the hitherto exotic and inaccesible academic disciplines involved computing with images accessible to everyone.

making-the-computer-see-ndc-2014's People

Contributors

mjul avatar

Watchers

 avatar James Cloos avatar  avatar

Forkers

rprouse haf

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.