Git Product home page Git Product logo

vidscraper's Introduction

Vidscraper is a library for retrieving information about videos from various sources โ€“ video feeds, APIs, page scrapes โ€“ combining it, and presenting it in a unified manner, all as efficiently as possible.

Vidscraper comes with built-in support for popular video sites like blip, vimeo, ustream, and youtube, as well support for generic RSS feeds with feedparser.

Quick example

>>> import vidscraper
>>> video = vidscraper.auto_scrape('http://www.youtube.com/watch?v=PMpu8jH1LE8')
>>> video.title
u"The Magic Roundabout - Ermintrude's Folly"
>>> video.description
u"Ermintrude's been at the poppies again, but it's Dougal who ends up high as a kite!"
>>> video.user
u'nickhirst999'
>>> video.guid
'http://gdata.youtube.com/feeds/api/videos/PMpu8jH1LE8'

Command line

vidscraper also comes with a command line utility allowing you to get video metadata from the command line. The example above could look like this:

$ vidscraper video http://www.youtube.com/watch?v=PMpu8jH1LE8 \
  --fields=title,description,user,guid
Scraping http://www.youtube.com/watch?v=PMpu8jH1LE8...
{
  "description": "Ermintrude's been at the poppies again, but it's Dougal who ends up high as a kite!",
  "fields": [
    "title",
    "description",
    "user",
    "guid"
  ],
  "guid": "http://gdata.youtube.com/feeds/api/videos/PMpu8jH1LE8",
  "title": "The Magic Roundabout - Ermintrude's Folly",
  "url": "http://www.youtube.com/watch?v=PMpu8jH1LE8",
  "user": "nickhirst999"
}

Project links

code:https://github.com/pculture/vidscraper/
docs:http://vidscraper.readthedocs.org/
bugtracker:http://bugzilla.pculture.org/
code:https://github.com/pculture/vidscraper/
irc:#vidscraper on irc.freenode.net
build status:build-image

Requirements

Optional

  • requests-oauth 0.4.1+ (for some APIs *cough* Vimeo searching *cough* which require authentication)
  • lxml 2.3.4+ (recommended for BeautifulSoup; assumed parser for test results.)
  • unittest2 0.5.1+ (for tests)
  • mock 0.8.0+ (for tests)
  • tox 1.4.2+ (for tests)

vidscraper's People

Contributors

melinath avatar paulswartz avatar cwebber avatar paulproteus avatar openhatched avatar willkg avatar msabramo avatar

Watchers

Jude Mwenda avatar James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.