Git Product home page Git Product logo

scrape-google-images's Introduction

Scrape Google Images

A TypeScript library for scraping images from Google Images using Puppeteer.

npm GitHub

Installation

To use this library, you'll need Node.js installed on your machine. Then, you can install it via npm:

npm install scrape-google-images

Usage

import { scrapeImages } from 'scrape-google-images';

const query = 'cats';
const options = {
    limit: 10,
    imgSize: 'm',
    imgType: 'photo',
    imgColor: 'color',
    imgar: 'xw',
    fileType: 'jpg',
    safe: false,
    siteSearch: '',
    rights: '',
    metadata: true,
    imgData: false,
    engine: 'pupeeteer'
};

const images = await scrapeImages(query, options);

Options

The pick function accepts an optional options object with the following properties:

Option Type Default Description
limit number 10 Maximum number of images to retrieve. (max: 100)
imgSize string '' Filter images by size (e.g., 'l''m''i''qsvga''vga''svga''xga').
imgType string '' Filter images by type (e.g., 'clipart''face''lineart''stock''photo').
imgColor string '' Filter images by color (e.g., 'mono''gray''color''trans').
imgar string '' Filter images by aspect ratio (e.g., 't', 's', 'w', 'xw')
fileType string '' Filter images by file type (e.g., 'jpg''gif''png''bmp''svg''webp').
safe boolean false Enable safe search filter.
siteSearch string '' Search for images from a specific website.
rights string '' Filter images by usage rights (e.g., 'cl' for creative commons, 'ol' for other).
random boolean false Randomize the order of search results.
metadata boolean true Include image metadata in the results.
imgData boolean false Include base64 image data in the results.
engine string pupeeteer Lib used to scrap Google Images Data. pupeeteer or cheerio

Result

The pick function returns a Promise that resolves to an array of PickResult objects, each containing the following properties:

Property Type Description
src string The URL of the image.
imgData string The base64 image data (if imgData is true).
description string The image description or alt text.
source string The URL of the source website.
metadata object The image metadata (if metadata is true).


The metadata object has the following properties:

Property Type Description
width number The width of the image.
height number The height of the image.
format sharp.FormatEnum The format of the image.

Licence

This library is licensed under the MIT License. See the LICENSE file for more information.

scrape-google-images's People

Contributors

eddydezuraud avatar

Stargazers

Tyler Mills avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.