Git Product home page Git Product logo

inat-backlog-slogger's Introduction

iNat Backlog Slogger

Work in progress!

This is a proof-of-concept of a tool to assist iNaturalist identifiers, with the goal of making it easier to sort through the backlog of unidentified iNaturalist observations.

Using image quality assessment and other criteria, it will attempt to find observations that are most likely to be high-quality ones, and filter out observations that are most likely to be unidentifiable.

inat-backlog-slogger's People

Contributors

jwcook avatar

Stargazers

 avatar

Watchers

 avatar  avatar

inat-backlog-slogger's Issues

Custom observation ranking proof-of-concept

Moved from pyinat/pyinaturalist#106 since this has kind of turned into a project of its own.

The purpose of this tool is to to help identifiers prioritize observations that are more likely to be high-quality ones, based on additional criteria that aren't available as observation search parameters.

Some potential indicators of a 'high-quality' observation for a given iconic taxon might be (with different ranking weights, roughly in order from high to low):

  • User's number of IDs for target taxon & descendants
  • User's number of research-grade observations for target taxon & descendants
  • Observation has at least 1 non-observer ID, but isn't RG yet due to a disagreement
  • Number of photos in observation
  • User's total observation count of all taxa
  • User has an ORCID ID
  • Observation has annotations and/or observation fields

If possible, it would also be helpful to get photo metadata to check for things like camera, lens, and flash info.

There are many pieces related to this that I would like to take the time to polish up and integrate into the rest of pyinaturalist (not just in a standalone script):

  • #98 : Web scraping for photo info page to get EXIF/XMP metadata
  • #104 : /identifiers endpoint is needed for user stats
  • #105 : This will require a better rate-limiting mechanism because this could potentially generate a large number of API requests
  • #107 : This will need to generate an HTML report that's easy to skim over for ID purposes; probably a simple image grid with medium-large images that link to the original observations
  • #111, #112 : After implementing rate-limiting, pagination can be improved as well.
  • #118 : We will need to work with CSV files from the iNat export tool once this is ready to work with larger datasets (queries fro which are too big for the API to handle)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.