inat-backlog-slogger's Introduction

iNat Backlog Slogger

Work in progress!

This is a proof-of-concept of a tool to assist iNaturalist identifiers, with the goal of making it easier to sort through the backlog of unidentified iNaturalist observations.

Using image quality assessment and other criteria, it will attempt to find observations that are most likely to be high-quality ones, and filter out observations that are most likely to be unidentifiable.

Frontend repo: https://github.com/JWCook/inat-backlog-viewer
Example report for Arachnid observations in North America: https://inat-backlog-viewer.netlify.app

inat-backlog-slogger's People

Contributors

Stargazers

Watchers

inat-backlog-slogger's Issues

Download CC-licencsed images from iNaturalist Open Data

Get all user info in bulk

Move observation data conversion tools to pyinaturalist-convert

Fetch updates for previously saved observations

Custom observation ranking proof-of-concept

Moved from pyinat/pyinaturalist#106 since this has kind of turned into a project of its own.

The purpose of this tool is to to help identifiers prioritize observations that are more likely to be high-quality ones, based on additional criteria that aren't available as observation search parameters.

Some potential indicators of a 'high-quality' observation for a given iconic taxon might be (with different ranking weights, roughly in order from high to low):

User's number of IDs for target taxon & descendants
User's number of research-grade observations for target taxon & descendants
Observation has at least 1 non-observer ID, but isn't RG yet due to a disagreement
Number of photos in observation
User's total observation count of all taxa
User has an ORCID ID
Observation has annotations and/or observation fields

If possible, it would also be helpful to get photo metadata to check for things like camera, lens, and flash info.

There are many pieces related to this that I would like to take the time to polish up and integrate into the rest of pyinaturalist (not just in a standalone script):

#98 : Web scraping for photo info page to get EXIF/XMP metadata
#104 : /identifiers endpoint is needed for user stats
#105 : This will require a better rate-limiting mechanism because this could potentially generate a large number of API requests
#107 : This will need to generate an HTML report that's easy to skim over for ID purposes; probably a simple image grid with medium-large images that link to the original observations
#111, #112 : After implementing rate-limiting, pagination can be improved as well.
#118 : We will need to work with CSV files from the iNat export tool once this is ready to work with larger datasets (queries fro which are too big for the API to handle)

Recommend Projects