Git Product home page Git Product logo

uaconflict_equipmentloss's Introduction

Equipment losses in Russia-Ukraine War 2022

This repo scrapes this list by Oryxspioenkop (daily) to document and visualize equipment losses in the Russia-Ukraine war.

Oryxspioenkop says about this dataset:

This list only includes destroyed vehicles and equipment of which photo or videographic evidence is available. Therefore, the amount of equipment destroyed is significantly higher than recorded here. Small arms, munitions, civilian vehicles, trailers and derelict equipment (including aircraft) are not included in this list. All possible effort has gone into discerning the status of equipment between captured or abandoned. Many of the entries listed as ‘abandoned’ will likely end up captured or destroyed. Similarly, some of the captured equipment might be destroyed if it can’t be recovered. ATGMs and MANPADS are included in the list but not included in the ultimate count. The Soviet flag is used when the equipment in question was produced prior to 1991.

Note: since this relies on publicly shared data there may also be a bias where losses for Ukraine and Russia are underreported or overreported, respectively. While it may be true that Russia is losing much more equipment than Ukraine, it might be faulty to assume so based on this data alone.

Main dataset

The main dataset is data/oryx_data.rds but there are also daily .csv to be found in the data/daily subfolder. Simply retrieve the data from the latest available day or from any other timestamp that you would like to analyze.

read_csv("data/daily/2022-03-26_oryx_data.csv")
#> Rows: 1798 Columns: 13
#> ── Column specification ────────────────────────────────────────────────────────
#> Delimiter: ","
#> chr  (6): equipment_type, cntry_army, flag, system, status, image_link
#> dbl  (6): total_equipment_type_oryx, total_system_oryx, total_destroyed_oryx...
#> dttm (1): timestamp
#> 
#> ℹ Use `spec()` to retrieve the full column specification for this data.
#> ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
#> # A tibble: 1,798 × 13
#>    equipment_type cntry_army flag      system status image_link total_equipment…
#>    <chr>          <chr>      <chr>     <chr>  <chr>  <chr>                 <dbl>
#>  1 Tanks          Russia     https://… T-64BV destr… https://i…                7
#>  2 Tanks          Russia     https://… T-64BV destr… https://i…                7
#>  3 Tanks          Russia     https://… T-64BV destr… https://i…                7
#>  4 Tanks          Russia     https://… T-64BV destr… https://i…                7
#>  5 Tanks          Russia     https://… T-64BV damag… https://i…                7
#>  6 Tanks          Russia     https://… T-64BV captu… https://i…                7
#>  7 Tanks          Russia     https://… T-64BV captu… https://i…                7
#>  8 Tanks          Russia     https://… T-72A  destr… https://i…                8
#>  9 Tanks          Russia     https://… T-72A  destr… https://i…                8
#> 10 Tanks          Russia     https://… T-72A  captu… https://i…                8
#> # … with 1,788 more rows, and 6 more variables: total_system_oryx <dbl>,
#> #   total_destroyed_oryx <dbl>, total_abandoned_oryx <dbl>,
#> #   total_captured_oryx <dbl>, total_damaged_oryx <dbl>, timestamp <dttm>

Timestamping Equipment Losses

OCR

The data for dates is kindly provided by @Narretz (click here) who runs the Oryx source images through OCR in Azure Vision API. Since there could be errors in reporting, I only ever aggregate the data by week (not showing daily counts as that might be misleading). The graphs always exclude the latest week, as that one is not finished yet and might give a biased impression.

Visualizations

Overall losses by type I

Overall losses by type II

Tank losses by status

Armor losses by status

Artillery losses by status

Aircraft losses by status

Overall losses over time

Overall losses over time by (some) vehicle types

Artillery losses over time

Overall tank losses over time

uaconflict_equipmentloss's People

Contributors

favstats avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

uaconflict_equipmentloss's Issues

Wrong counting in oryx_data_dates_com.rds

Great work! However, something goes wrong during the date matching in oryx_data_dates_com.rds. Russian trucks are missing completely and many other counts differ from the regular data due to duplicates and/or missing entries. I guess the trucks are missing due to the comma in their type.

PS: If it is not too much work I would appreciate it if you kept the oryx_data_dates.csv updated daily as well.

Access latest data as CSV without knowing the date of last scrape

To make analysis and using this data as easy as possible, a copy of the latest daily CSV could be kept with a fixed name.

This way, I can write a script, notebook or visualisation without changing the date or having to write a script that looks for a particular file and always retrieving the latest data at e.g. https://raw.githubusercontent.com/favstats/uaconflict_equipmentloss/main/data/daily/oryx_data.csv (instead of 2022-03-23_oryx_data.csv).

I think this can be achieved by simply rewriting this file upon scrape.

Counting Artillery

When I count the artillery losses, the numbers from different graphs don't add up. At one time it is 263, on time 181. Furthermore on the Orxy website it says RS lost is over 4000. Do you know why? Thanks and cheers

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.