Git Product home page Git Product logo

tech-diff's Introduction

tech-diff

Compare different technologies. No BS and all sources linked.

Motivation

  • Many new/existing technologies
  • Hard to compare objectively
  • Biased sources/marketing of data technologies skews views/opinions
  • Find answers to simple questions fast
  • Provide links for those wanting deeper information

How it works

  1. JSON Schema defined here
  2. Write YAML files under here
  3. Template markdown file here
  4. Run ./gradlew generate to generate markdown files
  5. Run mkdocs build to generate site
  6. Open site/index.html to see result

Step 1 and 2 => Base data of different technologies Step 3-6 => Generate UI

JSON Schema

JSON Schema for tech-diff can be found here. You can import this schema into your IDE for validation of your YAML files. Links below show how you can import the schema:

How to contribute

  • Add new high level type of technology (i.e. databases, object stores, messaging system)
    • Add properties to JSON schema here. Ensure they are all required fields
    • Add folder under tech folder to house the corresponding YAML files
    • Add in YAML file(s) for each respective implementation of that technology
  • Add new implementation of technology (i.e. Apache Parquet for file formats)

Try to ensure all sources linked are high quality sources. This means it should preferably come directly from the project itself (either from documentation or source code).

Categories

  • Databases
    • Relational
    • No-SQL
    • Graph
    • Vector
  • File types
    • Parquet
    • DeltaLake
    • ORC
    • CSV
    • JSON
    • Fixed width
  • Object stores
    • AWS S3
    • Azure Blob
    • GCP Cloud Storage
  • Messaging
    • RabbitMQ
    • AMQ
    • Solace
    • Kafka
    • AWS Kinesis
    • AWS SQS
    • Azure Web PubSub
    • GCP PubSub
  • Query Analytics (OLAP)
    • AWS Athena
    • Azure Data Lake Analytics
    • GCP BigQuery
    • Presto
    • Trino
    • Snowflake
    • Redshift
    • Clickhouse
  • Data visualisation
    • Superset
    • Tableau
    • Looker
    • Redash
    • Metabase
    • Highcharts
    • Power BI
  • Job orchestration
    • Airflow
    • Dagster
    • Prefect
    • Mage
  • ML pipelines
    • KubeFlow
    • MLFlow
    • AWS SageMaker
    • Vertex AI
  • Data validation
    • SodaCL
    • Pydantic
    • Deequ
    • Great Expectations

Cloud Providers

  • AWS
  • Azure
  • GCP
  • IBM Cloud
  • Oracle Cloud
  • Alibaba Cloud
  • Huawei Cloud

Comparison Overview

  • Description
  • Link
    • Source code
    • Webpage
  • Category
  • Sub categories
  • Closest alternatives
  • Differences
    • Tab for option 1 better high level metrics
    • Tab for option 2 better high level metrics
  • Detailed comparison
  • Metric is either boolean or number

Repo Structure

Base tech spec that all solutions have (i.e. name, description, logo, links, license, year created, company). Each category has a spec where certain features are defined (i.e. language support for data validation tools)

  • Tech spec
  • Tech YAML
  • Markdown template

Site Structure

  • Home page
    • Description
    • Card grid of categories
  • Categories list
    • Select technology to inspect
    • Inspect via table structure
    • Select other technologies to compare

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.