Git Product home page Git Product logo

Comments (3)

armandcismaru avatar armandcismaru commented on June 24, 2024 1

I selected a few ETL Tools that are popular in the market right now. Listed few features and specific details about each one.
(more to be added)

1. AWS Glue | Product documentation

  • Automatic schema discovery
  • This ETL tool automatically generates the code to extract, transform, and load your data.
  • AWS Glue jobs allow you to invoke on a schedule, on-demand, or based on a specific event.
  • Serverless. There is no infrastructure to provision or manage.
  • AWS Glue will generate Apache Spark ETL code in Scala or Python.
  • Pricing:
    -$0.44 per DPU-Hour, billed per second, with a 1-minute minimum (Glue version 2.0) or 10-minute minimum (Glue version
    0.9/1.0) for each ETL job of type Apache Spark.
    -$0.44 per DPU-Hour, billed per second, with a 1-minute minimum for each ETL job of type Python shell.
    -$0.44 per DPU-Hour, billed per second, with a 10-minute minimum for each provisioned development endpoint.

2. Informatica PowerCenter | Product documentation

  • It has a centralized error logging system which facilitates logging errors and rejecting data into relational tables
  • Build-in Intelligence to improve performance
  • Limit the Session Log
  • Ability to Scale-up Data Integration
  • Foundation for Data Architecture Modernization
  • Better designs with enforced best practices on code development
  • Code integration with external Software Configuration tools
  • Synchronization amongst geographically distributed team members.
  • Supports AWS and Microsoft Azure
  • Web-Based, Cloud, SaaS deployment
  • Pricing: Starting from $3.50/hr or from $24,528/yr

3. Talend Open Studio for Data Integration | Product documentation

  • It is the first commercial open source software vendor for data integration.
  • Over 900 inbuilt components for connecting various data sources.
  • Drag and drop interface.
  • Improves the productivity and time required for deployment are using GUI and inbuilt components.
  • Easily deployable in a cloud environment.
  • Data can be merged and transforms traditional and Big Data into Talend Open Studio.
  • The online user community is available for any technical support.
  • Pricing: Talend is a free open source ETL tool.

4. Fivetran | Product documentation

  • Helps you to build robust, automated pipelines with standardized schemas.
  • Adding new data sources as fast as you need.
  • No training or custom coding required.
  • Support for BigQuery, Snowflake, Azure, Redshift, etc.
  • Access to all your data in SQL.
  • Complete replication by default.
  • Automated connectors for Database sources
  • API Access for programmatic management and access of Fivetran
  • Pricing: $1.50 / Credit, only pay or data that is new or changed in any month.

from 2020-healthcarelake.

armandcismaru avatar armandcismaru commented on June 24, 2024

https://www.softwaretestinghelp.com/best-etl-tools/

https://www.guru99.com/best-etl-tools.html

from 2020-healthcarelake.

joekendal avatar joekendal commented on June 24, 2024

Thanks for the research, this is a great digest. This will be useful later in the week.

Thoughts on these? Great integration with AWS services and provides great AI analytics for healthcare.

https://databricks.com/aws
https://databricks.com/solutions/industries/healthcare

from 2020-healthcarelake.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.