jeromebanks Goto Github PK
Name: Jerome Banks
Type: User
Name: Jerome Banks
Type: User
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Mirror of Apache Ambari
A platform for visualization and real-time monitoring of data workflows
A test framework for working with test corpora for unit tests.
Mirror of Apache Avro
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
Work in progress transmit from Google Code
Hive UDF's for the data warehouse
This repository contains two Python scripts that demonstrate how to create a chatbot using Streamlit, OpenAI GPT-3.5-turbo, and Activeloop's Deep Lake.
Scala-friendly, fast class-finder library (using ASM under the covers)
Docker image for running Spark 3 on Kubernetes on AWS
Create and modify Tableau workbook and datasource files
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Google BigQuery support for Spark, SQL, and DataFrames
This project generalizes the Spark MLLIB Batch and Streaming K-Means clusterers in every practical way.
Mirror of Apache Giraph
Example project using GitHub Maven Plugins
Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append
Mirror of Apache Hive
GeoIP Functions for hive
Hive I/O Library
hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format
Mirror of Apache Hivemall (incubating)
Mirror of Apache Zeppelin (Incubating)
Java client for InfluxDB
A JavaScript implementation of the 128bit variant of Murmur3 (that is compatible with Guava)
A tool for managing Apache Kafka.
Apache Nutch
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.