yangchenghuang Goto Github PK
Type: User
Type: User
Example for an airflow plugin
A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces
A beautiful, simple, clean, and responsive Jekyll theme for academics
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Apache Ranger Plugin for S3
Spark DataSouce plugin for reading files from various formats like Parquet into Arrow compatible columnar vectors.
Apache Arrow DataFusion and Ballista query engines
ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.
ARXaaS is a "Anonymization as a Service" project built ontop of the ARX library
A curated list of resources for Document Understanding (DU) topic
This sample shows you how to use the Node.js SDK to interact with Microsoft Azure Cosmos DB.
This repo contains the sample code of the Azure Search and Cognitive Services used to provide insights and analysis around the JFK Files.
Bandit is a tool designed to find common security issues in Python code.
deidentify patient notes using pre-trained BERT
PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper
Bidirectional Encoder Representations from Transformers (BERT) transfer learning for named entity recognition and de-identification of sensitive data
An awesome README template to jumpstart your projects!
BigDebug: Debugging Primitives for Interactive Big Data Processing in Spark (ICSE 2016)
Boosting BERT performances in FSL context
A Python library for large-scale nearest neigbhor computations via k-d trees and GPUs.
Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logic in a scalable and resilient way.
2020 Census 2010 Demonstration Data Products Disclosure Avoidance System
Cerbos is the open core, language-agnostic, scalable authorization solution that makes user permissions and authorization simple to implement and manage by writing context-aware access control policies for your application resources.
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
CKAN extension that allows a user to create private datasets only visible to certain users. The extension provides also an API to specify which users can access private datasets
A stand-alone service to pack a given CKAN resource in a ZIP file and email the link to a user.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.