Aaron Kempf's Projects
Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages.
🗃 The open source self-hosted web archive. Takes browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
A curated list of awesome Python frameworks, libraries, software and resources
💿 Free software that works great, and also happens to be open-source Python.
[NOT MAINTAINED] Bubbles – Python ETL framework
Smart Automation inside your browser for free. Start earning and double your followers
.NET (WPF and Windows Forms) bindings for the Chromium Embedded Framework
Chinook is a sample database available for SQL Server, Oracle, MySQL, etc. It can be created by running a single SQL script.
Chocolatey package source for ssis-multiple-hash
CLR sql agent for SQL Server
SQL Server CLR function for running REST methods over HTTP
This is the Contoso University sample project to accompany the http://pleasereleaseme.net/continuous-delivery-with-tfs-vsts/ blog post series
A Microsoft SQL Server Integration Service plugin that enables Couchbase to be used as a source component in a data flow task
Mobile Application Development, Spring 2021 @ WSU
A Python library for building data applications: ETL, ML, Data Pipelines, and more.
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
A Python utility to crawl Microsoft SSIS ETL packages
Docker Image for Mautic
Examples of using DotNetBrowser
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Duty Scheduler application developed in Python Flask and MySQL
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
Extract, Transform, Load: Any SQL Database in 4 lines of Code.
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
GramBoard is world's first free and open source Desktop based Instagram marketing, management and analytics software.
Various Docker Compose examples of selfhosted FOSS and proprietary projects.
WordPress.org Plugin Mirror
Consolidating and Extending hosts files from several well-curated sources. You can optionally pick extensions to block Porn, Social Media, and other categories..