The scripts exist at varying levels of completeness (some have seen extensive use in many projects whereas others have been used little or have incomplete documentation and missing unit tests). In order to measure this, I have added in a confidence score for each:
Confidence Score
Description
5
Code has been used (without any observed failures) in multiple production environments (or large real world projects)
4
Code has been used (without any observed failures) in a production environment (or large real world project)
3
Code appears to work perfectly and passes a suite of unit tests but has not yet been used in a production environment or large real world project
2
The code appears to work perfectly but has not been thoroughly tested
1
Skeleton of function/class is present but the code does not work fully yet
Extracts the information (HTML) from a public LinkedIn page (e.g. person or company) using a virtual browser
4
ascii_density_histogram
Draws a histogram using only raw text symbols
2
conjugate_prior_beta_binomial
Calculates the posterior distribution of the success probability parameter [p] of a binomial distribution, from observed data and a user-specified beta prior
4
cosine_similarity
Calculates the cosine similarity between two 1-dimensional numpy arrays
2
create_gcloud_vm_docker_template
Creates a folder containing the files necessary to quickly build a python docker container to run on a google cloud Virtual Machine
4
create_parallel_google_cloud_run_job_template
Run a task in parallel using a Google Cloud Run job (code-generating function)
2
create_project_scope_doc
Creates a basic project scope document (markdown) by prompting the user for input
3
DataBatcher
Breaks a provided iterable up into batches according to a provided batching pattern
4
delete_file_in_gcloud_bucket
Deletes a file which is in a google cloud bucket
4
download_file_from_gcloud_bucket_to_python
Reads a file from a google cloud bucket into python memory
4
duckduckgo_search_multipage
Fetches search results from the DuckDuckGo Lite search engine
2
gcloud_vm_deletes_itself
Running this function on a google cloud Virtual Machine (VM) causes the VM to delete itself
4
list_all_python_imports
Searches every python script in a given folder and lists all python modules imported within those scripts
2
list_files_in_gcloud_bucket
Returns a list of the files present in a specified google cloud bucket
4
longest_common_substring
Identifies the longest substring appearing in both strings
3
longest_sentence_subsequence_plagiarism_detector
Finds phrases (sequences of consecutive words) common to 2 documents (e.g. to act as a naive plagiarism detector)
3
make_url_request
A convenience function for making API requests using the urllib library
3
move_or_rename_file_in_gcloud_bucket
Move or rename a file which is in a google cloud bucket (which includes moving it to a different bucket)
4
parse_mime_email_parts
Extracts parts from an email that is in MIME format
2
print_progress_bar
Prints a progress bar (to standard out) while code is running
3
PythonPlottingTutorials
Example code snippets for creating common data visualisations in python
4
query_bigquery_to_pandas_df
Runs a query on Google BigQuery and writes the result into a local pandas.DataFrame
4
RapidBinaryClassifier
Ultra rapid generation of binary classifier models in scikit-learn by abstracting away a lot of the decisions and model code
3
RegexRulesClassifier
A multi-class text classifier using manual regex rules
2
require_api_key
A decorator adding basic API key authentication to a flask route
3
retry_function_call
Retries function (if it fails) according to retry pattern
4
run_python_function_in_parallel
Runs a python function in parallel on multiple cores or threads
4
scrape_webpage_and_all_linked_webpages
Extracts HTML from given web page, and also follows all of the hyperlinks on that page and scrapes those too
1
StringCleaner
Performs common string-cleaning operations to a text string, also allowing them to be chained in sequence
1
upload_file_python_to_gcloud_bucket
Writes an object in python memory to a file (blob) on a google cloud bucket
4
url_to_filename_to_url_mapper
Converts a webpage URL into a useable filename, where the URL can be recovered directly from the filename
2
view_nested_dict_structure
Generates a simple printout for understanding the structure of a complex nested python dictionary
4
write_pandas_df_to_google_bigquery_table
Writes a pandas dataframe to a table on Google BigQuery
Calculates the posterior distribution of the success probability parameter [p] of a binomial distribution, from observed data and a user-specified beta prior