Supports Python >= 3.7.0
Place raw .html extract(s) in ./input_html
estc-parser can be installed and run via Poetry,
cd estc-parser
poetry install
poetry run chewfiles
cd estc-parser
pip3 install -r requirements.txt
python3 estc_parser/cli.py
Find .csv file with tabular results in ./output_csv
After submitting a query a researcher can export results using the Email/print/save button highlighted below.
Here is a sample of the raw unstructured .html export from our estc query.
Here we see the output of running the unstructured html above through estc-parser.