cul-it / archival-storage-interface Goto Github PK

View Code? Open in Web Editor NEW

0.0 0.0 2.0 267 KB

Search and discovery interface for CUL archival storage

Ruby 66.68% JavaScript 1.20% CSS 0.80% HTML 8.98% XSLT 22.34%

archival-storage-interface's People

Watchers

archival-storage-interface's Issues

See information about objects on CULAR, Overflow, Hathi Trust, etc

A long-term goal for the AS-IF project may be to be able to do discovery of archival resources on other archival systems other than Overflow.

For now, we are only looking at Overflow, and will not be attempting to integrate other systems.

As a possible integration method, this could involve writing indexers for the manifests from the other sources and updating what information we care about searching on and displaying.

Display aggregate sizes (in addition to counts) in facet displays.

The archivist can query for all items in a collection.

See Use Case 1 at https://confluence.cornell.edu/pages/viewpage.action?pageId=340904277

The archivist can edit a collection label.

See Use Case 1 of https://confluence.cornell.edu/pages/viewpage.action?pageId=340904277

The original use case document was uncertain as to the need for this. Offhand, it does not fit well with the overflow storage model, where the collection label is embedded in the storage system.

user authentication integration

We shouldn't open this up to the world.

Extract (and facet on) the archival share a resource is on.

The preservation planner can query for what is the aggregate size of each file type, and how much each has grown in size and number over a set period of time.

See Use Case 1 of https://confluence.cornell.edu/pages/viewpage.action?pageId=340904277

Continuous Deployment to bb233-dev

Configure Travis to automatically deploy every successful build to bb233-dev for testing and feedback.

Fix spelling of facet category label

"Colllection" should be changes to "Collection"

Link bibid to catalog

If bibid = 5991856 then catalog link is https://newcatalog.library.cornell.edu/catalog/5991856

Conversion of archive manifests from old format to new format.

Existing collections in archival storage have a manifest file in the "old" format. Once the "new" format it finalized, the new collections will follow that new format.

We need a utility that will convert old manifest files to the new format, suitable for ingestion into AS-IF.

Be able find an item by searching for a checksum (or stem of a checksum)

For example I might want to find a file by searching for the sha1 digest da39a3ee5e6b4b0d3255bfef95601890afd80709 or perhaps just the start da39a3ee5e6b4b

Move share facet to top of facet list.

The preservation planner can query across collections and see the number of files and aggregate size of each collection

See Use Case 2 of https://confluence.cornell.edu/pages/viewpage.action?pageId=340904277

As a depositor, I want to be able to see counts of deposited objects to verify against my expectations

This is basically to support the workflow used by the current CULAR ingests, where Michelle/Dianne/Mira/Erin/etc look at the tool tips in the CULAR admin interface to determine if everything was ingested.

Importation of archival manifests into AS-IF

For collections in archival storage, we need to be able to import the manifest into AS-IF so that the archived material can be discovered.

This needs to be done for each collection in archival storage after the ingest process is done (or part of the ingest process).

Unfortunately, the manifest does not contain type information about archived objects, so the importation process will have to walk the collection storage itself and test the type of each object to get that information.

Access/visibility restrictions

Some potential collections will have visibility constraints because of rights management. We need to handle this appropriately.

Replace initial screen with more relevant content

Create user accounts and lock down new account creation.

Search based on checksum prefix

Instead of having to know the whole SHA1 checksum, we should be able to search for just the prefix, similar to how it is handled in git.

Fix code coverage

Code coverage definitely isn't working as expected.

I want it to look at Rails code I have written, and right now I'm not sure it's running at all.

Fix problem with negative file sizes

Looks like there is a Solr datatype issue with file sizes that results in negative numbers showing

How big is Archival03? How will AS-IF tell me that?

How much occupied space, and how much free space?
Do we also want to know what collections are on it?

Dashboard of state of preservation storage.

The preservation planner can query across collections for aggregate sizes according to file type, and how much each has grown over a set period of time (monthly/yearly).

See use case 2 in https://confluence.cornell.edu/pages/viewpage.action?pageId=340904277

Growth should be able to be expressed in numbers of files
Growth should be able to be expressed in terms of aggregate size.
Being able to extract data as a TAB file for further analysis would be a plus.

cul-it / archival-storage-interface Goto Github PK

archival-storage-interface's People

Watchers

archival-storage-interface's Issues

Recommend Projects

Recommend Topics

Recommend Org