Comments (3)
@ghukill thanks for opening this; I think we may be adding some notes here soon with some other related issues and/or edge cases we've been running into.
from eulfedora.
I'm just adding some errors I encountered:
Error importing emory:d743q to dev: 400 <?xml version="1.0" encoding="UTF-8"?><management:validation xmlns:management="http://www.fedora.info/definitions/1/0/management/" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.fedora.info/definitions/1/0/management/ http://www.fedora.info/definitions/1/0/validation.xsd" pid="unknown" valid="true">
<management:contentModels>
</management:contentModels>
<management:problems>
<management:problem>Schematron validation failed:org.xml.sax.SAXParseException; lineNumber: 921; columnNumber: 2; The value of attribute "REF" associated with an element type "foxml:contentLocation" must not contain the '<' character.</management:problem>
</management:problems>
<management:datastreamProblems>
</management:datastreamProblems>
</management:validation>
ChecksumMismatch
even with --archive-xml
and --requires-auth
eg:
repo-cp --archive-xml --requires-auth prod dev emory:pg3k9
Traceback (most recent call last):
File "/home/jsvarn/eulf/bin/repo-cp", line 137, in <module>
repo_copy()
File "/home/jsvarn/eulf/bin/repo-cp", line 121, in repo_copy
requires_auth=args.requires_auth)
File "/home/jsvarn/eulf/lib/python2.7/site-packages/eulfedora/syncutil.py", line 104, in sync_object
export_data = export.object_data().getvalue()
File "/home/jsvarn/eulf/lib/python2.7/site-packages/eulfedora/syncutil.py", line 298, in object_data
dsinfo = self.get_datastream_info(previous_section)
File "/home/jsvarn/eulf/lib/python2.7/site-packages/eulfedora/syncutil.py", line 258, in get_datastream_info
infomatch = self.dsinfo_regex.search(force_text(dsinfo))
File "/home/jsvarn/eulf/lib/python2.7/site-packages/eulfedora/util.py", line 44, in force_text
s = six.text_type(bytes(s), encoding)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xa9 in position 0: invalid start byte
When I actually catch the error for the one above, i get:
Unexpected error on emory:bcd79: <type 'exceptions.ValueError'> __len__() should return >= 0
from eulfedora.
I think setting a larger size for the chunk used for datastream info should be fine, and it shouldn't cause an issue with the regex since we're splitting on datastream start and end - that chunk shouldn't ever include datastream info for a previous datastream. My testing indicated that it worked fine for objects that can be successfully synced (excepting the problem record mentioned above, which seems to have other issues).
from eulfedora.
Related Issues (18)
- Retrieving previous versions of a datastream's content HOT 2
- Missing Content-Disposition in raw_datastream view HOT 5
- progressbar versions HOT 6
- custom panel for django-debug-toolbar
- raw_datastream view documentation not included on readthedocs.org
- Documentation: clarify datastream.content usage for generic DatastreamObject vs FileDatastreamObject HOT 5
- fedora-checksums not repairing HOT 5
- eulfedora not compatible with requests 2.11.0 HOT 3
- Checksum Mismatch
- Eulfedora and Fedora 4.x
- Use pycryptodome instead of pycrypto
- Status of the project
- documentation (installation, tutorial) is out of date for current versions of Django HOT 2
- TypeError at /objects/simplerepo:1/ HOT 1
- syncrepo command not found HOT 4
- References to XMLDatastreamObjects being overwritten HOT 3
- add_relationship() does not recognize object PIDs when provided as unicode strings HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from eulfedora.