Comments (12)
Hi, Lisa. Thanks! If you could append some xml here with the guilty texts, that would be great. Is it possible that these have been run through a script that removes Latin-script letters? In that case, errant capital-e in place of capital-epsilon would get stripped.
from lace2.
I forget which one it was yesterday because I wasn't tracking it but I saw it in
urn:cts:greekLit:tlg4026.tlg003.1st1K-grc1 today
(It's possible tidy cleaned it up but these were capital pi and delta so not the type of letters that would be confused.)
I wasn't tracking it carefully (I will from now on), but here is one passage
[a 16] εῖ δὲ παρὰ ταῦτα μηδένα ἄλλον τρόπον ἐρωτήσεων τῶν
https://archive.org/details/commentariaina21pt12akaduoft/page/331/mode/1up?view=theater
from lace2.
Ok, this one was definitely tidy — as the original file was
<tei:div type="textpart" subtype="1" n="urn:cts:greekLit:tlg0557.tlg004.1st1K-grc1:2"><tei:p>2. Παρὰ θεῶν μὴ συνεχῆ
versus
<div type="textpart" subtype="section" xml:base="urn:cts:greekLit:tlg0557.tlg004.1st1K-grc1" n="2">
<p rend="indent">2. αρὰ θεῶν μὴ συνεχῆ
very interesting.
from lace2.
Phew. So unlikely to be a Lace problem, but if you can append the original file here, I can check if my MacOS tidy makes this error. We should file a bug against tidy for sure. However, in the long run, we can use XSLT to do all your postprocessing, including indenting and that should be more reliable.
from lace2.
I'll leave this open until we're certain it's not a Lace issue.
from lace2.
Ok, I freshly generated urn:cts:greekLit:tlg0557.tlg004.1st1K-grc1 and the uppercase pi is still there. I processed the file with Linux parallel tidy -xml -m -i {} ::: *xml and the result also has the uppercase pi.
from lace2.
This was a recent batch
ldpd_10922736_000.zip
from lace2.
Hi, Lisa. ldpd_10922736_000.zip is the set of files I based my comments above. When I generated them at heml.mta.ca/lace I did not see these problems in the Lace output, but rather post tidy (in macos).
from lace2.
As for the initial example, I find that the Δ is missing in the editing, as shown in this image
from lace2.
I've asked Charlotte to do a last scan of commentariaina21pt12akaduoft as up at heml.mta.ca/lace
from lace2.
@brobertson
Yes, I agree on this. I just could not be sure in the middle of the workflow (have to go back and redownload and compare, etc.). Some of these were near those Aristotle brackets so I thought that could have been creating some noise at first.
from lace2.
Issue resolved because it was not caused by Lace, but either by erroneous editing or post-processing in MacOS 'tidy'.
from lace2.
Related Issues (20)
- simplify code for presenting list of available image sets HOT 2
- filter input to exclude terrible characters HOT 1
- filter xml-illegal characters HOT 1
- zone titles don't appear in Safari, but do appear in Chrome
- provide means of tracking editing progress in volumes
- Lace TEI XML export vocabulary HOT 1
- put a floor on the height of a popup image
- Lace export refinement HOT 2
- refsDecl wording in Lace output
- hyphenation resolution leaves space HOT 3
- Are hyphenated forms across pages losing their hyphens?
- disallow GREEK PROSGEGRAMMENI u+1fbe HOT 1
- add space between first and last name of editor
- offer function to revert word or line to unedited state
- split output in @n attribute into @base and @n
- provided indented XML output
- interlinear lines of accents? delete whole line? HOT 3
- Hades spelling convention HOT 1
- Performance issues? (CPU pegged at 100% during editing) HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lace2.