Git Product home page Git Product logo

Comments (12)

brobertson avatar brobertson commented on August 22, 2024

Hi, Lisa. Thanks! If you could append some xml here with the guilty texts, that would be great. Is it possible that these have been run through a script that removes Latin-script letters? In that case, errant capital-e in place of capital-epsilon would get stripped.

from lace2.

lcerrato avatar lcerrato commented on August 22, 2024

I forget which one it was yesterday because I wasn't tracking it but I saw it in
urn:cts:greekLit:tlg4026.tlg003.1st1K-grc1 today

(It's possible tidy cleaned it up but these were capital pi and delta so not the type of letters that would be confused.)

I wasn't tracking it carefully (I will from now on), but here is one passage
[a 16] εῖ δὲ παρὰ ταῦτα μηδένα ἄλλον τρόπον ἐρωτήσεων τῶν
https://archive.org/details/commentariaina21pt12akaduoft/page/331/mode/1up?view=theater

from lace2.

lcerrato avatar lcerrato commented on August 22, 2024

Ok, this one was definitely tidy — as the original file was

<tei:div type="textpart" subtype="1" n="urn:cts:greekLit:tlg0557.tlg004.1st1K-grc1:2"><tei:p>2. Παρὰ θεῶν μὴ συνεχῆ

versus

<div type="textpart" subtype="section" xml:base="urn:cts:greekLit:tlg0557.tlg004.1st1K-grc1" n="2">
        <p rend="indent">2.  αρὰ θεῶν μὴ συνεχῆ 

very interesting.

from lace2.

brobertson avatar brobertson commented on August 22, 2024

Phew. So unlikely to be a Lace problem, but if you can append the original file here, I can check if my MacOS tidy makes this error. We should file a bug against tidy for sure. However, in the long run, we can use XSLT to do all your postprocessing, including indenting and that should be more reliable.

from lace2.

brobertson avatar brobertson commented on August 22, 2024

I'll leave this open until we're certain it's not a Lace issue.

from lace2.

brobertson avatar brobertson commented on August 22, 2024

Ok, I freshly generated urn:cts:greekLit:tlg0557.tlg004.1st1K-grc1 and the uppercase pi is still there. I processed the file with Linux parallel tidy -xml -m -i {} ::: *xml and the result also has the uppercase pi.

from lace2.

lcerrato avatar lcerrato commented on August 22, 2024

This was a recent batch
ldpd_10922736_000.zip

from lace2.

brobertson avatar brobertson commented on August 22, 2024

Hi, Lisa. ldpd_10922736_000.zip is the set of files I based my comments above. When I generated them at heml.mta.ca/lace I did not see these problems in the Lace output, but rather post tidy (in macos).

from lace2.

brobertson avatar brobertson commented on August 22, 2024

As for the initial example, I find that the Δ is missing in the editing, as shown in this image
Screenshot from 2021-04-15 13-59-38

from lace2.

brobertson avatar brobertson commented on August 22, 2024

I've asked Charlotte to do a last scan of commentariaina21pt12akaduoft as up at heml.mta.ca/lace

from lace2.

lcerrato avatar lcerrato commented on August 22, 2024

@brobertson
Yes, I agree on this. I just could not be sure in the middle of the workflow (have to go back and redownload and compare, etc.). Some of these were near those Aristotle brackets so I thought that could have been creating some noise at first.

from lace2.

brobertson avatar brobertson commented on August 22, 2024

Issue resolved because it was not caused by Lace, but either by erroneous editing or post-processing in MacOS 'tidy'.

from lace2.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.