Git Product home page Git Product logo

Comments (11)

Merricx avatar Merricx commented on June 20, 2024

I can if I have the data of Latin quadgram dataset, but unfortunately I don't have it.
If you have, maybe you can send the quadgram data here and I will add it to language support.

from blasto.

bi3mw avatar bi3mw commented on June 20, 2024

Unfortunately, I also do not have a corresponding dataset and do not know how one could create this from a Latin corpus. I have made a request in a forum if someone knows a source and I will get back to you if there is any feedback.

from blasto.

marcoponzi avatar marcoponzi commented on June 20, 2024

I put this together from a ~2.5M words corpus. Can it be useful?

https://drive.google.com/file/d/1ZX0Fu3rWREViVayVat_1myvSJq8lk2u-/view

from blasto.

Merricx avatar Merricx commented on June 20, 2024

Nice one, let me check that if it can be implemented

from blasto.

marcoponzi avatar marcoponzi commented on June 20, 2024

Something that occurred to me after I created that file: medieval Latin typically represented 'u' and 'v' with the same character. Should this be 'simulated' in the quadgrams by replacing u with v or vice-versa in at least part of the corpus? Maybe duplicating those lines so that they appear both with distinct u/v and a single character?
Similarly, in medieval Latin, combinations like 'ae' and 'oe' were often written as they are pronounced: just 'e'. Would it be useful to also manage this in the same way as proposed for u/v?

from blasto.

Merricx avatar Merricx commented on June 20, 2024

I think duplicating those characters as different line is the easiest way, although the quadgram size will be bigger.
If we replace 'u' with 'v' or vice versa, we should replace the "possible" plaintext from the ciphertext that we are trying to crack with the corresponding character and it can cause some miss accuracy to the correct plaintext.

from blasto.

marcoponzi avatar marcoponzi commented on June 20, 2024

An updated version of the file, where I added the replacement of AE/OE with E and of V with U. This of course results in additional quadgrams (about 1% more lines).

https://drive.google.com/file/d/1F3R1byY_63bS4H6TLssn3PieUthCNxwc/view?usp=sharing

from blasto.

zbelanger avatar zbelanger commented on June 20, 2024

I am also interested in the implementation of Latin in Blasto. Does it look like this can happen?

from blasto.

bi3mw avatar bi3mw commented on June 20, 2024

Is there any progress yet in the implementation of Latin ?

from blasto.

bi3mw avatar bi3mw commented on June 20, 2024

I am also interested in the implementation of Latin in Blasto. Does it look like this can happen?

If you are still interested, here is the mini version with Latin support:
https://www.dropbox.com/scl/fi/y066ahjjsccnpc8z9knpu/subst_solver_latin.zip?rlkey=l5en2nl8lps3rgv1rjiv6ln84&dl=1

from blasto.

zbelanger avatar zbelanger commented on June 20, 2024

from blasto.

Related Issues (1)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.