Comments (11)
I can if I have the data of Latin quadgram dataset, but unfortunately I don't have it.
If you have, maybe you can send the quadgram data here and I will add it to language support.
from blasto.
Unfortunately, I also do not have a corresponding dataset and do not know how one could create this from a Latin corpus. I have made a request in a forum if someone knows a source and I will get back to you if there is any feedback.
from blasto.
I put this together from a ~2.5M words corpus. Can it be useful?
https://drive.google.com/file/d/1ZX0Fu3rWREViVayVat_1myvSJq8lk2u-/view
from blasto.
Nice one, let me check that if it can be implemented
from blasto.
Something that occurred to me after I created that file: medieval Latin typically represented 'u' and 'v' with the same character. Should this be 'simulated' in the quadgrams by replacing u with v or vice-versa in at least part of the corpus? Maybe duplicating those lines so that they appear both with distinct u/v and a single character?
Similarly, in medieval Latin, combinations like 'ae' and 'oe' were often written as they are pronounced: just 'e'. Would it be useful to also manage this in the same way as proposed for u/v?
from blasto.
I think duplicating those characters as different line is the easiest way, although the quadgram size will be bigger.
If we replace 'u' with 'v' or vice versa, we should replace the "possible" plaintext from the ciphertext that we are trying to crack with the corresponding character and it can cause some miss accuracy to the correct plaintext.
from blasto.
An updated version of the file, where I added the replacement of AE/OE with E and of V with U. This of course results in additional quadgrams (about 1% more lines).
https://drive.google.com/file/d/1F3R1byY_63bS4H6TLssn3PieUthCNxwc/view?usp=sharing
from blasto.
I am also interested in the implementation of Latin in Blasto. Does it look like this can happen?
from blasto.
Is there any progress yet in the implementation of Latin ?
from blasto.
I am also interested in the implementation of Latin in Blasto. Does it look like this can happen?
If you are still interested, here is the mini version with Latin support:
https://www.dropbox.com/scl/fi/y066ahjjsccnpc8z9knpu/subst_solver_latin.zip?rlkey=l5en2nl8lps3rgv1rjiv6ln84&dl=1
from blasto.
from blasto.
Related Issues (1)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from blasto.