zeromus / jad Goto Github PK

License: zlib License

C 18.15% C++ 2.46% Objective-C 0.10% C# 79.29%

jad's Introduction

JAD

This project is pre-alpha WORK IN PROGRESS.

The JAD (Just-A-Disc) project is an implementation of the (new) JAD (and JAC) file formats dedicated for storing compressed optical discs in a random-access structure. Focus is on a healthy ecosystem and not just the format: tools and readily reusable libraries are provided.

JAD functions as a canonical representation of a disc. It should be superior to cue+bin, ccd, etc. in this respect. JAC functions as a conveniently compressed variant of a JAD.

It is designed to be simple and compact for embedded platforms; consequently the storage is not as efficient as it could be. We have opted for a 'least common denominator' approach, with no bulky external dependencies which might be difficult to implement on every device.

It is similar to but less powerful than MAME's CHD system.

These JAC files are not necessarily supposed to be distributed directly, although that would be nice. Rather, they are designed to to be a representation of the disc suitable for hardware emulation. Users would most likely convert other disc images to this format as an offline process.

The project is comprised of three parts: libjad, which solely implements the JAD/JAC formats; libjadvac which implements readers for other formats (cue,ccd) and conditionally encompasses much bulkier dependencies like libavcodec, etc. for decoding audio; and jadtool, which converts discs to JAD/JAC format. The intention is that libjad would be used on its own by most projects, libjadvac additionally by projects unwilling to require use of JAD/JAC files, and jadtool not used at all by other projects, but rather by end-users.

The libjadvac and libjad sources are written in beautiful archaic 1990-era C with no dependencies, to maximize portability. Jadtool is written in an archaic dialect of C++.

jad's People

Contributors

Watchers

jad's Issues

run more than 1 byte at a time through heatshrink (and other codecs?)

This is a performance optimization. Codecs that cant handle that can implement their own one-at-a-time loops

refine JACOP_COPY_SUBCODE_MASK

Define initial subcode buffer [96] state (based on a standard assortment of subcode)
Initial subcode state should include encoded MSF in subq
determine which channels need copying to adjust them from the initial buffer state and copy/encode whichever way only those channels
Consider how the P subchannel is all 0x00 or 0xFF and this is a good compression opportunity.

host media block size

hm i should also allow for a variable (specified in header) padding before the raw sector data begins, and possibly also a concept of host media block size, so that we can align disc sectors such that the number of host media block that need to be read can be bounded. for instance suppose the host reads 1MB at a time from its media; we would hate for any sector to straddle that boundary, else it would require two reads from host media.

metadata header specs

i want the header to have a metadata format version and/or registry version.

jadtool: add auditing tools

This would include such things as for instance referencing a redump database, and then "identifying" a disc by checking it's hash against the database. But... I guess the existing auditioning tools could just use libjad, huh? Maybe instead we should make an example/reference 'audition' tool which does that to show the way, but leave the bulk of the work to others.

exotic idea: gather encoding intelligence from back-tracking sectors from filesystem

Parse a filesystem and determine what types files are; track those back to sectors armed with knowledge of what type of file it is and choose the best compression filters.
i told you....this...is...exotic.

exotic idea: jadtool developer compression statistics report

calculate filters used and compression savings of each (how could you even isolate the effects of individual filters without rerunning the encoding on each sector with different sets of filters?) and print a report, so we can find out which filters are useful and which filters arent. Genererally, at least. Presumably someone would find at least one good test case before committing a filter...

analyze possibility of c# jad/jac implementation

This may be difficult due to the requirements of the flac precoder, but other than that it should be doable. I think whether libjad can be ported to other languages might be a condition for calling it a success. Therefore we might have to let this inform exactly how ruthlessly we strip down libflac when absorbing it (libflac being one of the larger slabs of external code that we'd incorporate)

Once done, implement in bizhawk

Add hashing mechanisms to jadtool

Actually, I'm not completely sure whether these belong in jadtool or libjad.

Calculating a hash would only be useful in libjad in case an emulator wanted to load a jad and check the hash itself. Since we're intending for hash memos/tags to be used instead for any purpose an emulator would want, overall I think the actual hashing methods belong in jadtool.

Hashes should include our custom PSX quick-hash, as well as some other suitable hash all over the disc contents, possibly based on what redump would do (although that may not be the best choice). Note that the idealized PSX quick-hash may depend on parsing the filesystem, but we haven't gone that far yet.

Add libavcodec/libavformat support to mednadisc

Depends on #2, naturally.

evaluate multi-threading encode/decode

it's conceivable that this library could work multi-threaded. each sector can encode in parallel as long as the results are queued to one thread for writing to the output file. even though it will complexify the encoding main loop, it might be worth doing this early on for speedier testing, and it might free us to try wilder ideas for selecting better sector encodings by brute forcing more possibilities.

Add flac-based precoder to coding pipeline

makefile building level 2

Add support for libmirage and libavcodec and libavformat to the makefile building. This is starting to get into the realm of bulkier dependencies but hopefully it won't hurt the makefile too badly. But at this point, we need a linux developer on-board.

Depends on #12

consider sector clustering

For better compression, we could cluster several sectors together. This would have to be optional. The size of clusters would be put in the header. Any given user of libjad would have to reject clusters larger than it's prepared to deal with (embedded systems may not have huge sector buffers)

Howevever, due to the concept for how the sector encoding works, this might be really difficult. Basically, all the filters are designed to operate on one sector at a time. Maybe we'd need a special encode stream command which said to extend the range of operation across several sectors worth. For instance a command like "just do basic decompression of 2340 bytes (that is, excluding sync)" would change to "just do basic decompression of 2340 bytes, twice (and skip both sync fields)"

makefile building level 3: mednadisc

Add support for mednadisc to makefile-based ports. (optional, of course).
This sounds like less work than #13 but may be harder due to mednafen codebase's surliness. Consider re-making mednadisc to be a patch on top of mednafen and then just add mednafen as a dependency, since mednafen's build surliness is already largely tamed.
It should be done earlier in windows simply because I already have it building in windows.
Depends on #12

add CCD support to libjadvac

Add libavcodec/libavformat to libjadvac

Should be done after #3 (since #3 is more value, letting us ingest a wider array of weird discs for testing on) (and also since it's no use until cue loading is somewhat respectable)

Another idea would be to add support for ffmpeg (maybe simpler to get rolling in windows than figuring out how to build the whole libav* stuff)

add proof of concept RLE filter to coder pipeline

Note, a ZEROES filter would be redundant, since the coder pipeline is defined to start with a zero-filled sector buffer anyway. And at any rate an RLE over a zero-filled sector would reduce to practically the same thing, just a few bytes.

Add mednadisc API

smaller index

allow format to specify whether index fields are 4byte or 8byte.. 8bytes would only be needed for dvd which im not supporting yet anyway

Add ECMtools-like functionality to coding pipeline

Add hash verify command to jadtool

Depends on #7. Depends on #19

Remember, a disc can have multiple hashes.

Add some general info options to libjad (header fields, metadata read/writing)

This is for spurring development of the basic architecture of the tool and making sure it can scale up to several commands. Also it will exercise the basic libjad apis.

Reminder: jadtool won't be able to print info about non-jad files. For instance jadtool info somefile.cue would not be allowed. All jadtool can do with cues is import (and later maybe export) them. Basically every jad command other than import and export operate on jads solely.

See also #19 (implementing metadata system)

install in libretro-common

This shouldn't be done for a long time, but it needs to be done eventually

add concept of mastering profiles

Since cue is a lossy format, occasionally assumptions must be made about how to reconstruct the original disc. I'm not sure where the appropriate point in the code is for these assumptions to be realized, but jadtool will need to receive them, and they need to be stored in metadata

add SBI support to jadtool (after ccd or cue are done)

Some thoughts.

We could have it only support SBI when importing cue/ccd
We could have a jadtool option to apply an SBI
In both cases, metadata should be added to indicate SBI (with a given hash ID perhaps) has been added

makefile building

Proof of concept makefile-based compiling of jadtool (and necessarily libjadvac and libjad).
This should not be hard since these are lightweight components. No autotools or cmake or anything is necessary. We'll update the makefile sources by hand (if we dont glob for inputs) when adding sources

evaluate png-like bitmap precoder

png has several precoders (most complex of which is 'paeth') which could be inspirational for compressing some sectors containing bitmaps. although come to think of it that paeth precoder needs some sense of bitmap pitch and that might be utterly impractical here. But we should be able to get mileage out of trying a few different bitplane configurations and seeing how well they compress.

But if you wanted to be really slick you could try various pitches, looking for the one that minimizes entropy from paeth's point of view, judging that to be the correct pitch. this would be a big job, but given that it's being applied a sector at a time, only a small range of pitches would need to be assessed (for instance, pitches of 4000 would not even fit in a sector; pitches of 2000 would not benefit much from the 2d precoding since it would really only be 2 lines. Therefore pitches between 2 and 1000 or so would be reasonable. This would be equivalent to trying to trial-compressing about 1000 45x45 pngs, or in other words, a single 450x450 png--not too bad, right?)

As mentioned in other tickets (#23), having a filesystem parser could help here with selecting the right precoder parameters to use by tracking sectors back to known media types.

implement metadata system

implement metadata system
see also #1

primary consideration of metadata system is that it should be updatable without rewriting the jadfile. This means it's in a table at the end of the file, like MP3 tags, you know. However, it's a little cleaner than that, since the jad header can point to the metadata table.

warning: old metadata concepts in docs are probably archaic

optional: jadvac other formats (MDS, NRG, ISO, CHD)

MDS or NRG would be good candidates, but this is mostly for the joy of it and the sake of completion--these aren't exactly extremely useful

also a good dumb ISO loader would be a good introductory task for someone (it would be largely based on the libmirage loader, which is not yet as graceful as it could be)

CHD might not make sense, but it's worth investigating. The suspected awkwardness of loading CHD was one of the motivating factors for libjad in the first place; but it would be nice if that awkwardness was resolved for other people once and for all within this pile of code, instead of just avoided (such that they could pluck the chd-reader out of libjadvac if all they wanted was a chd reader)

internal verification hash vs identification hash

clarify this concept. internal verification may be privileged--it may be stored in the header and hash the entire contents using a hash guaranteed to be available in libjad. that is, it could hash the sectors, the TOC, the metadata separately, and every single byte of the file. this is independent of identification hashes which libjad doesn't know anything about in detail (but can read out as comments)

zeromus / jad Goto Github PK

jad's Introduction

JAD

jad's People

Contributors

Watchers

jad's Issues

Recommend Projects

Recommend Topics

Recommend Org