Comments (3)
Any normal azw3/mobi even with drm has an unencrypted exth header where all the metadata is stored. See the DumpMobiHeader program for a python tool that will dump the metadata and the complete header as a text file you could collect and post process or modify the code yourself.
Sent from my iPad
On Aug 26, 2016, at 10:26 PM, vanabel [email protected] wrote:
Is it possible to extract metadata from mobi/azw3(non protected) files? It would be greate to return these metadata (author/title/category) as an array.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
from kindleunpack.
Thanks, very useful, I find the link. But could you please help me again to show an example of how to get the title by php
: I try to use
$fp = fopen('test.mobi', 'r');
echo fgets($fp,503);
it seems not work. I think there must be some end positon need be set. But how?
from kindleunpack.
Sorry but It is not simple. Please get the DumpMobiHeader python program from here and try to replicate what it does to find and read the exth section from each header record. Or alternatively, modify that python program to only dump the metadata and then use php to parse its text output.
from kindleunpack.
Related Issues (20)
- unpack awz4 is grayed out HOT 2
- UnicodeEncodeError HOT 4
- Crash unpacking calibre-generated 'both' MOBI files HOT 29
- Unable to unpack azw file with .opf content HOT 5
- Unpacked files size doubled HOT 7
- Script throws AttributeError when parsing NCX navList HOT 5
- except error: Processing ncx / toc HOT 7
- Fix simple typo: specifc -> specific
- Re-packing to .mobi HOT 1
- Have considered package kindle ebooks parser into a pip package? HOT 5
- Incorrect language codes HOT 5
- Some Inflection data lost in the html file produced from kindle dictionary by KindleUnpack HOT 1
- KindleUnpack_ReadMe.htm references incorrect Python version HOT 2
- Incorrect version displayed on screen in DumpMobiHeader_v023.py HOT 1
- Convert `original-resolution` to `rendition:viewport` HOT 20
- encoding issue with source code under Microsoft Windows and default codepage HOT 1
- Book title not escaped in NCX HOT 8
- AttributeError: 'array.array' object has no attribute 'tostring' proposed fix HOT 4
- imghdr used in cover will be removed in python 3.13 HOT 3
- Named character references are not allowed in ePub HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kindleunpack.