madewild / camille Goto Github PK
View Code? Open in Web Editor NEWCentre for Archives on the Media and Information
License: MIT License
Centre for Archives on the Media and Information
License: MIT License
Allow to perform case-sensitive searches, e.g. only "jambes" without including the city of "Jambes"
Elasticsearch managed service too expersive with 3 nodes
An idea to be discussed later: we could add a "(Un)Select all" button, which could allow to go faster if the user only wants to (un)select one or two newspaper(s) (the same could be done for other filters with checkboxes). But this "(Un)Select all" button should work without activating the filter, otherwise it could be problematic.
Originally posted by @anjacque in #36 (comment)
The font for the text containing the selected years in the "Year" filter is not the same as the one for the rest of the page and the other filters.
With some explanations about the project
Allow to override default ranking (by relevance) to rank by date for instance
Tweak css for font size/style
Use universal calendar??
To allow larger page numbers (e.g. 500) which might be inconvenient in a slider.
Parse id and reindex ES
Sort by date asc/desc automatically on select
Define distance between two terms with AND
Explain Lucene syntax, results...
Allow to search only with filters with no keyword (only pages of some journal/year/date etc)
Use * for that ?
So that it will be by default the first file in the list, which is potentially 500 files long
To 60GB/node
CAMille will provide users with a set of external online resources related to the history of journalism in Belgium.
For the moment, only write "Under construction"
It seems to work properly! Two things if we want to improve it even more:
Originally posted by @anjacque in #30 (comment)
Add language field in ES to anticipate for NL docs
Cron job?
In JB838 and JB1051
Parse id and reindex ES
Add basic stats in the zip (number of files, breakdown by journal, etc.)
Parse id and reindex ES
and booleans
Cf. IDLab demonstrator https://tw06v072.ugent.be/kbr/ and https://github.com/filipradenovic/cnnimageretrieval-pytorch
Using spaCy?
To show more than 10 hits
track_total_hits=true for count api
input field with calendar (from - to)
Exporting XML or TXT ? Limit ?
For overview of hits across years
In addition to modal
Like newseye, needed?
<S interpreted as strikethrough in https://dev.camille-ulb-kbr.be/?query=*&paper=JB421&sortcrit=relevance&year_from=1930&year_to=1930
Add README file (+ basic LICENSE?) with some explanations about the contents (limit of 500 files, format, etc.)
Add journal name in natural language etc
On newspaper, year (cursor)...
Parse id and reindex ES
Currently only possible one by one (newseye style), to replace by checkboxes or multiselect?
To save money once all XML are there. Hoping it will not crash...
@anjacque can you provide them?
Possible with route53?
Quotes should be escaped in return form
Waiting for domain name...
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.