Comments (7)
See also #56
from analytics.smgov.net.
@allejo: related to what I mentioned on the closure of #56. When I made ed16d7d and removed an old site, the aggregate WebJob started failing.
Removing the site from the _websites
collection removes the key from the Jekyll-generated reports/variables.json
file. Subsequent runs of the other WebJobs won't generate new data for the removed site. This is all expected 👍
However, data generated prior to removing the site is never cleaned up. When aggregate runs following the removal, it uses the contents of the data
directory (a subdirectory for each agency) and the keys in reports/variables.json
; since there is a mismatch, we get the error.
Two options I can think of: either use the keys from reports/variables.json
exclusively, or have a separate cleanup WebJob that continuously deletes subdirectories of data
that don't exist as keys in reports/variables.json
. (I kind of like the former approach better than latter). Your thoughts?
from analytics.smgov.net.
Ahhh that would make a lot of sense... Yea, I'm in favor of using reports/variables.json
exclusively in the aggregate WebJob.
As for cleaning up old data, we could have a manual WebJob available to delete any old data that we could run every so often? Or we could tie that WebJob/script to be run on deployment as well.
from analytics.smgov.net.
Oh I like the idea of doing a clean on deployment! That plus moving aggregate to key off the reports/variables.json
file should solve our current issue with removing sites and prevent stagnant data from sitting around forever.
from analytics.smgov.net.
Should the change go into the feature/aggregate-script-46
branch so that can be revived/merged? Or do it in both branches (rewrite + master).
from analytics.smgov.net.
Let's revive that thing and get it merged! I think I was supposed to review your changes, right?
from analytics.smgov.net.
Yea, and I just need to confirm that the generated data is the same as with the current script.
from analytics.smgov.net.
Related Issues (20)
- Add links for 7 and 30 day HOT 1
- Allow .json to serve from /reports
- Look at moving to Python 3
- UI Changes: 90 Day metrics
- Batch the Socrata calls HOT 2
- Backfill Socrata data HOT 7
- all-pages.json limitations HOT 1
- Fix Top Pages graph scaling
- Fix scale for computed Socrata fields
- Sessions not showing up HOT 5
- Socrata WebJob doesn't meet new SNI requirement HOT 17
- Intermittent ZeroDivisionError in Socrata WebJob
- Socrata Dataset dominated by PDFs HOT 5
- Create "Top N Sites" graph HOT 4
- Bring back the domain selector? HOT 6
- Remove BBB Store from sites HOT 2
- Implement surge.sh for testing
- Page links on 7/30 Top Pages graphs are broken HOT 3
- Add santamonica.gov to tracked websites HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from analytics.smgov.net.