- Clean & refactor
src/app.py
- Implement known schema detection
- Try different parameters for feature generation/selection
- Implement smart epsilon detection for DBSCAN (clustering based on margins b/t text blocks)
josephcappadona / cis520-final-project Goto Github PK
View Code? Open in Web Editor NEWDocument Understanding Using Density-Based Spatial Clustering and Layout Analysis