Comments (1)
A normalize()
function which parallels the existing fingerprint()
function would be useful to allow access to the algorithm that fingerprint
uses internally, but we probably want to include some other algorithms as well, such as the four Unicode normalization forms. The java.text.Normalizer
class provides access to this functionality.
Separately, we probably also want to enable access to locale sensitive string comparisons of various strengths as provided by java.text.Collator
. The associated CollationKeys
aren't really usable directly by themselves except for collating and testing whether two strings are equivalent at a given strength (e.g. for Western European languages, tertiary = ignore strength, secondary = ignore diacritics). Because this needs to have a locale specified, it probably needs a separate function, since normalize()
is locale independent.
from openrefine.
Related Issues (20)
- Column menus: select submenu item by moving mouse diagonally
- When checking for a running open refine localhost should be included in the no proxy list
- Trying to load a 3.4.1 (or 3.6?) project using OpenRefine v3.8.1 HOT 1
- Introduce new top level GREL variable `record` HOT 2
- Fix OSSRH upload in release pipeline HOT 6
- join() of array with nulls throws NullPointerException HOT 4
- toTitleCase() second argument (delimiters) undocumented HOT 1
- Permanent logging to file HOT 1
- Carriage Return added to cell value incorrectly during import
- Null values in rows are not parsed correctly during import preview
- Reconciliation candidates discarded when matching multiple cells to a single new item
- Filter() control returns text string not error object when first argument is not an array HOT 6
- Regression: matching a single cell to a reconciliation candidate resets the grid to the first page HOT 2
- Recon suggest widget: enable middle-click to open URL in new tab without selecting HOT 6
- Introduce Shared Functionality for Managing Clustering Functions
- Snapshot build shouldn't try to sort imports
- Failure of Cypress scatterplot facet test HOT 1
- OpenRefine 3.8.2 Mac Application Not Notarized Correctly HOT 1
- Issue With Best Candidate's Score Filtering
- Exception error raised when opening a CSV with large number of columns HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from openrefine.