Comments (4)
I've made a sample page for this. Using Spanish, if you search for jubilación
or jubilacion
(wrongly-written version of the first one), lunr is giving different results - something that shouldn't really happen, lunr being a full-text search engine.
We've discussed a little bit about this in manastech/middleman-search#23 (that's were the example comes from), and I think this should be solved by lunr-languages rather than the user having to load lunr.unicodeNormalizer by itself.
If lunr-languages loads lunr.unicodeNormalizer or if it does a different thing, I'm not sure. But if I'm enabling spanish full-text search, I definitely want accented words to yield the exact same results than a non-accented version of the word.
I can totally try to fix lunr-languages if you give me some pointers about how to do it. It's just that I'm not sure where/how should I do it.
I'm pretty much sure @eemi wants to know about this issue.
from lunr-languages.
about handling accent, see fortnightlabs/snowball-js#2
from lunr-languages.
and back to snowballstem/snowball#55
from lunr-languages.
Hi, any news about that issue?
I'm currently working on an offline & multi-language search client with pouchdb-quick-search and I face the same limitations.
But if I'm enabling spanish full-text search, I definitely want accented words to yield the exact same results than a non-accented version of the word.
I completely agree with @matiasgarciaisaia.
Ignoring all diacritical mark (à, ñ, ç, é, ...) will highly improve the relevancy of the results.
Right now, the only workaround I can think of would be to strip all the diacritical mark before indexing the data.
from lunr-languages.
Related Issues (20)
- Error when initializing `lunr.th.js` HOT 1
- Is this repository dead? HOT 1
- Add language support for Catalan
- How should I use it in ES6?
- lunr.zh.js can`t search 'database' HOT 4
- React JS
- Accented letter ê should be replaced by e in the french stemmer HOT 1
- Usage of Lunr js with Turkish language
- Minified version of the Thai language is missing HOT 2
- lunr-languages/lunr.fr.js fails to find common words like "équipement" HOT 3
- Can "nodejieba" be as "devDependency" or "peerDependency" in lunr-languages? HOT 4
- Update nodejs version in CI
- nodejieba (Chinese) is not working with webpack HOT 1
- lunr.de fails with umlaute in wildcard search
- Arabic HOT 1
- Search indexing for Chinese language (lunr.zh) does not work with multi-language HOT 4
- Idea: Use Intl.Segmenter to reduce bundle size in browsers
- Error when using lunr.zh.js 'nodejieba.cut is not a function' HOT 2
- Cannot read properties of undefined (reading 'registerFunction') HOT 1
- Add language support for Sanskrit, Kannada & Telugu HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lunr-languages.