Comments (7)
Here are some performance comparisons between SQLite, a couple of flavors of Lucene, and one I hadn't heard of called Tracker. SQLite and CLucene are roughly equivalent, and since SQLite's effective code size is nearly zero, it seems like the best choice.
from cbforest.
SQLite's FTS comes with a pretty basic tokenizer that doesn't handle stop-words or stemming. We've been using the fts3-unicodesn
library for that, which incorporates Snowball stemmers for a dozen languages, plus a custom list of stop-words.
It would be good to make this pluggable, so we can use OS facilities instead. For example, the Cocoa Foundation framework has classes for doing tokenization and stemming, which I believe support many more languages, including Asian ones which are very difficult to work with.
from cbforest.
There's also Xapian, which has a nice feature set.
from cbforest.
the Cocoa Foundation framework has classes for doing tokenization and stemming, which I believe support many more languages, including Asian ones which are very difficult to work with.
This would be amazing to have as I have a lot of Asian customers.
from cbforest.
I'm very interested in this feature, since I'm using couchbase with ForestDB and currently word-prefix matching is not supported. Is there any time frame for implementing this feature?
from cbforest.
No, it's not on our roadmap so far. Of course I'd be happy to advise & assist someone else who wanted to work on itβ¦
from cbforest.
@lucatorella Word-prefix matching is the only reason why I haven't switched to ForestDB just yet.
from cbforest.
Related Issues (20)
- Android: LogLevel does not effect for Android.
- After indexing a view, its lastSequenceIndexed may be less than db's lastSequence HOT 2
- RefCounted destructor can throw, triggering abort HOT 1
- InclusiveEnd=false query does not work unless end key is specified HOT 3
- Update ForestDB to get improved compaction / space reuse HOT 3
- ForestDB performance tanking on non SSD Window 10 machine HOT 5
- 1.4.0 ForestDb crash in _fdb_compact_file HOT 1
- Crash while compaction in background on iOS HOT 2
- After purging a doc, its rows are left behind in view indexes
- Allow enumerators to keep working after db or view is closed HOT 2
- ForestDB Error: null HOT 1
- Failed to begin transaction - com.couchbase.cbforest.Database.beginTransaction HOT 1
- KeyStore::lastSequence() could throw the exception. It should be caught.
- Crash in view
- C4Error: variables of struct is not initialized with 0 on Android platform HOT 4
- Crash in ForestDB (filemgr_is_fully_resident)
- Task: Update ForestDB to latest stable-branch commit HOT 1
- Multithreaded exception when indexing views
- Compilation warning with 64bit Android - type mismatch between unsigned long and unsigned long long HOT 4
- Multiple Transactions on same Database* cause assertion failure in thread-safe mode HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cbforest.