ikegami-yukino / shellinford-python Goto Github PK
View Code? Open in Web Editor NEWWavelet Matrix/Tree succinct data structure for full text search (based on shellinford C++ library)
License: BSD 3-Clause "New" or "Revised" License
Wavelet Matrix/Tree succinct data structure for full text search (based on shellinford C++ library)
License: BSD 3-Clause "New" or "Revised" License
I see the 0.3.2
release on PyPI, but running into trouble installing:
~ > pip install shellinford==0.3.2
Collecting shellinford==0.3.2
Could not find a version that satisfies the requirement shellinford==0.3.2 (from versions: 0.3.1)
No matching distribution found for shellinford==0.3.2
~ > pip search shellinford
shellinford (0.3.2) - Wavelet Matrix/Tree succinct data structure for full text
search (using shellinford C++ library)
Dear @ikegami-yukino,
I was wondering if there was a way to find maximal substring of query string given the FM index?
For example,
If the query is "IGPIVEGAGRE", and the FM Index contains "TIVEGAGRT", then it would return "IVEGAGR" and location (1)?
I am thinking this would involve building a bwt/fm index of both query and db and comparing, but not sure how to do this?
Thank you very much!
We're running into this error when reading a save FM index in Python 3 (it works well in Python 2!)
index_contains_kmer
for _ in fm.search(kmer):
SystemError: <built-in function delete_MapIntInt> returned a result with an error set
Hello,
I am building the library with VC++2019 build tools and it crashes on a undefined identifier 'time' when compiling shellinford_fm_index.cc
Including to the source solved the problem
As you know, FM-index allows counting of occurrences of a pattern and finding the locations of these occurrences as separate operations. Because counting is very fast (while locating and extracting contents are not), it would be more reasonable to have "count", "locate" and "extract" as separate APIs.
Sorry for not creating PR. I'm not very knowledgeable to SWIG and etc.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.