Git Product home page Git Product logo

elasticsearch-skroutz-greekstemmer's People

Contributors

astathopoulos avatar bill-kolokithas avatar chief avatar greenonion avatar lovemeblender avatar m-peter avatar ptheof avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

elasticsearch-skroutz-greekstemmer's Issues

Usage sample

Hi,

First of all I would like to congratulate you guys for the enhanced greek stemmer you worked on for the elasticsearch platform. I believe that Usage example is needed as well as a test case scenario to be sure that we have done the correct configuration.

Can't install plugin

I can't install the plugin, can you please help?

cd /usr/share/elasticsearch && sudo bin/plugin --install skroutz/elasticsearch-skroutz-greekstemmer/2.4.4.1
-> Installing skroutz/elasticsearch-skroutz-greekstemmer/2.4.4.1...
Trying http://download.elasticsearch.org/skroutz/elasticsearch-skroutz-greekstemmer/elasticsearch-skroutz-greekstemmer-2.4.4.1.zip...
Trying http://search.maven.org/remotecontent?filepath=skroutz/elasticsearch-skroutz-greekstemmer/2.4.4.1/elasticsearch-skroutz-greekstemmer-2.4.4.1.zip...
Trying https://oss.sonatype.org/service/local/repositories/releases/content/skroutz/elasticsearch-skroutz-greekstemmer/2.4.4.1/elasticsearch-skroutz-greekstemmer-2.4.4.1.zip...
Trying https://github.com/skroutz/elasticsearch-skroutz-greekstemmer/archive/2.4.4.1.zip...
Trying https://github.com/skroutz/elasticsearch-skroutz-greekstemmer/archive/master.zip...
Failed to install skroutz/elasticsearch-skroutz-greekstemmer/2.4.4.1, reason: failed to download out of all possible locations..., use --verbose to get detailed information

Building or Testing on a system with default encoding other than UTF-8 breaks file "stemming_samples.txt"

The problem is that UpdateStemmingSamples.java reads the file with UTF-8 encoding and replaces it with a file using the default encoding of the building computer. Subsequent builds fail.

Proposed changes (lines 27, 28):
FileOutputStream fileWriter = new FileOutputStream(file.getAbsoluteFile());
BufferedWriter writer = new BufferedWriter(new OutputStreamWriter(fileWriter, StandardCharsets.UTF_8));

elasticsearch 2

Αποτυχία εγκατάστασης σε elasticsearch 2
ERROR: Could not find plugin descriptor 'plugin-descriptor.properties' in plugin zip

Branch for 5.5.2

Hi!

Could you compile a new branch for ES 5.5.2?

Thank you :)

Installation on 6.x ES

Cannot install on latest ES due to error:

sudo bin/elasticsearch-plugin install gr.skroutz:elasticsearch-skroutz-greekstemmer:5.4.2.1
-> Downloading gr.skroutz:elasticsearch-skroutz-greekstemmer:5.4.2.1 from maven central
[=================================================] 100%   
Warning: sha512 not found, falling back to sha1. This behavior is deprecated and will be removed in a future release. Please update the plugin to use a sha512 checksum.
ERROR: This plugin was built with an older plugin structure. Contact the plugin author to remove the intermediate "elasticsearch" directory within the plugin zip.

Any chance for an update here?

ElasticsearchIllegalArgumentException[failed to find token filter type [skroutz_stem_greek] for [stem_greek]];

Version 1.1, Index :

    "index":{
        "analysis":{
            "analyzer":{
                "analyzer_startswith":{
                    "tokenizer":"keyword",
                    "filter":"lowercase"
                },
                "prefix-test-analyzer": {
                    "type": "custom",
                    "tokenizer": "standard",
                    "filter" : ["lowercase","stem_greek"]
                }
            },
            "filter" : {
                "mynGram" : {
                    "type" : "nGram",
                    "min_gram" : 2,
                    "max_gram" : 50
                },
                "stem_greek": {
                    "type":"skroutz_stem_greek"
                }
            },
            "tokenizer": {
                "prefix-test-tokenizer": {
                    "type": "path_hierarchy",
                    "delimiter": "."
                }
            }
        }
    }

Can't install plugin

Here is what I get:

sudo /usr/share/elasticsearch/bin/plugin -install skroutz/elasticsearch-skroutz-greekstemmer/0.0.1-> Installing skroutz/elasticsearch-skroutz-greekstemmer/0.0.1... Trying http://download.elasticsearch.org/skroutz/elasticsearch-skroutz-greekstemmer/elasticsearch-skroutz-greekstemmer-0.0.1.zip... Trying http://search.maven.org/remotecontent?filepath=skroutz/elasticsearch-skroutz-greekstemmer/0.0.1/elasticsearch-skroutz-greekstemmer-0.0.1.zip... Trying https://oss.sonatype.org/service/local/repositories/releases/content/skroutz/elasticsearch-skroutz-greekstemmer/0.0.1/elasticsearch-skroutz-greekstemmer-0.0.1.zip... Trying https://github.com/skroutz/elasticsearch-skroutz-greekstemmer/zipball/v0.0.1... (assuming site plugin) Failed to install skroutz/elasticsearch-skroutz-greekstemmer/0.0.1, reason: failed to download out of all possible locations..., use -verbose to get detailed information

rule0, exceptions handling

rule0 of SkroutzGreekStemmer.java tries to handle special cases for specific word endings.
However, most of those cases concern whole words, rather than endings.
Eg. the word περατοσ is handled as an ending, and will also match υδατοπερατοσ and stem it as υδατοπερ, σαφωσ will match φωσ, etc.
Those case are false positive matches.

Most of the cases should be handled with string equality (rather than string suffix matching).
This should happen in an extra step before what now is rule0 and rule0 should have less special cases to handle

Stem Exception Handling

The exceptional cases of the various analysis steps are not uniformly handled.
Some are static variables and some are coded into if clauses.
All of them are hardcoded and can only change by altering the source files.
We can make an effort to

  1. handle them uniformly
  2. load them from a resource file

How to install it in elasticsearch 7.17.6

Hi, I want to install it in my current elasticsearch which is v.7.17.6

Installation fails with the following message:

Exception in thread "main" java.lang.IllegalArgumentException: Plugin [elasticsearch-skroutz-greekstemmer] was built for Elasticsearch version 7.7.0 but version 7.17.6 is running

How can I update the code for my current elasticsearch version?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.