Git Product home page Git Product logo

elasticsearch-analysis-korean's Introduction

Korean Analysis for ElasticSearch

The Korean Analysis plugin integrates Lucene Korean analysis module into elasticsearch.

In order to install the plugin, simply run: bin/plugin -install chanil1218/elasticsearch-analysis-korean/1.3.0

if above command is not working then try this:

bin/plugin -url https://dl-web.dropbox.com/spa/grpekzky9x5y6mc/elastic-analysis-korean/public/elasticsearch-analysis-korean-1.3.0.zip -install analysis-korean

Or you can clone this git repository, set correct version, build and just copy the jar file to plugins/analysis-korean/ directory.

--------------------------------------------------
| Korean Analysis Plugin      | ElasticSearch    |
--------------------------------------------------
| master                      | 0.90.1 -> master |
--------------------------------------------------
| 1.3.0                       | 0.90.1 -> master |
--------------------------------------------------
| 1.2.0                       | 0.90.0           |
--------------------------------------------------
| 1.1.0                       | 0.19.9           |
--------------------------------------------------
| 1.0.0                       | 0.19.2           |
--------------------------------------------------

The plugin includes the kr_analyzer analyzer, kr_tokenizer tokenizer, and kr_filter token filter.

Lucene Korean Analysis Module

http://cafe.naver.com/korlucene

elasticsearch-analysis-korean's People

Contributors

chanil1218 avatar xissy avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

elasticsearch-analysis-korean's Issues

Support ES 1.5

Hi,
Do you plan to support future version of elasticsearch?

When i download newest source, it doesn't build. display error

[INFO] Scanning for projects...
[INFO]
[INFO] ------------------------------------------------------------------------
[INFO] Building elasticsearch-analysis-korean 1.3.0
[INFO] ------------------------------------------------------------------------
[INFO]
[INFO] --- maven-enforcer-plugin:1.0:enforce (enforce-maven) @ elasticsearch-analysis-korean ---
[INFO]
[INFO] --- maven-jflex-plugin:1.4.3-r1:generate (default) @ elasticsearch-analysis-korean ---
[INFO] generated D:\elasticsearch-analysis-korean\src\main\java\org\apache\lucene\analysis\kr\KoreanTokenizerImpl.java
[INFO]
[INFO] --- maven-resources-plugin:2.4.3:resources (default-resources) @ elasticsearch-analysis-korean ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 1 resource
[INFO] Copying 14 resources
[INFO]
[INFO] --- maven-compiler-plugin:2.3.2:compile (default-compile) @ elasticsearch-analysis-korean ---
[INFO] Compiling 49 source files to D:\elasticsearch-analysis-korean\target\classes
[INFO] -------------------------------------------------------------
[ERROR] COMPILATION ERROR :
[INFO] -------------------------------------------------------------
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[11,7] cannot find symbol
symbol : constructor TokenFilterFactory()
location: class org.apache.lucene.analysis.util.TokenFilterFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[22,10] cannot find symbol
symbol : method init(java.util.Map<java.lang.String,java.lang.String>)
location: class org.apache.lucene.analysis.util.TokenFilterFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[23,19] getBoolean(java.util.Map<java.lang.String,java.lang.String>,java.lang.String,boolean) in org.apache.lucene.analysis.util.AbstractAnalysisFactory cannot be applied to (java.lang.String,boolean)
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[24,17] getBoolean(java.util.Map<java.lang.String,java.lang.String>,java.lang.String,boolean) in org.apache.lucene.analysis.util.AbstractAnalysisFactory cannot be applied to (java.lang.String,boolean)
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[25,18] getBoolean(java.util.Map<java.lang.String,java.lang.String>,java.lang.String,boolean) in org.apache.lucene.analysis.util.AbstractAnalysisFactory cannot be applied to (java.lang.String,boolean)
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[26,16] getBoolean(java.util.Map<java.lang.String,java.lang.String>,java.lang.String,boolean) in org.apache.lucene.analysis.util.AbstractAnalysisFactory cannot be applied to (java.lang.String,boolean)
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanTokenizerFactory.java:[11,7] org.apache.solr.analysis.kr.KoreanTokenizerFactory is not abstract and does not override abstract method create(org.apache.lucene.util.AttributeSource.AttributeFactory,java.io.Reader) in org.apache.lucene.analysis.util.TokenizerFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanTokenizerFactory.java:[15,33] cannot find symbol
symbol : constructor TokenizerFactory()
location: class org.apache.lucene.analysis.util.TokenizerFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanTokenizerFactory.java:[19,42] cannot find symbol
symbol : constructor TokenizerFactory()
location: class org.apache.lucene.analysis.util.TokenizerFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanTokenizerFactory.java:[23,18] create(java.io.Reader) in org.apache.solr.analysis.kr.KoreanTokenizerFactory cannot override create(java.io.Reader) in org.apache.lucene.analysis.util.TokenizerFactory; overridden method is final
[INFO] 10 errors
[INFO] -------------------------------------------------------------
[INFO] ------------------------------------------------------------------------
[INFO] BUILD FAILURE
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 8.232s
[INFO] Finished at: Tue Jun 18 22:24:13 KST 2013
[INFO] Final Memory: 15M/308M
[INFO] ------------------------------------------------------------------------
[ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:2.3.2:compile (default-compile) on project elasticsearch-analysis-korean: Compilation failure: Compilation failure:
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[11,7] cannot find symbol
[ERROR] symbol : constructor TokenFilterFactory()
[ERROR] location: class org.apache.lucene.analysis.util.TokenFilterFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[22,10] cannot find symbol
[ERROR] symbol : method init(java.util.Map<java.lang.String,java.lang.String>)
[ERROR] location: class org.apache.lucene.analysis.util.TokenFilterFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[23,19] getBoolean(java.util.Map<java.lang.String,java.lang.String>,java.lang.String,boolean) in org.apache.lucene.analysis.util.AbstractAnalysisFactory cannot be applied to (java.lang.String,boolean)
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[24,17] getBoolean(java.util.Map<java.lang.String,java.lang.String>,java.lang.String,boolean) in org.apache.lucene.analysis.util.AbstractAnalysisFactory cannot be applied to (java.lang.String,boolean)
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[25,18] getBoolean(java.util.Map<java.lang.String,java.lang.String>,java.lang.String,boolean) in org.apache.lucene.analysis.util.AbstractAnalysisFactory cannot be applied to (java.lang.String,boolean)
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanFilterFactory.java:[26,16] getBoolean(java.util.Map<java.lang.String,java.lang.String>,java.lang.String,boolean) in org.apache.lucene.analysis.util.AbstractAnalysisFactory cannot be applied to (java.lang.String,boolean)
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanTokenizerFactory.java:[11,7] org.apache.solr.analysis.kr.KoreanTokenizerFactory is not abstract and does not override abstract method create(org.apache.lucene.util.AttributeSource.AttributeFactory,java.io.Reader) in org.apache.lucene.analysis.util.TokenizerFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanTokenizerFactory.java:[15,33] cannot find symbol
[ERROR] symbol : constructor TokenizerFactory()
[ERROR] location: class org.apache.lucene.analysis.util.TokenizerFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanTokenizerFactory.java:[19,42] cannot find symbol
[ERROR] symbol : constructor TokenizerFactory()
[ERROR] location: class org.apache.lucene.analysis.util.TokenizerFactory
[ERROR] \elasticsearch-analysis-korean\src\main\java\org\apache\solr\analysis\kr\KoreanTokenizerFactory.java:[23,18] create(java.io.Reader) in org.apache.solr.analysis.kr.KoreanTokenizerFactory cannot override create(java.io.Reader) in org.apache.lucene.analysis.util.TokenizerFactory; overridden method is final
[ERROR] -> [Help 1]
[ERROR]
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions, please read the following articles:
[ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException

Request: Code examples

In the readme.md, would it be possible to put a code example of how to implement the analyzer, filter and tokenizer?

In the ES customer analyzer documentation there are options to set.
What options are available to be set? What are some sensible default options?

For example, I have this configuration;

{
    "template": "template_ko",
    "settings": {
        "analysis": {
            "analyzer": {
                "kr_analyzer": {
                    "type": "custom",
                    "tokenizer": "kr_tokenizer",
                    "filter": [
                        "kr_filter"
                    ]
                }
            },
            "tokenizer": {
                "kr_tokenizer": {

                }
            },
            "filter": {
                "kr_filter": {}
            }
        }
    }
}

What should I be putting in tokenizer.kr_tokenizer and filter.kr_filter??

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.