Comments (3)
@janorivera If you look at our stop words, especially here you'll see that we do have Spanish stop words.
Look at how the classifier is initialized, English, 'en' is the default.
def initialize(*args)
@categories = Hash.new
options = { language: 'en', auto_categorize: false }
args.flatten.each do |arg|
if arg.kind_of?(Hash)
options.merge!(arg)
else
add_category(arg)
end
end
@total_words = 0
@category_counts = Hash.new(0)
@category_word_count = Hash.new(0)
@language = options[:language]
@auto_categorize = options[:auto_categorize]
end
Try something like the following:
> classifier = ClassifierReborn::Bayes.new(language: 'es', 'bueno', 'malo')
SyntaxError: unexpected ')', expecting end-of-input
> classifier = ClassifierReborn::Bayes.new('bueno', 'malo', language: 'es')
=> #<ClassifierReborn::Bayes:0x007f85358ca200 @categories={:Bueno=>{}, :Malo=>{}}, @total_words=0, @category_counts={}, @category_word_count={}, @language="es", @auto_categorize=false, @enable_threshold=false, @threshold=0.0>
> classifier.train_bueno "pastel ron gatos"
=> [
[0] "pastel ron gatos"
]
> classifier.train_malo "cosas malas como lluvia, dolor, enfermidad"
=> [
[0] "cosas malas como lluvia, dolor, enfermidad"
]
> classifier.classify "lluvia"
=> "Malo"
Hope this helps!
-Chase
from classifier-reborn.
I'll try to add to the docs soon.
from classifier-reborn.
Hey thanks Chase for the answer!
from classifier-reborn.
Related Issues (20)
- whan i add a utf8 chars HOT 1
- In some languages like Chinese, a word of length not bigger than 2 is very common, so I suppose this is a very strong(sometimes wrong in other languages) assumption. HOT 2
- How to install via jruby HOT 1
- ability to serialize model? HOT 1
- "ArgumentError: comparison of Float with NaN failed" if trying to search a corpus with an item that lacks common words HOT 3
- HTTPS for static site HOT 4
- Deprecated Gem::Specification#has_rdoc HOT 4
- 2.3.0 not released to Rubygems HOT 4
- broken links to docs (domain name not resolving) HOT 6
- TypeError: no implicit conversion from nil to integer in /classifier-reborn-2.2.0/lib/classifier-reborn/lsi.rb:313:in `sort' HOT 2
- Multiple separate bayes classifiers with single redis database HOT 1
- Documentation at classifier-reborn.com in inaccessible HOT 6
- Allow redis connection to be injected HOT 1
- Can classifier-reborn work with Numo::NArray / Numo::GSL ? Is that a better choice than nmatrix? HOT 9
- Is this project still actively maintained, or is it abandoned? HOT 3
- Problem with certain characters?
- [JRuby] Tests fail with jar-dependencies version mismatch
- Add prefix to the Redis keys
- Jekyll LSI not calculated on localized blog posts HOT 1
- Wijiji10
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from classifier-reborn.