Git Product home page Git Product logo

lamapacos's Introduction

lamapacos is a system for opinion mining on web data, it is base on hadoop.The goal is to
provide a general tool for opinion mining. 

lamapacos's People

Watchers

James Cloos avatar

lamapacos's Issues

code review sample

sample....
Branch name:

Purpose of code changes on this branch:


When reviewing my code changes, please focus on:


After the review, I'll merge this branch into:
/trunk


Original issue reported on code.google.com by [email protected] on 2 Mar 2013 at 11:04

integrate nutch, add parse-html-dangdang to nutch

Branch name:0.0.1

Purpose of code changes on this branch:
   1:Integrate nutch, we use nutch to crawl data from internet.
     1>add dependencies to pom.xml
     2>configuration in conf/
     3>plugins of nutch in plugins/
     4>lib
  2:NutchRunner is a try to call nutch's crawl API;

When reviewing my code changes, please focus on:
  NutchRunner.java & pom.xml & conf/ & lib/ & plugins/

After the review, I'll merge this branch into:
/trunk


Original issue reported on code.google.com by [email protected] on 13 Mar 2013 at 4:17

Attachments:

java.lang.UnsatisfiedLinkError: no NLPIR_JNI in java.library.path

What steps will reproduce the problem?
1. Run NLPIRTokenizerTest
2. You may encounter the following error:
java.lang.UnsatisfiedLinkError: no NLPIR_JNI in java.library.path
    at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1860)
    at java.lang.Runtime.loadLibrary0(Runtime.java:845)
    at java.lang.System.loadLibrary(System.java:1084)
    at kevin.zhang.NLPIR.<clinit>(NLPIR.java:126)
    at org.lamapacos.preprocessor.extraction.NLPIRTokenizer.segment(NLPIRTokenizer.java:18)
    at org.lamapacos.prepocessor.extraction.NLPIRTokenizerTest.testSegment(NLPIRTokenizerTest.java:31)
        …………………………
3. Or you may go through this test(as ShaoLin Xiao does)


It is strange if "java.library.path" is computer-different, since if so we have 
to set this Path properly computer by computer.


You may temporarily solve this problem by setting the VM arguments:
  -Djava.library.path=/home/hadoop/workspace/lamapacos/lib  #the correct path

FYI:http://blog.csdn.net/larrylgq/article/details/7515362

Original issue reported on code.google.com by [email protected] on 4 Apr 2013 at 3:31

part of speech

Though it's unnecessary to distinguish all tokens from each other, we must 
detect punctuations from a record so as to analyse its intensity properly.

FYI:
   [1]:http://hi.baidu.com/sing520/item/150f385de1ce0511db1635d4
   [2]:http://www.docin.com/p-381420600.html

Original issue reported on code.google.com by [email protected] on 4 Apr 2013 at 9:27

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.