Git Product home page Git Product logo

Comments (9)

MBoustani avatar MBoustani commented on August 21, 2024 1

@smadha I am currently working on using Docker.

from geoparser.

MBoustani avatar MBoustani commented on August 21, 2024 1

@smadha @chrismattmann I have both these steps along with other steps in Dockerfile, still testing and would be out soon.

from geoparser.

AravindRam avatar AravindRam commented on August 21, 2024 1

Hey @MBoustani

I tried the installation through docker and worked fine. Apart from few warnings, did not face any issues during the steps of installation.

from geoparser.

smadha avatar smadha commented on August 21, 2024

What will be our approach for installing tika and lucene-geo-gazetteer? Are we committing jar files into repo or will we be downloading them from mirror while installing?

I think we should also look for integrating geotopic parser with tika-python. Like not through a separate instance of tika-server (like we do now), but forming a separate object of tika-python which can directly parse using geotopic parser instead of default parser. I think we will need to append some files in class path and then we should be good.

@chrismattmann - thoughts?

from geoparser.

chrismattmann avatar chrismattmann commented on August 21, 2024

Use pip for Tika and use maven for lgg

from geoparser.

smadha avatar smadha commented on August 21, 2024

@chrismattmann I was suggesting to use tika python with location-ner-model , geotopic-mime in tika-server classpath.

from geoparser.

MBoustani avatar MBoustani commented on August 21, 2024

@chrismattmann @smadha
Here is my dockefile content, all steps go fine except last line I'm getting the error

#FROM debian:jessie
FROM ubuntu:14.04
#RUN rm /bin/sh && ln -s /bin/bash /bin/sh

RUN apt-get update && apt-get install -y \
    openjdk-7-jdk \
    curl \
    git \
    libxml2-dev \
    python \
    build-essential \
    python-dev \
    python-pip \
    python-numpy \
    python-scipy \
    maven

ENV JAVA_HOME /usr/lib/jvm/java-7-openjdk-amd64/jre/
ENV PATH $JAVA_HOME:$PATH

RUN echo $PATH
RUN echo $JAVA_HOME


RUN git clone https://github.com/chrismattmann/lucene-geo-gazetteer.git
ENV LGG /lucene-geo-gazetteer/src/main/bin/
ENV PATH $LGG:$PATH

WORKDIR lucene-geo-gazetteer
RUN mvn install assembly:assembly -DskipTests

RUN mkdir -p location-ner-model
WORKDIR location-ner-model
RUN curl -O http://opennlp.sourceforge.net/models-1.5/en-ner-location.bin
RUN mkdir -p org/apache/tika/parser/geo/topic
RUN mv en-ner-location.bin org/apache/tika/parser/geo/topic

WORKDIR /lucene-geo-gazetteer

RUN mkdir -p geotopic-mime
WORKDIR  geotopic-mime
RUN curl -O https://raw.githubusercontent.com/chrismattmann/geotopicparser-utils/master/mime/org/apache/tika/mime/custom-mimetypes.xml
RUN mkdir -p org/apache/tika/mime
RUN mv custom-mimetypes.xml org/apache/tika/mime

WORKDIR /lucene-geo-gazetteer

RUN curl -O http://download.geonames.org/export/dump/allCountries.zip
RUN unzip allCountries.zip
RUN mkdir -p geoIndex
RUN ["/bin/bash", "-c", "lucene-geo-gazetteer -i geoIndex -b allCountries.txt"]

Error:

WARNING: Building Finished
Sub command not recognised
The command '/bin/bash -c lucene-geo-gazetteer -i geoIndex -b allCountries.txt' returned a non-zero code: 255

from geoparser.

MBoustani avatar MBoustani commented on August 21, 2024

39189c7

from geoparser.

MBoustani avatar MBoustani commented on August 21, 2024

@smadha @chrismattmann @AravindRam

Can you please try docker installation (instruction in Readme) and let me know if there is any issue?

from geoparser.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.