Git Product home page Git Product logo

ambar's People

Contributors

bkanuka avatar chazu avatar fr2019 avatar isido993 avatar jwsy avatar sgarwood avatar sochix avatar temberature avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ambar's Issues

Missing trailing slash on proxy_pass to frontend

Your original nginx config produced by default installation process.

server {
  listen 80;
  server_name ambar-fe;
  client_max_body_size 1024m;
  location /api/ {
    proxy_pass http://webapi:8080/api/;
  }
  location / {
    proxy_pass http://frontend:80;
  }
}

The last directive should be proxy_pass http://frontend:80/;

Invalid port specification: "None"

[root@searchbox ambar]# ./ambar.py start
 

______           ____     ______  ____       
/\  _  \  /'\_/`\/\  _`\  /\  _  \/\  _`\    
\ \ \L\ \/\      \ \ \L\ \ \ \L\ \ \ \L\ \  
 \ \  __ \ \ \__\ \ \  _ <'\ \  __ \ \ ,  /   
  \ \ \/\ \ \ \_/\ \ \ \L\ \ \ \/\ \ \ \ \  
   \ \_\ \_\ \_\ \_\ \____/ \ \_\ \_\ \_\ \_\
    \/_/\/_/\/_/ \/_/\/___/   \/_/\/_/\/_/\/ /


                                              
Docker version 1.12.1, build 23cf638
docker-compose version 1.12.0, build b31ff33
vm.max_map_count = 262144
net.ipv4.ip_local_port_range = 15000 61000
net.ipv4.tcp_fin_timeout = 30
net.core.somaxconn = 1024
net.core.netdev_max_backlog = 2000
net.ipv4.tcp_max_syn_backlog = 2048
Creating ambar_db_1
Creating ambar_rabbit_1
Creating ambar_es_1
Creating ambar_frontend_1

ERROR: for es  Cannot create container for service es: b'Invalid port specification: "None"'

ERROR: for db  Cannot create container for service db: b'Invalid port specification: "None"'

ERROR: for rabbit  Cannot create container for service rabbit: b'Invalid port specification: "None"'

ERROR: for frontend  Cannot create container for service frontend: b'Invalid port specification: "None"'
ERROR: Encountered errors while bringing up the project.
Traceback (most recent call last):
  File "./ambar.py", line 218, in <module>
    start(configuration)
  File "./ambar.py", line 187, in start
    runShellCommandStrict('docker-compose -f {0}/docker-compose.yml -p ambar up -d'.format(PATH))
  File "./ambar.py", line 45, in runShellCommandStrict
    subprocess.check_call(command, shell = True)
  File "/usr/local/lib/python3.5/subprocess.py", line 584, in check_call
    raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'docker-compose -f /root/ambar/docker-compose.yml -p ambar up -d' returned non-zero exit status 1

Help with installing ambar on Ubuntu (says it cannot find docker-compose)

So I installed a 64-bit version of Ubuntu server inside VMware with the default settings. The only thing I changed was to install a few additional things (Samba file server and OpenSSH server). Next I run sudo apt-get update and then follow the instructions to install Docker CE. Adding the repository seems to go well, and then I install docker, and I follow step 2 (sudo apt-get install docker-ce). I am able to run the hello-world container that outputs a successful message. docker --version gives me v17.03.1-ce, build c6d412e.

Next I try to install docker-compose. pip is not installed by default it seems, so I run sudo apt install python-pip. pip version is 8.1.1 (python 2.7) if that matters. Next I run pip install docker-compose. It completes and mentions a newer version of pip is available. I execute the wget command on the blog, followed by sudo ./ambar.py install. Here I run into an issue- docker-compose not found. But docker-compose --version returns version 1.12.0, build b31ff33.

Yesterday I had a different error (ERROR: In file 'docker-compose.yml' service 'version' doesn't have any configuration options), but today is different. I'm not proficient with linux and all its permissions and various directories, so if it's obvious something is wrong, I won't see it. Thanks for any help!

Page not loading, "Oops... Something went wrong"

I appear to be having the same issue (or similar) as the OP (agreenfield1) did here. Any advise or assist based on the data below (PS I aready tried changing the http://es:9200 to the actual ip address but it didn't help?

./ambar.py start

______           ____     ______  ____
/\  _  \  /'\_/`\/\  _`\  /\  _  \/\  _`\
\ \ \L\ \/\      \ \ \L\ \ \ \L\ \ \ \L\ \
 \ \  __ \ \ \__\ \ \  _ <'\ \  __ \ \ ,  /
  \ \ \/\ \ \ \_/\ \ \ \L\ \ \ \/\ \ \ \ \
   \ \_\ \_\ \_\ \_\ \____/ \ \_\ \_\ \_\ \_\
    \/_/\/_/\/_/ \/_/\/___/   \/_/\/_/\/_/\/ /



Docker version 17.03.1-ce, build c6d412e
docker-compose version 1.13.0, build 1719ceb
sysctl: setting key "vm.max_map_count": Read-only file system
net.ipv4.ip_local_port_range = 15000 61000
net.core.somaxconn = 1024
ambar_proxy_1 is up-to-date
ambar_db_1 is up-to-date
Starting ambar_rabbit_1 ...
ambar_es_1 is up-to-date
Starting ambar_rabbit_1
Starting ambar_rabbit_1 ... done
ambar_webapi_1 is up-to-date
ambar_frontend_1 is up-to-date
Waiting for Ambar to start...
Ambar is running on http://192.168.5.122:80

http://192.168.5.122:80/_cat/indices
Just returns a blank page

website console errors

Failed to load resource: the server responded with a status of 502 (Bad Gateway)
     _/api/_
Failed to start. Error: SyntaxError: Unexpected token < in JSON at position 0                
     _app.5800ae896ed146f4e02a.js:3_ 
SyntaxError: Unexpected token < in JSON at position 0
     _app.5800ae896ed146f4e02a.js:3_
Failed to load resource: the server responded with a status of 502 (Bad Gateway)
     _/api/users/check_
js:6 Failed to auth. Error: [object Response]
     _app.5800ae896ed146f4e02a_

docker logs ambar_webapi_1

2017/06/21 02:11:12 Waiting for host:
2017/06/21 02:11:12 Waiting for host: 192.168.5.122:9200
2017/06/21 02:11:12 Connected to unix:///var/run/docker.sock
2017/06/21 02:32:25 Waiting for host:
2017/06/21 02:32:25 Waiting for host: 192.168.5.122:9200
2017/06/21 02:32:25 Connected to unix:///var/run/docker.sock
2017/06/21 02:37:25 Timeout after 5m0s waiting on dependencies to become available: [unix:///var/run/docker.sock http://192.168.5.122:9200]
2017/06/21 02:37:25 Waiting for host:
2017/06/21 02:37:25 Waiting for host: 192.168.5.122:9200
2017/06/21 02:37:25 Connected to unix:///var/run/docker.sock

docker logs ambar_es_1

[2017-06-21T02:50:42,752][INFO ][o.e.n.Node               ] [] initializing ...
[2017-06-21T02:50:42,855][INFO ][o.e.e.NodeEnvironment    ] [Co7MJEJ] using [1] data paths, mounts [[/ (/dev/mapper/pve-vm--210--disk--1)]], net usable_space [7.1gb], net total_space [58.9gb], spins? [possibly], types [ext4]
[2017-06-21T02:50:42,855][INFO ][o.e.e.NodeEnvironment    ] [Co7MJEJ] heap size [989.8mb], compressed ordinary object pointers [true]
[2017-06-21T02:50:42,857][INFO ][o.e.n.Node               ] node name [Co7MJEJ] derived from node ID [Co7MJEJKRxu9qhWVmwm8OQ]; set [node.name] to override
[2017-06-21T02:50:42,857][INFO ][o.e.n.Node               ] version[5.3.2], pid[1], build[3068195/2017-04-24T16:15:59.481Z], OS[Linux/4.4.62-1-pve/amd64], JVM[Oracle Corporation/OpenJDK 64-Bit Server VM/1.8.0_121/25.121-b13]
[2017-06-21T02:50:46,938][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [aggs-matrix-stats]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [ingest-common]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [lang-expression]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [lang-groovy]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [lang-mustache]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [lang-painless]
[2017-06-21T02:50:46,941][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [percolator]
[2017-06-21T02:50:46,941][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [reindex]
[2017-06-21T02:50:46,941][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [transport-netty3]
[2017-06-21T02:50:46,941][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded module [transport-netty4]
[2017-06-21T02:50:46,954][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded plugin [analysis-morphology]
[2017-06-21T02:50:46,954][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded plugin [analysis-smartcn]
[2017-06-21T02:50:46,955][INFO ][o.e.p.PluginsService     ] [Co7MJEJ] loaded plugin [analysis-stempel]
[2017-06-21T02:50:53,487][INFO ][o.e.n.Node               ] initialized
[2017-06-21T02:50:53,497][INFO ][o.e.n.Node               ] [Co7MJEJ] starting ...
[2017-06-21T02:50:53,741][WARN ][i.n.u.i.MacAddressUtil   ] Failed to find a usable hardware address from the network interfaces; using random bytes: f8:d1:3f:ba:9a:40:6b:8e
[2017-06-21T02:50:53,935][INFO ][o.e.t.TransportService   ] [Co7MJEJ] publish_address {172.18.0.5:9300}, bound_addresses {[::]:9300}
[2017-06-21T02:50:53,946][INFO ][o.e.b.BootstrapChecks    ] [Co7MJEJ] bound or publishing to a non-loopback or non-link-local address, enforcing bootstrap checks
ERROR: bootstrap checks failed
max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144]
[2017-06-21T02:50:53,982][INFO ][o.e.n.Node               ] [Co7MJEJ] stopping ...
[2017-06-21T02:50:54,789][INFO ][o.e.n.Node               ] [Co7MJEJ] stopped
[2017-06-21T02:50:54,789][INFO ][o.e.n.Node               ] [Co7MJEJ] closing ...
[2017-06-21T02:50:54,805][INFO ][o.e.n.Node               ] [Co7MJEJ] closed

File: config.json

{
    "dataPath": "/opt/ambar",
    "dropbox": {
        "clientId": "",
        "redirectUri": ""
    },
    "es": {
        "containerSize": "2g",
        "heapSize": "1g"
    },
    "ocr": {
        "pdfMaxPageCount": 5000,
        "pdfSymbolsPerPageThreshold": 100
    },
    "dockerComposeTemplate": "https://static.ambar.cloud/docker-compose.template.yml",
    "db": {
        "cacheSizeGb": 2
    },
    "fe": {
        "external": {
            "host": "192.168.5.122",
            "port": "80",
            "protocol": "http"
        }
    },
    "dockerRepo": "ambar",
    "api": {
        "auth": "basic",
        "external": {
            "host": "192.168.5.122",
            "port": "80",
            "protocol": "http"
        },
        "defaultLangAnalyzer": "ambar_en",
        "analyticsToken": "cda4b0bb11a1f32aed7564b08c455992",
        "mode": "ce",
        "crawlerCount": 1,
        "cacheSize": "1g",
        "pipelineCount": 1,
        "showFilePreview": "false"
    }
}

File: docker-compose.yml

version: "2"
networks:
  internal_network:
services:
  webapi:
    restart: always
    networks:
      - internal_network
    image: ambar/ambar-webapi:latest
    expose:
      - "8080"
    environment:
      - db=mongodb://db:27017/ambar_data
      - fe=http://192.168.5.122:80
      - api=http://webapi:8080
      - es=http://192.168.5.122:9200
      - redis=webapi-cache
      - rabbit=amqp://rabbit
      - mode=ce
      - pipelineCount=1
      - crawlerCount=1
      - dropboxClientId=
      - dropboxRedirectUri=
      - defaultLangAnalyzer=ambar_en
      - analyticsToken=cda4b0bb11a1f32aed7564b08c455992
      - auth=basic
      - ocrPdfMaxPageCount=5000
      - ocrPdfSymbolsPerPageThreshold=100
      - showFilePreview=false
    depends_on:
      - db
      - es
      - webapi-cache
      - proxy
      - rabbit
    volumes:
      - /var/run/docker.sock:/var/run/docker.sock
  webapi-cache:
    restart: always
    image: redis:alpine   
    networks:
      - internal_network    
    expose:
      - "6379"
    ports:
      - "6379:6379" 
    mem_limit: 1g 
  proxy:
    image: ambar/ambar-proxy:latest
    networks:
      - internal_network
    restart: always
    environment:
      - API_EXT_PORT=80
      - FE_EXT_PORT=80
    expose:
      - 80
      
    ports:
      - "80:80"
      
  frontend:
    image: ambar/ambar-frontend:latest
    networks:
      - internal_network
    depends_on:
      - webapi
    expose:
      - "80"
    restart: always
    environment:
      - api=http://192.168.5.122:80
  db:
    restart: always
    image: ambar/ambar-mongodb:latest
    networks:
      - internal_network
    environment:
      - cacheSizeGB=2
    volumes:
      - /opt/ambar/db:/data/db
    expose:
      - "27017"
    ports:
      - "27017:27017"
  es:
    image: ambar/ambar-es:latest
    networks:
      - internal_network
    restart: always
    expose:
      - "9200"
    ports:
      - "9200:9200"
    environment:
      - cluster.name=ambar-es
      - bootstrap.memory_lock=true
      - xpack.security.enabled=false
      - security.manager.enabled=false
      - "ES_JAVA_OPTS=-Xms1g -Xmx1g"
    ulimits:
      memlock:
        soft: -1
        hard: -1  
      nofile:
        soft: 65536
        hard: 65536
    mem_limit: 2g
    cap_add:
      - IPC_LOCK
    volumes:
      - /opt/ambar/es:/usr/share/elasticsearch/data
  rabbit:
    image: ambar/ambar-rabbit:latest
    networks:
      - internal_network
    hostname: rabbit
    expose:
      - "15672"
      - "5672"
    ports:
      - "15672:15672"
      - "5672:5672"
    volumes:
      - /opt/ambar/rabbit:/var/lib/rabbitmq

Ambar wont Autostart

Hello

When restarting Ubuntu this is the output of docker ps

CONTAINER ID IMAGE COMMAND CREAT ED STATUS PORTS NAMES
5ec43d188c10 ambar/ambar-pipeline:latest "/bin/sh -c 'pytho..." 37 mi nutes ago Restarting (139) 49 seconds ago ambar_pipeline_p0
2ed6b189aebd ambar/ambar-crawler:latest "/bin/sh -c 'pytho..." 37 mi nutes ago Restarting (1) 36 seconds ago ambar_crawler_c0
baac19554af4 ambar/ambar-frontend:latest "/bin/sh -c 'echo ..." 38 mi nutes ago Up 3 minutes 80/tcp ambar_frontend_1
d1ef1edac298 ambar/ambar-webapi:latest "/bin/sh -c 'docke..." 38 mi nutes ago Restarting (1) 28 seconds ago ambar_webapi_1
53931fb03307 ambar/ambar-proxy:latest "/entrypoint.sh" 38 mi nutes ago Up 3 minutes 0.0.0.0:889-890->889-890/tcp ambar_proxy_1
0cd5938e8bfa ambar/ambar-es:latest "/docker-entrypoin..." 38 mi nutes ago Up 3 minutes 0.0.0.0:9200->9200/tcp, 9300/ tcp ambar_es_1
99ecced7bd96 redis:alpine "docker-entrypoint..." 38 mi nutes ago Up 3 minutes 0.0.0.0:6379->6379/tcp ambar_webapi-cache_1
7971cf6807cd ambar/ambar-mongodb:latest "/entrypoint.sh" 38 mi nutes ago Restarting (2) About a minute ago

Pipeline, crawler, and webapi are in an endless restart loop.

If I manually run ./ambar.py start then they work fine and come up as expected.

I would like this to autostart though.

Bug?

How to use ambar-ES for document search

Hello,
I installed ambar in my machine.But when I start it, sometime UI not get loading.Giving 502 Bad gate way error.
& When It start I am not able to understand where to write search query for ES indices.How to check whether document is present in ES or not..

Errors while processing data

Hi,

I am seeing allot of the following errors on the pipeline for all the files being used via SMB Crawler .

2017-05-11 14:59:05.470: [error] [p0] error submiting parsed text to Api, unexpected response code //windowsxptools/g/docs/file.html 500 {"message":"Invalid non-string/buffer chunk\n TypeError: Invalid non-string/buffer chunk\n at validChunk (_stream_writable.js:186:14)\n at GridWriteStream.Writable.write (_stream_writable.js:213:12)\n at ondata (_stream_readable.js:542:20)\n at emitOne (events.js:77:13)\n at emit (events.js:169:7)\n at Readable.read (_stream_readable.js:368:10)\n at flow (stream_readable.js:759:26)\n at resume (_stream_readable.js:739:3)\n at nextTickCallbackWith2Args (node.js:511:9)\n at process._tickCallback (node.js:425:17)"}

This is happening to all the files.

Any ideas ?

Thanks

Install issue on proxmox lxc

The install process was easy and straightforward. I was excited to get started with ambar, but at the very end I received a message in the console

sysctl: cannot stat /proc/sys/net/ipv4/tcp_fin_timeout: No such file or directory

I know lxc containers have an issue running docker (which I was able to get around by modifying the lxc conf file to include some extra parameters. I realize this is probably not the safest practice but the container would only be accessible on a local network. Since I'm still having issues with the container I was hoping to see if anyone had a solution I missed.

'utf-16-le' codec error

Hi!

I'm getting the following error for some of my files when crawling them using the SMB crawler. The same file can be uploaded manually and works ok.

[error] 'utf-16-le' codec can't decode bytes in position 18-19: illegal encoding.

What to do?

Github repository for Webapi and Pipeline

I am trying to get Ambar running on OSX.
It works so far, however the pipeline seems to be not able to connect to the Rabbit server. Thus, no documents are being processed and nothing is added to the search index.

I am trying to understand what's going on behind the scenes in order to find a solution for this problem. Unfortunately, the code for the Webapi and Pipeline docker images are not public.

Do you plan to release these soon?

Ambar behind a HTTP proxy

Hi,

I seem to be having problems with the ambar_webapi docker not using the system HTTP proxy correctly.

I have installed ambar self-hosted community edition onto Centos 7, which is behind a HTTP proxy. I have setup systemd for docker to define the HTTP_PROXY and HTTPS_PROXY correctly. ie I can download/book amabar ok.

I can then access the web front end ok, (changed to port 8005) but everything else is standard.. however I can't login, signup or anything - get an 'opps something went wrong message'.

Inspecting the docker log for ambar_webapi seems to show attempts to access a remote host (52.64.9.77) (and amazonaws.com host - mandrillapp.com??) without using the HTTPS proxy
`
[root@kgs-sts-fusion ambar]# ./ambar.py start


/\ _ \ /'_/\/\ _\ /\ _ /\ `\
\ \ \L\ /\ \ \ \L\ \ \ \L\ \ \ \L\ \
\ \ __ \ \ _
\ \ \ _ <'\ \ __ \ \ , /
\ \ /\ \ \ _/\ \ \ \L\ \ \ /\ \ \ \ \
\ _\ _\ _\ _\ _/ \ _\ _\ _\ _
/
//
//
/ //// /////_// /

Docker version 17.04.0-ce, build 4845c56
docker-compose version 1.13.0, build 1719ceb
vm.max_map_count = 262144
net.ipv4.ip_local_port_range = 15000 61000
net.ipv4.tcp_fin_timeout = 30
net.core.somaxconn = 1024
net.core.netdev_max_backlog = 2000
net.ipv4.tcp_max_syn_backlog = 2048
Creating network "ambar_internal_network" with the default driver
Creating ambar_db_1 ...
Creating ambar_rabbit_1 ...
Creating ambar_proxy_1 ...
Creating ambar_es_1 ...
Creating ambar_db_1
Creating ambar_webapi-cache_1 ...
Creating ambar_rabbit_1
Creating ambar_es_1
Creating ambar_webapi-cache_1
Creating ambar_es_1 ... done
Creating ambar_webapi_1 ...
Creating ambar_webapi_1 ... done
Creating ambar_frontend_1 ...
Creating ambar_frontend_1 ... done
Waiting for Ambar to start...
Ambar is running on http://147.66.12.53:8005

[root@kgs-sts-fusion ambar]# cat docker inspect --format='{{.LogPath}}' ambar_webapi_1
{"log":"2017/05/03 05:44:08 Waiting for host: \n","stream":"stderr","time":"2017-05-03T05:44:08.385748978Z"}
{"log":"2017/05/03 05:44:08 Waiting for host: es:9200\n","stream":"stderr","time":"2017-05-03T05:44:08.385884331Z"}
{"log":"2017/05/03 05:44:08 Connected to unix:///var/run/docker.sock\n","stream":"stderr","time":"2017-05-03T05:44:08.388017144Z"}
{"log":"2017/05/03 05:44:22 Received 200 from http://es:9200\n","stream":"stderr","time":"2017-05-03T05:44:22.302769292Z"}
{"log":"Crawler schedule service initialized\n","stream":"stdout","time":"2017-05-03T05:44:24.380922736Z"}
{"log":"Pipeline initialized\n","stream":"stdout","time":"2017-05-03T05:44:24.71064609Z"}
{"log":"Started on :::8080\n","stream":"stdout","time":"2017-05-03T05:44:24.720793191Z"}
{"log":"{ [Error: connect ECONNREFUSED 52.64.27.232:443]\n","stream":"stderr","time":"2017-05-03T06:53:42.270438821Z"}
{"log":" code: 'ECONNREFUSED',\n","stream":"stderr","time":"2017-05-03T06:53:42.270489177Z"}
{"log":" errno: 'ECONNREFUSED',\n","stream":"stderr","time":"2017-05-03T06:53:42.270497139Z"}
{"log":" syscall: 'connect',\n","stream":"stderr","time":"2017-05-03T06:53:42.270503494Z"}
{"log":" address: '52.64.27.232',\n","stream":"stderr","time":"2017-05-03T06:53:42.27050999Z"}
{"log":" port: 443 }\n","stream":"stderr","time":"2017-05-03T06:53:42.270516275Z"}
{"log":"{ [Error: connect ECONNREFUSED 52.64.9.77:443]\n","stream":"stderr","time":"2017-05-03T06:53:43.182362118Z"}
{"log":" code: 'ECONNREFUSED',\n","stream":"stderr","time":"2017-05-03T06:53:43.18240549Z"}
{"log":" errno: 'ECONNREFUSED',\n","stream":"stderr","time":"2017-05-03T06:53:43.182413382Z"}
{"log":" syscall: 'connect',\n","stream":"stderr","time":"2017-05-03T06:53:43.182444112Z"}
{"log":" address: '52.64.9.77',\n","stream":"stderr","time":"2017-05-03T06:53:43.182451306Z"}
{"log":" port: 443 }\n","stream":"stderr","time":"2017-05-03T06:53:43.182457382Z"}
`
it seems that when I try to recover my password I type my email, and hit 'recover password' causes a new entry in the ambar_webapi docker log which looks like our http proxy (see above).

I have not yet been able to login at all to the Ambar web front end.

any ideas?

Regards
Kym

Feature Request: Provide Email Preview or let users customize the preview

Hi there,

I was really happy to find this project after spending quite some time for Solr, Datafari, manifoldcf, etc. PoCs. Finally something that works out of the box and allows me to crawl my local documents (using smb) and my emails.

I think it would be really helpful to have a preview functionality for .eml-Files as well. As I'm not sure if there are any suitable JS library to achieve that another helpful improvement could be to let the ambar admin customize the preview url. The approach to let google document viewer preview everything helps to get things up quickly, but fails if you want to show previews/documents using another software.

Cheers,
Alex

Feature Request: Webdav crawler

Hi all,

i'd like to have a new crawler available to access webdav folders.

Please let me know if you need any additional information about what i'd like to see, but i hope the general "webdav crawler" is explicit enough to understand what is needed.

Thank you very much

Page not loading - "Something went wrong" when loading from a dydns server

Hi there,

I am trying to get external access. I have a dydns web address that points to my computer that is hosting Ambar. What ports need forwarded and where do I configure this info?

I have attempted to do it now, but keep getting oops something went wrong. I am trying to use port 889 and 890.

I have forwarded the ports correctly (I run about 14 servers in my homelab) but I am a linux noob, so here is an image of the config (not sure how to access the text)
capture

Please advise

Additional info: If I access it through local IP address of computer it works fine. That negates external access though.

Additional info 2: If I access with my external IP instead of the dydns hostname it works.

Additional info 3: If I access it using my external IP:port the web interface works fine. When I download a file it is downloading using the dydns address and works fine.

UPDATE: This was a browser and DNS cache issue. Cleared the browser cache and DNS cache and it worked. I am leaving this here for anyone that Googles this.

Feature request: Hancom Office file(.hwp) support

Hancom Office or (aka Hangul Word Processor) file (.hwp) is a de facto standard in Korea.
It is widely used by the government and courts, so it is a must-have option in Korea.
But Dtsearch and Windows search does not support it. Ifilters are commercially available at a high price.
Hwp support would make Ambar very competitive in Korea.

Hwp format was adopted as a standard and its specification was made public but only in Korean.
https://www.hancom.com/board/noticeView.do?artcl_seq=6453

But there are some hwp parsing project you might want to reference.
https://sites.google.com/site/h2tlib/
https://github.com/ddoleye/java-hwp

Feature Request: On-line PDF Viewer

Hello,

Like other users here, I have tried other open source document management software but have found it overly complicated and bloated for my needs. I'm most impressed with your product so far, but there is some functionality which I have seen elsewhere which I think would make your product even better.

It is great that you have made all of the text in the PDF searchable and I like that it shows the paragraph where the key word is found. When I was testing it with some files, I though it would be very useful to be able to view the original PDF file on-line to see the original formatting and graphics without having to download the file and view it in a local viewer. This would be especially useful when using your product on a tablet.

I guess you could include viewers for other file types if they are available, but I feel that a PDF viewer would be most useful.

Thanks for the excellent software - I can't wait to see what new features are added with the next update!

Error when processing file.

I have loaded several pdf reports. Some of them, Ambar can't parse.
But he don't delete malformed files from queue.
After some time, Ambar try to parse them, again.

2017-04-07 13:42:17.512: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ the _wizzards_ of adware.pdf 'utf-8' codec can't decode byte 0xed in position 1605: invalid continuation byte 2017-04-07 13:42:19.413: [verbose] [p0] task received 7609b6aac3bff0e4bd067824dfc2925100797d2e72a1712bcf3edb1266660131 2017-04-07 13:42:19.450: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ threat round-up for the week of mar 6 - mar 10.pdf 2017-04-07 13:42:19.458: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ threat round-up for the week of mar 6 - mar 10.pdf 2017-04-07 13:42:19.778: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ threat round-up for the week of mar 6 - mar 10.pdf 'utf-8' codec can't decode byte 0xed in position 1075: invalid continuation byte 2017-04-07 13:42:33.929: [verbose] [p0] task received e0d13847f3079e244c9b4503635d59f0ce8840f37d410e9d6d8a0160895eb62a 2017-04-07 13:42:33.966: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ threat spotlight_ holiday greetings from pro pos – is your payment card data someone else’s christmas present_.pdf 2017-04-07 13:42:33.974: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ threat spotlight_ holiday greetings from pro pos – is your payment card data someone else’s christmas present_.pdf 2017-04-07 13:42:34.184: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ threat spotlight_ holiday greetings from pro pos – is your payment card data someone else’s christmas present_.pdf 'utf-8' codec can't decode byte 0xed in position 1307: invalid continuation byte 2017-04-07 13:42:34.200: [verbose] [p0] task received 7c8c073ba712de285d29877e3d01de448f6d7f0c0a4b061f700edfac0fa4d67b 2017-04-07 13:42:34.223: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ threat spotlight_ dyre_dyreza_ an analysis to discover the dga.pdf 2017-04-07 13:42:34.239: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ threat spotlight_ dyre_dyreza_ an analysis to discover the dga.pdf 2017-04-07 13:42:34.495: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ threat spotlight_ dyre_dyreza_ an analysis to discover the dga.pdf 'utf-8' codec can't decode byte 0xed in position 1697: invalid continuation byte 2017-04-07 13:42:35.406: [verbose] [p0] task received 073d9dde2cc486865db972a40f077660904439f0a00195bac24e33a3ebeecf72 2017-04-07 13:42:35.427: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ vulnerability deep dive - ichitaro office excel file code execution vulnerability.pdf 2017-04-07 13:42:35.436: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ vulnerability deep dive - ichitaro office excel file code execution vulnerability.pdf 2017-04-07 13:42:35.615: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ vulnerability deep dive - ichitaro office excel file code execution vulnerability.pdf 'utf-8' codec can't decode byte 0xed in position 222: invalid continuation byte 2017-04-07 13:42:38.804: [verbose] [p0] task received 7ecec8cba21bd5e0b0ee831045de5815a479f7dd20ccaaeb5ce48c30869f93ed 2017-04-07 13:42:38.840: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ threat spotlight_ teslacrypt - decrypt it yourself.pdf 2017-04-07 13:42:38.853: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ threat spotlight_ teslacrypt - decrypt it yourself.pdf 2017-04-07 13:42:39.089: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ threat spotlight_ teslacrypt - decrypt it yourself.pdf 'utf-8' codec can't decode byte 0xed in position 809: invalid continuation byte 2017-04-07 13:42:41.151: [verbose] [p0] task received 7fdb77feff9667babbe327a33f27db33c28b057c7032134a9b033b5ade09904b 2017-04-07 13:42:41.183: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ want tofsee my pictures_ a botnet gets aggressive.pdf 2017-04-07 13:42:41.194: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ want tofsee my pictures_ a botnet gets aggressive.pdf 2017-04-07 13:42:41.404: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ want tofsee my pictures_ a botnet gets aggressive.pdf 'utf-8' codec can't decode byte 0xed in position 544: invalid continuation byte 2017-04-07 13:42:43.720: [verbose] [p0] task received 8efde018b5242043d68dba573d67456712c74802c7b2ba64465793ec7daad1ff 2017-04-07 13:42:43.743: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ your files are encrypted with a _windows 10 upgrade_.pdf 2017-04-07 13:42:43.749: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ your files are encrypted with a _windows 10 upgrade_.pdf 2017-04-07 13:42:44.011: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ your files are encrypted with a _windows 10 upgrade_.pdf

Feature Request: Chinese (Simplified and Traditional) and Japanese Language Analyzer

Hi there,

I found this great app on reddit and am trying it out, both on your public cloud and on local dev machine using docker. Now I 'd like to ask if you can add the language analyzer for Chinese (Simplified and Traditional) and Japanese?

The CJK are some tricky languages to deal with and here's what I found on stackoverflow regarding the Chinese and Japanese language analyzers: http://stackoverflow.com/questions/29098347/elasticsearch-cjk-language-analyser. Sadly there isn't one for Korean yet.

I also want to curious about how well the current Tesseract OCR tuning included in ambar works with CJK languages.

Thanks.

configuration problems

I want to deploy the application in the private network and get accessed from the public network. To make it safe, I want to deploy a Nginx before it, So how can i config to make it works?
Just guess the machine hold the the ambar application has no public IP, and the machine that hold the Nginx has a Public IP.
It's interesting when I find the api.local.host.port desn't work. However the external configuration works.
How can I solve it ?

Feature Request: Delete Document Button

Hi,

I hope you don't mind me giving another suggestion, but I think it is essential that there is a clean and easy way to delete files from the database within the GUI.

I am planning to scan documents directly to the share that Ambar will import from, so there may be occasions where the document has not scanned correctly and I will have to rescan it. This means I need to delete the old scan from the Ambar database to keep it tidy.

A delete button on the search results screen next to each document would be sufficient so the incorrect document can be removed with a click.

Thanks again for all your hard work!

ERROR: In file 'docker-compose.yml' service 'version' doesn't have any configuration options

Trying to follow the blog post's instructions. Failed here:

ERROR: In file '/home/guy/.ambar/docker-compose.yml' service 'version' doesn't have any configuration options. All top level keys in your docker-compose.yml must map to a dictionary of configuration options.
Traceback (most recent call last):
  File "./ambar.py", line 224, in <module>
    install(configuration)
  File "./ambar.py", line 167, in install
    pullImages(configuration)
  File "./ambar.py", line 100, in pullImages
    runShellCommandStrict("docker-compose -f {0}/docker-compose.yml pull".format(PATH))
  File "./ambar.py", line 45, in runShellCommandStrict
    subprocess.check_call(command, shell = True)
  File "/usr/lib/python3.5/subprocess.py", line 581, in check_call
    raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'docker-compose -f /home/guy/.ambar/docker-compose.yml pull' returned non-zero exit status 1

"Oops.... Something went wrong" during loading

It seems like the api is not accessible, even though installation went without any apparent issue. During loading of the page, I get the error "Oops.... Something went wrong" at the bottom. It looks like the ambar-webapi container is restarting every 5 minutes due to not connecting to the ambar-es container?

andrew@onlyoffice:~$ sudo ./ambar.py start


______           ____     ______  ____
/\  _  \  /'\_/`\/\  _`\  /\  _  \/\  _`\
\ \ \L\ \/\      \ \ \L\ \ \ \L\ \ \ \L\ \
 \ \  __ \ \ \__\ \ \  _ <'\ \  __ \ \ ,  /
  \ \ \/\ \ \ \_/\ \ \ \L\ \ \ \/\ \ \ \ \
   \ \_\ \_\ \_\ \_\ \____/ \ \_\ \_\ \_\ \_\
    \/_/\/_/\/_/ \/_/\/___/   \/_/\/_/\/_/\/ /



Docker version 17.03.1-ce, build c6d412e
docker-compose version 1.11.2, build dfed245
vm.max_map_count = 262144
net.ipv4.ip_local_port_range = 15000 61000
net.ipv4.tcp_fin_timeout = 30
net.core.somaxconn = 1024
net.core.netdev_max_backlog = 2000
net.ipv4.tcp_max_syn_backlog = 2048
ambar_db_1 is up-to-date
ambar_es_1 is up-to-date
ambar_rabbit_1 is up-to-date
ambar_frontend_1 is up-to-date
ambar_webapi_1 is up-to-date
ambar_webapi-cache_1 is up-to-date
Waiting for Ambar to start...
Ambar is running on http://10.20.30.13:80

ambar-webapi container log output:

2017/04/07 05:08:51 Timeout after 5m0s waiting on dependencies to become available: [unix:///var/run/docker.sock http://es:9200]
2017/04/07 05:08:52 Waiting for host:
2017/04/07 05:08:52 Waiting for host: es:9200
2017/04/07 05:08:52 Connected to unix:///var/run/docker.sock
2017/04/07 05:13:52 Timeout after 5m0s waiting on dependencies to become available: [unix:///var/run/docker.sock http://es:9200]
2017/04/07 05:13:52 Waiting for host:
2017/04/07 05:13:52 Waiting for host: es:9200
2017/04/07 05:13:52 Connected to unix:///var/run/docker.sock
2017/04/07 05:18:52 Timeout after 5m0s waiting on dependencies to become available: [unix:///var/run/docker.sock http://es:9200]
2017/04/07 05:18:52 Waiting for host:
2017/04/07 05:18:52 Waiting for host: es:9200
2017/04/07 05:18:52 Connected to unix:///var/run/docker.sock

ambar-es container logs:

[2017-04-07T05:22:01,567][INFO ][o.e.n.Node               ] [BtkYnk-] stopping ...
[2017-04-07T05:22:01,633][INFO ][o.e.n.Node               ] [BtkYnk-] stopped
[2017-04-07T05:22:01,633][INFO ][o.e.n.Node               ] [BtkYnk-] closing ...
[2017-04-07T05:22:01,646][INFO ][o.e.n.Node               ] [BtkYnk-] closed
[2017-04-07T05:22:03,494][INFO ][o.e.n.Node               ] [] initializing ...
[2017-04-07T05:22:03,612][INFO ][o.e.e.NodeEnvironment    ] [BtkYnk-] using [1] data paths, mounts [[/usr/share/elasticsearch/data (/dev/mapper/onlyoffice--vg-root)]], net usable_space [34.7gb], net total_space [46.6gb], spins? [possibly], types [ext4]
[2017-04-07T05:22:03,612][INFO ][o.e.e.NodeEnvironment    ] [BtkYnk-] heap size [1007.3mb], compressed ordinary object pointers [true]
[2017-04-07T05:22:03,660][INFO ][o.e.n.Node               ] node name [BtkYnk-] derived from node ID [BtkYnk-rRXGLNCk4JZeisA]; set [node.name] to override
[2017-04-07T05:22:03,665][INFO ][o.e.n.Node               ] version[5.2.2], pid[1], build[f9d9b74/2017-02-24T17:26:45.835Z], OS[Linux/4.4.0-72-generic/amd64], JVM[Oracle Corporation/OpenJDK 64-Bit Server VM/1.8.0_121/25.121-b13]
[2017-04-07T05:22:05,239][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [aggs-matrix-stats]
[2017-04-07T05:22:05,239][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [ingest-common]
[2017-04-07T05:22:05,239][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [lang-expression]
[2017-04-07T05:22:05,239][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [lang-groovy]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [lang-mustache]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [lang-painless]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [percolator]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [reindex]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [transport-netty3]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded module [transport-netty4]
[2017-04-07T05:22:05,242][INFO ][o.e.p.PluginsService     ] [BtkYnk-] loaded plugin [analysis-morphology]
[2017-04-07T05:22:05,395][WARN ][o.e.d.s.g.GroovyScriptEngineService] [groovy] scripts are deprecated, use [painless] scripts instead
[2017-04-07T05:22:08,149][INFO ][o.e.n.Node               ] initialized
[2017-04-07T05:22:08,150][INFO ][o.e.n.Node               ] [BtkYnk-] starting ...
[2017-04-07T05:22:08,258][WARN ][i.n.u.i.MacAddressUtil   ] Failed to find a usable hardware address from the network interfaces; using random bytes: f5:84:67:88:74:e6:c5:b2
[2017-04-07T05:22:08,326][INFO ][o.e.t.TransportService   ] [BtkYnk-] publish_address {172.19.0.3:9300}, bound_addresses {[::]:9300}
[2017-04-07T05:22:08,335][INFO ][o.e.b.BootstrapChecks    ] [BtkYnk-] bound or publishing to a non-loopback or non-link-local address, enforcing bootstrap checks
[2017-04-07T05:22:11,400][INFO ][o.e.c.s.ClusterService   ] [BtkYnk-] new_master {BtkYnk-}{BtkYnk-rRXGLNCk4JZeisA}{bcr5fJbTS6WeNLWTn3-wbg}{172.19.0.3}{172.19.0.3:9300}, reason: zen-disco-elected-as-master ([0] nodes joined)
[2017-04-07T05:22:11,419][INFO ][o.e.h.HttpServer         ] [BtkYnk-] publish_address {172.19.0.3:9200}, bound_addresses {[::]:9200}
[2017-04-07T05:22:11,419][INFO ][o.e.n.Node               ] [BtkYnk-] started
[2017-04-07T05:22:11,669][INFO ][o.e.g.GatewayService     ] [BtkYnk-] recovered [2] indices into cluster_state
[2017-04-07T05:22:12,231][INFO ][o.e.c.r.a.AllocationService] [BtkYnk-] Cluster health status changed from [RED] to [YELLOW] (reason: [shards started [[ambar_log_record_data][7]] ...]).

Setting up SMB crawler -> Error: undefined

Attempting to setup the SMB crawler following the guide in the blog. Problem is I keep getting this red message at the bottom saying Error: undefined. There are no "i" markers on the left or highlighting to tell me where this may be or what it means. Below is my config with some slight edits.

Edits made: IP address, location, login, and password.

{
"id": "Share",
"description": "Share HDD",
"type": "smb",
"locations": [
{
"host_name": "EDITED",
"ip_address": "192.168.1.xxx",
"location": "EDITED"
}
],
"file_regex": "(\.doc[a-z]$)|(\.xls[a-z]$)|(\.txt$)|(\.csv$)|(\.htm[a-z]$)|(\.ppt[a-z]$)|(\.pdf$)|(\.msg$)|(\.zip$)|(\.eml$)|(\.rtf$)|(\.md$)|(\.png$)|(\.bmp$)|(\.tif[f]*$)|(\.jp[e]*g$)",
"credentials": {
"auth_type": "ntlm",
"login": "EDITED",
"password": "EDITED"

},
"schedule": {
"is_active": true,
"cron_schedule": "*/15 * * * *"
},
"max_file_size_bytes": 30000000,
"verbose": true
}

Could the issue be related to the location? I've tried:
"\COMPUTERNAME\SHARE"
"//COMPUTERNAME/SHARE"
"SHARE"

All top level keys in your docker-compose.yml must map to a dictionary of configuration options.

mkdir ambar
cd ambar/

wget -O ambar.py https://static.ambar.cloud/ambar.py && chmod +x ./ambar.py
sudo ./ambar.py install

ERROR: In file '/home/username/ambar/docker-compose.yml' service 'version' doesn't have any configuration options. All top level keys in your docker-compose.yml must map to a dictionary of configuration options.
Traceback (most recent call last):
File "./ambar.py", line 237, in
install(configuration)
File "./ambar.py", line 180, in install
pullImages(configuration)
File "./ambar.py", line 100, in pullImages
runShellCommandStrict("docker-compose -f {0}/docker-compose.yml pull".format(PATH))
File "./ambar.py", line 45, in runShellCommandStrict
subprocess.check_call(command, shell = True)
File "/usr/lib/python3.5/subprocess.py", line 581, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'docker-compose -f /home/username/ambar/docker-compose.yml pull' returned non-zero exit status 1

Not sure what I have done to deserve this treatment.

Remove an uploaded File

Hey there,
is it possible to remove (not just hide) an uploaded file (pdf) from the database? I need it, because I uploaded a file which is protected by copyright.

SSH Credentials for VirtualBox Image

Hi, I was able to download the VirtualBox image for Ambar but ran into some issues configuring it and getting it to work.

I'm putting it on my server behind an Apache web server so it's hosted with a proxypass directive on a custom hostname.

However, it's trying to hit the API on port 8004 with it's IP address, which looks like it's configurable by editing the apiURL.txt file on the container.

However, I can't gain access to the container to edit the apiURL file. Easiest would be the SSH credentials.

Unable to install ambar

I'm unable to start ambar at the moment
This is all I did:

# Pull ambar.py from github
$ sudo ./ambar.py install
$ sudo ./ambar.py start

 sudo ./ambar.py start  
 

______           ____     ______  ____       
/\  _  \  /'\_/`\/\  _`\  /\  _  \/\  _`\    
\ \ \L\ \/\      \ \ \L\ \ \ \L\ \ \ \L\ \  
 \ \  __ \ \ \__\ \ \  _ <'\ \  __ \ \ ,  /   
  \ \ \/\ \ \ \_/\ \ \ \L\ \ \ \/\ \ \ \ \  
   \ \_\ \_\ \_\ \_\ \____/ \ \_\ \_\ \_\ \_\
    \/_/\/_/\/_/ \/_/\/___/   \/_/\/_/\/_/\/ /


                                              
Docker version 17.05.0-ce, build 89658be
docker-compose version 1.10.0, build 4bd6f1a
vm.max_map_count = 262144
net.ipv4.ip_local_port_range = 15000 61000
net.ipv4.tcp_fin_timeout = 30
net.core.somaxconn = 1024
net.core.netdev_max_backlog = 2000
net.ipv4.tcp_max_syn_backlog = 2048
Creating network "ambar_internal_network" with the default driver
Creating ambar_es_1
Creating ambar_proxy_1
Creating ambar_webapi-cache_1
Creating ambar_rabbit_1
Creating ambar_db_1
Creating ambar_webapi_1
Creating ambar_frontend_1
Waiting for Ambar to start...
Ambar is running on http://192.168.1.5:80

But accessing http://192.168.1.5:80 results in a DNS Error (I think, since I'm receving a page from my ISP that said can't find this page)

Accessing http://192.168.1.5:80/api results in a 504 Gateway timeout error from Nginx

Logs:

Attaching to ambar_webapi_1
webapi_1        | 2017/06/30 15:25:00 Waiting for host: 
webapi_1        | 2017/06/30 15:25:00 Waiting for host: es:9200
webapi_1        | 2017/06/30 15:25:00 Connected to unix:///var/run/docker.sock
webapi_1        | 2017/06/30 15:25:31 Received 200 from http://es:9200
webapi_1        | Crawler schedule service initialized
webapi_1        | Pipeline initialized
webapi_1        | Started on :::8080
Attaching to ambar_proxy_1
proxy_1         | Starting NGINX...
proxy_1         | 192.168.1.5 - - [30/Jun/2017:15:25:36 +0000] "GET / HTTP/1.1" 403 612 "-" "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36" "-"
proxy_1         | 2017/06/30 15:26:43 [error] 7#7: *1 upstream timed out (110: Connection timed out) while connecting to upstream, client: 192.168.1.5, server: ambar-fe, request: "GET /api/ HTTP/1.1", upstream: "http://125.235.4.59:8080/api/", host: "192.168.1.5"
proxy_1         | 192.168.1.5 - - [30/Jun/2017:15:26:43 +0000] "GET /api/ HTTP/1.1" 504 585 "-" "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36" "-"

All docker images are up to date

Web API hangs

Error: Web API hangs. Container still shows running.
I'm seeing In the browser, "a spinning spinner"

  1. VMWare running Ubuntu Xenial 16.04.
  2. Kernel 4.4.0-72-generic
  3. Docker version 17.03.1-ce, build c6d412e
  4. docker-compose version 1.11.2, build dfed245

docker logs -f ambar-webapi

/usr/src/app/dist/services/QueueProxy.js:91
			throw err;
			^

Error: Unexpected close
    at succeed (/usr/src/app/node_modules/amqplib/lib/connection.js:270:13)
    at onOpenOk (/usr/src/app/node_modules/amqplib/lib/connection.js:252:5)
    at /usr/src/app/node_modules/amqplib/lib/connection.js:165:32
    at /usr/src/app/node_modules/amqplib/lib/connection.js:159:12
    at Socket.recv (/usr/src/app/node_modules/amqplib/lib/connection.js:497:12)
    at Socket.g (events.js:260:16)
    at emitNone (events.js:67:13)
    at Socket.emit (events.js:166:7)
    at emitReadable_ (_stream_readable.js:419:10)
    at emitReadable (_stream_readable.js:413:7)
    at readableAddChunk (_stream_readable.js:164:13)
    at Socket.Readable.push (_stream_readable.js:111:10)
    at TCP.onread (net.js:540:20)
2017/04/17 23:13:42 Command exited with error: exit status 1

docker logs -f ambar-rabbbit

=ERROR REPORT==== 17-Apr-2017::23:06:18 ===
closing AMQP connection <0.551.0> (172.19.0.1:36775 -> 172.19.0.4:5672):
missed heartbeats from client, timeout: 60s

Feature Request: Manipulate items via API or Frontend

It would be nice to be able to manipulate or remove items from the index either via API or frontend. It does look like perhaps we can work with elasticsearch API, but through the ambar-provided API would be desirable.

Feature Request: Admin-view of individual documents

I would like to see an organized and browsable view of documents available in the frontend view. Perhaps some extended statistics (time spent processing?) on an individual document, and a view of the document itself and the extracted text (without having to search for a document).

Log-In page and "API":"AUTH" to none

Hi!

Seems like a possible duplicate to #28.

After registering an account i wanted to switch the login off. So that that no authentication is needed to reach the search page. That is my goal ... Should be possible with "auth" to "none" in the config file.

But i still get the login page when connecting to the root url.

Thank you!

New language(s)?

Hello!
I was thinking about adding Polish language and support for users from Poland (separate documentation). I don't know if I can help with first thing, but I can definitely help with the second one.

Cheers.

Feature Request: Introduce Tags

Hi there!

I just stumbled upon Ambar and really like the idea as well as the frontend.
Is there any chance that one could add tags to uploaded documents?
That would open up so much more options in management and queries.

(Maybe adding a freetext / transform to tag field right next to the Download button?)
If you're interested, I would be happy to assist in the realization of features; simply because I see some great potential in Ambar compared to other DMS solutions out there.

Thanks!

Matthias

SMB crawler not working, share verified working

Installed clean today on clean Ubuntu 16.04 install. Verified I can connect to the share from Windows and Linux using mount -t cifs. Crawler config:

{
"id": "data",
"uid": "data_d033e22ae348aeb5660fc2140aec35850c4da997",
"description": "nas crawler",
"type": "smb",
"locations": [
{
"host_name": "nas",
"ip_address": "10.0.0.100",
"location": "data"
}
],
"file_regex": "(\.doc[a-z]$)|(\.xls[a-z]$)|(\.txt$)|(\.csv$)|(\.htm[a-z]$)|(\.ppt[a-z]$)|(\.pdf$)|(\.msg$)|(\.eml$)|(\.rtf$)|(\.md$)|(\.png$)|(\.bmp$)|(\.tif[f]$)|(\.jp[e]g$)|(\.hwp$)",
"credentials": {
"auth_type": "ntlm",
"login": "jes",
"password": "
***",
"token": ""
},
"schedule": {
"is_active": true,
"cron_schedule": "
/15 * * * *"
},
"max_file_size_bytes": 30000000,
"verbose": true
}

Error:
2017-07-14 11:15:00.688: [info] filecrawler initialized
2017-07-14 11:15:00.695: [error]
2017-07-14 11:15:00.700: [error] error connecting to Smb share on nas

Notice that there is nothing by the error at all.

Also, how do I get to the logs for this system? I looked at docker logs but they said nothing about this issue. Thank you.

File Upload error when using Firefox

Hello there, it seems to be some problems when uploading files using firefox...

Here is the log file:

POST 
XHR 
https://mc.yandex.ru/webvisor/42850839 [HTTP/1.1 200 OK 637 ms]
OPTIONS 
https://app.ambar.cloud:8004/api/files/Default/Biblia%20de%20Jerusalem.pdf [HTTP/2.0 204 No Content 250 ms]
POST 
https://app.ambar.cloud:8004/api/files/Default/Biblia%20de%20Jerusalem.pdf [HTTP/2.0 400 Bad Request 255 ms]
Response { type: "cors", url: "https://app.ambar.cloud:8004/api/fi…", status: 400, ok: false, statusText: "Bad Request", headers: Headers, bodyUsed: false } app.b263d6913c2e577ef05e.js:3:6732

uploadFile Response { type: "cors", url: "https://app.ambar.cloud:8004/api/fi…", status: 400, ok: false, statusText: "Bad Request", headers: Headers, bodyUsed: false } app.b263d6913c2e577ef05e.js:7:26205

GET 
XHR 
https://api.mixpanel.com/track/ [HTTP/1.1 200 OK 485 ms]
GET 
XHR 
https://api.mixpanel.com/track/ [HTTP/1.1 200 OK 343 ms]
````

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.