rd17 / ambar Goto Github PK
View Code? Open in Web Editor NEW:mag: Ambar: Document Search Engine
Home Page: https://ambar.cloud/
License: MIT License
:mag: Ambar: Document Search Engine
Home Page: https://ambar.cloud/
License: MIT License
Your original nginx config produced by default installation process.
server {
listen 80;
server_name ambar-fe;
client_max_body_size 1024m;
location /api/ {
proxy_pass http://webapi:8080/api/;
}
location / {
proxy_pass http://frontend:80;
}
}
The last directive should be proxy_pass http://frontend:80/;
[root@searchbox ambar]# ./ambar.py start
______ ____ ______ ____
/\ _ \ /'\_/`\/\ _`\ /\ _ \/\ _`\
\ \ \L\ \/\ \ \ \L\ \ \ \L\ \ \ \L\ \
\ \ __ \ \ \__\ \ \ _ <'\ \ __ \ \ , /
\ \ \/\ \ \ \_/\ \ \ \L\ \ \ \/\ \ \ \ \
\ \_\ \_\ \_\ \_\ \____/ \ \_\ \_\ \_\ \_\
\/_/\/_/\/_/ \/_/\/___/ \/_/\/_/\/_/\/ /
Docker version 1.12.1, build 23cf638
docker-compose version 1.12.0, build b31ff33
vm.max_map_count = 262144
net.ipv4.ip_local_port_range = 15000 61000
net.ipv4.tcp_fin_timeout = 30
net.core.somaxconn = 1024
net.core.netdev_max_backlog = 2000
net.ipv4.tcp_max_syn_backlog = 2048
Creating ambar_db_1
Creating ambar_rabbit_1
Creating ambar_es_1
Creating ambar_frontend_1
ERROR: for es Cannot create container for service es: b'Invalid port specification: "None"'
ERROR: for db Cannot create container for service db: b'Invalid port specification: "None"'
ERROR: for rabbit Cannot create container for service rabbit: b'Invalid port specification: "None"'
ERROR: for frontend Cannot create container for service frontend: b'Invalid port specification: "None"'
ERROR: Encountered errors while bringing up the project.
Traceback (most recent call last):
File "./ambar.py", line 218, in <module>
start(configuration)
File "./ambar.py", line 187, in start
runShellCommandStrict('docker-compose -f {0}/docker-compose.yml -p ambar up -d'.format(PATH))
File "./ambar.py", line 45, in runShellCommandStrict
subprocess.check_call(command, shell = True)
File "/usr/local/lib/python3.5/subprocess.py", line 584, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'docker-compose -f /root/ambar/docker-compose.yml -p ambar up -d' returned non-zero exit status 1
Hi,
First of all thanks for your works, i've a request:
Can you add italian as a language analyzer?
Thanks
Thank you for building such an awesome project like this. I hope you guys can add support for Vietnamese. There is a tokenizer/analyzer for Elasticsearch for Vietnamese available at https://github.com/duydo/elasticsearch-analysis-vietnamese
Hi,
As stated in the title, I would like to ask if its possible to add German as language to the analyzer.
I can support you with documents to test if necessary.
Regards
Jan
So I installed a 64-bit version of Ubuntu server inside VMware with the default settings. The only thing I changed was to install a few additional things (Samba file server and OpenSSH server). Next I run sudo apt-get update
and then follow the instructions to install Docker CE. Adding the repository seems to go well, and then I install docker, and I follow step 2 (sudo apt-get install docker-ce
). I am able to run the hello-world container that outputs a successful message. docker --version
gives me v17.03.1-ce, build c6d412e
.
Next I try to install docker-compose. pip is not installed by default it seems, so I run sudo apt install python-pip
. pip version is 8.1.1 (python 2.7) if that matters. Next I run pip install docker-compose
. It completes and mentions a newer version of pip is available. I execute the wget command on the blog, followed by sudo ./ambar.py install
. Here I run into an issue- docker-compose not found. But docker-compose --version
returns version 1.12.0, build b31ff33.
Yesterday I had a different error (ERROR: In file 'docker-compose.yml' service 'version' doesn't have any configuration options), but today is different. I'm not proficient with linux and all its permissions and various directories, so if it's obvious something is wrong, I won't see it. Thanks for any help!
I appear to be having the same issue (or similar) as the OP (agreenfield1) did here. Any advise or assist based on the data below (PS I aready tried changing the http://es:9200 to the actual ip address but it didn't help?
./ambar.py start
______ ____ ______ ____
/\ _ \ /'\_/`\/\ _`\ /\ _ \/\ _`\
\ \ \L\ \/\ \ \ \L\ \ \ \L\ \ \ \L\ \
\ \ __ \ \ \__\ \ \ _ <'\ \ __ \ \ , /
\ \ \/\ \ \ \_/\ \ \ \L\ \ \ \/\ \ \ \ \
\ \_\ \_\ \_\ \_\ \____/ \ \_\ \_\ \_\ \_\
\/_/\/_/\/_/ \/_/\/___/ \/_/\/_/\/_/\/ /
Docker version 17.03.1-ce, build c6d412e
docker-compose version 1.13.0, build 1719ceb
sysctl: setting key "vm.max_map_count": Read-only file system
net.ipv4.ip_local_port_range = 15000 61000
net.core.somaxconn = 1024
ambar_proxy_1 is up-to-date
ambar_db_1 is up-to-date
Starting ambar_rabbit_1 ...
ambar_es_1 is up-to-date
Starting ambar_rabbit_1
Starting ambar_rabbit_1 ... done
ambar_webapi_1 is up-to-date
ambar_frontend_1 is up-to-date
Waiting for Ambar to start...
Ambar is running on http://192.168.5.122:80
http://192.168.5.122:80/_cat/indices
Just returns a blank page
website console errors
Failed to load resource: the server responded with a status of 502 (Bad Gateway)
_/api/_
Failed to start. Error: SyntaxError: Unexpected token < in JSON at position 0
_app.5800ae896ed146f4e02a.js:3_
SyntaxError: Unexpected token < in JSON at position 0
_app.5800ae896ed146f4e02a.js:3_
Failed to load resource: the server responded with a status of 502 (Bad Gateway)
_/api/users/check_
js:6 Failed to auth. Error: [object Response]
_app.5800ae896ed146f4e02a_
docker logs ambar_webapi_1
2017/06/21 02:11:12 Waiting for host:
2017/06/21 02:11:12 Waiting for host: 192.168.5.122:9200
2017/06/21 02:11:12 Connected to unix:///var/run/docker.sock
2017/06/21 02:32:25 Waiting for host:
2017/06/21 02:32:25 Waiting for host: 192.168.5.122:9200
2017/06/21 02:32:25 Connected to unix:///var/run/docker.sock
2017/06/21 02:37:25 Timeout after 5m0s waiting on dependencies to become available: [unix:///var/run/docker.sock http://192.168.5.122:9200]
2017/06/21 02:37:25 Waiting for host:
2017/06/21 02:37:25 Waiting for host: 192.168.5.122:9200
2017/06/21 02:37:25 Connected to unix:///var/run/docker.sock
docker logs ambar_es_1
[2017-06-21T02:50:42,752][INFO ][o.e.n.Node ] [] initializing ...
[2017-06-21T02:50:42,855][INFO ][o.e.e.NodeEnvironment ] [Co7MJEJ] using [1] data paths, mounts [[/ (/dev/mapper/pve-vm--210--disk--1)]], net usable_space [7.1gb], net total_space [58.9gb], spins? [possibly], types [ext4]
[2017-06-21T02:50:42,855][INFO ][o.e.e.NodeEnvironment ] [Co7MJEJ] heap size [989.8mb], compressed ordinary object pointers [true]
[2017-06-21T02:50:42,857][INFO ][o.e.n.Node ] node name [Co7MJEJ] derived from node ID [Co7MJEJKRxu9qhWVmwm8OQ]; set [node.name] to override
[2017-06-21T02:50:42,857][INFO ][o.e.n.Node ] version[5.3.2], pid[1], build[3068195/2017-04-24T16:15:59.481Z], OS[Linux/4.4.62-1-pve/amd64], JVM[Oracle Corporation/OpenJDK 64-Bit Server VM/1.8.0_121/25.121-b13]
[2017-06-21T02:50:46,938][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [aggs-matrix-stats]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [ingest-common]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [lang-expression]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [lang-groovy]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [lang-mustache]
[2017-06-21T02:50:46,940][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [lang-painless]
[2017-06-21T02:50:46,941][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [percolator]
[2017-06-21T02:50:46,941][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [reindex]
[2017-06-21T02:50:46,941][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [transport-netty3]
[2017-06-21T02:50:46,941][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded module [transport-netty4]
[2017-06-21T02:50:46,954][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded plugin [analysis-morphology]
[2017-06-21T02:50:46,954][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded plugin [analysis-smartcn]
[2017-06-21T02:50:46,955][INFO ][o.e.p.PluginsService ] [Co7MJEJ] loaded plugin [analysis-stempel]
[2017-06-21T02:50:53,487][INFO ][o.e.n.Node ] initialized
[2017-06-21T02:50:53,497][INFO ][o.e.n.Node ] [Co7MJEJ] starting ...
[2017-06-21T02:50:53,741][WARN ][i.n.u.i.MacAddressUtil ] Failed to find a usable hardware address from the network interfaces; using random bytes: f8:d1:3f:ba:9a:40:6b:8e
[2017-06-21T02:50:53,935][INFO ][o.e.t.TransportService ] [Co7MJEJ] publish_address {172.18.0.5:9300}, bound_addresses {[::]:9300}
[2017-06-21T02:50:53,946][INFO ][o.e.b.BootstrapChecks ] [Co7MJEJ] bound or publishing to a non-loopback or non-link-local address, enforcing bootstrap checks
ERROR: bootstrap checks failed
max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144]
[2017-06-21T02:50:53,982][INFO ][o.e.n.Node ] [Co7MJEJ] stopping ...
[2017-06-21T02:50:54,789][INFO ][o.e.n.Node ] [Co7MJEJ] stopped
[2017-06-21T02:50:54,789][INFO ][o.e.n.Node ] [Co7MJEJ] closing ...
[2017-06-21T02:50:54,805][INFO ][o.e.n.Node ] [Co7MJEJ] closed
File: config.json
{
"dataPath": "/opt/ambar",
"dropbox": {
"clientId": "",
"redirectUri": ""
},
"es": {
"containerSize": "2g",
"heapSize": "1g"
},
"ocr": {
"pdfMaxPageCount": 5000,
"pdfSymbolsPerPageThreshold": 100
},
"dockerComposeTemplate": "https://static.ambar.cloud/docker-compose.template.yml",
"db": {
"cacheSizeGb": 2
},
"fe": {
"external": {
"host": "192.168.5.122",
"port": "80",
"protocol": "http"
}
},
"dockerRepo": "ambar",
"api": {
"auth": "basic",
"external": {
"host": "192.168.5.122",
"port": "80",
"protocol": "http"
},
"defaultLangAnalyzer": "ambar_en",
"analyticsToken": "cda4b0bb11a1f32aed7564b08c455992",
"mode": "ce",
"crawlerCount": 1,
"cacheSize": "1g",
"pipelineCount": 1,
"showFilePreview": "false"
}
}
File: docker-compose.yml
version: "2"
networks:
internal_network:
services:
webapi:
restart: always
networks:
- internal_network
image: ambar/ambar-webapi:latest
expose:
- "8080"
environment:
- db=mongodb://db:27017/ambar_data
- fe=http://192.168.5.122:80
- api=http://webapi:8080
- es=http://192.168.5.122:9200
- redis=webapi-cache
- rabbit=amqp://rabbit
- mode=ce
- pipelineCount=1
- crawlerCount=1
- dropboxClientId=
- dropboxRedirectUri=
- defaultLangAnalyzer=ambar_en
- analyticsToken=cda4b0bb11a1f32aed7564b08c455992
- auth=basic
- ocrPdfMaxPageCount=5000
- ocrPdfSymbolsPerPageThreshold=100
- showFilePreview=false
depends_on:
- db
- es
- webapi-cache
- proxy
- rabbit
volumes:
- /var/run/docker.sock:/var/run/docker.sock
webapi-cache:
restart: always
image: redis:alpine
networks:
- internal_network
expose:
- "6379"
ports:
- "6379:6379"
mem_limit: 1g
proxy:
image: ambar/ambar-proxy:latest
networks:
- internal_network
restart: always
environment:
- API_EXT_PORT=80
- FE_EXT_PORT=80
expose:
- 80
ports:
- "80:80"
frontend:
image: ambar/ambar-frontend:latest
networks:
- internal_network
depends_on:
- webapi
expose:
- "80"
restart: always
environment:
- api=http://192.168.5.122:80
db:
restart: always
image: ambar/ambar-mongodb:latest
networks:
- internal_network
environment:
- cacheSizeGB=2
volumes:
- /opt/ambar/db:/data/db
expose:
- "27017"
ports:
- "27017:27017"
es:
image: ambar/ambar-es:latest
networks:
- internal_network
restart: always
expose:
- "9200"
ports:
- "9200:9200"
environment:
- cluster.name=ambar-es
- bootstrap.memory_lock=true
- xpack.security.enabled=false
- security.manager.enabled=false
- "ES_JAVA_OPTS=-Xms1g -Xmx1g"
ulimits:
memlock:
soft: -1
hard: -1
nofile:
soft: 65536
hard: 65536
mem_limit: 2g
cap_add:
- IPC_LOCK
volumes:
- /opt/ambar/es:/usr/share/elasticsearch/data
rabbit:
image: ambar/ambar-rabbit:latest
networks:
- internal_network
hostname: rabbit
expose:
- "15672"
- "5672"
ports:
- "15672:15672"
- "5672:5672"
volumes:
- /opt/ambar/rabbit:/var/lib/rabbitmq
Can you add search support for chinese language file ? Thanks!
When I try to download a large file, nothing happens. Upload button inactive.
I upload pdf in ambar but not geting the image text extracted in output .txt file
CLASS-1-6.pdf
Hello
When restarting Ubuntu this is the output of docker ps
CONTAINER ID IMAGE COMMAND CREAT ED STATUS PORTS NAMES
5ec43d188c10 ambar/ambar-pipeline:latest "/bin/sh -c 'pytho..." 37 mi nutes ago Restarting (139) 49 seconds ago ambar_pipeline_p0
2ed6b189aebd ambar/ambar-crawler:latest "/bin/sh -c 'pytho..." 37 mi nutes ago Restarting (1) 36 seconds ago ambar_crawler_c0
baac19554af4 ambar/ambar-frontend:latest "/bin/sh -c 'echo ..." 38 mi nutes ago Up 3 minutes 80/tcp ambar_frontend_1
d1ef1edac298 ambar/ambar-webapi:latest "/bin/sh -c 'docke..." 38 mi nutes ago Restarting (1) 28 seconds ago ambar_webapi_1
53931fb03307 ambar/ambar-proxy:latest "/entrypoint.sh" 38 mi nutes ago Up 3 minutes 0.0.0.0:889-890->889-890/tcp ambar_proxy_1
0cd5938e8bfa ambar/ambar-es:latest "/docker-entrypoin..." 38 mi nutes ago Up 3 minutes 0.0.0.0:9200->9200/tcp, 9300/ tcp ambar_es_1
99ecced7bd96 redis:alpine "docker-entrypoint..." 38 mi nutes ago Up 3 minutes 0.0.0.0:6379->6379/tcp ambar_webapi-cache_1
7971cf6807cd ambar/ambar-mongodb:latest "/entrypoint.sh" 38 mi nutes ago Restarting (2) About a minute ago
Pipeline, crawler, and webapi are in an endless restart loop.
If I manually run ./ambar.py start then they work fine and come up as expected.
I would like this to autostart though.
Bug?
what is the name of index is created after uploading a pdf file on ambar???
also the text file which I got in that file image text is not present which is my main aim.
Hello,
I installed ambar in my machine.But when I start it, sometime UI not get loading.Giving 502 Bad gate way error.
& When It start I am not able to understand where to write search query for ES indices.How to check whether document is present in ES or not..
Hi,
I am seeing allot of the following errors on the pipeline for all the files being used via SMB Crawler .
2017-05-11 14:59:05.470: [error] [p0] error submiting parsed text to Api, unexpected response code //windowsxptools/g/docs/file.html 500 {"message":"Invalid non-string/buffer chunk\n TypeError: Invalid non-string/buffer chunk\n at validChunk (_stream_writable.js:186:14)\n at GridWriteStream.Writable.write (_stream_writable.js:213:12)\n at ondata (_stream_readable.js:542:20)\n at emitOne (events.js:77:13)\n at emit (events.js:169:7)\n at Readable.read (_stream_readable.js:368:10)\n at flow (stream_readable.js:759:26)\n at resume (_stream_readable.js:739:3)\n at nextTickCallbackWith2Args (node.js:511:9)\n at process._tickCallback (node.js:425:17)"}
This is happening to all the files.
Any ideas ?
Thanks
The install process was easy and straightforward. I was excited to get started with ambar, but at the very end I received a message in the console
sysctl: cannot stat /proc/sys/net/ipv4/tcp_fin_timeout: No such file or directory
I know lxc containers have an issue running docker (which I was able to get around by modifying the lxc conf file to include some extra parameters. I realize this is probably not the safest practice but the container would only be accessible on a local network. Since I'm still having issues with the container I was hoping to see if anyone had a solution I missed.
Hi!
I'm getting the following error for some of my files when crawling them using the SMB crawler. The same file can be uploaded manually and works ok.
[error] 'utf-16-le' codec can't decode bytes in position 18-19: illegal encoding.
What to do?
I am trying to get Ambar running on OSX.
It works so far, however the pipeline seems to be not able to connect to the Rabbit server. Thus, no documents are being processed and nothing is added to the search index.
I am trying to understand what's going on behind the scenes in order to find a solution for this problem. Unfortunately, the code for the Webapi and Pipeline docker images are not public.
Do you plan to release these soon?
Hi,
I seem to be having problems with the ambar_webapi docker not using the system HTTP proxy correctly.
I have installed ambar self-hosted community edition onto Centos 7, which is behind a HTTP proxy. I have setup systemd for docker to define the HTTP_PROXY and HTTPS_PROXY correctly. ie I can download/book amabar ok.
I can then access the web front end ok, (changed to port 8005) but everything else is standard.. however I can't login, signup or anything - get an 'opps something went wrong message'.
Inspecting the docker log for ambar_webapi seems to show attempts to access a remote host (52.64.9.77) (and amazonaws.com host - mandrillapp.com??) without using the HTTPS proxy
`
[root@kgs-sts-fusion ambar]# ./ambar.py start
/\ _ \ /'_/\/\ _
\ /\ _ /\ `\
\ \ \L\ /\ \ \ \L\ \ \ \L\ \ \ \L\ \
\ \ __ \ \ _\ \ \ _ <'\ \ __ \ \ , /
\ \ /\ \ \ _/\ \ \ \L\ \ \ /\ \ \ \ \
\ _\ _\ _\ _\ _/ \ _\ _\ _\ _
////// //// /////_// /
Docker version 17.04.0-ce, build 4845c56
docker-compose version 1.13.0, build 1719ceb
vm.max_map_count = 262144
net.ipv4.ip_local_port_range = 15000 61000
net.ipv4.tcp_fin_timeout = 30
net.core.somaxconn = 1024
net.core.netdev_max_backlog = 2000
net.ipv4.tcp_max_syn_backlog = 2048
Creating network "ambar_internal_network" with the default driver
Creating ambar_db_1 ...
Creating ambar_rabbit_1 ...
Creating ambar_proxy_1 ...
Creating ambar_es_1 ...
Creating ambar_db_1
Creating ambar_webapi-cache_1 ...
Creating ambar_rabbit_1
Creating ambar_es_1
Creating ambar_webapi-cache_1
Creating ambar_es_1 ... done
Creating ambar_webapi_1 ...
Creating ambar_webapi_1 ... done
Creating ambar_frontend_1 ...
Creating ambar_frontend_1 ... done
Waiting for Ambar to start...
Ambar is running on http://147.66.12.53:8005
[root@kgs-sts-fusion ambar]# cat docker inspect --format='{{.LogPath}}' ambar_webapi_1
{"log":"2017/05/03 05:44:08 Waiting for host: \n","stream":"stderr","time":"2017-05-03T05:44:08.385748978Z"}
{"log":"2017/05/03 05:44:08 Waiting for host: es:9200\n","stream":"stderr","time":"2017-05-03T05:44:08.385884331Z"}
{"log":"2017/05/03 05:44:08 Connected to unix:///var/run/docker.sock\n","stream":"stderr","time":"2017-05-03T05:44:08.388017144Z"}
{"log":"2017/05/03 05:44:22 Received 200 from http://es:9200\n","stream":"stderr","time":"2017-05-03T05:44:22.302769292Z"}
{"log":"Crawler schedule service initialized\n","stream":"stdout","time":"2017-05-03T05:44:24.380922736Z"}
{"log":"Pipeline initialized\n","stream":"stdout","time":"2017-05-03T05:44:24.71064609Z"}
{"log":"Started on :::8080\n","stream":"stdout","time":"2017-05-03T05:44:24.720793191Z"}
{"log":"{ [Error: connect ECONNREFUSED 52.64.27.232:443]\n","stream":"stderr","time":"2017-05-03T06:53:42.270438821Z"}
{"log":" code: 'ECONNREFUSED',\n","stream":"stderr","time":"2017-05-03T06:53:42.270489177Z"}
{"log":" errno: 'ECONNREFUSED',\n","stream":"stderr","time":"2017-05-03T06:53:42.270497139Z"}
{"log":" syscall: 'connect',\n","stream":"stderr","time":"2017-05-03T06:53:42.270503494Z"}
{"log":" address: '52.64.27.232',\n","stream":"stderr","time":"2017-05-03T06:53:42.27050999Z"}
{"log":" port: 443 }\n","stream":"stderr","time":"2017-05-03T06:53:42.270516275Z"}
{"log":"{ [Error: connect ECONNREFUSED 52.64.9.77:443]\n","stream":"stderr","time":"2017-05-03T06:53:43.182362118Z"}
{"log":" code: 'ECONNREFUSED',\n","stream":"stderr","time":"2017-05-03T06:53:43.18240549Z"}
{"log":" errno: 'ECONNREFUSED',\n","stream":"stderr","time":"2017-05-03T06:53:43.182413382Z"}
{"log":" syscall: 'connect',\n","stream":"stderr","time":"2017-05-03T06:53:43.182444112Z"}
{"log":" address: '52.64.9.77',\n","stream":"stderr","time":"2017-05-03T06:53:43.182451306Z"}
{"log":" port: 443 }\n","stream":"stderr","time":"2017-05-03T06:53:43.182457382Z"}
`
it seems that when I try to recover my password I type my email, and hit 'recover password' causes a new entry in the ambar_webapi docker log which looks like our http proxy (see above).
I have not yet been able to login at all to the Ambar web front end.
any ideas?
Regards
Kym
Cloudron is a popular way to selfhost services easily. Support for cloudron would be awesome!
Hi there,
I was really happy to find this project after spending quite some time for Solr, Datafari, manifoldcf, etc. PoCs. Finally something that works out of the box and allows me to crawl my local documents (using smb) and my emails.
I think it would be really helpful to have a preview functionality for .eml-Files as well. As I'm not sure if there are any suitable JS library to achieve that another helpful improvement could be to let the ambar admin customize the preview url. The approach to let google document viewer preview everything helps to get things up quickly, but fails if you want to show previews/documents using another software.
Cheers,
Alex
Hi all,
i'd like to have a new crawler available to access webdav folders.
Please let me know if you need any additional information about what i'd like to see, but i hope the general "webdav crawler" is explicit enough to understand what is needed.
Thank you very much
Hi there,
I am trying to get external access. I have a dydns web address that points to my computer that is hosting Ambar. What ports need forwarded and where do I configure this info?
I have attempted to do it now, but keep getting oops something went wrong. I am trying to use port 889 and 890.
I have forwarded the ports correctly (I run about 14 servers in my homelab) but I am a linux noob, so here is an image of the config (not sure how to access the text)
Please advise
Additional info: If I access it through local IP address of computer it works fine. That negates external access though.
Additional info 2: If I access with my external IP instead of the dydns hostname it works.
Additional info 3: If I access it using my external IP:port the web interface works fine. When I download a file it is downloading using the dydns address and works fine.
UPDATE: This was a browser and DNS cache issue. Cleared the browser cache and DNS cache and it worked. I am leaving this here for anyone that Googles this.
Hancom Office or (aka Hangul Word Processor) file (.hwp) is a de facto standard in Korea.
It is widely used by the government and courts, so it is a must-have option in Korea.
But Dtsearch and Windows search does not support it. Ifilters are commercially available at a high price.
Hwp support would make Ambar very competitive in Korea.
Hwp format was adopted as a standard and its specification was made public but only in Korean.
https://www.hancom.com/board/noticeView.do?artcl_seq=6453
But there are some hwp parsing project you might want to reference.
https://sites.google.com/site/h2tlib/
https://github.com/ddoleye/java-hwp
Hello,
Like other users here, I have tried other open source document management software but have found it overly complicated and bloated for my needs. I'm most impressed with your product so far, but there is some functionality which I have seen elsewhere which I think would make your product even better.
It is great that you have made all of the text in the PDF searchable and I like that it shows the paragraph where the key word is found. When I was testing it with some files, I though it would be very useful to be able to view the original PDF file on-line to see the original formatting and graphics without having to download the file and view it in a local viewer. This would be especially useful when using your product on a tablet.
I guess you could include viewers for other file types if they are available, but I feel that a PDF viewer would be most useful.
Thanks for the excellent software - I can't wait to see what new features are added with the next update!
I have loaded several pdf reports. Some of them, Ambar can't parse.
But he don't delete malformed files from queue.
After some time, Ambar try to parse them, again.
2017-04-07 13:42:17.512: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ the _wizzards_ of adware.pdf 'utf-8' codec can't decode byte 0xed in position 1605: invalid continuation byte 2017-04-07 13:42:19.413: [verbose] [p0] task received 7609b6aac3bff0e4bd067824dfc2925100797d2e72a1712bcf3edb1266660131 2017-04-07 13:42:19.450: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ threat round-up for the week of mar 6 - mar 10.pdf 2017-04-07 13:42:19.458: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ threat round-up for the week of mar 6 - mar 10.pdf 2017-04-07 13:42:19.778: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ threat round-up for the week of mar 6 - mar 10.pdf 'utf-8' codec can't decode byte 0xed in position 1075: invalid continuation byte 2017-04-07 13:42:33.929: [verbose] [p0] task received e0d13847f3079e244c9b4503635d59f0ce8840f37d410e9d6d8a0160895eb62a 2017-04-07 13:42:33.966: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ threat spotlight_ holiday greetings from pro pos – is your payment card data someone else’s christmas present_.pdf 2017-04-07 13:42:33.974: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ threat spotlight_ holiday greetings from pro pos – is your payment card data someone else’s christmas present_.pdf 2017-04-07 13:42:34.184: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ threat spotlight_ holiday greetings from pro pos – is your payment card data someone else’s christmas present_.pdf 'utf-8' codec can't decode byte 0xed in position 1307: invalid continuation byte 2017-04-07 13:42:34.200: [verbose] [p0] task received 7c8c073ba712de285d29877e3d01de448f6d7f0c0a4b061f700edfac0fa4d67b 2017-04-07 13:42:34.223: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ threat spotlight_ dyre_dyreza_ an analysis to discover the dga.pdf 2017-04-07 13:42:34.239: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ threat spotlight_ dyre_dyreza_ an analysis to discover the dga.pdf 2017-04-07 13:42:34.495: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ threat spotlight_ dyre_dyreza_ an analysis to discover the dga.pdf 'utf-8' codec can't decode byte 0xed in position 1697: invalid continuation byte 2017-04-07 13:42:35.406: [verbose] [p0] task received 073d9dde2cc486865db972a40f077660904439f0a00195bac24e33a3ebeecf72 2017-04-07 13:42:35.427: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ vulnerability deep dive - ichitaro office excel file code execution vulnerability.pdf 2017-04-07 13:42:35.436: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ vulnerability deep dive - ichitaro office excel file code execution vulnerability.pdf 2017-04-07 13:42:35.615: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ vulnerability deep dive - ichitaro office excel file code execution vulnerability.pdf 'utf-8' codec can't decode byte 0xed in position 222: invalid continuation byte 2017-04-07 13:42:38.804: [verbose] [p0] task received 7ecec8cba21bd5e0b0ee831045de5815a479f7dd20ccaaeb5ce48c30869f93ed 2017-04-07 13:42:38.840: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ threat spotlight_ teslacrypt - decrypt it yourself.pdf 2017-04-07 13:42:38.853: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ threat spotlight_ teslacrypt - decrypt it yourself.pdf 2017-04-07 13:42:39.089: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ threat spotlight_ teslacrypt - decrypt it yourself.pdf 'utf-8' codec can't decode byte 0xed in position 809: invalid continuation byte 2017-04-07 13:42:41.151: [verbose] [p0] task received 7fdb77feff9667babbe327a33f27db33c28b057c7032134a9b033b5ade09904b 2017-04-07 13:42:41.183: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ want tofsee my pictures_ a botnet gets aggressive.pdf 2017-04-07 13:42:41.194: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ want tofsee my pictures_ a botnet gets aggressive.pdf 2017-04-07 13:42:41.404: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ want tofsee my pictures_ a botnet gets aggressive.pdf 'utf-8' codec can't decode byte 0xed in position 544: invalid continuation byte 2017-04-07 13:42:43.720: [verbose] [p0] task received 8efde018b5242043d68dba573d67456712c74802c7b2ba64465793ec7daad1ff 2017-04-07 13:42:43.743: [verbose] [p0] file content received //default/cisco's talos intelligence group blog_ your files are encrypted with a _windows 10 upgrade_.pdf 2017-04-07 13:42:43.749: [verbose] [p0] parsing //default/cisco's talos intelligence group blog_ your files are encrypted with a _windows 10 upgrade_.pdf 2017-04-07 13:42:44.011: [error] [p0] error parsing //default/cisco's talos intelligence group blog_ your files are encrypted with a _windows 10 upgrade_.pdf
Hi there,
I found this great app on reddit and am trying it out, both on your public cloud and on local dev machine using docker. Now I 'd like to ask if you can add the language analyzer for Chinese (Simplified and Traditional) and Japanese?
The CJK are some tricky languages to deal with and here's what I found on stackoverflow regarding the Chinese and Japanese language analyzers: http://stackoverflow.com/questions/29098347/elasticsearch-cjk-language-analyser. Sadly there isn't one for Korean yet.
I also want to curious about how well the current Tesseract OCR tuning included in ambar works with CJK languages.
Thanks.
I want to deploy the application in the private network and get accessed from the public network. To make it safe, I want to deploy a Nginx before it, So how can i config to make it works?
Just guess the machine hold the the ambar application has no public IP, and the machine that hold the Nginx has a Public IP.
It's interesting when I find the api.local.host.port desn't work. However the external configuration works.
How can I solve it ?
Hi,
I hope you don't mind me giving another suggestion, but I think it is essential that there is a clean and easy way to delete files from the database within the GUI.
I am planning to scan documents directly to the share that Ambar will import from, so there may be occasions where the document has not scanned correctly and I will have to rescan it. This means I need to delete the old scan from the Ambar database to keep it tidy.
A delete button on the search results screen next to each document would be sufficient so the incorrect document can be removed with a click.
Thanks again for all your hard work!
I have multiple cores.... use them all!
Trying to follow the blog post's instructions. Failed here:
ERROR: In file '/home/guy/.ambar/docker-compose.yml' service 'version' doesn't have any configuration options. All top level keys in your docker-compose.yml must map to a dictionary of configuration options.
Traceback (most recent call last):
File "./ambar.py", line 224, in <module>
install(configuration)
File "./ambar.py", line 167, in install
pullImages(configuration)
File "./ambar.py", line 100, in pullImages
runShellCommandStrict("docker-compose -f {0}/docker-compose.yml pull".format(PATH))
File "./ambar.py", line 45, in runShellCommandStrict
subprocess.check_call(command, shell = True)
File "/usr/lib/python3.5/subprocess.py", line 581, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'docker-compose -f /home/guy/.ambar/docker-compose.yml pull' returned non-zero exit status 1
It seems like the api is not accessible, even though installation went without any apparent issue. During loading of the page, I get the error "Oops.... Something went wrong" at the bottom. It looks like the ambar-webapi container is restarting every 5 minutes due to not connecting to the ambar-es container?
andrew@onlyoffice:~$ sudo ./ambar.py start
______ ____ ______ ____
/\ _ \ /'\_/`\/\ _`\ /\ _ \/\ _`\
\ \ \L\ \/\ \ \ \L\ \ \ \L\ \ \ \L\ \
\ \ __ \ \ \__\ \ \ _ <'\ \ __ \ \ , /
\ \ \/\ \ \ \_/\ \ \ \L\ \ \ \/\ \ \ \ \
\ \_\ \_\ \_\ \_\ \____/ \ \_\ \_\ \_\ \_\
\/_/\/_/\/_/ \/_/\/___/ \/_/\/_/\/_/\/ /
Docker version 17.03.1-ce, build c6d412e
docker-compose version 1.11.2, build dfed245
vm.max_map_count = 262144
net.ipv4.ip_local_port_range = 15000 61000
net.ipv4.tcp_fin_timeout = 30
net.core.somaxconn = 1024
net.core.netdev_max_backlog = 2000
net.ipv4.tcp_max_syn_backlog = 2048
ambar_db_1 is up-to-date
ambar_es_1 is up-to-date
ambar_rabbit_1 is up-to-date
ambar_frontend_1 is up-to-date
ambar_webapi_1 is up-to-date
ambar_webapi-cache_1 is up-to-date
Waiting for Ambar to start...
Ambar is running on http://10.20.30.13:80
ambar-webapi container log output:
2017/04/07 05:08:51 Timeout after 5m0s waiting on dependencies to become available: [unix:///var/run/docker.sock http://es:9200]
2017/04/07 05:08:52 Waiting for host:
2017/04/07 05:08:52 Waiting for host: es:9200
2017/04/07 05:08:52 Connected to unix:///var/run/docker.sock
2017/04/07 05:13:52 Timeout after 5m0s waiting on dependencies to become available: [unix:///var/run/docker.sock http://es:9200]
2017/04/07 05:13:52 Waiting for host:
2017/04/07 05:13:52 Waiting for host: es:9200
2017/04/07 05:13:52 Connected to unix:///var/run/docker.sock
2017/04/07 05:18:52 Timeout after 5m0s waiting on dependencies to become available: [unix:///var/run/docker.sock http://es:9200]
2017/04/07 05:18:52 Waiting for host:
2017/04/07 05:18:52 Waiting for host: es:9200
2017/04/07 05:18:52 Connected to unix:///var/run/docker.sock
ambar-es container logs:
[2017-04-07T05:22:01,567][INFO ][o.e.n.Node ] [BtkYnk-] stopping ...
[2017-04-07T05:22:01,633][INFO ][o.e.n.Node ] [BtkYnk-] stopped
[2017-04-07T05:22:01,633][INFO ][o.e.n.Node ] [BtkYnk-] closing ...
[2017-04-07T05:22:01,646][INFO ][o.e.n.Node ] [BtkYnk-] closed
[2017-04-07T05:22:03,494][INFO ][o.e.n.Node ] [] initializing ...
[2017-04-07T05:22:03,612][INFO ][o.e.e.NodeEnvironment ] [BtkYnk-] using [1] data paths, mounts [[/usr/share/elasticsearch/data (/dev/mapper/onlyoffice--vg-root)]], net usable_space [34.7gb], net total_space [46.6gb], spins? [possibly], types [ext4]
[2017-04-07T05:22:03,612][INFO ][o.e.e.NodeEnvironment ] [BtkYnk-] heap size [1007.3mb], compressed ordinary object pointers [true]
[2017-04-07T05:22:03,660][INFO ][o.e.n.Node ] node name [BtkYnk-] derived from node ID [BtkYnk-rRXGLNCk4JZeisA]; set [node.name] to override
[2017-04-07T05:22:03,665][INFO ][o.e.n.Node ] version[5.2.2], pid[1], build[f9d9b74/2017-02-24T17:26:45.835Z], OS[Linux/4.4.0-72-generic/amd64], JVM[Oracle Corporation/OpenJDK 64-Bit Server VM/1.8.0_121/25.121-b13]
[2017-04-07T05:22:05,239][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [aggs-matrix-stats]
[2017-04-07T05:22:05,239][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [ingest-common]
[2017-04-07T05:22:05,239][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [lang-expression]
[2017-04-07T05:22:05,239][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [lang-groovy]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [lang-mustache]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [lang-painless]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [percolator]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [reindex]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [transport-netty3]
[2017-04-07T05:22:05,240][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded module [transport-netty4]
[2017-04-07T05:22:05,242][INFO ][o.e.p.PluginsService ] [BtkYnk-] loaded plugin [analysis-morphology]
[2017-04-07T05:22:05,395][WARN ][o.e.d.s.g.GroovyScriptEngineService] [groovy] scripts are deprecated, use [painless] scripts instead
[2017-04-07T05:22:08,149][INFO ][o.e.n.Node ] initialized
[2017-04-07T05:22:08,150][INFO ][o.e.n.Node ] [BtkYnk-] starting ...
[2017-04-07T05:22:08,258][WARN ][i.n.u.i.MacAddressUtil ] Failed to find a usable hardware address from the network interfaces; using random bytes: f5:84:67:88:74:e6:c5:b2
[2017-04-07T05:22:08,326][INFO ][o.e.t.TransportService ] [BtkYnk-] publish_address {172.19.0.3:9300}, bound_addresses {[::]:9300}
[2017-04-07T05:22:08,335][INFO ][o.e.b.BootstrapChecks ] [BtkYnk-] bound or publishing to a non-loopback or non-link-local address, enforcing bootstrap checks
[2017-04-07T05:22:11,400][INFO ][o.e.c.s.ClusterService ] [BtkYnk-] new_master {BtkYnk-}{BtkYnk-rRXGLNCk4JZeisA}{bcr5fJbTS6WeNLWTn3-wbg}{172.19.0.3}{172.19.0.3:9300}, reason: zen-disco-elected-as-master ([0] nodes joined)
[2017-04-07T05:22:11,419][INFO ][o.e.h.HttpServer ] [BtkYnk-] publish_address {172.19.0.3:9200}, bound_addresses {[::]:9200}
[2017-04-07T05:22:11,419][INFO ][o.e.n.Node ] [BtkYnk-] started
[2017-04-07T05:22:11,669][INFO ][o.e.g.GatewayService ] [BtkYnk-] recovered [2] indices into cluster_state
[2017-04-07T05:22:12,231][INFO ][o.e.c.r.a.AllocationService] [BtkYnk-] Cluster health status changed from [RED] to [YELLOW] (reason: [shards started [[ambar_log_record_data][7]] ...]).
I'd really like to be able to crawl an S3 bucket. I feel sketchy serving web traffic out of my Dropbox!
Attempting to setup the SMB crawler following the guide in the blog. Problem is I keep getting this red message at the bottom saying Error: undefined. There are no "i" markers on the left or highlighting to tell me where this may be or what it means. Below is my config with some slight edits.
Edits made: IP address, location, login, and password.
{
"id": "Share",
"description": "Share HDD",
"type": "smb",
"locations": [
{
"host_name": "EDITED",
"ip_address": "192.168.1.xxx",
"location": "EDITED"
}
],
"file_regex": "(\.doc[a-z]$)|(\.xls[a-z]
"credentials": {
"auth_type": "ntlm",
"login": "EDITED",
"password": "EDITED"
},
"schedule": {
"is_active": true,
"cron_schedule": "*/15 * * * *"
},
"max_file_size_bytes": 30000000,
"verbose": true
}
Could the issue be related to the location? I've tried:
"\COMPUTERNAME\SHARE"
"//COMPUTERNAME/SHARE"
"SHARE"
mkdir ambar
cd ambar/
wget -O ambar.py https://static.ambar.cloud/ambar.py && chmod +x ./ambar.py
sudo ./ambar.py install
ERROR: In file '/home/username/ambar/docker-compose.yml' service 'version' doesn't have any configuration options. All top level keys in your docker-compose.yml must map to a dictionary of configuration options.
Traceback (most recent call last):
File "./ambar.py", line 237, in
install(configuration)
File "./ambar.py", line 180, in install
pullImages(configuration)
File "./ambar.py", line 100, in pullImages
runShellCommandStrict("docker-compose -f {0}/docker-compose.yml pull".format(PATH))
File "./ambar.py", line 45, in runShellCommandStrict
subprocess.check_call(command, shell = True)
File "/usr/lib/python3.5/subprocess.py", line 581, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'docker-compose -f /home/username/ambar/docker-compose.yml pull' returned non-zero exit status 1
Not sure what I have done to deserve this treatment.
Dear Ambar,
Would it be possible to add support for Dutch (nl-NL)?
Hey there,
is it possible to remove (not just hide) an uploaded file (pdf) from the database? I need it, because I uploaded a file which is protected by copyright.
I would like to be able to specify larger thumbnails for custom API use.
Hi, I was able to download the VirtualBox image for Ambar but ran into some issues configuring it and getting it to work.
I'm putting it on my server behind an Apache web server so it's hosted with a proxypass directive on a custom hostname.
However, it's trying to hit the API on port 8004 with it's IP address, which looks like it's configurable by editing the apiURL.txt file on the container.
However, I can't gain access to the container to edit the apiURL file. Easiest would be the SSH credentials.
I'm unable to start ambar at the moment
This is all I did:
# Pull ambar.py from github
$ sudo ./ambar.py install
$ sudo ./ambar.py start
sudo ./ambar.py start
______ ____ ______ ____
/\ _ \ /'\_/`\/\ _`\ /\ _ \/\ _`\
\ \ \L\ \/\ \ \ \L\ \ \ \L\ \ \ \L\ \
\ \ __ \ \ \__\ \ \ _ <'\ \ __ \ \ , /
\ \ \/\ \ \ \_/\ \ \ \L\ \ \ \/\ \ \ \ \
\ \_\ \_\ \_\ \_\ \____/ \ \_\ \_\ \_\ \_\
\/_/\/_/\/_/ \/_/\/___/ \/_/\/_/\/_/\/ /
Docker version 17.05.0-ce, build 89658be
docker-compose version 1.10.0, build 4bd6f1a
vm.max_map_count = 262144
net.ipv4.ip_local_port_range = 15000 61000
net.ipv4.tcp_fin_timeout = 30
net.core.somaxconn = 1024
net.core.netdev_max_backlog = 2000
net.ipv4.tcp_max_syn_backlog = 2048
Creating network "ambar_internal_network" with the default driver
Creating ambar_es_1
Creating ambar_proxy_1
Creating ambar_webapi-cache_1
Creating ambar_rabbit_1
Creating ambar_db_1
Creating ambar_webapi_1
Creating ambar_frontend_1
Waiting for Ambar to start...
Ambar is running on http://192.168.1.5:80
But accessing http://192.168.1.5:80 results in a DNS Error (I think, since I'm receving a page from my ISP that said can't find this page)
Accessing http://192.168.1.5:80/api results in a 504 Gateway timeout error from Nginx
Logs:
Attaching to ambar_webapi_1
webapi_1 | 2017/06/30 15:25:00 Waiting for host:
webapi_1 | 2017/06/30 15:25:00 Waiting for host: es:9200
webapi_1 | 2017/06/30 15:25:00 Connected to unix:///var/run/docker.sock
webapi_1 | 2017/06/30 15:25:31 Received 200 from http://es:9200
webapi_1 | Crawler schedule service initialized
webapi_1 | Pipeline initialized
webapi_1 | Started on :::8080
Attaching to ambar_proxy_1
proxy_1 | Starting NGINX...
proxy_1 | 192.168.1.5 - - [30/Jun/2017:15:25:36 +0000] "GET / HTTP/1.1" 403 612 "-" "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36" "-"
proxy_1 | 2017/06/30 15:26:43 [error] 7#7: *1 upstream timed out (110: Connection timed out) while connecting to upstream, client: 192.168.1.5, server: ambar-fe, request: "GET /api/ HTTP/1.1", upstream: "http://125.235.4.59:8080/api/", host: "192.168.1.5"
proxy_1 | 192.168.1.5 - - [30/Jun/2017:15:26:43 +0000] "GET /api/ HTTP/1.1" 504 585 "-" "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36" "-"
All docker images are up to date
Error: Web API hangs. Container still shows running.
I'm seeing In the browser, "a spinning spinner"
docker logs -f ambar-webapi
/usr/src/app/dist/services/QueueProxy.js:91
throw err;
^
Error: Unexpected close
at succeed (/usr/src/app/node_modules/amqplib/lib/connection.js:270:13)
at onOpenOk (/usr/src/app/node_modules/amqplib/lib/connection.js:252:5)
at /usr/src/app/node_modules/amqplib/lib/connection.js:165:32
at /usr/src/app/node_modules/amqplib/lib/connection.js:159:12
at Socket.recv (/usr/src/app/node_modules/amqplib/lib/connection.js:497:12)
at Socket.g (events.js:260:16)
at emitNone (events.js:67:13)
at Socket.emit (events.js:166:7)
at emitReadable_ (_stream_readable.js:419:10)
at emitReadable (_stream_readable.js:413:7)
at readableAddChunk (_stream_readable.js:164:13)
at Socket.Readable.push (_stream_readable.js:111:10)
at TCP.onread (net.js:540:20)
2017/04/17 23:13:42 Command exited with error: exit status 1
docker logs -f ambar-rabbbit
=ERROR REPORT==== 17-Apr-2017::23:06:18 ===
closing AMQP connection <0.551.0> (172.19.0.1:36775 -> 172.19.0.4:5672):
missed heartbeats from client, timeout: 60s
It would be nice to be able to manipulate or remove items from the index either via API or frontend. It does look like perhaps we can work with elasticsearch API, but through the ambar-provided API would be desirable.
I would like to see an organized and browsable view of documents available in the frontend view. Perhaps some extended statistics (time spent processing?) on an individual document, and a view of the document itself and the extracted text (without having to search for a document).
Hi!
Seems like a possible duplicate to #28.
After registering an account i wanted to switch the login off. So that that no authentication is needed to reach the search page. That is my goal ... Should be possible with "auth" to "none" in the config file.
But i still get the login page when connecting to the root url.
Thank you!
Hello!
I was thinking about adding Polish language and support for users from Poland (separate documentation). I don't know if I can help with first thing, but I can definitely help with the second one.
Cheers.
Hi there!
I just stumbled upon Ambar and really like the idea as well as the frontend.
Is there any chance that one could add tags to uploaded documents?
That would open up so much more options in management and queries.
(Maybe adding a freetext / transform to tag field right next to the Download button?)
If you're interested, I would be happy to assist in the realization of features; simply because I see some great potential in Ambar compared to other DMS solutions out there.
Thanks!
Matthias
Installed clean today on clean Ubuntu 16.04 install. Verified I can connect to the share from Windows and Linux using mount -t cifs. Crawler config:
{
"id": "data",
"uid": "data_d033e22ae348aeb5660fc2140aec35850c4da997",
"description": "nas crawler",
"type": "smb",
"locations": [
{
"host_name": "nas",
"ip_address": "10.0.0.100",
"location": "data"
}
],
"file_regex": "(\.doc[a-z]$)|(\.xls[a-z]
"credentials": {
"auth_type": "ntlm",
"login": "jes",
"password": "***",
"token": ""
},
"schedule": {
"is_active": true,
"cron_schedule": "/15 * * * *"
},
"max_file_size_bytes": 30000000,
"verbose": true
}
Error:
2017-07-14 11:15:00.688: [info] filecrawler initialized
2017-07-14 11:15:00.695: [error]
2017-07-14 11:15:00.700: [error] error connecting to Smb share on nas
Notice that there is nothing by the error at all.
Also, how do I get to the logs for this system? I looked at docker logs but they said nothing about this issue. Thank you.
When items contain enclosures linking to available documents.
Hello there, it seems to be some problems when uploading files using firefox...
Here is the log file:
POST
XHR
https://mc.yandex.ru/webvisor/42850839 [HTTP/1.1 200 OK 637 ms]
OPTIONS
https://app.ambar.cloud:8004/api/files/Default/Biblia%20de%20Jerusalem.pdf [HTTP/2.0 204 No Content 250 ms]
POST
https://app.ambar.cloud:8004/api/files/Default/Biblia%20de%20Jerusalem.pdf [HTTP/2.0 400 Bad Request 255 ms]
Response { type: "cors", url: "https://app.ambar.cloud:8004/api/fi…", status: 400, ok: false, statusText: "Bad Request", headers: Headers, bodyUsed: false } app.b263d6913c2e577ef05e.js:3:6732
uploadFile Response { type: "cors", url: "https://app.ambar.cloud:8004/api/fi…", status: 400, ok: false, statusText: "Bad Request", headers: Headers, bodyUsed: false } app.b263d6913c2e577ef05e.js:7:26205
GET
XHR
https://api.mixpanel.com/track/ [HTTP/1.1 200 OK 485 ms]
GET
XHR
https://api.mixpanel.com/track/ [HTTP/1.1 200 OK 343 ms]
````
Thanks!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.