slamby / slamby-api Goto Github PK
View Code? Open in Web Editor NEWSlamby API under AGPL-3.0 license
Home Page: https://www.slamby.com/api
License: GNU Affero General Public License v3.0
Slamby API under AGPL-3.0 license
Home Page: https://www.slamby.com/api
License: GNU Affero General Public License v3.0
I experienced it at GET /tags endpoint
Smart search engine as a service. Optional service paring + smart result. Elastic search with improved settings + smarter search using our technology + related services such as PRC for keyword extraction + related products + classifier service suggesting categories.
Features:
default
Server: Skye
Api version: 1.4.17030.02
Insight version: 1.3.17026.01
I created two new services, one classifier and one search service.
After activating the classifier service, I try to activate the search service. The search service settings contains the related classifier service settings as well. Fatal error occures.
Hi,
Could we check the geo_point filed type, during dataset creation using schema?
General checking process about creating and using geo_point field type.
Further documentation: https://www.elastic.co/guide/en/elasticsearch/reference/current/geo-point.html
Make a middleware or action filter, if the HDD do not have enough capacity then some of the request (document add, document bulk add, tag add, service prepare... etc) should return with a human readable error message.
Under the e-mail input tag please put a small section with a checkbox and a small text about accepting our licenses.
E.g.:
Copy to, move to, copy all to, move all to organized into background process.
If a service prepare (or activate) calling two times in a short time that can cause the process to start twice
The API store the endpoint statistics (hit numbers) in Redis.
There can be an endpoint to show these statistics for the user.
A user can have hundreds or thousands of processes, after a time. And can be a big (and slow) response if you want to get all.
Maybe a "delete all processes" endpoint can be useful.
When we create a PRC service on dataset A, the service can only recommend documents from the A dataset. That would be useful if the service can recommend documents across the datasets
Let the user create multiple API (mainly 2) instances and make them work together in a cluster. The main goal is the high availability
Grouping datasets by labels. Create labels and assign datasets to labels. Display datasets grouping by labels via Insight. Visual help for better dataset management.
Process list returns all the processes from the last 30 days as default. There is a new settings option to set the process time interval.
If the ParentIdList contains a tag that is not prepared (which is a usual case) then it says:
"There is at least one parent tag in the ParentTagIdList which is not a parent of any activated tags!"
Basic market research about the available SLA solutions. Preparing for SLA development capability built-in the Slamby API server.
Estimated time need: 3 days.
Storing and managing files on the Slamby API server. List files and folders, create folders, delete files and folders, move and copy files, download files.
Jmeter test file + test process + build process integration. Adding tests and tutorials to the API.
Statistics endpoint for local statistics. Basic usage statistics with aggregated endpoint data.
Built-in classifier service recommend filter option. Filter by tags during the recommendation
The api/Documents/Copy and api/Documents/Move endpoints just returns with simple HTTP Status Code.
These should return with BulkResults just like the api/Documents/Bulk enpoint to get information about the problems with the failed copied/moved documents
When there is a new PRC request, tagId filed is not required anymore. When tagId is empty, PRC predict the most suitable tagId and uses it.
Wherever the Query property is used, the API pass the query as it is to the elasticsearch. So it can fail. But the API not response the error message from the elasticsearch (which is usually human readable) rather just return with an 500.
It can be better if we return with something like "QueryParserException" and the error message from the elasticsearch?
500 internal server error handling when bad request to elastic search.
Manage logs into files, and make them available via file manager. + log settings with log levels.
As ElasticSearch and NEST 5.0 officially released. We should upgrade and refactor the code according to this.
Release notes:
https://www.elastic.co/blog/nest-5-0-released
https://www.elastic.co/guide/en/elasticsearch/reference/5.0/release-notes-5.0.0.html
Th API validate if the given field is a valid field in the dataset.
But the validation failed for boosted fields, for e.g.: title^2 but (in this case) the title is a valid field
Validation should allow the boosted fields also
Basic research about max connection number. What happen when 1000 requests hit the API server? Any settings?
Quality measurement during/after the training process. Top result measurement, + preciseness measurement. Integrating preciseness value in classifier service recommend. Visualize result data.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.