Git Product home page Git Product logo

slamby-api's People

Contributors

attilaersek avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

slamby-api's Issues

Test

Document / Copy

Document / Bulk add

Document / Filter

Document / Sample

  • Sample test with percentage for status code
  • Sample test with percentage for size check
  • Sample test with fix fix size for status code
  • Sample test with fix size for size check
  • Sample test for different fields variation
  • Sample test for tag field test. Status code and size check

Statistics / Get

Process / Get / Get inactive

Resources / Get

Services / Get all services

Search engine service (MVP)

Smart search engine as a service. Optional service paring + smart result. Elastic search with improved settings + smarter search using our technology + related services such as PRC for keyword extraction + related products + classifier service suggesting categories.

Features:

  • Search
  • Classifier Service Integration
  • Service with create, prepare and activate functions
  • Typo auto fix
  • Activation settings as default
  • Settings: autoCompleteSettings, SearchSettings, ClassifierSettings
  • SearchSettings: filter, weights, responseFieldList, searchFieldList
  • Classifier result validation flag: searchResultMatch field: true, false.
  • Search history save
  • useDefaultFilter for get search
  • useDefaultWeights
  • Order settings (by field, asc, desc)
  • Total search count
  • Highlight
  • Auto Complete

Search Service activation error.

Server: Skye

Api version: 1.4.17030.02
Insight version: 1.3.17026.01

I created two new services, one classifier and one search service.

After activating the classifier service, I try to activate the search service. The search service settings contains the related classifier service settings as well. Fatal error occures.

Storage capacity check

Make a middleware or action filter, if the HDD do not have enough capacity then some of the request (document add, document bulk add, tag add, service prepare... etc) should return with a human readable error message.

Service prepare double call

If a service prepare (or activate) calling two times in a short time that can cause the process to start twice

Endpoint statistics

The API store the endpoint statistics (hit numbers) in Redis.
There can be an endpoint to show these statistics for the user.

delete processes?

A user can have hundreds or thousands of processes, after a time. And can be a big (and slow) response if you want to get all.
Maybe a "delete all processes" endpoint can be useful.

PRC recommendation across datasets

When we create a PRC service on dataset A, the service can only recommend documents from the A dataset. That would be useful if the service can recommend documents across the datasets

High Availability (HA) feature

Let the user create multiple API (mainly 2) instances and make them work together in a cluster. The main goal is the high availability

Dataset groups

Grouping datasets by labels. Create labels and assign datasets to labels. Display datasets grouping by labels via Insight. Visual help for better dataset management.

Processes 30 days list + all time

Process list returns all the processes from the last 30 days as default. There is a new settings option to set the process time interval.

Classifier Service parent tag filter problem

If the ParentIdList contains a tag that is not prepared (which is a usual case) then it says:
"There is at least one parent tag in the ParentTagIdList which is not a parent of any activated tags!"

HA (High Availability) research

Basic market research about the available SLA solutions. Preparing for SLA development capability built-in the Slamby API server.

Estimated time need: 3 days.

File Manager (MVP)

Storing and managing files on the Slamby API server. List files and folders, create folders, delete files and folders, move and copy files, download files.

PRC keywords tag id is not required

When there is a new PRC request, tagId filed is not required anymore. When tagId is empty, PRC predict the most suitable tagId and uses it.

Filters with invalid Query can cause 500

Wherever the Query property is used, the API pass the query as it is to the elasticsearch. So it can fail. But the API not response the error message from the elasticsearch (which is usually human readable) rather just return with an 500.

It can be better if we return with something like "QueryParserException" and the error message from the elasticsearch?

Log management

Manage logs into files, and make them available via file manager. + log settings with log levels.

SearchFieldList validation problem

Th API validate if the given field is a valid field in the dataset.
But the validation failed for boosted fields, for e.g.: title^2 but (in this case) the title is a valid field
Validation should allow the boosted fields also

concurrent requests limit

Basic research about max connection number. What happen when 1000 requests hit the API server? Any settings?

Twister QC

Quality measurement during/after the training process. Top result measurement, + preciseness measurement. Integrating preciseness value in classifier service recommend. Visualize result data.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.