Comments (8)
Pinging @elastic/ingest-management (Team:Ingest Management)
from elastic-agent.
I think with the new Fleet Server architecture we should have the Elastic Agent send the status information directly to Fleet Server and from their Fleet Server should append the status information to an index. This removes the need to Elastic Agent to have to write directly to elasticsearch, and it ensures that the Fleet Server has the updated information to perform required adjustments based on the coordinator work that the Fleet Server will need to perform in the future.
from elastic-agent.
I think with the new Fleet Server architecture we should have the Elastic Agent send the status information directly to Fleet Server and from their Fleet Server should append the status information to an index
I agree that we should send the data to Fleet server, so the coordinator can take advantage of that, I am just wondering if the agent should also persist is status (maybe in the logs) for agent observability purpose it will allow to have agent status for standalone agents too.
If we go to the direction where Fleet Server write the agent status, we should probably implement the same in Kibana, I think we can have an index similar to uptime synthetics-agent-status-default
, (this could be defined in an elastic agent package ?) Right now is kind of tricky to write from the Kibana checkin route because we do not have the permission to write to ES unless we use the fleet_enroll
admin user.
from elastic-agent.
The status should be reported the Fleet Server, but I still think Elastic Agent should persist it to ES directly as log or similar. This means, status reporting also works in standalone case for example.
from elastic-agent.
Agent can fail/be degraded in these cases (* meaning overall agent failure not component related):
* - metadata loading failed to load metadata
* - recover previous state could not recover state, error %+v, skipping...
* - prepare FS filesystem error from os
gateway - dispatch failed failed to dispatch actions, error: %s", err
gateway - encode request fail to encode the checkin request
gateway - decode response fail to decode checkin response
gateway - request failed(auth,network) fail to checkin to fleet (more detail in a message)
process - download - req fail fetching package failed
process - download - !200 respose call to '%s' returned unsuccessful status code: %d
process - download - FS error {details from os}
process - verify - hash not found hash for '%s' not found
process - verify - hash mismatch hash does not match
process - verify - asc fetch req fail fetching asc file from %s {details}
process - verify - asc armored key fail read armored key ring
process - verify - asc signature check check detached signature
process - install - os error e.g "TarInstaller: error writing to %s: %v" or "tar contained invalid filename: %q"
process - start failed to start '{details from go runtime}'
process - config e.g application stopping
agent server e.g "failed to marshal connection information", "failed to write connection information"
upgrade - non upgradeable agent cannot be upgraded; must be installed with install sub-command and running under control of the systems supervisor
upgrade - copy actions store failed to copy action store
upgrade - fs errors error details from FS
from elastic-agent.
Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane)
from elastic-agent.
@michalpristas do you think that it's still relevant?
from elastic-agent.
Closing this as done, we implement a more detailed status reporting already.
cc @ycombinator
from elastic-agent.
Related Issues (20)
- [Flaky Test]: TestActionDispatcher/Dispatch_multiples_events_returns_one_error – Expected error HOT 5
- allow multiple hosts to be passed in --fleet-server-es flag HOT 8
- QA test: State Store migrations HOT 4
- Make `elasticinframetricsprocessor` available in `otel` mode HOT 1
- [windows] move service startup to beginning of run function HOT 1
- Elastic Agent on Windows cannot be stopped or removed if --delay-enroll is retrying HOT 6
- Handle new action for switching Agent from privileged to unprivileged mode HOT 6
- [Fleet]: Multiple logs: `[elastic_agent][info] got checkin with pid 0` are generated for installed agent. HOT 5
- [Windows] Service startup failing on CI with otel dependencies linked HOT 4
- [Windows] - `system.diskio` datastream missing on Kibana for unprivileged mode. HOT 6
- Kubernetes e2e tests HOT 10
- Installing elastic agent on AWS EKS HOT 5
- Development agent gets unhealthy on adding Elastic Defend. when not added to the primary agent. HOT 9
- Retry artifact downloads in the integration test framework (artifact fetcher) HOT 2
- Support hints based autodiscover for Fleet managed Agents HOT 1
- [E2E test] Ingesting data with OTel-based shipper pipeline
- `panic: The system cannot find the file specified` when installing Elastic Agent on AWS EC2 Windows Server HOT 3
- Elastic Agent should accept CA fingerprint containing colons HOT 4
- High memory and CPU consumption when fleet-server fails to start during enroll HOT 2
- Clarify cli flags for TLS config for fleet-server, ES and the gent itself HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elastic-agent.