docs(monitoring): use alloy instead of promtail#260
Conversation
✅ Deploy Preview for metal-stack-io ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
vknabel
left a comment
There was a problem hiding this comment.
Just a few nits. Waiting for review and merge of actual changes before a full approve.
219065f to
a367764
Compare
Gerrit91
left a comment
There was a problem hiding this comment.
Thanks for updating the docs accordingly. This section really misses a lot of information.
|
|
||
|  | ||
|
|
||
| The diagram above shows the full monitoring and logging stack: partition hosts ship logs to Loki and expose metrics for Prometheus scraping; control-plane and Gardener seed Alloy instances push both logs and self-metrics centrally; Grafana provides unified dashboards and alerting across all tiers. |
There was a problem hiding this comment.
I think regarding Thanos this diagram is kind of wrong, too. Can you maybe correct it, too? Add that partition prometheuses remote write their metrics to it.
|
|
||
| #### Gardener | ||
|
|
||
| Gardener ships with a built-in logging stack (Vali + fluent-bit per seed). The metal-stack deployment disables this stack and instead uses Alloy to forward all logs centrally — giving platform operators a single place to query infrastructure logs across all Gardener clusters. |
There was a problem hiding this comment.
Gardener logging must not be disabled. It's just that we do it. Users can always decide to enable it. We just deploy our own logging in addition to their logging because we have an own centralized control plane.
| - `grafana-dashboard-sonic-exporter` | ||
|
|
||
| and also some gardener related dashboards: | ||
| Metrics are supplied by |
There was a problem hiding this comment.
These are not the only metrics, these are just additional metrics exporters that we have.
| - `ipmi-exporter` | ||
| - `sonic-exporter` | ||
| - `metal-core` | ||
| - `frr-exporter` |
There was a problem hiding this comment.
Unfortunately, the frr-exporter is deployable for some reason, maybe we should remove it from the list.
Description
Update monitoring docs to reflect metal-stack/metal-roles#552.
Only merge this, after metal-stack/metal-roles#592 and metal-stack/metal-roles#595 are merged and released.
Used AI-Tools ✨
References: