Skip to content

Improve Longevity Test #1744

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
pleshakov opened this issue Mar 21, 2024 · 1 comment
Open

Improve Longevity Test #1744

pleshakov opened this issue Mar 21, 2024 · 1 comment
Labels
refined Requirements are refined and the issue is ready to be implemented. size/medium Estimated to be completed within a week tests Pull requests that update tests
Milestone

Comments

@pleshakov
Copy link
Contributor

pleshakov commented Mar 21, 2024

As a maintainer of NGF
I want our longevity test to more closely mimic a real customer environment
So that we can discover bugs before they reach our users.

Acceptance Criteria

  • A second replica is created to test leader election.
  • The cluster name and NGF pod name are present to the generated output.
@mpstefan mpstefan added the tests Pull requests that update tests label Mar 21, 2024
@mpstefan mpstefan added this to the v1.3.0 milestone Mar 21, 2024
@mpstefan mpstefan modified the milestones: v1.3.0, v2.0.0 Apr 23, 2024
@mpstefan mpstefan added refined Requirements are refined and the issue is ready to be implemented. size/medium Estimated to be completed within a week labels Jun 24, 2024
@mpstefan mpstefan modified the milestones: v1.4.0, v2.2.0 Jul 24, 2024
@bjee19
Copy link
Contributor

bjee19 commented Apr 18, 2025

With the changes introduced in our control plane data plane split and the updates to the test in #3305, all of the nginx metrics tracked in the GKE dashboard are unavailable. We think that this is because we stopped using the NGINX Prometheus Exporter, which disqualifies us from accessing the metrics shown here https://cloud.google.com/monitoring/api/metrics_kubernetes#kubernetes-nginx.

For us to access the metrics from agent, we would need to set up some GKE specific service which would cost money.

An alternative, and possible addition to the acceptance criteria, is to set up OSS prometheus in the cluster while running the longevity tests, scrape that, and create similar graphs. We may be able to recover some of the lost nginx metrics.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
refined Requirements are refined and the issue is ready to be implemented. size/medium Estimated to be completed within a week tests Pull requests that update tests
Projects
Status: 🆕 New
Development

No branches or pull requests

3 participants