virtual-kubelet

Author	SHA1	Message	Date
Pires	4942ea59a7	build: adopt Go 1.23 and bump linter	2025-01-08 10:59:24 +00:00
Brian Goff	c668ae6ab6	Bump problematic deps Changes in klog and logr have made automatic bumps from dependabot problematic. We also shouldn't need klogv1 so removed that. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2022-10-07 23:21:47 +00:00
Brian Goff	f617ccebc5	Fixup some new lint issues Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2022-08-31 00:58:56 +00:00
Brian Goff	0543245668	lifecycle test: timeout send goroutine on context In error cases these goroutines never exit. Trying to debug cases we end up with a bunch of these goroutines stuck making it difficult to troubleshoot. We could just make a buffered channel, however this will makes it less clear, in cases of an error, what all is happening.	2021-05-18 23:06:55 +00:00
Sargun Dhillon	e95023b76e	Fix test This starts watching for events prior to the start of the controller. This smells like a bug in the fakeclient bits, but it seems to fix the problem. Signed-off-by: Sargun Dhillon <sargun@sargun.me>	2021-04-14 10:52:26 -07:00
Sargun Dhillon	c4582ccfbc	Allow providers to update pod statuses We had added an optimization that made it so we dedupe pod status updates from the provider. This ignored two subfields that could be updated along with status. Because the details of subresource updating is a bit API server centric, I wrote an envtest which checks for this behaviour. Signed-off-by: Sargun Dhillon <sargun@sargun.me>	2021-02-16 12:30:53 -08:00
Sargun Dhillon	7feb175720	Split up lifecycle test wireUpSystem function This splits up the wireUpSystem function into a chunk that makes it "client agnostic". It also removes the requirement that the client is faked.	2021-02-16 12:30:51 -08:00
Sargun Dhillon	1b8597647b	Refactor queue code This refactor is a preparation for another commit. I want to add instrumentation around our queues. The code of how queues were handled was spread throughout the code base, and that made adding such instrumentation nice and complicated. This centralizes the queue management logic in queue.go, and only requires the user to provide a (custom) rate limiter, if they want to, a name, and a handler. The lease code is moved into its own package to simplify testing, because the goroutine leak tester was triggering incorrectly if other tests were running, and it was measuring leaks from those tests. This also identified buggy behaviour: wq := workqueue.NewNamedRateLimitingQueue(workqueue.DefaultItemBasedRateLimiter(), "test") wq.AddRateLimited("hi") fmt.Printf("Added hi, len: %d\n", wq.Len()) wq.Forget("hi") fmt.Printf("Forgot hi, len: %d\n", wq.Len()) wq.Done("hi") fmt.Printf("Done hi, len: %d\n", wq.Len()) --- Prints all 0s because event non-delayed items are delayed. If you call Add directly, then the last line prints a len of 2. // Workqueue docs: // Forget indicates that an item is finished being retried. Doesn't matter whether it's for perm failing // or for success, we'll stop the rate limiter from tracking it. This only clears the `rateLimiter`, you // still have to call `Done` on the queue. ^----- Even this seems untrue	2021-01-08 00:56:05 -08:00
Adrien Trouillaud	845b4cd409	upgrade k8s libs to 1.18.4	2020-07-07 21:00:56 -07:00
Brian Goff	0ccf5059e4	Put sync lifecycle tests being -short flag. This lets you skip tests for the slower sync provider.	2019-10-29 15:05:35 -07:00
Brian Goff	31c8fbaa41	Apply suggestions from code review Typos and punctuation fixes. Co-Authored-By: Pires <1752631+pires@users.noreply.github.com>	2019-10-24 09:23:33 -07:00
Brian Goff	4ee2c4d370	Re-add support for sync providers This brings back support for sync providers by wrapping them in a provider that handles async notifications.	2019-10-24 09:23:28 -07:00
Sargun Dhillon	c314045d60	Ensure that delete dangling pods which are still deleting at startup (#784 ) If a pod is being gracefully deleted at podcontroller startup, it will not get deleted via the deletedanglingpods code. This ensures the normal deletion loop covers the case.	2019-10-22 06:45:36 -04:00
Sargun Dhillon	d22265e5f5	Do not delete pods in a non-graceful manner This moves from forcefully deleting pods to deleting pods in a graceful manner from the API Server. It waits for the pod to get to a terminal status prior to deleting the pod from api server.	2019-10-17 09:58:21 -07:00
Sargun Dhillon	4202b03cda	Remove sync provider support This removes the legacy sync provider interface. All new providers are expected to implement the async NotifyPods interface. The legacy sync provider interface creates complexities around how the deletion flow works, and the mixed sync and async APIs block us from evolving functionality. This collapses in the NotifyPods interface into the PodLifecycleHandler interface.	2019-10-02 09:28:09 -07:00
Brian Goff	bb9ff1adf3	Adds Done() and Err() to pod controller (#735 ) Allows callers to wait for pod controller exit in addition to readiness. This means the caller does not have to deal handling errors from the pod controller running in a gorutine since it can wait for exit via `Done()` and check the error with `Err()`	2019-09-10 17:44:19 +01:00
Sargun Dhillon	da57373abb	Test pods going missing while they're running in legacy providers (#759 ) We poll legacy providers for their pod(s) status periodically. This is because we have no way of knowing when the pod is updated. If the pod somehow goes missing in the provider, that state must be handled. Currently, we update API server, and mark the pod as failed, or ignore it.	2019-09-04 22:16:14 +01:00
Sargun Dhillon	5949e6279d	Miscellaneous cleanup for linting	2019-09-03 11:00:33 -07:00
Sargun Dhillon	89d88a17ed	Add a generic reactor to lifecycle_test to bump resource version (#733 ) All updates in our tests should have the behaviour that best reflects what API server does.	2019-08-15 08:46:38 +01:00
Sargun Dhillon	bc2f6e0dc4	Wait for the informer to become in sync before starting tests If the informers are starting at the same time as createPods, then we can get into a situation where the pod seems to get "lost". Instead, we wait for the informer to get into sync prior to the createpod event. This also moves to one informer as a microoptimization in the tests.	2019-08-14 07:03:53 -07:00
Sargun Dhillon	edc0991c0c	Fix hotloop around scheduling in lifecycle_test Lifecycle test had a hotloop, where it would run a never-yielding function while processing was going on elsewhere. This inserts a sleep. A sleep is used rather than a yield to be kind to people's battery life.	2019-08-13 11:25:21 -07:00
Sargun Dhillon	fbed4ca702	Remove usage of atomics It turns out that running atomic.Read(...) in a tight loop breaks Golang. The goroutine would never yield control over the scheduler, so we ended up getting into a situation where the test would get stuck forever. This moves to a different model, in which there is a condition var, instead of atomics in loops.	2019-08-13 11:25:21 -07:00
Sargun Dhillon	5c2b682cdc	Array of minor fixups to lifecycle tests * Fix the deletion test to actually test the pod is deleted * Fix the update pods test to update a value which is allowed to be updated * Shut down watches after tests * Do not delete pod statuses on DeletePod in mock_test This intentionally leaks pod statuses, but it makes the situation a lot less complicated around handling race conditions with the GetPodStatus callback	2019-08-12 12:10:29 -07:00
Sargun Dhillon	50bbc3d1d4	Add tests around updates This makes sure the update function works correctly after the pod is running if the podspec is changed. Upon writing the test, I realized we were accessing the variables outside of the goroutine that the workers with tests were running in, and we had no locks. Therefore, I converted all of those numbers to use atomics.	2019-07-30 09:13:43 -07:00
Sargun Dhillon	bd8e39e3f9	Add a benchmark for pod creation This adds a benchmark for pod creation and makes the mock_test provider actually work correctly in concurrent situations.	2019-07-30 09:12:56 -07:00
Sargun Dhillon	ce38d72c0e	Add additional lifecycle tests * Don't scheduled failed, or succeeded pods * Delete dangling pods	2019-07-30 06:56:54 -07:00
Sargun Dhillon	4a270fea08	Add a test which tests the e2e lifecycle of the pod controller This uses the mock provider, so I moved the mock provider to a location where the node test can use it.	2019-07-30 06:56:54 -07:00

27 Commits