386 Commits

Author SHA1 Message Date
Paulo Pires
28a757f4da use shared informers and workqueue (#425)
* vendor: add vendored code

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* controller: use shared informers and a work queue

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* errors: use cpuguy83/strongerrors

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* aci: fix test that uses resource manager

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* readme: clarify skaffold run before e2e

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* cmd: use root context everywhere

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: refactor pod lifecycle management

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* e2e: fix race in test when observing deletions

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* e2e: test pod forced deletion

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* cmd: fix root context potential leak

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: rename metaKey

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: remove calls to HandleError

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* Revert "errors: use cpuguy83/strongerrors"

This reverts commit f031fc6d.

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* manager: remove redundant lister constraint

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: rename the pod event recorder

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: amend misleading comment

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* mock: add tracing

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: add tracing

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* test: observe timeouts

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* trace: remove unnecessary comments

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: limit concurrency in deleteDanglingPods

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: never store context, always pass in calls

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: remove HandleCrash and just panic

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: don't sync succeeded pods

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* sync: ensure pod deletion from kubernetes

Signed-off-by: Paulo Pires <pjpires@gmail.com>
v0.7.3
2018-11-30 15:53:58 -08:00
Paulo Pires
0e9cfca585 ci: move go vet to validate check (#430)
* ci: move go vet to validate check

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* cri: don't build when not on linux

Signed-off-by: Paulo Pires <pjpires@gmail.com>
2018-11-30 13:58:55 -08:00
Robbie Zhang
ea40a180f4 Install bash in the Dockerfile (#426) 2018-11-28 16:22:09 -08:00
Paulo Pires
d252a9a1d7 Merge pull request #424 from virtual-kubelet/pires/fix_pod_deletion
Fix pod deletion
2018-11-28 22:43:21 +00:00
Paulo Pires
fe7b363f10 rbac: improve and add missing verb
Signed-off-by: Paulo Pires <pjpires@gmail.com>
2018-11-28 20:31:56 +00:00
Paulo Pires
0f8ef994a3 sync: don't swallow delete errors
Signed-off-by: Paulo Pires <pjpires@gmail.com>
2018-11-28 20:31:55 +00:00
Robbie Zhang
338a7c2213 [ACI] Fallback to parse authConfig.Auth when Username field is not specified (#421)
* Fallback to parse authConfig.Auth when Username field is not specified
2018-11-28 11:33:07 -08:00
Ria Bhatia
8d264db90d updated meeting times (#420) 2018-11-28 11:12:12 -08:00
Robbie Zhang
30f9c8cf23 Add MasterURI setting in the README.md
Add MasterURI setting in the README.md
2018-11-28 10:44:06 -08:00
Paulo Pires
579823e6a5 tests: introduce e2e suite (#422)
* mock: implement GetStatsSummary

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* make: use skaffold to deploy vk

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* test: add an e2e test suite

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* test: add vendored code

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* docs: update README.md

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* ci: run e2e on circleci

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* make: improve the skaffold target

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* e2e: fix defer pod deletion

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* e2e: improve instructions

Signed-off-by: Paulo Pires <pjpires@gmail.com>

* makefile: default shell is bash

Signed-off-by: Paulo Pires <pjpires@gmail.com>
2018-11-28 09:01:36 -08:00
shidao-ytt
688c10fa8b [AliCloud] Add cluster name and bug fix (#412)
* [AliCloud] add tag ClusterName for different k8s cluster

* [AliCloud] BugFix: vk get wrong pod ip

* update serverless kubernetes help doc link
2018-11-26 11:08:37 -08:00
Robbie Zhang
f07daf494b BugFix: secret data missing tail newline (#406) 2018-11-26 11:01:00 -08:00
Brian Goff
662afccd0e Merge pull request #416 from cpuguy83/update_contributing
Update docs for how to add a provider
2018-11-26 10:35:30 -08:00
Brian Goff
4cb94dd19b Update docs for how to add a provider 2018-11-21 10:04:36 -08:00
Brian Goff
023fd1214a Merge pull request #410 from tariq1890/add_dep_check
Replace dep validation with dep check and resyncing Gopkg.lock
2018-11-15 12:37:25 -08:00
tariqibrahim
bf61f7f2e0 Replace dep validation with dep check and resyncing Gopkg.lock 2018-11-14 14:12:29 -08:00
Brian Goff
cd42fdd7b8 Use strongerrors in provider implementations. (#403)
This ensures that we can catch certain types of errors from providers
and handle accordingly in the core. There is still more to do here to
improve that but this resolves an immediate need to know why a Delete
failed.

vic provider was not updated since I could not figure out where to get
this information.
2018-11-07 16:02:48 -08:00
Brian Goff
1d76b1341e Merge pull request #390 from cpuguy83/update_on_provider_delete
Fix a case where provider pod status is not found
2018-11-06 16:23:36 -08:00
Robbie Zhang
edb8c516f3 Merge branch 'master' into update_on_provider_delete 2018-11-06 16:21:36 -08:00
Brian Goff
8cc888176a Merge pull request #387 from cpuguy83/ocagent_exporter
Add ocagent exporter
2018-11-06 16:20:55 -08:00
Brian Goff
aee1fde504 Fix a case where provider pod status is not found
Updates the pod status in Kubernetes to "Failed" when the pod status is
not found from the provider.

Note that currently thet most providers return `nil, nil` when a pod is
not found. This works but should possibly return a typed error so we can
determine if the error means not found or something else... but this
works as is so I haven't changed it.
2018-11-06 16:11:42 -08:00
Brian Goff
bec818bf3c Do not close pod sync, use context cancel instead. (#402)
Closing the channel is racey and can lead to a panic on exit.
Instead rely on context cancellation to know if workers should exit.
2018-11-05 11:37:00 -08:00
Andrey Moor
2743c07222 Fix loganalytics settings in chart's secret.yaml file (#388)
* Fix chart secret.yaml file

* Update packages
2018-11-05 10:38:52 -08:00
Brian Goff
180cf34c3e Merge pull request #395 from cpuguy83/1_worker_default
Use 1 worker by default
v0.7.2
2018-11-02 12:42:49 -07:00
Jeremy Rickard
55bd7ebaed Merge branch 'master' into 1_worker_default 2018-11-02 13:37:06 -06:00
Brian Goff
f10596562d Use 1 worker by default
This is to work around an issue with concurrent workers and makes the
default config just work.
2018-11-02 12:35:26 -07:00
Carlos Mendible
4edd39a8ef Fixed typo on helm install sample (#392)
Fixed typo on helm install sample. Replaced providers.azure.clientID with provider.azure.clientId so the values are correctly parsed.
2018-11-02 15:04:50 +00:00
shidao-ytt
77c3315da4 [AliCloud] Add init container (#382)
* clean up eci sdk code

remove unnecessary code

* add init containers support
2018-11-01 15:57:30 -07:00
Brian Goff
143d9f71cc Add ocagent exporter
ocagent allows users to send traces out of VK to a "standard"  external forwarder
(standard as in this is a format/protocol defined in the opencensus project).

This allows uers to implement metrics for whatever backend they want
without having to add it into VK directly.
2018-10-31 14:31:13 -07:00
Ria Bhatia
a9ff4fed4a adding vnet documentation changes (#354)
* adding vnet changes

* format error

* change

* add exec

* added sp instructions

*  adding output

* adding link

* name change

* node selector change

*  plz be the last change

* vnet changes

* vnet changes, took out sp variables

* Adding more changes

* aci subnet

* added master uri

* fixed stuff

* adding master uri again

* add /
2018-10-30 23:43:17 +00:00
Robbie Zhang
ebc9463783 [ACI] Set the ACI_EXTRA_USER_AGENT in the helm charts (#380)
* Set the ACI_EXTRA_USER_AGENT in the helm charts

* Update the latest helm charts
2018-10-22 14:42:25 -07:00
Jeremy Rickard
1d22783951 Merge pull request #381 from cpuguy83/fix_typo_alicloud
Typo in build tag for alicloud provider
2018-10-19 13:46:47 -06:00
Brian Goff
4b6f79111f Typo in build tag for alicloud provider
It's what I do...
2018-10-19 12:22:25 -07:00
Robbie Zhang
5991b29f76 [ACI] Support to add extra user agent to the request (#378)
* Add UserAgent

* Change to join the user agent with a whitespace and set in the header

* Remove empty entry from user agent
v0.7.1
2018-10-18 14:28:43 -07:00
Brian Goff
0dbc88826c Merge pull request #379 from robbiezhang/on-delete
[VK] Fix the potential runtime type casting error
2018-10-18 13:17:02 -07:00
robbiezhang
966c76368f user %T instead of reflect.TypeOf 2018-10-18 20:06:03 +00:00
robbiezhang
a6bab6e3bb Fix the potential runtime type casting error 2018-10-18 19:15:05 +00:00
Robbie Zhang
4a7b74ed42 [VK] Use Cache controller and Make create/delete pod Concurrently (#373)
* Add k8s.io/client-go/tools/cache package

* Add cache controller

* Add pod creator and terminator

* Pod Synchronizer

* Clean up

* Add back reconcile

* Remove unnecessary space in log

* Incorprate feedbacks

* dep ensure

* Fix the syntax error

* Fix the merge errors

* Minor Refactor

* Set status

* Pass context together with the pod to the pod channel

* Change to use flag to specify the number of pod sync workers

* Remove the unused const

* Use Stable PROD Region WestUS in Test

EastUS2EUAP is not reliable
v0.7.0
2018-10-16 17:20:02 -07:00
Brian Goff
b082eced13 Merge pull request #377 from robbiezhang/master
[ACI] A few bug fix for the Network Setup
v0.6.4
2018-10-16 13:58:04 -07:00
robbiezhang
10cf53cc9c Use a hash of the subnetID instead of virtual node name to avoid conflict 2018-10-16 20:31:39 +00:00
robbiezhang
c193fbe3c0 Remove the validation on the Subnet IP Configuration Profile
NRP used to add a profile entry when a service association link is added to the subnet

Now, the profile entry is added when a network profile is created

This breaks the current validation
2018-10-16 19:52:23 +00:00
Brian Goff
f5c47d59c4 Merge pull request #376 from robbiezhang/master
[VK] Change the default taint effect to NoSchedule
v0.6.3
2018-10-15 12:59:49 -07:00
robbiezhang
055f5a2e01 Change the default taint effect to NoSchedule 2018-10-15 19:46:42 +00:00
Brian Goff
957ee81634 Merge pull request #375 from robbiezhang/network-sdk
[ACI] Update the network SDK and add more validations
2018-10-13 11:45:10 -07:00
Brian Goff
c1fe923131 Minor refactorings (#368)
* Split vkubelet funcitons into separate files.

* Minor re-org for cmd/census*

* refactor run loop
2018-10-12 17:36:37 -07:00
robbiezhang
eb77d5686f Update the network sdk and add more validations 2018-10-13 00:16:19 +00:00
Brian Goff
d710e0391c Merge pull request #369 from robbiezhang/master
Update kubernetes projects version in vendor to 1.12.1
2018-10-11 10:06:20 -07:00
Brian Goff
d9e37b9fcc Merge pull request #374 from robbiezhang/fix-metrics-test
[ACI] Make the metrics test more robust in multi-container case
2018-10-11 08:16:03 -07:00
Robbie Zhang
8a5d5b670f Make the metrics test more robust in multi-container case 2018-10-10 18:43:37 -07:00
Robbie Zhang
05595835f1 Make default waiting reason to be Creating (#372) 2018-10-10 10:40:41 -07:00