Commit Graph

9854 Commits

Author SHA1 Message Date
Rajalakshmi-Girish
98948ad809 fixes the unit tests to be more tolerant with error messages 2020-12-24 04:47:46 +00:00
Gurleen Grewal
2414bce4c3 Fix golint issues in pkg/kubelet/types 2020-12-23 19:21:31 -08:00
Pingan2017
2f76666ff4 add operator for allocateMemory.available signal 2020-12-24 10:04:09 +08:00
Kubernetes Prow Robot
e20300b1a0 Merge pull request #93285 from aarnaud/windows-devicemanager
Port deviceManager to windows container manager to enable GPU access
2020-12-23 12:00:26 -08:00
Ricardo Pchevuzinske Katz
a1b66f11b5 Change GetBootTime to work specificaly with FreeBSD
Signed-off-by: Ricardo Pchevuzinske Katz <ricardo.katz@gmail.com>
2020-12-22 16:52:47 -03:00
Lorenz Brun
670452cfc5 Fix tests to test for new behavior 2020-12-21 20:20:54 +01:00
Kevin Klues
2fcbd2206d Fix bug in CPUManager with race on map acccess
Signed-off-by: Kevin Klues <kklues@nvidia.com>
2020-12-21 19:11:53 +00:00
Lorenz Brun
89d72977d7 Add warnings after cleanup back 2020-12-21 19:22:18 +01:00
Lorenz Brun
80e9bd30ea Automatically remove orphaned pod's dangling volumes 2020-12-21 19:15:26 +01:00
Chris Henzie
9d8f994d4e Separate test Kubelet and AttachDetach VolumeHost types
fakeVolumeHost previously implemented both the KubeletVolumeHost and
AttachDetachVolumeHost interfaces. This design makes it difficult to test the
CSIAttacher since it behaves differently depending on what type of
VolumeHost is supplied.
2020-12-17 15:17:04 -08:00
Anthony ARNAUD
8bdc3d8970 Port deviceManager in windows container manager 2020-12-16 00:25:26 -05:00
Alex Dudko
088030a4d1 Migrate prober.probe logs to structured logging 2020-12-15 18:12:20 -08:00
Ruofeng Lei
161a801f71 fix typo in dns.go 2020-12-15 16:36:07 +08:00
Kubernetes Prow Robot
4efd56485d Merge pull request #97259 from saad-ali/addLogsFor85330
Add more logging for "Mount cannot be satisfied for container" error
2020-12-14 14:02:18 -08:00
saad-ali
6391c97f99 Add more logging for Mount error
Add additional logging for "Mount cannot be satisfied for container"
error to help debug #85330.
2020-12-14 11:58:09 -08:00
Kubernetes Prow Robot
ac101cbdda Merge pull request #97042 from joelsmith/eviction
Measure/compute used ephemeral storage in stats provider, not eviction manager
2020-12-14 08:47:30 -08:00
jornshen
93606f8ba3 [flaky test] fix devicemanager TestDevicePluginReRegistrationProbeMode fail 2020-12-10 21:07:49 +08:00
Kubernetes Prow Robot
a20aeb8eed Merge pull request #96873 from ruiwen-zhao/accelerator_stats
Add AcceleratorStats to cri_stats_provider
2020-12-09 07:55:02 -08:00
Kubernetes Prow Robot
cef6681602 Merge pull request #97056 from sjenning/status-loglevel
kubelet: reduce no-op status manager msg log level
2020-12-08 22:01:11 -08:00
Kubernetes Prow Robot
a8c4a2453c Merge pull request #96978 from JornShen/flaky_test_TestVersion
[flaky test] fix race in TestVersion in pkg/kubelet/cri/remote
2020-12-08 21:05:42 -08:00
Kubernetes Prow Robot
32188d5076 Merge pull request #96958 from sjenning/ip-msg-loglevel
kubelet: remove periodic messages from log-level 2
2020-12-08 21:05:09 -08:00
Kubernetes Prow Robot
80c1c0fc04 Merge pull request #96946 from houjun41544/20201130
Define const for metric name
2020-12-08 21:04:53 -08:00
Kubernetes Prow Robot
8a65055c2d Merge pull request #96638 from hasheddan/cadvisor-util
Fix link to CRI-O sock path
2020-12-08 18:34:31 -08:00
Kubernetes Prow Robot
125530629a Merge pull request #96572 from sjenning/dont-rerun-init
kubelet: do not rerun init containers if any main containers have status
2020-12-08 17:29:18 -08:00
Kubernetes Prow Robot
e40cba59e3 Merge pull request #95269 from SataQiu/kubelet-20201003
Fix panic when kubelet register if a node object already exists with no Status.Capacity or Status.Allocatable
2020-12-08 16:29:19 -08:00
Kubernetes Prow Robot
1588d58151 Merge pull request #95099 from brianpursley/TestReadLogs
Added unit tests for ReadLogs
2020-12-08 16:29:02 -08:00
Kubernetes Prow Robot
b6e0aac05c Merge pull request #93920 from zhouya0/log_with_limited_tail
[Flaky Test] Add limited lines to log when having tail option
2020-12-08 16:28:45 -08:00
Erik Wilson
a4037d2684 Fix cadvisor machine metrics
Signed-off-by: Ling Samuel <lingsamuelgrace@gmail.com>
2020-12-04 10:08:05 +08:00
Joel Smith
29ff2fe528 Remove now-unused eviction helpers, fix unit test TestCRIListPodStats 2020-12-03 13:04:25 -07:00
Derek Carr
acb43c7c4a Rework hostfs metrics
Ephemeral storage usage should be calculated by the metrics code,
not the eviction code.
2020-12-03 13:04:25 -07:00
Seth Jennings
acae34be79 kubelet: reduce no-op status manager msg log level 2020-12-03 13:06:02 -06:00
Joel Smith
39a11744ce Partially revert "Include pod /etc/hosts in ephemeral storage calculation for eviction"
This reverts (most of) commit f34b586d01.
2020-12-03 04:47:16 -07:00
jornshen
c38637a08b [flaky test] fix race in TestVersion in pkg/kubelet/cri/remote 2020-12-03 14:26:11 +08:00
Seth Jennings
c8d02f703b kubelet: do not rerun init containers if any main containers have status 2020-12-01 14:59:03 -06:00
Kubernetes Prow Robot
61dc69ac2c Merge pull request #87461 from bboreham/fix-uid-gen
kubelet: ensure static pod UIDs are unique
2020-12-01 08:18:50 -08:00
ruiwen-zhao
36b4ccdb7c Add AcceleratorStats to cri_stats_provider 2020-11-30 19:30:02 +00:00
Seth Jennings
ee60ee26e0 kubelet: remove periodic messages from log-level 2 2020-11-30 11:34:00 -06:00
00041544
f2b8fdb265 Define const for metric name 2020-11-30 14:40:26 +08:00
Jay Li
4ea18b6e1e fix privileged config of Pod Sandbox 2020-11-26 12:30:55 +08:00
Jordan Liggitt
5c88880584 Restore beta os/arch labels on initial node registration 2020-11-23 11:23:59 -05:00
fighterhit
0eaceb7eb5 Fix: kubelet return error when device plugin sets PreStartRequired true while creating pods with 0 resource 2020-11-21 22:44:27 +08:00
Kubernetes Prow Robot
ece591f722 Merge pull request #96758 from msau42/revert-84206-refactor/remove-mount-volume-check-orphaned-pod-cleanup
Revert "check volume directories instead of mounts for cleanupOrphanedPodDirs"
2020-11-20 22:37:33 -08:00
Kubernetes Prow Robot
8c7cd8a8cc Merge pull request #96553 from AlexeyPerevalov/FixesKubeletCrashEmptyTopology
Fixes sigfault in case of empty TopologyInfo
2020-11-20 16:03:33 -08:00
Michelle Au
25edb8bc69 Revert "check volume directories instead of mounts for cleanupOrphanedPodDirs" 2020-11-20 09:06:09 -08:00
hasheddan
97c358fe5b Fix link to cadvisor CRI-O sock path
Fixes link to point to CRI-O sock constant defined in cadvisor. We
cannot pin directly because of linux build tags in transitive dependency
opencontaines/runc.

Signed-off-by: hasheddan <georgedanielmangum@gmail.com>
2020-11-17 12:02:27 -06:00
Andrew Sy Kim
a59189e213 kubelet: dockershim should return grpc status with DeadlineExceeded code
Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>
2020-11-16 22:51:59 -05:00
xiongzhongliang
90f4aeeea4 use klog.Info and klog.Warning when had no format 2020-11-14 00:55:06 +08:00
Alexey Perevalov
5e6aed4137 Fixes sigfault in case of empty TopologyInfo
Device plugin which implements v1beta interface can return nil in
Topology field

For example nvidia-gpu-deviceplugin
3520254b75/nvidia.go (L147)
Signed-off-by: Alexey Perevalov <alexey.perevalov@huawei.com>
2020-11-13 11:51:47 +03:00
Kubernetes Prow Robot
0e0cc1ead8 Merge pull request #95839 from egernst/pod-usage
resource-metrics: add pod/sandbox metrics to endpoint
2020-11-12 19:36:23 -08:00
David Porter
16f71c6d47 Implement shutdown manager in kubelet
Implements KEP 2000, Graceful Node Shutdown:
https://github.com/kubernetes/enhancements/tree/master/keps/sig-node/2000-graceful-node-shutdown

* Add new FeatureGate `GracefulNodeShutdown` to control
enabling/disabling the feature
* Add two new KubeletConfiguration options
  * `ShutdownGracePeriod` and `ShutdownGracePeriodCriticalPods`
* Add new package, `nodeshutdown` that implements the Node shutdown
manager
  * The node shutdown manager uses the systemd inhibit package, to
  create an system inhibitor, monitor for node shutdown events, and
  gracefully terminate pods upon a node shutdown.
2020-11-12 21:47:55 +00:00