kubernetes

Author	SHA1	Message	Date
Tim Hockin	f558554ce0	kube-proxy: minor cleanup Get rid of overlapping helper functions.	2021-11-05 12:28:19 -07:00
Antonio Ojea	909925b492	kube-proxy: fix stale detection logic The logic to detect stale endpoints was not assuming the endpoint readiness. We can have stale entries on UDP services for 2 reasons: - an endpoint was receiving traffic and is removed or replaced - a service was receiving traffic but not forwarding it, and starts to forward it. Add an e2e test to cover the regression	2021-11-05 20:14:56 +01:00
Dan Winship	229ae58520	proxy/iptables: fix all-vs-ready endpoints a bit Filter the allEndpoints list into readyEndpoints sooner, and set "hasEndpoints" based (mostly) on readyEndpoints, not allEndpoints (so that, eg, we correctly generate REJECT rules for services with no _functioning_ endpoints, even if they have unusable terminating endpoints). Also, write out the endpoint chains at the top of the loop when we iterate the endpoints for the first time, rather than copying some of the data to another set of variables and then writing them out later. And don't write out endpoint chains that won't be used Also, generate affinity rules only for readyEndpoints rather than allEndpoints, so affinity gets broken correctly when an endpoint becomes unready.	2021-11-04 16:32:08 -04:00
Dan Winship	3679639cf1	proxy/iptables: Remove a no-op check There was code to deal with endpoints that have invalid/empty IP addresses, but EndpointSlice validation already ensures that these can't exist.	2021-11-04 16:32:08 -04:00
Dan Winship	6ab3dc6875	proxy/iptables: Add more stuff to the unit test The external traffic policy terminating endpoints test was testing LoadBalancer functionality against a NodePort service with no nodePorts (or loadBalancer IPs). It managed to test what it wanted to test, but it's kind of dubious (and we probably _shouldn't_ have been generating the rules it was looking for since there was no way to actually reach the XLB chains). So fix that. Also make the terminating endpoints test use session affinity, to add more testing for that. Also, remove the multiple copies of the same identical Service that is used for all of the test cases in that test. Also add a "Cluster traffic policy and no source ranges" test to TestOverallIPTablesRulesWithMultipleServices since we weren't really testing either of those. Also add a test of --masquerade-all.	2021-11-04 16:32:08 -04:00
Dan Winship	22a951c096	proxy/iptables: Fix TestOnlyLocalNodePortsNoClusterCIDR The test got broken to not actually use "no cluster CIDR" when LocalDetector was implemented (and the old version of the unit test didn't check enough to actually notice this).	2021-11-04 16:32:08 -04:00
Dan Winship	799c222c84	proxy/iptables: test that we create a consistent set of iptables rules	2021-11-04 16:32:08 -04:00
Dan Winship	9403bfb178	proxy/iptables: Misc improvements to unit test The original tests here were very shy about looking at the iptables output, and just relied on checks like "make sure there's a jump to table X that also includes string Y somewhere in it" and stuff like that. Whereas the newer tests were just like, "eh, here's a wall of text, make sure the iptables output is exactly that". Although the latter looks messier in the code, it's more precise, and it's easier to update correctly when you change the rules. So just make all of the tests do a check on the full iptables output. (Note that I didn't double-check any of the output; I'm just assuming that the output of the current iptables proxy code is actually correct...) Also, don't hardcode the expected number of rules in the metrics tests, so that there's one less thing to adjust when rules change. Also, use t.Run() in one place to get more precise errors on failure.	2021-11-04 16:32:06 -04:00
Dan Winship	a1a12ca1da	proxy/iptables: Improve the sorting logic in TestOverallIPTablesRulesWithMultipleServices The test was sorting the iptables output so as to not depend on the order that services get processed in, but this meant it wasn't checking the relative ordering of rules (and in fact, the ordering of the rules in the "expected" string was wrong, in a way that would break things if the rules had actually been generated in that order). Add a more complicated sorting function that sorts services alphabetically while preserving the ordering of rules within each service.	2021-11-04 16:31:16 -04:00
Dan Winship	08680192fb	proxy/iptables: Fix sync_proxy_rules_iptables_total metric It was counting the number of lines including the "COMMIT" line at the end, so it was off by one.	2021-11-04 16:30:12 -04:00
Shivanshu Raj Shrivastava	81636f2158	Fixed improperly migrated logs (#105763 ) * fixed improperly migrated logs * small fixes * small fix * Update pkg/proxy/iptables/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/healthcheck/service_health.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/iptables/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/iptables/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/iptables/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/iptables/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/ipvs/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/ipvs/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/ipvs/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * Update pkg/proxy/winkernel/proxier.go Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com> * refactoring * refactoring * refactoring * reverted some files back to master Co-authored-by: Marek Siarkowicz <marek.siarkowicz@protonmail.com>	2021-10-20 03:55:58 -07:00
Ricardo Pchevuzinske Katz	37d11bcdaf	Move node and networking related helpers from pkg/util to component helpers Signed-off-by: Ricardo Katz <rkatz@vmware.com>	2021-09-16 17:00:19 -03:00
Kubernetes Prow Robot	648559b63e	Merge pull request #104742 from khenidak/health-check-port change health-check port to listen to node port addresses	2021-09-13 15:43:52 -07:00
Khaled (Kal) Henidak	acdf50fbed	change proxiers to pass nodePortAddresses	2021-09-13 18:27:07 +00:00
Dan Winship	7f6fbc4482	Drop broken/no-op proxyconfig.EndpointsHandler implementations Because the proxy.Provider interface included proxyconfig.EndpointsHandler, all the backends needed to implement its methods. But iptables, ipvs, and winkernel implemented them as no-ops, and metaproxier had an implementation that wouldn't actually work (because it couldn't handle Services with no active Endpoints). Since Endpoints processing in kube-proxy is deprecated (and can't be re-enabled unless you're using a backend that doesn't support EndpointSlice), remove proxyconfig.EndpointsHandler from the definition of proxy.Provider and drop all the useless implementations.	2021-09-13 09:32:38 -04:00
Antonio Ojea	0cd75e8fec	run hack/update-netparse-cve.sh	2021-08-20 10:42:09 +02:00
Antonio Ojea	a2a22903bc	delete stale UDP conntrack entries for loadbalancer IPs	2021-07-29 17:35:07 +02:00
Swetha Repakula	0a42f7b989	Graduate EndpointSliceProxying and WindowsEndpointSliceProxying Gates	2021-07-07 13:33:30 -07:00
Kubernetes Prow Robot	96dff7d0c7	Merge pull request #102832 from Yuan-Junliang/migrateProxyEventAPI Migrate kube-proxy event to use v1 Event API	2021-07-05 17:44:17 -07:00
Kubernetes Prow Robot	7cd40e1885	Merge pull request #103116 from chenyw1990/reducekubeproxycpu reduce cpu usage of kube-proxy with iptables mode	2021-07-05 15:13:38 -07:00
chenyw1990	1f24a198e7	reduce cpu usage of kube-proxy with iptables mode	2021-07-05 16:08:19 +08:00
Swetha Repakula	03b7a699c2	Kubeproxy uses V1 EndpointSlice	2021-06-30 18:41:57 -07:00
Yuan-Junliang	2e06066bab	Migrate kube-proxy to use v1 Event API	2021-06-13 18:57:52 +08:00
Andrew Sy Kim	ed4fe07375	proxy/iptables: add unit test Test_HealthCheckNodePortWhenTerminating for ensuring health check node port fails when all local endpoints are terminating Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:17:43 -04:00
Andrew Sy Kim	68ebd16a2c	proxier/iptables: refactor terminating endpoints unit tests with test table and test for feature gate Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:17:43 -04:00
Andrew Sy Kim	8c514cb232	proxier/iptables: check feature gate ProxyTerminatingEndpoints Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:17:43 -04:00
Andrew Sy Kim	d82d851d89	proxier/iptables: include Service port in unit tests Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:17:43 -04:00
Andrew Sy Kim	4c8b190372	proxier/iptables: reuse the same variable for endpointchains for better memory consumption Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:17:43 -04:00
Andrew Sy Kim	b54c0568d8	proxier/iptables: add unit tests for falling back to terminating endpoints Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:15:40 -04:00
Andrew Sy Kim	732635fd4b	proxier/iptables: fallback to terminating endpoints if there are no ready endpoints Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:15:40 -04:00
刁浩 10284789	580b557592	Log spelling formatting and a redundant conversion Signed-off-by: 刁浩 10284789 <diao.hao@zte.com.cn>	2021-05-27 07:07:22 +00:00
Antonio Ojea	c6d97ee156	kube-proxy copy node labels	2021-04-28 13:26:26 +02:00
Surya Seetharaman	d3fe48e848	Kube-proxy: perf-enhancement: Reduce NAT table KUBE-SERVICES/NODEPORTS chain rules The nat KUBE-SERVICES chain is called from OUTPUT and PREROUTING stages. In clusters with large number of services, the nat-KUBE-SERVICES chain is the largest chain with for eg: 33k rules. This patch aims to move the KubeMarkMasq rules from the kubeServicesChain into the respective KUBE-SVC-* chains. This way during each packet-rule matching we won't have to traverse the MASQ rules of all services which get accumulated in the KUBE-SERVICES and/or KUBE-NODEPORTS chains. Since the jump to KUBE-MARK-MASQ ultimately sets the 0x400 mark for nodeIP SNAT, it should not matter whether the jump is made from KUBE-SERVICES or KUBE-SVC-* chains. Specifically we change: 1) For ClusterIP svc, we move the KUBE-MARK-MASQ jump rule from KUBE-SERVICES chain into KUBE-SVC-* chain. 2) For ExternalIP svc, we move the KUBE-MARK-MASQ jump rule in the case of non-ServiceExternalTrafficPolicyTypeLocal from KUBE-SERVICES chain into KUBE-SVC-* chain. 3) For NodePorts svc, we move the KUBE-MARK-MASQ jump rule in case of non-ServiceExternalTrafficPolicyTypeLocal from KUBE-NODEPORTS chain to KUBE-SVC-* chain. 4) For load-balancer svc, we don't change anything since it is already svc specific due to creation of KUBE-FW-* chains per svc. This would cut the rules per svc in KUBE-SERVICES and KUBE-NODEPORTS in half.	2021-04-21 16:41:03 +02:00
Surya Seetharaman	667e50abc8	Add TestOverallIPTablesRulesWithMultipleServices	2021-04-21 16:41:03 +02:00
Kubernetes Prow Robot	eda1de301a	Merge pull request #100874 from lojies/proxyiptableslog improve the readability of log	2021-04-10 19:04:37 -07:00
卢振兴10069964	98d4bdb5d7	improve the readability of log	2021-04-07 15:10:05 +08:00
Masashi Honma	d43b8dbf4e	Use simpler expressions for error messages 1. Do not describe port type in message because lp.String() already has the information. 2. Remove duplicate error detail from event log. Previous log is like this. 47s Warning listen tcp4 :30764: socket: too many open files node/127.0.0.1 can't open port "nodePort for default/temp-svc:834" (:30764/tcp4), skipping it: listen tcp4 :30764: socket: too many open files	2021-04-01 09:13:45 +09:00
Masashi Honma	3266136c1d	Fire an event when failing to open NodePort [issue] When creating a NodePort service with the kubectl create command, the NodePort assignment may fail. Failure to assign a NodePort can be simulated with the following malicious command[1]. $ kubectl create service nodeport temp-svc --tcp=`python3 <<EOF print("1", end="") for i in range(2, 1026): print("," + str(i), end="") EOF ` The command succeeds and shows following output. service/temp-svc created The service has been successfully generated and can also be referenced with the get command. $ kubectl get svc NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) temp-svc NodePort 10.0.0.139 <none> 1:31335/TCP,2:32367/TCP,3:30263/TCP,(omitted),1023:31821/TCP,1024:32475/TCP,1025:30311/TCP 12s The user does not recognize failure to assign a NodePort because create/get/describe command does not show any error. This is the issue. [solution] Users can notice errors by looking at the kube-proxy logs, but it may be difficult to see the kube-proxy logs of all nodes. E0327 08:50:10.216571 660960 proxier.go:1286] "can't open port, skipping this nodePort" err="listen tcp4 :30641: socket: too many open files" port="\"nodePort for default/temp-svc:744\" (:30641/tcp4)" E0327 08:50:10.216611 660960 proxier.go:1286] "can't open port, skipping this nodePort" err="listen tcp4 :30827: socket: too many open files" port="\"nodePort for default/temp-svc:857\" (:30827/tcp4)" ... E0327 08:50:10.217119 660960 proxier.go:1286] "can't open port, skipping this nodePort" err="listen tcp4 :32484: socket: too many open files" port="\"nodePort for default/temp-svc:805\" (:32484/tcp4)" E0327 08:50:10.217293 660960 proxier.go:1612] "Failed to execute iptables-restore" err="pipe2: too many open files ()" I0327 08:50:10.217341 660960 proxier.go:1615] "Closing local ports after iptables-restore failure" So, this patch will fire an event when NodePort assignment fails. In fact, when the externalIP assignment fails, it is also notified by event. The event will be displayed like this. $ kubectl get event LAST SEEN TYPE REASON OBJECT MESSAGE ... 2s Warning listen tcp4 :31055: socket: too many open files node/127.0.0.1 can't open "nodePort for default/temp-svc:901" (:31055/tcp4), skipping this nodePort: listen tcp4 :31055: socket: too many open files 2s Warning listen tcp4 :31422: socket: too many open files node/127.0.0.1 can't open "nodePort for default/temp-svc:474" (:31422/tcp4), skipping this nodePort: listen tcp4 :31422: socket: too many open files ... This PR fixes iptables and ipvs proxier. Since userspace proxier does not seem to be affected by this issue, it is not fixed. [1] Assume that fd limit is 1024(default). $ ulimit -n 1024	2021-04-01 08:27:51 +09:00
Rob Scott	f07be06a19	Adding support for TopologyAwareHints to kube-proxy	2021-03-08 15:37:47 -08:00
Fangyuan Li	0621e90d31	Rename fields and methods for BaseServiceInfo Fields: 1. rename onlyNodeLocalEndpoints to nodeLocalExternal; 2. rename onlyNodeLocalEndpointsForInternal to nodeLocalInternal; Methods: 1. rename OnlyNodeLocalEndpoints to NodeLocalExternal; 2. rename OnlyNodeLocalEndpointsForInternal to NodeLocalInternal;	2021-03-07 16:52:59 -08:00
Fangyuan Li	7ed2f1d94d	Implements Service Internal Traffic Policy 1. Add API definitions; 2. Add feature gate and drops the field when feature gate is not on; 3. Set default values for the field; 4. Add API Validation 5. add kube-proxy iptables and ipvs implementations 6. add tests	2021-03-07 16:52:59 -08:00
Antonio Ojea	654be57022	kube-proxy iptables expose number of rules metrics add a new metric to kube-proxy iptables, so it exposes the number of rules programmed in each iteration.	2021-03-05 10:00:38 +01:00
Benjamin Elder	56e092e382	hack/update-bazel.sh	2021-02-28 15:17:29 -08:00
Kubernetes Prow Robot	6dc317a107	Merge pull request #98130 from JornShen/optimze_redundant_listenPortOpener migrate to use k8s.io/util/net/port in kube-proxy	2021-02-18 10:02:51 -08:00
jornshen	dbe89a5683	migrate kube canary chain as const	2021-02-15 16:50:48 +08:00
jornshen	e68e105102	migrate to use k8s.io/util LocalPort and ListenPortOpener in iptables.proxier	2021-02-15 16:36:06 +08:00
Antonio Ojea	ed21a0e16c	kube-proxy: clear conntrack entries after rules are in place Clear conntrack entries for UDP NodePorts, this has to be done AFTER the iptables rules are programmed. It can happen that traffic to the NodePort hits the host before the iptables rules are programmed this will create an stale entry in conntrack that will blackhole the traffic, so we need to clear it ONLY when the service has endpoints.	2021-02-10 16:22:03 +01:00
Kubernetes Prow Robot	c1b3797f4b	Merge pull request #97824 from hanlins/fix/97225/hc-rules Explicitly add iptables rule to allow healthcheck nodeport	2021-02-04 15:54:52 -08:00
Hanlin Shi	4cd1eacbc1	Add rule to allow healthcheck nodeport traffic in filter table 1. For iptables mode, add KUBE-NODEPORTS chain in filter table. Add rules to allow healthcheck node port traffic. 2. For ipvs mode, add KUBE-NODE-PORT chain in filter table. Add KUBE-HEALTH-CHECK-NODE-PORT ipset to allow traffic to healthcheck node port.	2021-02-03 15:20:10 +00:00
Kubernetes Prow Robot	e89e7b4ed1	Merge pull request #98083 from JornShen/optimize_proxier_duplicate_localaddrset optimize proxier duplicate localaddrset	2021-01-29 01:21:40 -08:00
jornshen	3f506cadb0	optimize proxier duplicate localaddrset	2021-01-29 10:52:01 +08:00
jornshen	3783821553	move the redundant writeline writeBytesLine to proxy/util/util.go	2021-01-21 10:51:39 +08:00
Kubernetes Prow Robot	eb08f36c7d	Merge pull request #96371 from andrewsykim/kube-proxy-terminating kube-proxy: track serving/terminating conditions in endpoints cache	2021-01-11 18:38:25 -08:00
Andrew Sy Kim	9c096292cc	kube-proxy: iptables proxy should ignore endpoints with condition ready=false Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-01-11 16:27:38 -05:00
Andrew Sy Kim	55cb453a3c	kube-proxy: update internal endpoints map with 'serving' and 'terminating' condition from EndpointSlice Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-01-11 16:17:58 -05:00
jornshen	5af5a2ac7d	migrate proxy.UpdateServiceMap to be a method of ServiceMap	2021-01-11 11:07:30 +08:00
jornshen	07990e44bf	migrate proxy/iptables/proxier.go logs to structured logging	2021-01-07 10:48:01 +08:00
Kubernetes Prow Robot	d2662b9842	Merge pull request #96488 from basantsa1989/kproxy_cleanup Kube-proxy cleanup: Changing FilterIncorrectIP/CIDR functions to MapIPsToIPFamily that returns a map	2020-12-08 17:28:52 -08:00
Antonio Ojea	120472032c	kube-proxy: treat ExternalIPs as ClusterIP Currently kube-proxy treat ExternalIPs differently depending on: - the traffic origin - if the ExternalIP is present or not in the system. It also depends on the CNI implementation to discriminate between local and non-local traffic. Since the ExternalIP belongs to a Service, we can avoid the roundtrip of sending outside the traffic originated in the cluster. Also, we leverage the new LocalTrafficDetector to detect the local traffic and not rely on the CNI implementations for this.	2020-11-22 00:54:33 +01:00
Basant Amarkhed	707073d2f9	Fixup #1 addressing review comments	2020-11-17 07:13:51 +00:00
Basant Amarkhed	8fb895f3f1	Updating after merging with a conflicting commit	2020-11-14 01:09:46 +00:00
Patrik Cyvoct	d29665cc17	Revert "Merge pull request #92312 from Sh4d1/kep_1860" This reverts commit `ef16faf409`, reversing changes made to `2343b8a68b`.	2020-11-11 10:26:53 +01:00
Patrik Cyvoct	20fc86df25	fix defaulting Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 10:00:59 +01:00
Patrik Cyvoct	0768b45e7b	add nil case in proxy Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 10:00:58 +01:00
Patrik Cyvoct	11b97e9ef8	fix tests Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 10:00:55 +01:00
Patrik Cyvoct	540901779c	fix reviews Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 10:00:53 +01:00
Patrik Cyvoct	af7494e896	Update generated Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 10:00:52 +01:00
Patrik Cyvoct	0153b96ab8	fix review Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 10:00:27 +01:00
Patrik Cyvoct	d562b6924a	Add tests Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 09:59:59 +01:00
Patrik Cyvoct	47ae7cbf52	Add route type field to loadbalancer status ingress Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 09:59:58 +01:00
Kubernetes Prow Robot	48a2bca893	Merge pull request #96251 from ravens/nodeport_udp_conntrack_fix Correctly fix clearing conntrack entry on endpoint changes (nodeport)	2020-11-06 14:25:37 -08:00
Kubernetes Prow Robot	0451848d64	Merge pull request #95787 from qingsenLi/k8s201022-format format incorrectAddresses in klog	2020-11-05 11:50:33 -08:00
Yan Grunenberger	fdee7b2faa	Correctly fix clearing conntrack entry on endpoint changes (nodeport) A previous PR (#71573) intended to clear conntrack entry on endpoint changes when using nodeport by introducing a dedicated function to remove the stale conntrack entry on the node port and allow traffic to resume. By doing so, it has introduced a nodeport specific bug where the conntrack entries related to the ClusterIP does not get clean if endpoint is changed (issue #96174). We fix by doing ClusterIP cleanup in all cases.	2020-11-05 09:45:17 +01:00
Khaled Henidak (Kal)	6675eba3ef	dual stack services (#91824 ) * api: structure change * api: defaulting, conversion, and validation * [FIX] validation: auto remove second ip/family when service changes to SingleStack * [FIX] api: defaulting, conversion, and validation * api-server: clusterIPs alloc, printers, storage and strategy * [FIX] clusterIPs default on read * alloc: auto remove second ip/family when service changes to SingleStack * api-server: repair loop handling for clusterIPs * api-server: force kubernetes default service into single stack * api-server: tie dualstack feature flag with endpoint feature flag * controller-manager: feature flag, endpoint, and endpointSlice controllers handling multi family service * [FIX] controller-manager: feature flag, endpoint, and endpointSlicecontrollers handling multi family service * kube-proxy: feature-flag, utils, proxier, and meta proxier * [FIX] kubeproxy: call both proxier at the same time * kubenet: remove forced pod IP sorting * kubectl: modify describe to include ClusterIPs, IPFamilies, and IPFamilyPolicy * e2e: fix tests that depends on IPFamily field AND add dual stack tests * e2e: fix expected error message for ClusterIP immutability * add integration tests for dualstack the third phase of dual stack is a very complex change in the API, basically it introduces Dual Stack services. Main changes are: - It pluralizes the Service IPFamily field to IPFamilies, and removes the singular field. - It introduces a new field IPFamilyPolicyType that can take 3 values to express the "dual-stack(mad)ness" of the cluster: SingleStack, PreferDualStack and RequireDualStack - It pluralizes ClusterIP to ClusterIPs. The goal is to add coverage to the services API operations, taking into account the 6 different modes a cluster can have: - single stack: IP4 or IPv6 (as of today) - dual stack: IPv4 only, IPv6 only, IPv4 - IPv6, IPv6 - IPv4 * [FIX] add integration tests for dualstack * generated data * generated files Co-authored-by: Antonio Ojea <aojea@redhat.com>	2020-10-26 13:15:59 -07:00
Kubernetes Prow Robot	766ae2b81b	Merge pull request #95252 from tssurya/shrink-input-chain Kube-proxy: Perf-fix: Shrink INPUT chain	2020-10-22 22:16:02 -07:00
qingsenLi	9ad39c9eda	format incorrectAddresses in klog	2020-10-22 17:26:29 +08:00
Surya Seetharaman	477b14b3c4	Kube-proxy: Perf-fix: Shrink INPUT chain In #56164, we had split the reject rules for non-ep existing services into KUBE-EXTERNAL-SERVICES chain in order to avoid calling KUBE-SERVICES from INPUT. However in #74394 KUBE-SERVICES was re-added into INPUT. As noted in #56164, kernel is sensitive to the size of INPUT chain. This patch refrains from calling the KUBE-SERVICES chain from INPUT and FORWARD, instead adds the lb reject rule to the KUBE-EXTERNAL-SERVICES chain which will be called from INPUT and FORWARD.	2020-10-19 11:26:04 +02:00
Antonio Ojea	880baa9f6f	kube-proxy: log stale services operations	2020-10-19 09:35:34 +02:00
Lion-Wei	1f7ea16560	kube-proxy ensure KUBE-MARK-DROP exist but not modify their rules	2020-10-16 14:52:07 +08:00
Amim Knabben	a18e5de51a	LockToDefault the ExternalPolicyForExternalIP feature gate	2020-09-16 13:16:33 -04:00
Rob Scott	c382c79f60	Updating kube-proxy to trim space from loadBalancerSourceRanges Before this fix, a Service with a loadBalancerSourceRange value that included a space would cause kube-proxy to crashloop. This updates kube-proxy to trim any space from that field.	2020-08-20 18:19:52 -07:00
Vinod K L Swamy	4505d5b182	Changes to Proxy common code	2020-06-29 14:29:46 -07:00
Kubernetes Prow Robot	73fa63a86d	Merge pull request #92035 from danwinship/unmark-before-masq kubelet, kube-proxy: unmark packets before masquerading them	2020-06-16 00:50:03 -07:00
Dan Winship	c12534d8b4	kubelet, kube-proxy: unmark packets before masquerading them It seems that if you set the packet mark on a packet and then route that packet through a kernel VXLAN interface, the VXLAN-encapsulated packet will still have the mark from the original packet. Since our NAT rules are based on the packet mark, this was causing us to double-NAT some packets, which then triggered a kernel checksumming bug. But even without the checksum bug, there are reasons to avoid double-NATting, so fix the rules to unmark the packets before masquerading them.	2020-06-15 18:45:38 -04:00
Kubernetes Prow Robot	35fc65dc2c	Merge pull request #89998 from Nordix/issue-89923 Filter nodePortAddresses to proxiers	2020-06-13 09:39:55 -07:00
Kubernetes Prow Robot	8f5e8514b3	Merge pull request #90103 from SataQiu/refactor-proxy-20200413 kube-proxy: move GetNodeAddresses call out of internal loop to avoid repeated computation	2020-06-02 19:44:17 -07:00
SataQiu	b68312e688	kube-proxy: move GetNodeAddresses call out of internal loop to avoid repeated computation Signed-off-by: SataQiu <1527062125@qq.com>	2020-05-26 15:32:05 +08:00
Davanum Srinivas	07d88617e5	Run hack/update-vendor.sh Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:33 -04:00
Davanum Srinivas	442a69c3bd	switch over k/k to use klog v2 Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2020-05-16 07:54:27 -04:00
Lars Ekman	f54b8f98b9	Filter nodePortAddresses to the proxiers. Log a warning for addresses of wrong family.	2020-05-15 09:54:33 +02:00
Casey Callendrello	042daa24ac	proxy: followup to last-queued-change metric Fixes two small issues with the metric added in #90175: 1. Bump the timestamp on initial informer sync. Otherwise it remains 0 if restarting kube-proxy in a quiescent cluster, which isn't quite right. 2. Bump the timestamp even if no healthz server is specified.	2020-05-11 18:48:47 +02:00
Casey Callendrello	2e1a884bf3	pkg/proxy: add last-queued-timestamp metric This adds a metric, kubeproxy_sync_proxy_rules_last_queued_timestamp, that captures the last time a change was queued to be applied to the proxy. This matches the healthz logic, which fails if a pending change is stale. This allows us to write alerts that mirror healthz. Signed-off-by: Casey Callendrello <cdc@redhat.com>	2020-04-21 15:19:32 +02:00
Tim Hockin	efb24d44c6	Rename iptables IsIpv6 to IsIPv6	2020-04-10 15:29:50 -07:00
Tim Hockin	ef934a2c5e	Add Protocol() method to iptables Enables simpler printing of which IP family the iptables interface is managing.	2020-04-10 15:29:49 -07:00
Tim Hockin	b874f7c626	Encapsulate sysctl test and log	2020-04-10 15:29:49 -07:00
Tim Hockin	341022f8d1	kube-proxy: log service and endpoint updates	2020-04-10 15:29:44 -07:00
Tim Hockin	37da906db2	kube-proxy: more logging at startup	2020-04-10 15:17:46 -07:00
Kubernetes Prow Robot	4a63d95916	Merge pull request #89792 from andrewsykim/remove-redundant-len-check proxy: remove redundant length check on local address sets	2020-04-10 00:31:47 -07:00
Andrew Sy Kim	5169ef5fb5	proxy: remove redundant length check on local address set Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2020-04-02 16:06:51 -04:00
Kubernetes Prow Robot	bbe5594409	Merge pull request #89296 from danwinship/random-emptily Don't log whether we're using iptables --random-fully	2020-04-02 12:42:24 -07:00
Kubernetes Prow Robot	c2ae0bd763	Merge pull request #74073 from Nordix/issue-70020 Issue #70020; Flush Conntrack entities for SCTP	2020-04-01 22:14:24 -07:00
Dan Winship	8edd656238	Don't log whether we're using iptables --random-fully	2020-03-20 08:06:27 -04:00
Kubernetes Prow Robot	1b3c94b034	Merge pull request #89146 from SataQiu/fix-kube-proxy-20200316 comment cleanup for kube-proxy	2020-03-18 22:25:05 -07:00
SataQiu	64a496e645	kube-proxy: some code cleanup	2020-03-17 21:46:54 +08:00
Minhan Xia	068963fc06	add testing	2020-03-13 14:59:40 -07:00
Minhan Xia	efc4b12186	add ExternalTrafficPolicy support for External IPs in iptables kubeproxy	2020-03-13 14:59:39 -07:00
Lars Ekman	aa8521df66	Issue #70020 ; Flush Conntrack entities for SCTP Signed-off-by: Lars Ekman <lars.g.ekman@est.tech>	2020-03-11 09:56:54 +01:00
Satyadeep Musuvathy	8c6956e5bb	Refactor handling of local traffic detection.	2020-02-21 17:57:34 -08:00
Kubernetes Prow Robot	ad68c4a8b5	Merge pull request #87699 from michaelbeaumont/fix_66766 kube-proxy: Only open ipv4 sockets for ipv4 clusters	2020-02-13 23:54:18 -08:00
Andrew Sy Kim	1653476e3f	proxier: use IPSet from k8s.io/utils/net to store local addresses This allows the proxier to cache local addresses instead of fetching all local addresses every time in IsLocalIP. Signed-off-by: Andrew Sy Kim <kiman@vmware.com>	2020-02-11 16:44:34 -05:00
Andrew Sy Kim	313c3b81e3	iptables proxier: get local addresses only once per sync loop This avoids fetching all local network interfaces everytime we sync an external IP. For clusters with many external IPs this gets really expensive. This change caches all local addresses once per sync. Signed-off-by: Andrew Sy Kim <kiman@vmware.com>	2020-02-11 16:35:49 -05:00
Michael Beaumont	3eea0d1405	kube-proxy: Only open ipv4 sockets for ipv4 clusters	2020-01-30 18:54:16 +01:00
Rob Scott	47b2593d59	Creating new EndpointSliceProxying feature gate for kube-proxy This creates a new EndpointSliceProxying feature gate to cover EndpointSlice consumption (kube-proxy) and allow the existing EndpointSlice feature gate to focus on EndpointSlice production only. Along with that addition, this enables the EndpointSlice feature gate by default, now only affecting the controller. The rationale here is that it's really difficult to guarantee all EndpointSlices are created in a cluster upgrade process before kube-proxy attempts to consume them. Although masters are generally upgraded before nodes, and in most cases, the controller would have enough time to create EndpointSlices before a new node with kube-proxy spun up, there are plenty of edge cases where that might not be the case. The primary limitation on EndpointSlice creation is the API rate limit of 20QPS. In clusters with a lot of endpoints and/or with a lot of other API requests, it could be difficult to create all the EndpointSlices before a new node with kube-proxy targeting EndpointSlices spun up. Separating this into 2 feature gates allows for a more gradual rollout with the EndpointSlice controller being enabled by default in 1.18, and EndpointSlices for kube-proxy being enabled by default in the next release.	2020-01-17 16:17:40 -08:00
danielqsj	a8f2feaeb5	remove deprecated metrics of proxy	2020-01-10 17:05:38 +08:00
Kubernetes Prow Robot	5373fa3f59	Merge pull request #82462 from vllry/dualstack-iptables Dualstack support for kube-proxy iptables mode	2020-01-07 04:38:20 -08:00
Kubernetes Prow Robot	30090d0809	Merge pull request #86665 from SataQiu/clean-proxy-20191227 kube-proxy: add some interface type assertions	2020-01-02 22:25:40 -08:00
SataQiu	134c545b96	proxy: add some interface type assertions	2019-12-27 18:30:25 +08:00
libnux	f0e01bcfde	Change log level to 3 when --random-fully is not supported	2019-12-24 17:47:27 +08:00
Mark Janssen	a54e5cec54	Fix staticcheck failures for pkg/proxy/... Errors from staticcheck: pkg/proxy/healthcheck/proxier_health.go:55:2: field port is unused (U1000) pkg/proxy/healthcheck/proxier_health.go:162:20: printf-style function with dynamic format string and no further arguments should use print-style function instead (SA1006) pkg/proxy/healthcheck/service_health.go:166:20: printf-style function with dynamic format string and no further arguments should use print-style function instead (SA1006) pkg/proxy/iptables/proxier.go:737:2: this value of args is never used (SA4006) pkg/proxy/iptables/proxier.go:737:15: this result of append is never used, except maybe in other appends (SA4010) pkg/proxy/iptables/proxier.go:1287:28: this result of append is never used, except maybe in other appends (SA4010) pkg/proxy/userspace/proxysocket.go:293:3: this value of n is never used (SA4006) pkg/proxy/winkernel/metrics.go:74:6: func sinceInMicroseconds is unused (U1000) pkg/proxy/winkernel/metrics.go:79:6: func sinceInSeconds is unused (U1000) pkg/proxy/winuserspace/proxier.go:94:2: field portMapMutex is unused (U1000) pkg/proxy/winuserspace/proxier.go:118:2: field owner is unused (U1000) pkg/proxy/winuserspace/proxier.go:119:2: field socket is unused (U1000) pkg/proxy/winuserspace/proxysocket.go:620:4: this value of n is never used (SA4006)	2019-12-22 21:32:06 +01:00
SataQiu	2497a1209b	bump k8s.io/utils version	2019-12-21 14:54:44 +08:00
Vallery Lancey	23957a6b28	Allow kube-proxy iptables mode to support dual-stack, with the meta-proxier.	2019-12-16 22:50:25 -08:00
Kubernetes Prow Robot	459b1d76bf	Merge pull request #85527 from aojea/fix#85414 Revert "kube-proxy: check KUBE-MARK-DROP"	2019-11-23 13:19:49 -08:00
Antonio Ojea	98be7831e4	Revert "kube-proxy: check KUBE-MARK-DROP" This reverts commit `1ca0ffeaf2`. kube-proxy is not recreating the rules associated to the KUBE-MARK-DROP chain, that is created by the kubelet. Is preferrable avoid the dependency between the kubelet and kube-proxy and that each of them handle their own rules.	2019-11-22 06:37:42 +01:00
Andrew Sy Kim	884582d892	proxier: improve node topology event handler logic Signed-off-by: Andrew Sy Kim <kiman@vmware.com>	2019-11-15 08:53:56 -05:00
Roc Chan	80c6524cd0	kube-proxy: sync rules when current node labels change detected	2019-11-15 13:36:43 +08:00
Roc Chan	c9cf3f5b72	Service Topology implementation * Implement Service Topology for ipvs and iptables proxier * Add test files * API validation	2019-11-15 13:36:43 +08:00
Rob Scott	a7e589a8c6	Promoting EndpointSlices to beta	2019-11-13 14:20:19 -08:00
Rob Scott	0fa9981e01	Splitting IP address type into IPv4 and IPv6 for EndpointSlices	2019-11-12 09:03:53 -08:00
Kubernetes Prow Robot	2b3540068b	Merge pull request #84422 from aojea/kubemarkdrop kube-proxy: ensure KUBE-MARK-DROP exists	2019-11-03 13:41:39 -08:00
Kubernetes Prow Robot	1da7210180	Merge pull request #84440 from lsytj0413/fix-gosimple refactor(*): fix comparison to bool constant, return redundant	2019-11-01 18:08:10 -07:00
Kubernetes Prow Robot	85575e929b	Merge pull request #83387 from danwinship/proxy-error-retry If an iptables proxier sync fails, retry after iptablesSyncPeriod	2019-10-31 21:53:23 -07:00
Dan Winship	2fd42dee95	If an iptables proxier sync fails, retry after iptablesSyncPeriod	2019-10-29 07:36:00 -04:00
lsytj0413	948a578179	refactor(*): fix comparison to bool constant, return redundant	2019-10-28 16:41:08 +08:00
Antonio Ojea	1ca0ffeaf2	kube-proxy: check KUBE-MARK-DROP	2019-10-27 18:46:51 +01:00
zouyee	a3e0ac2951	set config.BindAddress to IPv4 address "127.0.0.1" if not specified Signed-off-by: Zou Nengren <zouyee1989@gmail.com>	2019-10-25 21:46:41 +08:00
Kubernetes Prow Robot	af6f302e46	Merge pull request #83498 from danwinship/proxy-health Fix kube-proxy healthz server for proxier sync loop changes	2019-10-15 23:04:58 -07:00
Rob Scott	3924364585	Making iptables probability more granular in kube-proxy. Until now, iptables probabilities had 5 decimal places of granularity. That meant that probabilities would start to repeat once a Service had 319 or more endpoints. This doubles the granularity to 10 decimal places, ensuring that probabilities will not repeat until a Service reaches 100,223 endpoints.	2019-10-07 17:37:33 -07:00
Dan Winship	f83474916e	Fix kube-proxy healthz server for proxier sync loop changes The proxy healthz server assumed that kube-proxy would regularly call UpdateTimestamp() even when nothing changed, but that's no longer true. Fix it to only report unhealthiness when updates have been received from the apiserver but not promptly pushed out to iptables/ipvs.	2019-10-04 13:37:09 -04:00
Dan Winship	0f10102c16	Better distinguish the two kinds of proxy health check servers Kube-proxy runs two different health servers; one for monitoring the health of kube-proxy itself, and one for monitoring the health of specific services. Rename them to "ProxierHealthServer" and "ServiceHealthServer" to make this clearer, and do a bit of API cleanup too.	2019-10-04 10:37:58 -04:00
Rob Scott	af56f25797	Only detecting stale connections for UDP ports in kube-proxy. The detectStaleConnections function in kube-proxy is very expensive in terms of CPU utilization. The results of this function are only actually used for UDP ports. This adds a protocol attribute to ServicePortName to make it simple to only run this function for UDP connections. For clusters with primarily TCP connections this can improve kube-proxy performance by 2x.	2019-09-25 17:48:54 -07:00
Dan Winship	3948f16ff4	Add iptables.Monitor, use it from kubelet and kube-proxy Kubelet and kube-proxy both had loops to ensure that their iptables rules didn't get deleted, by repeatedly recreating them. But on systems with lots of iptables rules (ie, thousands of services), this can be very slow (and thus might end up holding the iptables lock for several seconds, blocking other operations, etc). The specific threat that they need to worry about is firewall-management commands that flush all dynamic iptables rules. So add a new iptables.Monitor() function that handles this by creating iptables-flush canaries and only triggering a full rule reload after noticing that someone has deleted those chains.	2019-09-17 10:19:26 -04:00
Kubernetes Prow Robot	61ecdba9ca	Merge pull request #82289 from robscott/endpointslice-fixes Fixing bugs related to Endpoint Slices	2019-09-05 09:03:10 -07:00
Rob Scott	8f9483d827	Fixing bugs related to Endpoint Slices This should fix a bug that could break masters when the EndpointSlice feature gate was enabled. This was all tied to how the apiserver creates and manages it's own services and endpoints (or in this case endpoint slices). Consumers of endpoint slices also need to know about the corresponding service. Previously we were trying to set an owner reference here for this purpose, but that came with potential downsides and increased complexity. This commit changes behavior of the apiserver endpointslice integration to set the service name label instead of owner references, and simplifies consumer logic to reference that (both are set by the EndpointSlice controller). Additionally, this should fix a bug with the EndpointSlice GenerateName value that had previously been set with a "." as a suffix.	2019-09-04 09:09:32 -07:00
Mike Spreitzer	d86d1defa1	Made IPVS and iptables modes of kube-proxy fully randomize masquerading if possible Work around Linux kernel bug that sometimes causes multiple flows to get mapped to the same IP:PORT and consequently some suffer packet drops. Also made the same update in kubelet. Also added cross-pointers between the two bodies of code, in comments. Some day we should eliminate the duplicate code. But today is not that day.	2019-09-01 22:07:30 -04:00
Kubernetes Prow Robot	4495d09282	Merge pull request #81430 from robscott/endpointslice-proxy Adding EndpointSlice support for kube-proxy ipvs and iptables proxiers	2019-08-29 15:36:44 -07:00
Rob Scott	9665c590c7	Adding EndpointSlice support for kube-proxy ipvs and iptables proxiers	2019-08-29 01:06:52 -07:00
Kubernetes Prow Robot	454e8e6e92	Merge pull request #80514 from liuxu623/master don't delete KUBE-MARK-MASQ chain in iptables/ipvs proxier	2019-08-28 23:49:56 -07:00
Kubernetes Prow Robot	0a486d97ed	Merge pull request #81415 from oxddr/asdf kube-proxy: improve logging around network programming latency SLI.	2019-08-23 15:48:39 -07:00
Janek Łukaszewicz	c33be173bf	kube-proxy: improve logging around network programming latency SLI.	2019-08-23 15:48:25 +02:00
Kubernetes Prow Robot	9c736445f5	Merge pull request #79846 from aramase/fix-golint-pkg/proxy Fix golint failures in pkg/proxy	2019-08-23 00:51:17 -07:00
Kubernetes Prow Robot	37651f1cef	Merge pull request #80368 from danwinship/iptables-checks iptables feature detection improvements	2019-08-22 13:31:20 -07:00
liuxu	c90b295ef1	don't delete KUBE-MARK-MASQ chain in iptables/ipvs proxier	2019-08-20 15:43:54 +08:00
Tim Hockin	5b14394f4e	Don't track syncProxyRules runtime if not running	2019-08-16 17:05:03 -07:00
hui luo	a2ef00c1b1	Add iptables restore failure metrics As mentioned in issue #80061, in iptables lock contention case, we can see increasing rate of iptables restore failures because it need to grab iptables file lock. The failure metric can provide administrators more insight Metrics will be collected in kube-proxy iptables and ipvs modes Signed-off-by: Hui Luo <luoh@vmware.com>	2019-08-09 10:18:19 -07:00
Dan Winship	a735c97356	kube-proxy: drop iptables version check Kube-proxy's iptables mode used to care whether utiliptables's EnsureRule was able to use "iptables -C" or if it had to implement it hackily using "iptables-save". But that became irrelevant when kube-proxy was reimplemented using "iptables-restore", and no one ever noticed. So remove that check.	2019-08-01 12:05:31 -04:00
Anish Ramasekar	2878270f5b	Fix golint failures in pkg/proxy Review feedback - remove alias from imports fix comments	2019-07-08 11:48:33 -07:00
Kubernetes Prow Robot	da0f51ffed	Merge pull request #78820 from haosdent/fix_typos Fix typos.	2019-07-01 15:09:20 -07:00
Andrew Sy Kim	ba19451020	iptables proxier: fix comments for LB IP traffic from local address Signed-off-by: Andrew Sy Kim <kiman@vmware.com>	2019-06-28 16:42:01 -04:00
Kubernetes Prow Robot	0c9964fac3	Merge pull request #76160 from JacobTanenbaum/BaseServiceInfo-cleanup enforce the interface relationship between ServicePort and BaseServiceInfo	2019-06-13 20:37:13 -07:00
Haosdent Huang	7ce6e71891	Fix typos.	2019-06-11 01:52:14 +08:00
Jacob Tanenbaum	c0392d72e9	enforce the interface relationship between ServicePort and BaseServiceInfo Currently the BaseServiceInfo struct implements the ServicePort interface, but only uses that interface sometimes. All the elements of BaseServiceInfo are exported and sometimes the interface is used to access them and othertimes not I extended the ServicePort interface so that all relevent values can be accessed through it and unexported all the elements of BaseServiceInfo	2019-06-05 14:50:24 -04:00
Kubernetes Prow Robot	bdf3d248eb	Merge pull request #77523 from andrewsykim/fix-xlb-from-local iptables proxier: route local traffic to LB IPs to service chain	2019-05-31 12:22:53 -07:00
Kubernetes Prow Robot	929adb69e3	Merge pull request #76165 from JacobTanenbaum/minor-cleanups Minor cleanups in pkg/proxy/endpoints.go	2019-05-15 22:55:55 -07:00
Kubernetes Prow Robot	746404f82a	Merge pull request #77560 from dcbw/proxy-sig-network-owners pkg/proxy: add sig-network-approvers/sig-network-reviewers to OWNERS files	2019-05-15 03:08:33 -07:00
Kubernetes Prow Robot	74743793f2	Merge pull request #74027 from squeed/kube-proxy-metrics proxy: add some useful metrics	2019-05-15 03:08:19 -07:00
Dan Williams	91716989b6	pkg/proxy: add sig-network-approvers/sig-network-reviewers to OWNERS files This PR also adds m1093782566 (Jun Du) to sig-network-reviewers in recognition of his contributions to the proxy.	2019-05-13 10:30:29 -05:00
Brad Hoekstra	62e58a66aa	Fix some lint errors in pkg/proxy	2019-05-09 16:48:29 -04:00
Andrew Sy Kim	8dfd4def99	add unit tests for -src-type=LOCAL from LB chain Signed-off-by: Andrew Sy Kim <kiman@vmware.com>	2019-05-07 15:22:46 -04:00
Andrew Sy Kim	b926fb9d2b	iptables proxier: route local traffic to LB IPs to service chain Signed-off-by: Andrew Sy Kim <kiman@vmware.com>	2019-05-07 15:22:46 -04:00
Jacob Tanenbaum	9d4693a70f	changing UpdateEndpointsMap to Update changing UpdateEndpointsMap to be a function of the EndpointsMap object	2019-05-07 14:41:15 -04:00
Casey Callendrello	017f57a6b0	proxy: add some useful metrics This adds some useful metrics around pending changes and last successful sync time. The goal is for administrators to be able to alert on proxies that, for whatever reason, are quite stale. Signed-off-by: Casey Callendrello <cdc@redhat.com>	2019-05-07 14:21:13 +02:00
Krzysztof Siedlecki	941629d37a	Revert "Add better logging when iptables-restore fails"	2019-05-07 13:37:29 +02:00
JieJhih Jhang	176d49300d	combine two logics avoid for range the same thing	2019-05-01 18:35:52 +08:00
Kubernetes Prow Robot	a143d07b27	Merge pull request #76254 from JieJhih/fix/word Fix spell error	2019-04-26 14:26:34 -07:00
Kubernetes Prow Robot	fa833a1e33	Merge pull request #74840 from anfernee/connreset kube-proxy: Drop packets in INVALID state	2019-04-26 14:26:22 -07:00
Davanum Srinivas	7b8c9acc09	remove unused code Change-Id: If821920ec8872e326b7d85437ad8d2620807799d	2019-04-19 08:36:31 -04:00
WanLinghao	d0138ca3fe	This commit does two things in pkg package: 1. Remove unused ptr functions. 2. Replace ptr functions with k8s.io/utils/pointer	2019-04-09 10:56:35 +08:00
Jay	9f2147161e	Fix spell error	2019-04-08 15:49:29 +08:00
Tim Hockin	f8a7936894	Add better logging when iptables-restore fails	2019-04-04 16:34:10 -07:00
Yongkun Gui	a07169bcad	kube-proxy: Drop packets in INVALID state Fixes: #74839	2019-03-18 15:22:30 -07:00
Kubernetes Prow Robot	aa9cbd112c	Merge pull request #75265 from JacobTanenbaum/ClearExternalIPs Clear conntrack entries on 0 -> 1 endpoint transition with externalIPs	2019-03-18 11:06:23 -07:00
Jacob Tanenbaum	c3548165d5	Clear conntrack entries on 0 -> 1 endpoint transition with externalIPs As part of the endpoint creation process when going from 0 -> 1 conntrack entries are cleared. This is to prevent an existing conntrack entry from preventing traffic to the service. Currently the system ignores the existance of the services external IP addresses, which exposes that errant behavior This adds the externalIP addresses of udp services to the list of conntrack entries that get cleared. Allowing traffic to flow Signed-off-by: Jacob Tanenbaum <jtanenba@redhat.com>	2019-03-15 11:18:51 -04:00
Tim Hockin	de25d6cb95	Kube-proxy: REJECT LB IPs with no endpoints We REJECT every other case. Close this FIXME. To get this to work in all cases, we have to process service in filter.INPUT, since LB IPS might be manged as local addresses.	2019-03-11 20:33:45 -07:00
danielqsj	10ab3fb832	clean the deprecated metrics which introduced recently	2019-03-06 15:23:46 +08:00
danielqsj	f7b437cae0	convert latency in mertics name to duration	2019-02-22 21:40:13 +08:00
Kubernetes Prow Robot	059d6057dd	Merge pull request #73323 from prameshj/clear-externalip-conntrack Clear conntrack entries for externalIP and LoadBalancer IP	2019-02-19 18:38:17 -08:00
Kubernetes Prow Robot	808f2cf0ef	Merge pull request #72525 from justinsb/owners_should_not_be_executable Remove executable file permission from OWNERS files	2019-02-14 23:55:45 -08:00
Pavithra Ramesh	24d3ab83dc	Remove conntrack entries from loadbalancer ip too.	2019-02-13 09:55:31 -08:00
Matt Matejczyk	7141ece4bf	Start exporting the in-cluster network programming latency metric.	2019-02-12 08:09:59 +01:00
Kubernetes Prow Robot	5b7a790d35	Merge pull request #72185 from dcbw/owners-label-sig-network OWNERS: add label:sig/network to a bunch of places	2019-02-08 10:36:16 -08:00
Roy Lenferink	b43c04452f	Updated OWNERS files to include link to docs	2019-02-04 22:33:12 +01:00
Ashish Ranjan	7be223e798	Refactor to use k8s.io/utils/net/ package instead of kubernetes/pkg/util/net/sets Signed-off-by: Ashish Ranjan <ashishranjan738@gmail.com>	2019-02-04 10:34:53 +05:30
Kubernetes Prow Robot	b8d6de320f	Merge pull request #72334 from danielqsj/kp Change proxy metrics to conform metrics guidelines	2019-01-25 18:32:12 -08:00
prameshj	5667ebd4f6	Merge branch 'master' into clear-externalip-conntrack	2019-01-25 11:12:16 -08:00
Pavithra Ramesh	168602e597	Clear conntrack entries for externalIP When an endpoint is deleted, the conntrack entries are cleared for clusterIP but not for externalIP of the service. This change adds that step.	2019-01-25 11:05:18 -08:00
Justin SB	dd19b923b7	Remove executable file permission from OWNERS files	2019-01-11 16:42:59 -08:00
Tim Hockin	df77e8eefd	kube-proxy: reject 0 endpoints on forward Previously we only REJECTed on OUTPUT which works for packets from the node but not for packets from pods on the node.	2019-01-03 10:59:13 -08:00
Tim Hockin	0d451d7a4c	kube-proxy: remove old cleanup rules	2019-01-03 10:59:10 -08:00
Tim Hockin	51442b1e8e	kube-proxy: rename field for congruence	2019-01-03 10:59:10 -08:00
Tim Hockin	2106447d21	kube-proxy: rename vars for clarity, fix err str	2019-01-03 10:59:10 -08:00
Tim Hockin	b3c2888e71	kube-proxy: rename internal field for clarity	2019-01-03 10:59:06 -08:00
danielqsj	8975e62254	Change proxy metrics to conform guideline	2018-12-26 17:25:10 +08:00
Dan Williams	2e339188ed	OWNERS: add label:sig/network to a bunch of places	2018-12-19 00:00:02 -06:00
Jacob Tanenbaum	144280e7a7	Correctly Clear conntrack entrty on endpoint changes when using nodeport When using NodePort to connect to an endpoint using UDP, if the endpoint is deleted on restoration of the endpoint traffic does not flow. This happens because conntrack holds the state of the connection and the proxy does not correctly clear the conntrack entry for the stale endpoint. Introduced a new function to conntrack ClearEntriesForPortNAT that uses the endpointIP and NodePort to remove the stale conntrack entry and allow traffic to resume when the endpoint is restored. Signed-off-by: Jacob Tanenbaum <jtanenba@redhat.com>	2018-12-03 15:02:48 -05:00
AdamDang	cc4d38c768	Typo fix: healtcheck->healthcheck (#65394 ) Typo fix: healtcheck->healthcheck Typo fix: healtcheck->healthcheck	2018-11-13 19:45:24 -08:00
Davanum Srinivas	954996e231	Move from glog to klog - Move from the old github.com/golang/glog to k8s.io/klog - klog as explicit InitFlags() so we add them as necessary - we update the other repositories that we vendor that made a similar change from glog to klog * github.com/kubernetes/repo-infra * k8s.io/gengo/ * k8s.io/kube-openapi/ * github.com/google/cadvisor - Entirely remove all references to glog - Fix some tests by explicit InitFlags in their init() methods Change-Id: I92db545ff36fcec83afe98f550c9e630098b3135	2018-11-10 07:50:31 -05:00
k8s-ci-robot	941fc26418	Merge pull request #67888 from tanshanshan/glogformat remove unused format log print	2018-10-01 22:20:28 -07:00
k8s-ci-robot	3fe21e5433	Merge pull request #68922 from BenTheElder/version-staging move pkg/util/version to staging	2018-09-26 22:59:42 -07:00
Benjamin Elder	8b56eb8588	hack/update-gofmt.sh	2018-09-24 12:21:29 -07:00
Benjamin Elder	f828c6f662	hack/update-bazel.sh	2018-09-24 12:03:24 -07:00
Benjamin Elder	088cf3c37b	find & replace version import	2018-09-24 12:03:24 -07:00
Jess Frazelle	f8ba640ced	pkg/proxy: only set sysctl if not already set This will allow for kube-proxy to be run without `privileged` and with only adding the capability `NET_ADMIN`. Signed-off-by: Jess Frazelle <acidburn@microsoft.com>	2018-09-19 15:29:53 -04:00
Kubernetes Submit Queue	11c47e1872	Merge pull request #67948 from wojtek-t/use_buffers_in_kube_proxy Automatic merge from submit-queue (batch tested with PRs 66577, 67948, 68001, 67982). If you want to cherry-pick this change to another branch, please follow the instructions here: https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md. Reduce amount of allocations in kube-proxy Follow up from https://github.com/kubernetes/kubernetes/pull/65902	2018-08-29 16:33:34 -07:00
wojtekt	8fb365df32	Reduce amount of allocations in kube-proxy	2018-08-28 15:18:58 +02:00
tanshanshan	8598c9dceb	remove unused format log print	2018-08-27 17:10:24 +08:00
Laszlo Janosi	cbe94df8c6	gofmt update	2018-08-27 05:59:50 +00:00
Laszlo Janosi	e466bdc67e	Changes according to the approved KEP. SCTP is supported for HostPort and LoadBalancer. Alpha feature flag SCTPSupport controls the support of SCTP. Kube-proxy config parameter is removed.	2018-08-27 05:58:36 +00:00
Laszlo Janosi	a6da2b1472	K8s SCTP support implementation for the first pull request The requested Service Protocol is checked against the supported protocols of GCE Internal LB. The supported protocols are TCP and UDP. SCTP is not supported by OpenStack LBaaS. If SCTP is requested in a Service with type=LoadBalancer, the request is rejected. Comment style is also corrected. SCTP is not allowed for LoadBalancer Service and for HostPort. Kube-proxy can be configured not to start listening on the host port for SCTP: see the new SCTPUserSpaceNode parameter changed the vendor github.com/nokia/sctp to github.com/ishidawataru/sctp. I.e. from now on we use the upstream version. netexec.go compilation fixed. Various test cases fixed SCTP related conformance tests removed. Netexec's pod definition and Dockerfile are updated to expose the new SCTP port(8082) SCTP related e2e test cases are removed as the e2e test systems do not support SCTP sctp related firewall config is removed from cluster/gce/util.sh. Variable name sctp_addr is corrected to sctpAddr in pkg/proxy/ipvs/proxier.go cluster/gce/util.sh is copied from master	2018-08-27 05:56:27 +00:00
fisherxu	5a9bea0353	update bazel	2018-08-16 09:59:33 +08:00
x00416946 fisherxu	79e17e6cd7	use versioned api in kube-proxy	2018-08-16 09:59:33 +08:00
tanshanshan	f68af9e584	fix spell	2018-07-14 10:05:56 +08:00
Kubernetes Submit Queue	13f9c26fd7	Merge pull request #65902 from wojtek-t/kube_proxy_less_allocations_2 Automatic merge from submit-queue (batch tested with PRs 65902, 65781). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Avoid unnecessary allocations in kube-proxy	2018-07-09 23:07:01 -07:00
wojtekt	6e50f39dbd	Avoid allocations when parsing iptables	2018-07-08 10:55:19 +02:00
Kubernetes Submit Queue	28e78ec987	Merge pull request #65755 from wojtek-t/optimize_kube_proxy Automatic merge from submit-queue (batch tested with PRs 65882, 65896, 65755, 60549, 65927). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Avoid printing some service comments in iptables rules According to some profiles, with large number of endpoints in the system, comments mentioning the service in appropriate iptables rules may be responsible for 40% of all iptables contents. Given that ~70% of memory usage of kube-proxy seems to be because of generated iptables rules, the overall saving may be at the level of 30% or so. OTOH, we sacrifise a bit understandability of iptables, but this PR only changes some of iptables that contribute to the most painful rules. @thockin @danwinship @dcbw - thoughts? Ref #65441	2018-07-07 18:41:09 -07:00
wojtekt	d073b2097f	Optimize iptables	2018-07-06 14:25:56 +02:00
wojtekt	bbd0a98346	Avoid printing service comments in proxy rules	2018-07-04 08:45:19 +02:00
Jeff Grafton	23ceebac22	Run hack/update-bazel.sh	2018-06-22 16:22:57 -07:00
Jeff Grafton	a725660640	Update to gazelle 0.12.0 and run hack/update-bazel.sh	2018-06-22 16:22:18 -07:00
xujieasd	368cb99d0b	fix iptables_test typo	2018-06-13 15:12:40 +08:00
m1093782566	029a16a1eb	fix review comments	2018-05-14 16:07:13 +08:00
m1093782566	8b16d66b46	add some comment message	2018-05-02 17:02:07 +08:00
m1093782566	b2f5c8e610	fix localport open - iptables part changes	2018-04-02 11:53:12 +08:00
Zihong Zheng	6004452bed	Auto-updated BUILD files	2018-02-27 11:18:11 -08:00
Zihong Zheng	f6eed81f21	[kube-proxy] Mass service/endpoint info functions rename and comments	2018-02-27 11:14:02 -08:00
Zihong Zheng	95cde4fb98	[kube-proxy] Harden change tracker and proxiers for unmatched IP versions	2018-02-27 11:14:02 -08:00
Zihong Zheng	dfbec1a63a	[kube-proxy] Move ipv6 related funcs to utils pkg	2018-02-27 11:12:45 -08:00
Zihong Zheng	b485f7b5b4	[kube-proxy] Move Service/EndpointInfo common codes to change tracker	2018-02-27 11:05:59 -08:00
Kubernetes Submit Queue	42378eab40	Merge pull request #58052 from m1093782566/nodeip-config Automatic merge from submit-queue (batch tested with PRs 60430, 60115, 58052, 60355, 60116). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Make nodeport ip configurable What this PR does / why we need it: By default, kube-proxy accepts everything from NodePort without any filter. It can be a problem for nodes which has both public and private NICs, and people only want to provide a service in private network and avoid exposing any internal service on the public IPs. This PR makes nodeport ip configurable. Which issue(s) this PR fixes: Closes: #21070 Special notes for your reviewer: Design proposal see: https://github.com/kubernetes/community/pull/1547 Issue in feature repo: https://github.com/kubernetes/features/issues/539 Release note: ```release-note Make NodePort IP addresses configurable ```	2018-02-27 09:38:44 -08:00
Kubernetes Submit Queue	05425f0826	Merge pull request #60256 from danwinship/review-iptables-stuff Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. add me to iptables/kube-proxy reviewers kube-proxy needs reviewers!	2018-02-26 07:50:58 -08:00
m1093782566	9bb4807e25	update bazel	2018-02-26 23:48:48 +08:00
m1093782566	ddfa04e8f4	iptables part implementation	2018-02-26 23:48:47 +08:00
Kubernetes Submit Queue	c11ae9d21e	Merge pull request #60306 from danwinship/proxier-connstate-new Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Only run connection-rejecting rules on new connections Kube-proxy has two iptables chains full of rules to reject incoming connections to services that don't have any endpoints. Currently these rules get tested against all incoming packets, but that's unnecessary; if a connection to a given service has already been established, then we can't have been rejecting connections to that service. By only checking the first packet in each new connection, we can get rid of a lot of unnecessary checks on incoming traffic. Fixes #56842 Release note: ```release-note Additional changes to iptables kube-proxy backend to improve performance on clusters with very large numbers of services. ```	2018-02-24 16:19:56 -08:00
Kubernetes Submit Queue	c1a73ea685	Merge pull request #59286 from prameshj/udp-conntrack Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Delete stale UDP conntrack entries that use hostPort What this PR does / why we need it: This PR introduces a change to delete stale conntrack entries for UDP connections, specifically for udp connections that use hostPort. When the pod listening on that udp port get updated/restarted(and gets a new ip address), these entries need to be flushed so that ongoing udp connections can recover once the pod is back and the new iptables rules have been installed. Which issue(s) this PR fixes (optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged): Fixes #59033 Special notes for your reviewer: Release note: ```release-note NONE ```	2018-02-23 19:54:08 -08:00
Kubernetes Submit Queue	e6c2a5de10	Merge pull request #57461 from danwinship/proxier-no-dummy-nat-rules Automatic merge from submit-queue (batch tested with PRs 55637, 57461, 60268, 60290, 60210). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Don't create no-op iptables rules for services with no endpoints Currently for all services we create `-t nat -A KUBE-SERVICES` rules that match the destination IPs (ClusterIP, ExternalIP, NodePort IPs, etc) and then jump to the appropriate `KUBE-SVC-XXXXXX` chain. But if the service has no endpoints then the `KUBE-SVC-XXXXXX` chain will be empty and so nothing happens except that we wasted time (a) forcing iptables-restore to parse the match rules, and (b) forcing the kernel to test matches that aren't going to have any effect. This PR gets rid of the match rules in this case. Which is to say, it changes things so that every incoming service packet is matched either by nat rules to rewrite it or by filter rules to ICMP reject it, but not both. (Actually, that's not quite true: there are no filter rules to reject Ingress-addressed packets, and I think that's a bug?) I also got rid of some comments that seemed redundant. The patch is mostly reindentation, so best viewed with `diff -w`. Partial fix for #56842 / Related to #56164 (which it conflicts with but I'll fix that after one or the other merges). Release note: ```release-note Removed some redundant rules created by the iptables proxier, to improve performance on systems with very many services. ```	2018-02-23 09:49:38 -08:00
Dan Winship	225941679e	Only run connection-rejecting rules on new connections	2018-02-23 08:50:58 -05:00
Pavithra Ramesh	098a4467fe	Remove conntrack entry on udp rule add. Moved conntrack util outside of proxy pkg Added warning message if conntrack binary is not found Addressed review comments. ran gofmt	2018-02-22 23:34:42 -08:00
Kubernetes Submit Queue	f0ca996274	Merge pull request #56164 from danwinship/proxier-chain-split Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Split KUBE-SERVICES chain to re-shrink the INPUT chain What this PR does / why we need it: #43972 added an iptables rule "`-A INPUT -j KUBE-SERVICES`" to make NodePort ICMP rejection work. (Previously the KUBE-SERVICES chain was only run from OUTPUT, not INPUT.) #44547 extended that patch for ExternalIP rejection as well. However, the KUBE-SERVICES chain may potentially have a very large number of ICMP reject rules for plain ClusterIP services (the ones that get run from OUTPUT), and it seems that for some reason the kernel is much more sensitive to the length of the INPUT chain than it is to the length of the OUTPUT chain. So a node that worked fine with kube 1.6 (when KUBE-SERVICES was only run from OUTPUT) might fall over with kube 1.7 (with KUBE-SERVICES being run from both INPUT and OUTPUT). (Specifically, a node with about 5000 ClusterIP reject rules that ran fine with OpenShift 3.6 [kube 1.6] slowed almost to a complete halt with OpenShift 3.7 [kube 1.7].) This PR fixes things by splitting out the "new" part of KUBE-SERVICES (NodePort and ExternalIP reject rules) into a separate KUBE-EXTERNAL-SERVICES chain run from INPUT, and moves KUBE-SERVICES back to being only run from OUTPUT. (So, yes, this assumes that you don't have 5000 NodePort/ExternalIP services, but, if you do, there's not much we can do, since those rules have to be run on the INPUT side.) Oh, and I left in the code to clean up the "`-A INPUT -j KUBE-SERVICES`" rule even though we don't generate it any more, so it gets fixed on upgrade. Release note: ```release-note Reorganized iptables rules to fix a performance regression on clusters with thousands of services. ``` @kubernetes/sig-network-bugs @kubernetes/rh-networking	2018-02-22 18:52:53 -08:00
Dan Winship	fc03cfe7a8	add me to iptables/kube-proxy reviewers	2018-02-22 17:36:57 -05:00
Jeff Grafton	ef56a8d6bb	Autogenerated: hack/update-bazel.sh	2018-02-16 13:43:01 -08:00
Dan Winship	07ead7d8e2	Don't create no-op iptables rules for services with no endpoints	2018-02-13 07:52:47 -05:00

... 3 4 5 6 7 ...

696 Commits