kubernetes

Author	SHA1	Message	Date
Dan Winship	37ada4b04f	proxy/iptables: Don't create unused chains, and enable the unit test for that	2022-02-21 09:16:22 -05:00
Dan Winship	ef4324eaf5	proxy/iptables: refactor unit test code / fix error reporting Only run assertIPTablesRuleJumps() on the expected output, not on the actual output, since if there's a problem with the actual output, we'd rather see it as the diff from the expected output.	2022-02-21 09:16:22 -05:00
Dan Winship	4af471f8be	proxy/iptables: move GetChainLines unit tests to the right package GetChainLines is a utiliptables method, so it should be part of the unit tests there.	2022-02-21 09:16:22 -05:00
Quan Tian	6ce612ef65	kube-proxy: fix duplicate port opening When nodePortAddresses is not specified for kube-proxy, it tried to open the node port for a NodePort service twice, triggered by IPv4ZeroCIDR and IPv6ZeroCIDR separately. The first attempt would succeed and the second one would always generate an error log like below: "listen tcp4 :30522: bind: address already in use" This patch fixes it by ensuring nodeAddresses of a proxier only contain the addresses for its IP family.	2022-01-08 02:35:35 +08:00
Quan Tian	95a706ba7c	Remove redundant forwarding rule in filter table	2021-11-11 10:27:53 +08:00
Dan Winship	a4e6d2f6fa	proxy/iptables: add a unit test for the comment elision code	2021-11-10 09:08:02 -05:00
Dan Winship	8ef1255cdd	proxy/iptables: Abstract out code for writing service-chain-to-endpoint-chain rules The same code appeared twice, once for the SVC chain and once for the XLB chain, with the only difference being that the XLB version had more verbose comments.	2021-11-09 20:59:33 -05:00
Dan Winship	9cd0552ddd	proxy/iptables: Remove unnecessary /32 and /128 in iptables rules If you pass just an IP address to "-s" or "-d", the iptables command will fill in the correct mask automatically. Originally, the proxier was just hardcoding "/32" for all of these, which was unnecessary but simple. But when IPv6 support was added, the code was made more complicated to deal with the fact that the "/32" needed to be "/128" in the IPv6 case, so it would parse the IPs to figure out which family they were, which in turn involved adding some checks in case the parsing fails (even though that "can't happen" and the old code didn't check for invalid IPs, even though that would break the iptables-restore if there had been any). Anyway, all of that is unnecessary because we can just pass the IP strings to iptables directly rather than parsing and unparsing them first. (The diff to proxier_test.go is just deleting "/32" everywhere.)	2021-11-09 09:32:50 -05:00
Kubernetes Prow Robot	0940dd6fc4	Merge pull request #106163 from aojea/conntrack_readiness kube-proxy consider endpoint readiness to delete UDP stale conntrack entries	2021-11-08 13:11:44 -08:00
Tim Hockin	f662170ff7	kube-proxy: make iptables buffer-writing cleaner	2021-11-05 12:28:19 -07:00
Antonio Ojea	909925b492	kube-proxy: fix stale detection logic The logic to detect stale endpoints was not assuming the endpoint readiness. We can have stale entries on UDP services for 2 reasons: - an endpoint was receiving traffic and is removed or replaced - a service was receiving traffic but not forwarding it, and starts to forward it. Add an e2e test to cover the regression	2021-11-05 20:14:56 +01:00
Dan Winship	229ae58520	proxy/iptables: fix all-vs-ready endpoints a bit Filter the allEndpoints list into readyEndpoints sooner, and set "hasEndpoints" based (mostly) on readyEndpoints, not allEndpoints (so that, eg, we correctly generate REJECT rules for services with no _functioning_ endpoints, even if they have unusable terminating endpoints). Also, write out the endpoint chains at the top of the loop when we iterate the endpoints for the first time, rather than copying some of the data to another set of variables and then writing them out later. And don't write out endpoint chains that won't be used Also, generate affinity rules only for readyEndpoints rather than allEndpoints, so affinity gets broken correctly when an endpoint becomes unready.	2021-11-04 16:32:08 -04:00
Dan Winship	6ab3dc6875	proxy/iptables: Add more stuff to the unit test The external traffic policy terminating endpoints test was testing LoadBalancer functionality against a NodePort service with no nodePorts (or loadBalancer IPs). It managed to test what it wanted to test, but it's kind of dubious (and we probably _shouldn't_ have been generating the rules it was looking for since there was no way to actually reach the XLB chains). So fix that. Also make the terminating endpoints test use session affinity, to add more testing for that. Also, remove the multiple copies of the same identical Service that is used for all of the test cases in that test. Also add a "Cluster traffic policy and no source ranges" test to TestOverallIPTablesRulesWithMultipleServices since we weren't really testing either of those. Also add a test of --masquerade-all.	2021-11-04 16:32:08 -04:00
Dan Winship	22a951c096	proxy/iptables: Fix TestOnlyLocalNodePortsNoClusterCIDR The test got broken to not actually use "no cluster CIDR" when LocalDetector was implemented (and the old version of the unit test didn't check enough to actually notice this).	2021-11-04 16:32:08 -04:00
Dan Winship	799c222c84	proxy/iptables: test that we create a consistent set of iptables rules	2021-11-04 16:32:08 -04:00
Dan Winship	9403bfb178	proxy/iptables: Misc improvements to unit test The original tests here were very shy about looking at the iptables output, and just relied on checks like "make sure there's a jump to table X that also includes string Y somewhere in it" and stuff like that. Whereas the newer tests were just like, "eh, here's a wall of text, make sure the iptables output is exactly that". Although the latter looks messier in the code, it's more precise, and it's easier to update correctly when you change the rules. So just make all of the tests do a check on the full iptables output. (Note that I didn't double-check any of the output; I'm just assuming that the output of the current iptables proxy code is actually correct...) Also, don't hardcode the expected number of rules in the metrics tests, so that there's one less thing to adjust when rules change. Also, use t.Run() in one place to get more precise errors on failure.	2021-11-04 16:32:06 -04:00
Dan Winship	a1a12ca1da	proxy/iptables: Improve the sorting logic in TestOverallIPTablesRulesWithMultipleServices The test was sorting the iptables output so as to not depend on the order that services get processed in, but this meant it wasn't checking the relative ordering of rules (and in fact, the ordering of the rules in the "expected" string was wrong, in a way that would break things if the rules had actually been generated in that order). Add a more complicated sorting function that sorts services alphabetically while preserving the ordering of rules within each service.	2021-11-04 16:31:16 -04:00
Dan Winship	08680192fb	proxy/iptables: Fix sync_proxy_rules_iptables_total metric It was counting the number of lines including the "COMMIT" line at the end, so it was off by one.	2021-11-04 16:30:12 -04:00
Dan Winship	7f6fbc4482	Drop broken/no-op proxyconfig.EndpointsHandler implementations Because the proxy.Provider interface included proxyconfig.EndpointsHandler, all the backends needed to implement its methods. But iptables, ipvs, and winkernel implemented them as no-ops, and metaproxier had an implementation that wouldn't actually work (because it couldn't handle Services with no active Endpoints). Since Endpoints processing in kube-proxy is deprecated (and can't be re-enabled unless you're using a backend that doesn't support EndpointSlice), remove proxyconfig.EndpointsHandler from the definition of proxy.Provider and drop all the useless implementations.	2021-09-13 09:32:38 -04:00
Antonio Ojea	0cd75e8fec	run hack/update-netparse-cve.sh	2021-08-20 10:42:09 +02:00
Antonio Ojea	a2a22903bc	delete stale UDP conntrack entries for loadbalancer IPs	2021-07-29 17:35:07 +02:00
Swetha Repakula	0a42f7b989	Graduate EndpointSliceProxying and WindowsEndpointSliceProxying Gates	2021-07-07 13:33:30 -07:00
Kubernetes Prow Robot	7cd40e1885	Merge pull request #103116 from chenyw1990/reducekubeproxycpu reduce cpu usage of kube-proxy with iptables mode	2021-07-05 15:13:38 -07:00
chenyw1990	1f24a198e7	reduce cpu usage of kube-proxy with iptables mode	2021-07-05 16:08:19 +08:00
Swetha Repakula	03b7a699c2	Kubeproxy uses V1 EndpointSlice	2021-06-30 18:41:57 -07:00
Andrew Sy Kim	ed4fe07375	proxy/iptables: add unit test Test_HealthCheckNodePortWhenTerminating for ensuring health check node port fails when all local endpoints are terminating Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:17:43 -04:00
Andrew Sy Kim	68ebd16a2c	proxier/iptables: refactor terminating endpoints unit tests with test table and test for feature gate Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:17:43 -04:00
Andrew Sy Kim	d82d851d89	proxier/iptables: include Service port in unit tests Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:17:43 -04:00
Andrew Sy Kim	b54c0568d8	proxier/iptables: add unit tests for falling back to terminating endpoints Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-06-04 15:15:40 -04:00
Surya Seetharaman	d3fe48e848	Kube-proxy: perf-enhancement: Reduce NAT table KUBE-SERVICES/NODEPORTS chain rules The nat KUBE-SERVICES chain is called from OUTPUT and PREROUTING stages. In clusters with large number of services, the nat-KUBE-SERVICES chain is the largest chain with for eg: 33k rules. This patch aims to move the KubeMarkMasq rules from the kubeServicesChain into the respective KUBE-SVC-* chains. This way during each packet-rule matching we won't have to traverse the MASQ rules of all services which get accumulated in the KUBE-SERVICES and/or KUBE-NODEPORTS chains. Since the jump to KUBE-MARK-MASQ ultimately sets the 0x400 mark for nodeIP SNAT, it should not matter whether the jump is made from KUBE-SERVICES or KUBE-SVC-* chains. Specifically we change: 1) For ClusterIP svc, we move the KUBE-MARK-MASQ jump rule from KUBE-SERVICES chain into KUBE-SVC-* chain. 2) For ExternalIP svc, we move the KUBE-MARK-MASQ jump rule in the case of non-ServiceExternalTrafficPolicyTypeLocal from KUBE-SERVICES chain into KUBE-SVC-* chain. 3) For NodePorts svc, we move the KUBE-MARK-MASQ jump rule in case of non-ServiceExternalTrafficPolicyTypeLocal from KUBE-NODEPORTS chain to KUBE-SVC-* chain. 4) For load-balancer svc, we don't change anything since it is already svc specific due to creation of KUBE-FW-* chains per svc. This would cut the rules per svc in KUBE-SERVICES and KUBE-NODEPORTS in half.	2021-04-21 16:41:03 +02:00
Surya Seetharaman	667e50abc8	Add TestOverallIPTablesRulesWithMultipleServices	2021-04-21 16:41:03 +02:00
Fangyuan Li	7ed2f1d94d	Implements Service Internal Traffic Policy 1. Add API definitions; 2. Add feature gate and drops the field when feature gate is not on; 3. Set default values for the field; 4. Add API Validation 5. add kube-proxy iptables and ipvs implementations 6. add tests	2021-03-07 16:52:59 -08:00
Antonio Ojea	654be57022	kube-proxy iptables expose number of rules metrics add a new metric to kube-proxy iptables, so it exposes the number of rules programmed in each iteration.	2021-03-05 10:00:38 +01:00
jornshen	e68e105102	migrate to use k8s.io/util LocalPort and ListenPortOpener in iptables.proxier	2021-02-15 16:36:06 +08:00
Antonio Ojea	ed21a0e16c	kube-proxy: clear conntrack entries after rules are in place Clear conntrack entries for UDP NodePorts, this has to be done AFTER the iptables rules are programmed. It can happen that traffic to the NodePort hits the host before the iptables rules are programmed this will create an stale entry in conntrack that will blackhole the traffic, so we need to clear it ONLY when the service has endpoints.	2021-02-10 16:22:03 +01:00
Hanlin Shi	4cd1eacbc1	Add rule to allow healthcheck nodeport traffic in filter table 1. For iptables mode, add KUBE-NODEPORTS chain in filter table. Add rules to allow healthcheck node port traffic. 2. For ipvs mode, add KUBE-NODE-PORT chain in filter table. Add KUBE-HEALTH-CHECK-NODE-PORT ipset to allow traffic to healthcheck node port.	2021-02-03 15:20:10 +00:00
Kubernetes Prow Robot	eb08f36c7d	Merge pull request #96371 from andrewsykim/kube-proxy-terminating kube-proxy: track serving/terminating conditions in endpoints cache	2021-01-11 18:38:25 -08:00
Andrew Sy Kim	9c096292cc	kube-proxy: iptables proxy should ignore endpoints with condition ready=false Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-01-11 16:27:38 -05:00
Andrew Sy Kim	55cb453a3c	kube-proxy: update internal endpoints map with 'serving' and 'terminating' condition from EndpointSlice Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>	2021-01-11 16:17:58 -05:00
jornshen	5af5a2ac7d	migrate proxy.UpdateServiceMap to be a method of ServiceMap	2021-01-11 11:07:30 +08:00
Patrik Cyvoct	d29665cc17	Revert "Merge pull request #92312 from Sh4d1/kep_1860" This reverts commit `ef16faf409`, reversing changes made to `2343b8a68b`.	2020-11-11 10:26:53 +01:00
Patrik Cyvoct	11b97e9ef8	fix tests Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 10:00:55 +01:00
Patrik Cyvoct	0153b96ab8	fix review Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 10:00:27 +01:00
Patrik Cyvoct	d562b6924a	Add tests Signed-off-by: Patrik Cyvoct <patrik@ptrk.io>	2020-11-07 09:59:59 +01:00
Khaled Henidak (Kal)	6675eba3ef	dual stack services (#91824 ) * api: structure change * api: defaulting, conversion, and validation * [FIX] validation: auto remove second ip/family when service changes to SingleStack * [FIX] api: defaulting, conversion, and validation * api-server: clusterIPs alloc, printers, storage and strategy * [FIX] clusterIPs default on read * alloc: auto remove second ip/family when service changes to SingleStack * api-server: repair loop handling for clusterIPs * api-server: force kubernetes default service into single stack * api-server: tie dualstack feature flag with endpoint feature flag * controller-manager: feature flag, endpoint, and endpointSlice controllers handling multi family service * [FIX] controller-manager: feature flag, endpoint, and endpointSlicecontrollers handling multi family service * kube-proxy: feature-flag, utils, proxier, and meta proxier * [FIX] kubeproxy: call both proxier at the same time * kubenet: remove forced pod IP sorting * kubectl: modify describe to include ClusterIPs, IPFamilies, and IPFamilyPolicy * e2e: fix tests that depends on IPFamily field AND add dual stack tests * e2e: fix expected error message for ClusterIP immutability * add integration tests for dualstack the third phase of dual stack is a very complex change in the API, basically it introduces Dual Stack services. Main changes are: - It pluralizes the Service IPFamily field to IPFamilies, and removes the singular field. - It introduces a new field IPFamilyPolicyType that can take 3 values to express the "dual-stack(mad)ness" of the cluster: SingleStack, PreferDualStack and RequireDualStack - It pluralizes ClusterIP to ClusterIPs. The goal is to add coverage to the services API operations, taking into account the 6 different modes a cluster can have: - single stack: IP4 or IPv6 (as of today) - dual stack: IPv4 only, IPv6 only, IPv4 - IPv6, IPv6 - IPv4 * [FIX] add integration tests for dualstack * generated data * generated files Co-authored-by: Antonio Ojea <aojea@redhat.com>	2020-10-26 13:15:59 -07:00
Surya Seetharaman	477b14b3c4	Kube-proxy: Perf-fix: Shrink INPUT chain In #56164, we had split the reject rules for non-ep existing services into KUBE-EXTERNAL-SERVICES chain in order to avoid calling KUBE-SERVICES from INPUT. However in #74394 KUBE-SERVICES was re-added into INPUT. As noted in #56164, kernel is sensitive to the size of INPUT chain. This patch refrains from calling the KUBE-SERVICES chain from INPUT and FORWARD, instead adds the lb reject rule to the KUBE-EXTERNAL-SERVICES chain which will be called from INPUT and FORWARD.	2020-10-19 11:26:04 +02:00
Amim Knabben	a18e5de51a	LockToDefault the ExternalPolicyForExternalIP feature gate	2020-09-16 13:16:33 -04:00
Rob Scott	c382c79f60	Updating kube-proxy to trim space from loadBalancerSourceRanges Before this fix, a Service with a loadBalancerSourceRange value that included a space would cause kube-proxy to crashloop. This updates kube-proxy to trim any space from that field.	2020-08-20 18:19:52 -07:00
Vinod K L Swamy	4505d5b182	Changes to Proxy common code	2020-06-29 14:29:46 -07:00
Dan Winship	c12534d8b4	kubelet, kube-proxy: unmark packets before masquerading them It seems that if you set the packet mark on a packet and then route that packet through a kernel VXLAN interface, the VXLAN-encapsulated packet will still have the mark from the original packet. Since our NAT rules are based on the packet mark, this was causing us to double-NAT some packets, which then triggered a kernel checksumming bug. But even without the checksum bug, there are reasons to avoid double-NATting, so fix the rules to unmark the packets before masquerading them.	2020-06-15 18:45:38 -04:00

1 2 3 4

181 Commits