kubernetes

Author	SHA1	Message	Date
Dan Winship	169604d906	Validate single-stack --nodeport-addresses sooner In the dual-stack case, iptables.NewDualStackProxier and ipvs.NewDualStackProxier filtered the nodeport addresses values by IP family before creating the single-stack proxiers. But in the single-stack case, the kube-proxy startup code just passed the value to the single-stack proxiers without validation, so they had to re-check it themselves. Fix that.	2023-01-03 09:01:45 -05:00
Dan Winship	e7ed7220eb	Explicitly pass IP family to proxier Rather than re-determining it from the iptables object in both proxies.	2023-01-03 09:01:45 -05:00
Dan Winship	c6cc056675	Replace iptables-proxy-specific SCTP e2e test with a unit test We had a test that creating a Service with an SCTP port would create an iptables rule with "-p sctp" in it, which let us test that kube-proxy was doing vaguely the right thing with SCTP even if the e2e environment didn't have SCTP support. But this would really make much more sense as a unit test.	2022-12-13 16:21:12 -05:00
Tim Hockin	dd0a50336e	ServiceInternalTrafficPolicyType: s/Type// Rename ServiceInternalTrafficPolicyType => ServiceInternalTrafficPolicy	2022-12-11 13:48:31 -08:00
Tim Hockin	d0e2b06850	ServiceExternalTrafficPolicyType: s/Type// Rename ServiceExternalTrafficPolicyType => ServiceExternalTrafficPolicy	2022-12-11 13:48:27 -08:00
Dan Winship	bfa4948bb6	Don't re-run EnsureChain/EnsureRules on partial syncs We currently invoke /sbin/iptables 24 times on each syncProxyRules before calling iptables-restore. Since even trivial iptables invocations are slow on hosts with lots of iptables rules, this adds a lot of time to each sync. Since these checks are expected to be a no-op 99% of the time, skip them on partial syncs.	2022-11-29 09:42:49 -05:00
cyclinder	4aff0dba0d	kube-proxy ipatbles: update log message	2022-11-04 10:07:15 +08:00
Kubernetes Prow Robot	d86c013b0d	Merge pull request #108250 from cyclinder/add_flag_in_proxy kube-proxy: add a flag to disable nodePortOnLocalhost	2022-11-03 17:10:13 -07:00
Andy Voltz	29f4862ed8	Promote ServiceInternalTrafficPolicy to GA	2022-11-03 13:17:03 -04:00
Manav Agarwal	3320e50e24	If applied, this commit will refactor variable names in kube-proxy	2022-11-03 03:45:57 +05:30
cyclinder	bef2070031	kube-proxy: add a flag to disables the allowing NodePort services to be accessed via localhost	2022-11-02 16:17:52 +08:00
Dan Winship	818de5a545	proxy/iptables: Add metric for partial sync failures, add test	2022-09-26 16:31:42 -04:00
Dan Winship	ab326d2f4e	proxy/iptables: Don't rewrite chains that haven't changed iptables-restore requires that if you change any rule in a chain, you have to rewrite the entire chain. But if you avoid mentioning a chain at all, it will leave it untouched. Take advantage of this by not rewriting the SVC, SVL, EXT, FW, and SEP chains for services that have not changed since the last sync, which should drastically cut down on the size of each iptables-restore in large clusters.	2022-09-26 16:30:42 -04:00
Dan Winship	c437b15441	proxy/iptables: make part of the unit test sanity-checking optional	2022-08-24 09:02:48 -04:00
Kubernetes Prow Robot	da112dda68	Merge pull request #111806 from danwinship/kube-proxy-no-mode-fallback remove kube-proxy mode fallback	2022-08-24 05:52:03 -07:00
Dan Winship	9f69a3a9d4	kube-proxy: remove iptables-to-userspace fallback Back when iptables was first made the default, there were theoretically some users who wouldn't have been able to support it due to having an old /sbin/iptables. But kube-proxy no longer does the things that didn't work with old iptables, and we removed that check a long time ago. There is also a check for a new-enough kernel version, but it's checking for a feature which was added in kernel 3.6, and no one could possibly be running Kubernetes with a kernel that old. So the fallback code now never actually falls back, so it should just be removed.	2022-08-16 09:21:34 -04:00
ialidzhikov	f2bc2ed2da	pkg/proxy: Replace deprecated func usage from the `k8s.io/utils/pointer` pkg	2022-08-14 18:27:33 +03:00
Dan Winship	f65fbc877b	proxy/iptables: remove last references to KUBE-MARK-DROP	2022-07-28 09:03:49 -04:00
Dan Winship	9313188909	proxy/iptables: Don't use KUBE-MARK-DROP for LoadBalancerSourceRanges	2022-07-28 09:03:46 -04:00
Kubernetes Prow Robot	ce433f87b4	Merge pull request #110266 from danwinship/minimize-prep-reorg iptables proxy reorg in preparation for minimizing iptables-restore	2022-07-27 04:06:30 -07:00
Dan Williams	f197509879	proxy: queue syncs on node events rather than syncing immediately The proxies watch node labels for topology changes, but node labels can change in bursts especially in larger clusters. This causes pressure on all proxies because they can't filter the events, since the topology could match on any label. Change node event handling to queue the request rather than immediately syncing. The sync runner can already handle short bursts which shouldn't change behavior for most cases. Signed-off-by: Dan Williams <dcbw@redhat.com>	2022-07-18 09:21:52 -05:00
Dan Winship	367f18c49b	proxy/iptables: move firewall chain setup Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This moves the FW chain creation to the end (rather than having it in the middle of adding the jump rules for the LB IPs).	2022-07-09 07:08:42 -04:00
Dan Winship	2030591ce7	proxy/iptables: move internal traffic setup code Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This fixes the jump rules for internal traffic. Previously we were handling "jumping from kubeServices to internalTrafficChain" and "adding masquerade rules to internalTrafficChain" in the same place.	2022-07-09 07:07:48 -04:00
Dan Winship	00f789cd8d	proxy/iptables: move EXT chain rule creation to the end Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This fixes the handling of the EXT chain.	2022-07-09 07:07:47 -04:00
Dan Winship	8906ab390e	proxy/iptables: reorganize cluster/local chain creation Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This fixes the handling of the SVC and SVL chains. We were already filling them in at the end of the loop; this fixes it to create them at the bottom of the loop as well.	2022-07-09 07:05:05 -04:00
Dan Winship	da14a12fe5	proxy/iptables: move endpoint chain rule creation to the end Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This fixes the handling of the endpoint chains. Previously they were handled entirely at the top of the loop. Now we record which ones are in use at the top but don't create them and fill them in until the bottom.	2022-07-09 06:51:47 -04:00
Dan Winship	8a5801996b	proxy/iptables: belatedly simplify local traffic policy metrics We figure out early on whether we're going to end up outputting no endpoints, so update the metrics then. (Also remove a redundant feature gate check; svcInfo already checks the ServiceInternalTrafficPolicy feature gate itself and so svcInfo.InternalPolicyLocal() will always return false if the gate is not enabled.)	2022-07-09 06:50:16 -04:00
Dan Winship	95705350d5	proxy/iptables: Don't use KUBE-MARK-DROP for "no local endpoints" Rather than marking packets to be dropped in the "nat" table and then dropping them from the "filter" table later, just use rules in "filter" to drop the packets we don't like directly.	2022-06-29 16:37:24 -04:00
Dan Winship	283218bd4c	proxy/iptables: update TestTracePackets Re-sync the rules from TestOverallIPTablesRulesWithMultipleServices to make sure we're testing all the right kinds of rules. Remove a duplicate copy of the KUBE-MARK-MASQ and KUBE-POSTROUTING rules. Update the "REJECT" test to use the new svc6 from TestOverallIPTablesRulesWithMultipleServices. (Previously it had used a modified version of TOIPTRWMS's svc3.)	2022-06-29 16:33:13 -04:00
Dan Winship	59b7f969e8	proxy/iptables: fix up TestOverallIPTablesRulesWithMultipleServices svc2b was using the same ClusterIP as svc3; change it and rename the service to svc5 to make everything clearer. Move the test of LoadBalancerSourceRanges from svc2 to svc5, so that svc2 tests the rules for dropping packets due to externalTrafficPolicy, and svc5 tests the rules for dropping packets due to LoadBalancerSourceRanges, rather than having them both mixed together in svc2. Add svc6 with no endpoints.	2022-06-29 16:33:13 -04:00
Kubernetes Prow Robot	f045fb688f	Merge pull request #110334 from danwinship/iptables-fewer-saves only clean up iptables chains periodically in large clusters	2022-06-29 09:48:06 -07:00
Dan Winship	7d3ba837f5	proxy/iptables: only clean up chains periodically in large clusters "iptables-save" takes several seconds to run on machines with lots of iptables rules, and we only use its result to figure out which chains are no longer referenced by any rules. While it makes things less confusing if we delete unused chains immediately, it's not actually _necessary_ since they never get called during packet processing. So in large clusters, make it so we only clean up chains periodically rather than on every sync.	2022-06-29 11:14:38 -04:00
Dan Winship	1cd461bd24	proxy/iptables: abstract the "endpointChainsNumberThreshold" a bit Turn this into a generic "large cluster mode" that determines whether we optimize for performance or debuggability.	2022-06-29 11:14:38 -04:00
Dan Winship	c12da17838	proxy/iptables: Add a unit test with multiple resyncs	2022-06-29 11:14:38 -04:00
Kubernetes Prow Robot	0d9ed2c3e7	Merge pull request #110328 from danwinship/iptables-counters Stop trying to "preserve" iptables counters that are always 0	2022-06-29 08:06:06 -07:00
Dan Winship	7c27cf0b9b	Simplify iptables-save parsing We don't need to parse out the counter values from the iptables-save output (since they are always 0 for the chains we care about). Just parse the chain names themselves. Also, all of the callers of GetChainLines() pass it input that contains only a single table, so just assume that, rather than carefully parsing only a single table's worth of the input.	2022-06-28 08:39:32 -04:00
Dan Winship	a3556edba1	Stop trying to "preserve" iptables counters that are always 0 The iptables and ipvs proxies have code to try to preserve certain iptables counters when modifying chains via iptables-restore, but the counters in question only actually exist for the built-in chains (eg INPUT, FORWARD, PREROUTING, etc), which we never modify via iptables-restore (and in fact, can't safely modify via iptables-restore), so we are really just doing a lot of unnecessary work to copy the constant string "[0:0]" over from iptables-save output to iptables-restore input. So stop doing that. Also fix a confused error message when iptables-save fails.	2022-06-28 08:39:32 -04:00
Kubernetes Prow Robot	832c4d8cb7	Merge pull request #110503 from aojea/iptables_rules kube-proxy iptables test number of generated iptables rules	2022-06-27 18:10:08 -07:00
Antonio Ojea	3cb63833ff	kube-proxy iptables test number of generated iptables rules kube-proxy generates iptables rules to forward traffic from Services to Endpoints kube-proxy uses iptables-restore to configure the rules atomically, however, this has the downside that large number of rules take a long time to be processed, causing disruption. There are different parameters than influence the number of rules generated: - ServiceType - Number of Services - Number of Endpoints per Service This test will fail when the number of rules change, so the person that is modifying the code can have feedback about the performance impact on their changes. It also runs multiple number of rules test cases to check if the number of rules grows linearly.	2022-06-14 11:55:42 +02:00
Kubernetes Prow Robot	dc4e91a875	Merge pull request #109844 from danwinship/iptables-tests-new improve parsing in iptables unit tests	2022-06-10 14:27:44 -07:00
gkarthiks	1fd959e256	refactor: serviceNameString to svcptNameString Signed-off-by: gkarthiks <github.gkarthiks@gmail.com> refactor: svc port name variable #108806 Signed-off-by: gkarthiks <github.gkarthiks@gmail.com> refactor: rename struct for service port information to servicePortInfo and fields for more redability Signed-off-by: gkarthiks <github.gkarthiks@gmail.com> fix: drop chain rule Signed-off-by: gkarthiks <github.gkarthiks@gmail.com>	2022-05-22 03:31:00 -07:00
Dan Winship	24e1e3d9ee	proxy/iptables: port packet-flow tests to use new parsing stuff	2022-05-09 11:29:08 -04:00
Dan Winship	10a72a9e03	pkg/util/iptables/testing: Add IPTables dump-parsing helpers	2022-05-09 11:29:06 -04:00
Dan Winship	f0f47ae590	proxy/iptables: tweak sortIPTablesRules some more Sort the ":CHAINNAME" lines in the same order as the "-A CHAINNAME" lines (meaning, KUBE-NODEPORTS and KUBE-SERVICES come first). (This will simplify IPTablesDump because it won't need to keep track of the declaration order and the rule order separately.)	2022-05-09 11:19:28 -04:00
Dan Winship	b0d9c063a8	unexport mistakenly-exported constants	2022-05-06 07:33:29 -04:00
Kubernetes Prow Robot	2b3508e0f1	Merge pull request #109826 from danwinship/multi-load-balancer fix kube-proxy bug with multiple LB IPs and source ranges	2022-05-06 03:09:15 -07:00
Dan Winship	813aca47af	proxy/iptables: fix firewall rules with multiple LB IPs The various loops in the LoadBalancer rule section were mis-nested such that if a service had multiple LoadBalancer IPs, we would write out the firewall rules multiple times (and the allowFromNode rule for the second and later IPs would end up being written after the "else DROP" rule from the first IP).	2022-05-05 10:58:09 -04:00
Dan Winship	df589b46a1	proxy/iptables: test multiple LoadBalancer IPs on one service	2022-05-05 10:58:09 -04:00
Dan Winship	709b4f696d	proxy/iptables: test LoadBalancerSourceRanges vs node IP The LoadBalancer rules change if the node IP is in one of the LoadBalancerSourceRange subnets, so make sure to set nodeIP on the fake proxier so we can test this, and add a second source range to TestLoadBalancer containing the node IP. (This changes the result of one flow test that previously expected that node-to-LB would be dropped.)	2022-05-05 10:58:07 -04:00
Kubernetes Prow Robot	50e1f70027	Merge pull request #109782 from danwinship/no-local-endpoints-metric Don't increment "no local endpoints" metric when there are no remote endpoints	2022-05-05 05:02:20 -07:00

1 2 3 4 5 ...

603 Commits