Panic from dynamicRESTMapper when using lazy initialization #1712

siliconbrain · 2021-11-08T11:42:35Z

When dynamicRESTMapper lazy initialization fails, any subsequent calls to it will panic (runtime error: invalid memory address or nil pointer dereference) because staticMapper is uninitialized.

I ran into this when my cluster could not be reached, so I'll illustrate the problem this way, but any error returned by setStaticMapper during init will result in the same issue.

How to reproduce

import (
	"k8s.io/apimachinery/pkg/runtime/schema"
	"sigs.k8s.io/controller-runtime/pkg/client/apiutil"
	"sigs.k8s.io/controller-runtime/pkg/client/config"
)

func main() {
	// get a client config; this should succeed
	cfg := config.GetConfigOrDie()
	// to simulate a refused connection, set cfg.Host to some URL that won't answer
	cfg.Host = "https://127.0.0.1:12345"
	// create a new dynamicRESTMapper instance with lazy initialization; this should succeed
	mapper, _ := apiutil.NewDynamicRESTMapper(cfg, apiutil.WithLazyDiscovery)
	// call any method on mapper; this should result in an error
	_, err := mapper.KindFor(schema.GroupVersionResource{Group: "", Version: "v1", Resource: "pods"})
	if err != nil {
		fmt.Println("failed to get kind for pods")
	}
	// later on (or from another place/goroutine in your code), call any method on mapper; this should panic
	_, err := mapper.KindFor(schema.GroupVersionResource{Group: "", Version: "v1", Resource: "pods"})
	if err != nil {
		fmt.Println("failed to get kind for pods")
	}
}

Suggested fix

sync.Once is not suitable for guarding the lazy initialization code because the initialization should be retried until successful, not just tried once. I couldn't find an existing solution, so I propose to create a custom version of sync.Once that is done only when the function given to Do returns with true, although, other alternative implementations might be fine too.

type Once struct {
	done uint32
	m    Mutex
}

func (o *Once) Do(f func() bool) {
	if atomic.LoadUint32(&o.done) == 0 {
		o.doSlow(f)
	}
}

func (o *Once) doSlow(f func() bool) {
	o.m.Lock()
	defer o.m.Unlock()
	if o.done == 0 {
		if f() {
			atomic.StoreUint32(&o.done, 1)
		}
	}
}

This should be used instead of sync.Once, updating the inline function passed to Do with a return err == nil statement.

The text was updated successfully, but these errors were encountered:

k8s-triage-robot · 2022-02-06T12:22:38Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

siliconbrain · 2022-02-06T18:50:42Z

/remove-lifecycle stale

k8s-triage-robot · 2022-05-07T19:23:25Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

siliconbrain · 2022-05-07T19:52:42Z

/remove-lifecycle stale

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 6, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 6, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 7, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 7, 2022

FillZpp mentioned this issue May 9, 2022

🐛 Fix panic for lazy dynamicRESTMapper #1891

Merged

k8s-ci-robot closed this as completed in #1891 May 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Panic from dynamicRESTMapper when using lazy initialization #1712

Panic from dynamicRESTMapper when using lazy initialization #1712

siliconbrain commented Nov 8, 2021 •

edited

Loading

k8s-triage-robot commented Feb 6, 2022

siliconbrain commented Feb 6, 2022

k8s-triage-robot commented May 7, 2022

siliconbrain commented May 7, 2022

Panic from dynamicRESTMapper when using lazy initialization #1712

Panic from dynamicRESTMapper when using lazy initialization #1712

Comments

siliconbrain commented Nov 8, 2021 • edited Loading

How to reproduce

Suggested fix

k8s-triage-robot commented Feb 6, 2022

siliconbrain commented Feb 6, 2022

k8s-triage-robot commented May 7, 2022

siliconbrain commented May 7, 2022

siliconbrain commented Nov 8, 2021 •

edited

Loading