Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a CLI command to initialize kedro ipython with specified dataset_name as data variable #3994

Open
yury-fedotov opened this issue Jul 5, 2024 · 2 comments
Labels
Issue: Feature Request New feature or improvement to existing feature

Comments

@yury-fedotov
Copy link
Contributor

yury-fedotov commented Jul 5, 2024

Situation

One operation I do very frequently is:

kedro ipython

...waiting for it to load. Then...

data = catalog.load("dataset_name")

And explore it somehow.

Idea

If that's a common scenario for many, maybe a CLI command like that may be in demand:

kedro ipython --fetch dataset_name

And it does the following:

  1. Initializes kedro ipython
  2. Performs data = catalog.load(dataset_name) automatically
  3. You end up in ipython session just like after kedro ipython, with 1 difference: you have the data variable where a dataset you requested is available.

Alternatives

Instead of fetch, it can be provide.

@yury-fedotov yury-fedotov added the Issue: Feature Request New feature or improvement to existing feature label Jul 5, 2024
@yury-fedotov yury-fedotov changed the title Add a CLI command to initialize kedro ipython with specified dataset_name at data variable Add a CLI command to initialize kedro ipython with specified dataset_name as data variable Jul 5, 2024
@datajoely
Copy link
Contributor

datajoely commented Jul 8, 2024

could you elaborate on what this would help you achieve?

@yury-fedotov
Copy link
Contributor Author

yury-fedotov commented Jul 8, 2024

could you elaborate on what this would help you achieve?

Just save ~ 10 mins per day and improve developer experience slightly.

Why I do this a lot of times per day:

kedro ipython
data = catalog.load("dataset_name")

Is to then do things like:

data.info()
data["Column"].describe()
(data["Column"] > 10).mean()
...
# And other things I need to look at to explore data, depending on what I'm modifying now.

So what my suggestion assumed is like a CLI alias that allows to do one command:

kedro ipython --fetch dataset_name

Instead of two

kedro ipython
data = catalog.load("dataset_name")

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Issue: Feature Request New feature or improvement to existing feature
Projects
None yet
Development

No branches or pull requests

2 participants