Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Self-learning GCP, Datalab, Python #201

Open
ccasimiro9444 opened this issue May 24, 2017 · 13 comments
Open

Self-learning GCP, Datalab, Python #201

ccasimiro9444 opened this issue May 24, 2017 · 13 comments

Comments

@ccasimiro9444
Copy link

No description provided.

@ccasimiro9444
Copy link
Author

Using NYC yellow taxi dataset

@danfowler
Copy link

👀

@danfowler
Copy link

GCP == Google Cloud Platform ? Cool : What is GCP?

@ccasimiro9444
Copy link
Author

Yes, Google Cloud Platform. There are some large public datasets on Google's Bigquery, that could be imported into Google Cloud Datalab (similar to Jupyter Notebooks) and then I want to use Python to visually render the data.
I am from a stats background, so Python and GCP are kinda new to me. But saw this in action and hence will try to replicate some of it. But I am already stuck importing the Bigquery data into Datalab :) Hope you guys can help me figuring it out.

@ccasimiro9444
Copy link
Author

@ccasimiro9444
Copy link
Author

SELECT
pickup_datetime,
dropoff_datetime
FROM
bigquery-public-data.new_york.tlc_yellow_trips_20*

@kev-ho
Copy link

kev-ho commented May 25, 2017

Go here
https://cloud.google.com/bigquery/public-data/nyc-tlc-trips

and click on the "goto the new york city dataset" button

@danfowler
Copy link

"""
#standardSQL
SELECT
pickup_datetime,
dropoff_datetime
FROM
`bigquery-public-data.new_york.tlc_yellow_trips_20*`
""")

@danfowler
Copy link

@ccasimiro9444
Copy link
Author

Got it to run on datalab, just used Dan's code.
screen shot 2017-05-26 at 00 15 49

@danfowler
Copy link

Excellent! 💯

By the way, I finally got datalab working on my Google account 😉

Maybe, if you haven't gotten around to it by then, we work on the visualization piece at the next bkkhack.

@kev-ho
Copy link

kev-ho commented May 26, 2017

Nice!

Unfortunately I've had 0 success so far. So I'm going to just have to leave it here for now. :(

Next time I'll just clear everything and start from scratch and see if that helps.

@ccasimiro9444
Copy link
Author

Sounds good. Gotta get the whole dataset into a dataframe first, the direct import from Bigquery to dataframe takes too long.
Maybe loading the table and then transforming will be faster.
Let's try that and the visualization out at the next hack.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants