🚀 Work in progress 👷
The course will be hands-on. We have access to a computer room, but if it is possible, I would suggest you to bring your own laptop. In this way you will be sure to have R and Rstudio installed on your laptop, and after the workshop you will be ready to start making your own data explorations.
💾 You can install R and Rstudio to your laptop.
Afterwards, you can install the Tidyverse :milky_way:, which collects most of the packages that we will use for our explorations. To install it, open Rstudio and type in your R console:
install.packages("tidyverse")
If you get any ❌ error message, we will fix it together 🎇.
Otherwise, Rstudio ☁️ cloud let’s you run Rstudio in cloud computing.
-
My contact details and not much else…
-
🔗 Meet R
What is an object in R? What is a variable? Why do we need functions?
-
🔗 Load and Manipulate Data - Tidyverse, part 1
A quick introduction to the tidyverse, including how to manipulate data with dplyr and how to pipe many steps of your analysis.
-
🔗 Visualize Data - Tidyverse, part 2
Build a graphical representation of your data with ggplot2.
-
🔗 Clean Data - Tidyverse, part 3
Most of the time you’ll need to clean and reashape your data with Tidyr and Janitor.
-
🔗 More practice - Tidyverse, part 4
Practice more Exploratory Data Analysis with Open Data from the City of Milan.
-
Pick a dataset and explore it!
The R community is active online, and committed to create a friendly and welcoming environment for new everybody.
This includes writing outsanding 📖 open access material that you can use to learn R 🐳.
- 🔗 R programming for Data Science - Roger D. Peng - 🐯 Jump start your R!
- 🔗 Advanced R - Hadley Wickham - :elephant: Everything you wish to know about R.
🎷 Remember to read the articles on the packages’ website!! 🎷
- 🔗 R for Data Science - Grolemund, Wickham - 🐦 An overview of most data science topics, with great tips.
- 🔗 Introduction to Statistical Learning in R - Gareth James et al., - 🐶 Kick start you statistical models.
Check the 📚 bookdown repository for more books on data science, including 🌍 geocomputation, 🎩 forecasting and ⛏️ text mining!
- 🔗 Data Visualization - Kieran Healy - :tropical_fish: Communication oriented data visualization in R.
- 🔗 R Graphics Cookbook - Winston - Chang - 🐙 Practical introduction to visualization with ggplot2.
Also, check the Viz chapters in “R for Data science” (see above) :point_up:.
- 🔗 HarvardX Biomedical Data Science Open Online Training - Love, Irizarry - :snail: Full course on R for life science.
- 🔗 It goes together with this book.
Did I mention that the R community is great? Online you can find wonderful learning material.
by [@EvaMaeRay](https://twitter.com/EvaMaeRey)
…and Others
- 🔗 Fronkonstion - Experiments in R, by [@aschinchon](https://twitter.com/aschinchon).
- 🔗 Data Imaginist, by [@thomasp85](https://twitter.com/thomasp85).
- 🔗 Chi’s Impe[r]fect Blog, by [@chisatini](https://twitter.com/chisatini)
Check out also the work of Cédric Scherer, Sil Aarts, Jake Kaupp and many other TidyTuesdaers with Neal Grantham’s app.
This is a mostly incomplete list, suggestions are welcome! 🙌
- 🔗 Tidy Tuesday - :fish_cake: Best community, weekly social data exercises in R. (check also the R4DS learning community)
- 🔗 Kaggle - 🍧 Advanced Data Science and Machine Learning community.
- 🔗 Data is Beautiful - Reddit 🍢 - Monthly data visualization competitions.
I would like to thank the University of Milano and to the PhD School in Molecular abnd Cell Biology for financing and hosting this workshop. Thanks to Accurat for the great support.
🎓 Best!