Skip to content

Latest commit

 

History

History
381 lines (306 loc) · 30.5 KB

README.md

File metadata and controls

381 lines (306 loc) · 30.5 KB

Data Science Resources

This list started out as a way for me to keep track of data science resources I've found helpful. However, I frequently get asked for data science resource recommendations by other data scientists and friends looking to break into data science. So I've continued to add to this, with a focus on beginner- and intermediate-level resources. Where possible, I've included links to the (legitimate) free versions of books. One of the great things about the data science community is the willingness to open-source and make things available for free. Within each category or sub-category the resources are listed very loosely in order of usefulness/introductory level to more advanced (but not entirely).

This list is far from complete, but I'll try to continue to add to it. Hopefully you find it helpful.

Non-exhaustive list of additional topics to add:

  • Spark
  • time series forecasting
  • docker



Technical Resources

1. Foundational

1.1 Python

1.2 Statistics

1.3 SQL

1.4 Computer Science, data structures and algorithms

2. General ML

2.1 ML overview

2.2 University ML courses

2.3 Dimensionality reduction

2.4 Clustering

2.5 Curse of dimensionality

2.6 Data issues

3. ML in production

4. MLOps

5. Deep Learning

5.1 General DL

5.2 University DL courses

5.3 DL papers

5.4 TensorFlow

5.5 PyTorch

5.6 Reinforcement Learning

5.7 Graph Neural Networks

6. NLP

6.1 NLP overview

6.2 Embeddings

6.3 Topic modeling

6.4 Transformers

7. Experimentation

7.1 A/B testing

7.2 Bayesian A/B testing

7.3 Multi-Armed Bandits (MAB)

8. Coding best practices

8.1 GitHub

8.2 Structuring projects

8.3 Code refactoring workflow

8.4 Unit testing

8.5 Creating PyPI packages

9. Helpful tools and packages

9.1 AWS

9.2 Flask

9.3 Hyperopt

10. Datasets

10.1 General

10.2 NLP

10.3 Time series

11. Domain applications

11.1 Rewewable Energy

11.2 Healthcare

12. Additional topics

12.1 Ethics

12.2 Bias and explanability

13. Other learning resource lists

14. Industry resources and trends

14.1 Company tech blogs

14.2 Newsletters

14.3 Podcasts

Career resources

Career advice

Defining data science

Becoming a data scientist

Generalist vs specialist

IC vs Management and career progression

Team structure

Data-driven culture

Interviewing

Non-technical resources

Agile & Project management

Product

Business