If you want to know more about Data Science but don’t know where to start this list is for you! :chart_with_upwards_trend:
No previous knowledge required but Python and statistics basics will definitely come in handy. These ressources have been used successfully for many beginners at my local Data Science student group ML-KA.
This list covers only Python, as many are already familiar with this language. Data Science tutorials using R.
numpy is a Python library which provides large multidimensional arrays and fast mathematical operations on them.
pandas provides efficient data structures and analysis tools for Python. It is build on top of numpy.
scikit-learn is the most common library for Machine Learning and Data Science in Python.
Jupyter Notebook is a web application for easy data visualisation and code presentation.
Sorted by increasing complexity.
Contributions welcome! Read the contribution guidelines first.
To the extent possible under law, Simon Böhm has waived all copyright and related or neighboring rights to this work. Disclaimer: Some of the links are affiliate links.