This repository contains the entire Python Data Science Handbook, in the form of (free!) Jupyter notebooks.
You can read the book in its entirety online at https://jakevdp.github.io/PythonDataScienceHandbook/
The book was written and tested with Python 3.5, though older Python versions (including Python 2.7) should work in nearly all cases.
The book introduces the core libraries essential for working with data in Python: particularly IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and related packages. Familiarity with Python as a language is assumed; if you need a quick introduction to the language itself, see the free companion project, A Whirlwind Tour of Python: it's a fast-paced introduction to the Python language aimed at researchers and scientists.
See Index.ipynb for an index of the notebooks available to accompany the text.
The code in the book was tested with Python 3.5, though most (but not all) will also work correctly with Python 2.7 and other older Python versions.
The packages I used to run the code in the book are listed in requirements.txt (Note that some of these exact version numbers may not be available on your platform: you may have to tweak them for your own use). To install the requirements using conda, run the following at the command-line:
To create a stand-alone environment named with Python 3.5 and all the required package versions, run the following:
You can read more about using conda environments in the Managing Environments section of the conda documentation.
Jake VanderPlas, Summer 2016
A Whirlwind Tour of Python is a fast-paced introduction to essential components of the Python language for researchers and developers who are already familiar with programming in another language.
The material is particularly aimed at those who wish to use Python for data science and/or scientific programming, and in this capacity serves as an introduction to The Python Data Science Handbook (also with notebooks on github). These materials are adapted from courses and workshops I've given on these topics at University of Washington and at various conferences, meetings, and workshops around the world.
This material was written and tested using Python 3.5, and should work for any Python 3.X version. I have done my best to note places where the syntax of Python 2.X will differ.
(Note: sometimes GitHub's notebook rendering can be slow or finicky. If you're having trouble with the following links, try viewing the material on nbviewer)
If you do use this material, I would appreciate attribution. An attribution usually includes the title, author, publisher, and ISBN. For example:
A Whirlwind Tour of Python by Jake VanderPlas (O’Reilly). Copyright 2016 O’Reilly Media, Inc., 978-1-491-96465-1.