Data Science with Python
Last updated on Oct 1, 2021
⚠️ WORK IN PROGRESS! ⚠️
Welcome to my Data Science with Python course!
You can find all the Jupyter notebook on my Github page here.
Please, if you find any typos or mistakes, open a new issue. Or even better, fork the repo and submit a pull request. I am happy to share my work and I am even happier if it can be useful.
Content
- Data Structures
- Lists
- Tuples
- Sets
- Dictionaries
- Numpy arrays
- Pandas DataFrames
- Pyspark DataFrames
- Data Exploration
- Import, export data
- Descriprives and summary statistics
- Pivot tables and aggregation
- Data Types
- Numerical data
- String data
- Time data
- Missing data
- Data Wrangling
- Rows: sorting, indexing, ….
- Columns: renaming, ordering, ….
- Collapse and aggregate
- Reshape
- Concatenate and merge
- Plotting
- Distributions
- Time Series
- Correlations
- Regression
- Geographical data
- Machine Learning Pipeline
- Data exploration
- Encoding and normalization
- Missing values
- Weighting
- Prediction
- Cross-validation
- Web Scraping
- Pandas
- APIs
- Static Webscraping
- Dynamic Webscraping
- TBD
- What is missing? Let me know!
Contacts
All feedback is greatly appreciated!