
Open Access Books
1. Foundational Programming Skills (Python and R)
Begin with these books to build programming fundamentals, essential for data engineering, data science, and analytics.
- Python for Everybody - https://books.trinket.io/pfe/index.html
- Automate the Boring Stuff - https://automatetheboringstuff.com/
- Python datasciencebook - https://python.datasciencebook.ca/
- R for Data Science - https://r4ds.had.co.nz/
- Data Science - R Basics - https://rafalab.dfci.harvard.edu/dsbook/
2. Practical Programming and Data Manipulation
Expand your programming skills by exploring libraries and frameworks for data manipulation.
- Modern Polars - https://kevinheavey.github.io/modern-polars/
- Python for Geocomputation - https://py.geocompx.org/
- Hands-On Programming with R - https://rstudio-education.github.io/hopr/
- R Packages - https://r-pkgs.org/
3. Data Engineering Foundations
Develop a solid understanding of data engineering principles, design patterns, and tools.
- Fundamentals of Data Engineering - https://redpanda.com/guides/fundamentals-of-data-engineering
- Modern Data Engineering Playbook - https://www.thoughtworks.com/en-cl/what-we-do/data-and-ai/modern-data-engineering-playbook
- Data Engineering Design Patterns (DEDP) - https://www.dedp.online/about-this-book.html
- The Data Engineering Cookbook - https://github.com/project303/The-Data-Engineering-Cookbook-
- Data Engineering Vault: A Second Brain Knowledge Network - https://www.ssp.sh/brain/data-engineering/
4. Database and Data Management
Learn about databases, data models, and data management practices.
- Database Technology Overview - https://halvorsen.blog/documents/technology/database/database.php
- Erwin in Database Design - https://halvorsen.blog/documents/technology/database/erwin.php
- Data Management in R - https://dmbook.org/
5. Data Visualization and Communication
Focus on effective data visualization and storytelling with data.
- Data Visualization - A Practical Introduction - https://socviz.co/index.html?#preface
- Interactive Data Visualization - https://jjallaire.github.io/visualization-curriculum/
- Telling Stories with Data - https://tellingstorieswithdata.com/
6. Statistical Thinking and Data Science Techniques
Build statistical knowledge and explore techniques used in data science.
- Exploratory Data Analysis with R - https://bookdown.org/rdpeng/exdata/
- The Art of Data Science - https://bookdown.org/rdpeng/artofdatascience/
7. Advanced Topics in Machine Learning and Forecasting
Deepen your knowledge by studying advanced machine learning, forecasting, and time series analysis.
- Elements of Statistical Learning - https://hastie.su.domains/ElemStatLearn/
- Pattern Recognition and Machine Learning - https://www.microsoft.com/en-us/research/publication/pattern-recognition-machine-learning/
- Mathematics for Machine Learning - https://mml-book.github.io/
- Machine Learning - a Probabilistic Perspective - https://probml.github.io/pml-book/book0.html
- Time Series Analysis with R - https://vlyubchich.github.io/tsar/
- Forecasting: Principles and Practice with R - https://otexts.com/fpp2/
8. Specialized Areas and Use Cases
Explore books focused on specific domains and practical applications of data analysis.
- Spatial Data Science - https://www.paulamoraga.com/book-spatial/index.html
- Cookbook for R Polars - https://ddotta.github.io/cookbook-rpolars/
- NFL Analytics with R - https://bradcongelio.com/nfl-analytics-with-r-book/
- mlr3book - https://mlr3book.mlr-org.com/
- Pandas for Everyone - https://wesmckinney.com/book/
9. Web Scraping and Data Collection
Learn how to automate data collection through web scraping techniques.
- Web Scraping with R - https://steviep42.github.io/webscraping/book/