Big Data Basics
The video below gives a definition of big data and discusses common issues that arise when dealing with big data.
I highly recommend watching the video using the ‘full’ Panopto player. There is a ‘pop out’ button in the bottom right of the video to enter this viewer.
Notes
Additional Readings for Week 7
The Big Data Paradigm
- Academic overview
- Overview 2 (Suse)
- Overview 3 (Oracle)
Databases
- Basics of Databases(SQL and NoSQL)
- What is a database?, what is a relational database? (Oracle)
- Python SQL Libraries (realpython - a nice site but some stuff goes out of date!)
- SQLite in python (tutorial from pynative.com)
- SQLite schema table (sqlite.org documentation - a lot of other useful stuff there)
- Three plus table joins (learnsql.com)
Data Storage
- The first answer here is useful to read
- Data Warehouses: Article 1 (Oracle), Article 2 (Amazon)
- Databases and Data Warehouses: Article 1, Article 2
- Data Marts (IBM)
- MDM (SAS)
- Databases, Data Warehouses, and Data Lake: Article 1, Article 2, Article 3
- Data Lakes: Article 1, Article 2
- Lake House (databricks - they actually have a lot of useful training resources as well!)
Big Data Storage
- Data pipelines: Article 1, Article 2
Use the table of contents on the left or the arrows at the bottom of this page to navigate to the next learning material!