Course Schedule

Part 1: Resources

Week 1

Mon, Sep 1
Labor Day!
Wed, Sep 3
Course Intro
Read: Syllabus
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Fri, Sep 5
Deployment (Linux Shell)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 2

Wed, Sep 10
Deployment (Docker)
Release: P1 (Docker)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Online Quiz: week 1
Fri, Sep 12
Network Resources (Overview)
Read: Designing Data Intensive Applications, Kleppmann (Chapter 4, "Encoding and Evolution")
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 3

Mon, Sep 15
Network Resources (gRPC)
Read: gRPC Basics Tutorial
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Wed, Sep 17
gRPC demo
Read: gRPC Basics Tutorial
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Online Quiz: week 2 and before (cumulative)
Fri, Sep 19
Network Resources (Compose)
Watch: Lecture
Slides: PDF

Week 4

Mon, Sep 22
Memory Resources (Caching)
Read: Systems Performance, Gregg (6.2.2; "CPU Caches" and "Latency" subsections of 6.4.1)
Due: P1
Release: P2 (Network+Memory)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Wed, Sep 24
In-Person Quiz (in class)
Catchup+Quiz Watch: Lecture
Fri, Sep 26
Memory Resources (Caching Practice)
Watch: Lecture

Week 5

Mon, Sep 29
Memory Resources (PyArrow)
Read: Gallery of Processor Cache Effects (Examples 1 and 2)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Fri, Oct 3
Compute Resources (Locks)
Read: Mastering Concurrency in Python ("Working With Threads In Python" chapter)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 6

Mon, Oct 6
Storage Resources (File Systems)
Due: P2
Release: P3 (Compute+Storage)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Wed, Oct 8
Storage Resources (Formats and DBs)
Read: Designing Data Intensive Applications, Kleppmann ("Transaction Processing or Analytics?" and "Column-Oriented Storage" sections of Chapter 3, "Storage and Retrieval")
Evening: Exam 1
  • Regular exam: 5:45 to 6:45 pm; Location: TBD
  • McBurney exam: 5:45 to 7:45 pm; Location: TBD

Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Part 2: Clusters

Week 7

Mon, Oct 13
SQL Practice
Watch: Lecture

Week 8

Mon, Oct 20
MapReduce (contd.)
Watch: Lecture
Anki Flashcards: Deck

Week 9

Wed, Oct 29
In-Person Quiz (in class)
Catchup+Quiz
Watch: Lecture
Fri, Oct 31
Spark Internals and Performance
Read: Learning Spark, 2nd edition by Damji et al. (Chapter 10, "Machine Learning with MLlib")
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 10

Mon, Nov 3
Spark Internals and Performance (contd.)
Watch: Lecture
Wed, Nov 5
Spark Machine Learning API
Read: Cassandra, The Definitive Guide, by Carpenter et al. (Chapter 4, "The Cassandra Query Language")
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Online Quiz: week 9 and before (cumulative)
Fri, Nov 7
Spark Machine Learning: PLANET
Watch: Lecture

Week 11

Wed, Nov 12
Cassandra Query Language (CQL)
Due: P5
Release: P6 (Cassandra, Weather)
Watch: Lecture

Week 12

Mon, Nov 17
Cassandra Replication
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Wed, Nov 19
Exam 2 review
Review/Catchup
  • Regular exam: 5:45 to 6:45 pm; Location: TBD
  • McBurney exam: 5:45 to 7:45 pm; Location: TBD

Watch: Lecture
Anki Flashcards: Deck

Week 13

Mon, Nov 24
Streaming: Kafka Demos
Watch: Lecture
Wed, Nov 26
Streaming: Kafka demo (contd.)
Read: Kafka, The Definitive Guide, 2nd edition by Shapira et al. (Chapter 7, "Reliable Data Delivery")
Watch: Lecture
Anki Flashcards: Deck
Fri, Nov 28
Thanksgiving Break
Part 3: Cloud

Week 14

Mon, Dec 1
Kafka Reliability
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Wed, Dec 3
The Cloud + Big Query 1: Basics
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Online Quiz: week 13 and before (cumulative)

Week 15

Wed, Dec 10
Big Query 2: Data Sources + Geo Data
Watch: Lecture
Fri, Dec 12
No Class
Due: P8