CS 672: System Reliability at Scale

Spring 2024

Course Plan (subject to revision)

Week

Date

Assignments

Topic

Reading(s)

1 Jan 16 Survey Assigned
(will be on Canvas)
Preview of the syllabus,
course objectives and activities.
  • Recommended:
    • None
Jan 18 Survey Due
(not graded)
Background on large-scale
computing systems.
  • Required:
    • None
2 Jan 23 Quality of service and
availability.
Jan 25 Assignment 1
Released on Canvas
Failures at scale.
3 Jan 30 Hardware reliability testing.
Feb 1 Production testing.
4 Feb 6 Silent data corruption.
Feb 8 Assignment 1 Due
(14:00 on Canvas)
Assignment 1 Discussion
  • Required:
    • None - no review due today.
5 Feb 13 Assignment 2
Released on Feb 14
Project Discussion
Feb 15 Logical Time in
Distributed Systems
6 Feb 20 Fault Tolerance and
Commit Consensus
Feb 22 Project:
Idea Bids Due
No Class Today
Project & Assignment Time
7 Feb 27 Assignment 2 Due
(Feb 28, 11:59 pm)
Data Consensus; Raft None
Feb 29 Byzantine Fault Tolerance
8 Mar 5 Fault Injection
Mar 7 Project:
Proposal Report Due
Group Membership None
- Mar 12 Spring Break
Mar 14
9 Mar 19 Resource Disaggregation
Mar 21 Resource Disaggregation cntd.
10 Mar 26 Memory Disaggregation:
Technologies
  • Required:
    • None (readings from last week)
  • Recommended:
    • None
Mar 28 Project:
Checkpoint 1 Report Due
Memory Disaggregation:
Coherence and Consistency
  • Required:
    • None (readings from last week)
11 Apr 2 Disaggregation:
Hardware Accelerators
Apr 4 Disaggregation:
CXL and Programmability
12 Apr 9 Abstractions
and Transparency
  • Required:
    • None
  • Recommended:
    • None
Apr 11 Project:
Checkpoint 2 Report Due
Near Data Processing
  • Required:
    • None
  • Recommended:
13 Apr 16 Course Recap
and Reflection
  • Required:
    • None
  • Recommended:
    • None
Apr 18 No Class - Project Time
  • Required:
    • None
  • Recommended:
    • None
14 Apr 23 Project:
Final Presentations
See Canvas for each group's
specific date assignment
Apr 25
- May 7
@ 11:59 pm
Project:
Final Report Due
- -