Syllabus

Course Meeting Times

Lectures: 2 sessions / week, 1.5 hours / session

One optional 2 hour open-problem session / week

Course Description

Data structures play a central role in modern computer science. You interact with data structures even more often than with algorithms (think Google, your mail server, and even your network routers). In addition, data structures are essential building blocks in obtaining efficient algorithms. This course covers major results and current directions of research in data structures:

Time Travel We can remember the past efficiently (a technique called persistence), but in general it's difficult to change the past and see the outcomes on the present (retroactivity). So alas, Back To The Future isn't really possible.
Geometry When data has more than one dimension (e.g. maps, database tables).
Dynamic Optimality Is there one binary search tree that's as good as all others? We still don't know, but we're close.
Memory Hierarchy Real computers have multiple levels of caches. We can optimize the number of cache misses, often without even knowing the size of the cache.
Hashing Hashing is the most used data structure in computer science. And it's still an active area of research.
Integers Logarithmic time is too easy. By careful analysis of the information you're dealing with, you can often reduce the operation times substantially, sometimes even to constant. We will also cover lower bounds that illustrate when this is not possible.
Dynamic Graphs A network link went down, or you just added or deleted a friend in a social network. We can still maintain essential information about the connectivity as it changes.
Strings Searching for phrases in giant text (think Google or DNA).
Succinct Most "linear size" data structures you know are much larger than they need to be, often by an order of magnitude. Some data structures require almost no space beyond the raw data but are still fast (think heaps, but much cooler).

Prerequisites

The prerequisite for this course is 6.046, Design and Analysis of Algorithms, or an equivalently thorough undergraduate algorithms class from another school (e.g., covering much of CLRS). I recommend that you take 6.854, Advanced Algorithms, the broad entry-level graduate course in Theory / Algorithms—it normally makes sense to start there before jumping into deeper graduate courses. If you haven't taken 6.854, you must have a strong understanding of algorithms at the undergraduate level, such as receiving an A in 6.046, having done relevant research, involvement in computer competitions, etc.

Grading

There are three requirements, other than attending lectures:

  • Scribing one, maybe two, lectures. Note in particular that scribe notes are due on the day of the lecture. The entire calendar for the course has been posted, so you can select a lecture that interests you. We will circulate a sign-up sheet during the second week. Listeners may also be required to scribe one lecture.
  • Lightweight homework assignments. Problems will be posted weekly, and will not be distributed otherwise.
  • Research-oriented final project (paper and presentation). We allow theoretical, experimental, and survey final projects.

LaTeX Help

Homework solutions, scribe notes, and final projects must be typeset in LaTeX. If you are not familiar with LaTeX, there is no need to worry. Start with this good introduction, This resource may not render correctly in a screen reader."The Not So Short Introduction to LaTeX 2" (PDF - 2.04MB). You need to know very little to start writing problem sets in LaTeX: just skim through the mathematics section in the introduction, and download this template.