BU CS599

BU CS599 Advanced Topics in CS
Graph Analytics – Fall 2021

Graphs are mathematical objects that model pairwise relations between entities/objects. They model a wide variety of important datasets including the World Wide Web, online social networks and other social media, transportation networks, scientific collaborations, and even the human brain. This class aims to introduce students to cutting-edge research on applied graph algorithms, and machine learning with graphs. Topics of interest include motifs as building blocks of networks, motif detection and counting algorithms, worst case optimal joins, graph generative models starting from random binomial graphs to deep learning based models, graph clustering and dense cluster detection algorithms, node embeddings for downstream machine learning tasks on graphs, and graph neural networks.

Syllabus

A tentative syllabus is available here.

Where and When

  • Time: Weekly on Tue and Thu, 11am-12.15pm
  • Location:  MCS B33
  • Office hours: Weekly on Friday 5-6pm, MCS 102

Blackboard

We will use blackboard for online discussions/questions etc.

Prerequisites

One should have had the Algorithm Design course (CS330). Familiarity with coding is necessary. Some more advanced courses in machine learning, statistics, and computer science will be helpful but are not required.

Grading

The final grade will be computed as a weighted sum of two grades:

  1. Project (60%)
  2. Assignment (40%)
  • Teams up to two (2) people are allowed for the class project. The project is an opportunity for you to explore an interesting graph mining problem. A wide variety of projects is allowed ranging from systems-oriented projects on managing efficiently big graphs, to graph algorithm design and deep learning on graphs. Your performance will be evaluated on the technical depth, the scope of the project, the clarity of your technical report, and your final presentation.
  • The assignment is individual. It is a challenging assignment, so make sure you start working early You may discuss the problems with other students, but you must write your solutions independently, based on your own understanding. Academic integrity dictates citing any sources that you use on the problem set. Finally, using Github is mandatory for submitting your code.

Schedule

– 9/2: Lecture 1, Introduction (Slides)

– 9/7: Lecture 2, Motifs (Slides)

– 9/9: Lecture 3, Color coding (Slides)

– 9/14: Lecture 4, Triangle counting (Slides)

– 9/16: Lecture 5, Triangle counting (cont.), Loomis-Whitney inequality (Slides)

– 9/21: Lecture 6, WCOJ (Slides)

– 9/23: Lecture 7, WCOJ (cont.) (Slides)

– 9/28: Lecture 8, Empirical properties of real world networks (Slides)

– 9/28: Lecture 9, Random graph models (Slides)

– 10/5: Lecture 10, Erdős–Rényi model (Notes)

– 10/7: Lecture 11, Erdős–Rényi model (cont.) (Notes)

– 10/14: Lecture 12 (Guest Lecture by Kostas Sotiropoulos)
Link prediction (Slides Part 1,
Part 2 Signed Networks (Leskovec slides),
Part 3 Multi-layered Networks(TwitterMancer))


%d bloggers like this: