NAVIGATION
Large-Scale Systems Group
Private
Large-Scale Systems Group (LSSG) @ University of Chicago
Large-Scale Systems Group -> People -> Andrew A. Chien -> Andrew A. Chien Teaching -> Data-Intensive Computing Systems (2013)
Spring 2013
Instructor:   Andrew A. Chien
Meetings: MF 130-250pm, Ry 277

"Big Data" and Data Analytics have become hot topics as well as drivers of multi-billion dollar industries.  We live in an era of unprecedented data collection from sources as diverse as e-commerce, the WWW, scientific instruments, wireless sensors, and a rich electronic, networked infrastructure.  While cheap computing, sensors, storage, and pervasive networking make the collection of these exabytes of data possible, significant challenges exist in the analysis of "big data" to deliver internet-scale services, scientific insights, and of course commercial insights.  

The course objective is to e xpose students to the technical challenges of data-intensive computing systems, including canonical driving problems, research systems, and emerging technologies. While other classes focus on analysis algorithms (or even underlying statistical or machine learning methods), in this class we focus on the computer systems and technology needed to achieve scalable and efficient data-intensive computing systems.  Through intensive research paper reading, interactive discussions, presentations, and in-depth course projects, students will develop 
a broad familiarity with current challenges, the state of the art, including leading edge research in the area, and hands-on experience with a range of systems which together provide a solid preparation for research in the area.  Course topics include: parallel filesystems, SQL databases, NoSQL/Mapreduce systems, storage class memories (from Flash to Memristor to ReRAM), and popular open source infrastructures such as Hadoop, VoltDB/HadoopDB, Cassandra, Memcached, MongoDB, and others.   Several unique project opportunities this quarter include experimentation with Presto/Blockus (a parallel R system we're developing in CS), Graphlab/GraphChi, and Cleversafe's "Limitless Storage".

Course Activites will include:
- paper reading, presentation and discussion 
- hands-on labs/projects with leading edge data-intensive computing systems
- invited speakers from leading companies and projects

Syllabus   May 20 Version (subject to change)
Lecture Slides ( 4/1 , 4/5 , 4/8 , 4/12 , 4/15, 4/19, 4/22, 4/28 ,...)
Assignments  
Overall Project Assignment ( here )... includes
- What is a DI Project (Assigned 4/5, due 4/12)
- Assessing a DI Computing infrastructure (Assigned 4/5, due 4/15)
- Sketch Project Plan (due May 3rd - or over the weekend)
- Full Project Plan (due May 10th - or over the weekend)
- Project status report (due May 31st)
- Final Project Presentation and Demo and Report (due ~June 10th)

If you have questions, please come to the first class meeting, Monday April 1, 130pm, Ryerson 277.



Subpages


Comments


author: Andrew A. Chien, achien7242@gmail.com
updated: June 16, 2015, 05:39 AM
revision: 23

Attachments

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

DICSys-Syllabus4-14-2013.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

CS33001S13Lec3.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

DICSys-Syllabus4-28-2013.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

DICSys-Syllabus5-13-2013.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

CS33001S13Lec8.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

CS33001S13Lec4.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

DICSys-Syllabus5-7-2013.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

DICSys-Syllabus5-9-2013.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

DICsys2013-Project-v1.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

DICSys-Syllabus5-20-2013.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

CS33001S13Lec1.pdf

Andrew A. Chien, achien7242@gmail.com, June 16, 2015, 05:39 AM, revision: 1

CS33001S13Lec2.pdf