Jump to content

ELL886

From IITD Wiki
ELL886
Big Data Systems
Credits 3
Structure 3-0-0
Pre-requisites
Overlaps

ELL886 : Big Data Systems

[edit]

Introduction; Hadoop, Map-Reduce, GFS/HDFS, Bigtable/HBASE; Extension of Map-Reduce: iMap-reduce (iterative), incremental map-reduce. SQL and Data-parallel programming, DryadLINQ. Data-flow parallelism vs. message passing. Data locality. Memory hierarchies. Sequential versus random access to secondary storage. NoSQL systems. NewSQL systems. Finding similar items and LSH; Search Technology: link analysis and Page-rank algorithm; Large Scale Graph Processing; Mining Streaming Data and Realtime analytics: Window semantics and window joins. Sampling and approximating aggregates (no joins). Querying histograms. Maintaining histograms of streams. Use of Haar wavelets. Incremental and online query processing: online aggregation.