Home|Publications|Projects|Research Interests|Teaching|Students|Experience|Announcements|Contact


BIG DATA ANALYTICS
Description
Big data, data mining, MapReduce, finding similar items, distance measures, mining data streams, link analysis, clustering, frequent itemsets, association rules, information retrieval, advertising on the web.
Grading
Midterm - 25%
Homeworks - 15%
Final project - 30% 
Final exam - 30%
Textbooks
(1) Mining of Massive Datasets, Jure Leskovec, Anand Rajaraman, Jeffrey David Ullman, Stanford University, 2011.
Supplementary books
(1) Real-Time Big Data Analytics: Emerging Architecture, Mike Barlow, O’Reilly Media, 2013.
(2) Big Data, Data Mining, and Machine Learning: Value Creation for Business Leaders and Practitioners, Jared Dean, Wiley, 2014.
(3) Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data, EMC Education Services, 2015.
Outline
(1) Big Data mining
(2) MapReduce
(3) Finding similar items
(4) Distance measures
(5) Mining data streams
(6) Link analysis
(7) Clustering
(8) Frequent itemsets
(9) Association rules
(10) Information retrieval
(11) Advertising on the web
General information about homeworks
- The papers reviewed will be published in the last 3 years. 
- The papers will be published in a journal in Q1, Q2 or Q3 quarter.
- The assignment document will be a single file in pdf format and will contain the following documents:
      (1) Document showing the quarter in which the article was published
      (2) Full text of the reviewed article
      (3) Prepared report
- The file name will be in the format DersKodu_StudentNo_HomeworkNo.pdf.