Mining of massive datasets /

This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain...

Full description

Saved in:
Bibliographic Details
Main Authors: Leskovec, Jurij (Author)
Group Author: Rajaraman, Anand; Ullman, Jeffrey D., 1942-
Published: Cambridge University Press,
Publisher Address: Cambridge :
Publication Dates: 2014.
Literature type: Book
Language: English
Edition: Second edition.
Subjects:
Summary: This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction. It includes a range of over 150 challenging exercises. --
Item Description: Previous edition: 2012.
Carrier Form: xi, 467 pages : illustrations ; 26 cm
Bibliography: Includes bibliographical references and index.
ISBN: 9781107077232 :
9781316147047
1316147045
Index Number: QA76
CLC: TP311.131
Call Number: TP311.131/R161/2nd ed.