《MapReduce进行密集文本数据处理》英文版
基本信息
打开支付宝首页搜“673273051”领红包,领到大红包的小伙伴赶紧使用哦!
相关书籍
- 《美河制作.软件观念革命.交互设计精髓》[]
- 《iPhone和iPod Touch iOS 5的基础操作视频教程》英文版[]
- 《iPod终极宝典》(The Ultimate iPod Guide)(Nik Rawlinson)2010 Edition[]
- 《数字人像创作艺术》电子书[]
- 《数据科学实战》电子书[]
- 《莫斯科会战》电子书[]
- 《银座红牌驭男术-男人都爱小魔女》电子书[]
- 《万寿寺》电子书[]
内容介绍
目录:
1.Introduction
2.MapReduce Basics
3.MapReduce algorithm design
4.Inverted Indexing for Text Retrieval
5.Graph Algorithms
6.EM Algorithms for Text Processing
7.Closing Remarks
内容介绍:
Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms more crucial than ever. MapReduce is a programming model for expressing distributed computations on massive datasets and an execution framework for large-scale data processing on clusters of commodity servers. The programming model provides an easy-to-understand abstraction for designing scalable algorithms, while the execution framework transparently handles many system-level details, ranging from scheduling to synchronization to fault tolerance. This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader "think in MapReduce", but also discusses limitations of the programming model as well.
