登入帳戶  | 訂單查詢  | 購物車/收銀台(0) | 在線留言板  | 付款方式  | 聯絡我們  | 運費計算  | 幫助中心 |  加入書簽
會員登入   新用戶註冊
HOME新書上架暢銷書架好書推介特價區會員書架精選月讀2024年度TOP分類閱讀雜誌 香港/國際用戶
最新/最熱/最齊全的簡體書網 品種:超過100萬種書,正品正价,放心網購,悭钱省心 送貨:速遞 / 物流,時效:出貨後2-4日

2025年06月出版新書

2025年05月出版新書

2025年04月出版新書

2025年03月出版新書

2025年02月出版新書

2025年01月出版新書

2024年12月出版新書

2024年11月出版新書

2024年10月出版新書

2024年09月出版新書

2024年08月出版新書

2024年07月出版新書

2024年06月出版新書

2024年05月出版新書

『簡體書』Hadoop MapReduce v2参考手册 第2版(影印版)

書城自編碼: 2706994
分類: 簡體書→大陸圖書→計算機/網絡程序設計
作者: 冈纳拉森 (Thilina Gunarathne)
國際書號(ISBN): 9787564160890
出版社: 东南大学出版社
出版日期: 2016-01-01

頁數/字數: 304页
書度/開本: 16开

售價:NT$ 480

我要買

share:

** 我創建的書架 **
未登入.



新書推薦:
天下、中国与王朝:中国古代政治地理结构再认知
《 天下、中国与王朝:中国古代政治地理结构再认知 》

售價:NT$ 494.0
缥缈 典藏版(1-5)
《 缥缈 典藏版(1-5) 》

售價:NT$ 1145.0
揭秘立体翻翻书 我们的中国+地图 精装硬壳儿童3D立体书 小学生中国地理百科 少儿百科知识翻翻书启蒙读物
《 揭秘立体翻翻书 我们的中国+地图 精装硬壳儿童3D立体书 小学生中国地理百科 少儿百科知识翻翻书启蒙读物 》

售價:NT$ 1060.0
短线交易天才 我如何从商品期货市场赚到100万 拉瑞威廉姆斯著经典投资期货交易入门技术分析
《 短线交易天才 我如何从商品期货市场赚到100万 拉瑞威廉姆斯著经典投资期货交易入门技术分析 》

售價:NT$ 347.0
长江人文馆:中国近代史
《 长江人文馆:中国近代史 》

售價:NT$ 281.0
漆法乾坤:《髹饰录》八讲
《 漆法乾坤:《髹饰录》八讲 》

售價:NT$ 398.0
医美必修课:科学变美的实践指南
《 医美必修课:科学变美的实践指南 》

售價:NT$ 653.0
法国国家图书馆藏中国古代博物图集(第一卷)
《 法国国家图书馆藏中国古代博物图集(第一卷) 》

售價:NT$ 959.0

建議一齊購買:

+

NT$ 822
《Hadoop权威指南 第4版(影印版)》
+

NT$ 441
《MapReduce设计模式》
+

NT$ 502
《Hive编程(影印版)》
+

NT$ 828
《HBase权威指南(“十二五”国家重点图书出版规划项目)》
內容簡介:
《Hadoop MapReduce V2参考手册第2版影印版英文版》开篇介绍了Hadoop YARN、MapReduce、HDFs以及其他Hadoop生态系统组件的安装。在《Hadoop MapReduce V2参考手册第2版影印版英文版》的指引下,你很快就会学习到很多激动人心的主题,例如MapReduce模式,使用Hadoop处理分析、归类、在线销售、推荐、数据索引及搜索。你还会学习到如何使用包括Hive、HBase、Pig、Mahout、Nutch~BGi raph在内的Hadoop生态系统项目以及如何在云环境下进行部署。
目錄
Preface
Chapter 1:Getting Started with Hadooo v2
IntrOductiOn
Setting up Hadoop v2 on your local machine
Writing a WordCount MapReduce application,bundling it
and running it using the Hadoop local mode
Adding a combiner step to the WordCount MapReduce program
Setting up HDFS
Setting up Hadoop YARN in a distributed cluster environment
using Hadoop v2
Setting up Hadoop ecosystem in a distributed cluster environment
using a Hadoop distribution
HDFS command—line file operations
Running the WordCount program in a distributed cluster environment
Benchmarking HDFS using DFSIO
Benchmarking Hadoop MapReduce using TeraSort
Chapter 2:Cloud Deployments—Using Hadoop YARN on
Cloud Environments
Introduction
Running Hadoop MapReduce v2 computations using Amazon
Elastic MapReduce
Saving money using Amazon EC2 Spot Instances to execute EMR job flows
Executing a Pig script using EMR
Executing a Hive script using EMR
Creating an Amazon EMR job flow using the AWS Command Line Interface
Deploying an Apache HBase cluster on Amazon EC2 using EMR
Using EMR bootstrap actions to configure VMs for the Amazon EMR jobs
Using Apache Whirr to deploy an Apache Hadoop cluster in a
cloud environment
Chapter 3:Hadoop Essentials—C0nfigurations,Unit Tests,and Other APIs
Introduction
Optimizing Hadoop YARN and MapReduce cOnfiguratiOns for
cluster deployments
Shared user Hadoop clusters——using Fair and Capacity schedulers
Setting classpath precedence to user—provided JARs
Speculative execution of straggling tasks
Unit testing Hadoop MapReduce applications using MRUnit
Integration testing Hadoop MapReduce applications using
MiniYarnCluster
Adding a new DataNode
Decommissioning DataNodes
Using multiple disks/volumes and limiting HDFS disk usage
Setting the HDFS block size
Setting the file replication factor
Using the HDFs Java API
Chapter 4:Develooin~ComDlex Hadooo MaoReduce Aoolications
IntrOductiOn
Choosing appropriate Hadoop data types
Implementing a custom Hadoop Writable data type
Implementing a custom Hadoop key type
Emitting data of different value types from a Mapper
Choosing a suitable Hadoop InputFormat for your input data format
Adding support for new input data formats——implementing
a custom InputFormat
Formatting the results of MapReduce computations——using
Hadoop OutputFormats
Writing multiple outputs from a MapReduce computation
Hadoop intermediate data partitioning
Secondary sorting——sorting Reduce input values
BrOadcasting and distributing shared resources to tasks in a
MapReduce job—Hadoop DistributedCache
Using Hadoop with legacy applications——Hadoop streaming
Adding dependencies between MapReduce jobs
Hadoop counters to report custom metrics
Chapter5:Analvtics
Introduction
Simple analytics using MapReduce
Performing GROUP BY using MapReduce
Calculating frequency distributions and sorting using MapReduce
Plotting the Hadoop MapReduce results using gnuplot
Calculating histograms using MapReduce
Calculating Scatter plots using MapReduce
Parsing a complex dataset with Hadoop
Joining two datasets using MapReduce
Chapter6:Hadooo Ecosystem—Apache Hive
Introduction
Getting started with Apache Hive
Creating databases and tables using Hive CLI
Simple SQL—style data querying using Apache Hive
Creating and populating Hive tables and views using Hive query results
Utilizing different storage formats in Hive.storing table data
using ORC files
Using Hive built—in functions
Hive batch mode—using a query file
Performing a join with Hive
Creating partitioned Hive tables
Writing Hive User·defined Functions(UDF)
HCatalog—·performing Java MapReduce computations on
data mapped to Hive tables
HCatalog——writing data to Hive tables from Java
MapReduce computations
Chapter7:HadooD Ecosystem II—Pig.HBase.Mahout.and Sannn
Introduction
Getting started with Apache Pig
Joining two datasets using Pig
Accessing a Hive table data in Pig using HCatalog
Getting started with Apache HBase
Data random access using Java client APIs
Running MapReduce jobs on HBase
Using Hive to insert data into HBase tables
Getting started with Apache Mahout
Running K—means with Mahout
Importing data to HDFS from a relational database using Apache Sqoop
Exporting data from HDFs to a relational database using Apache Sqoop
Tahie OrContencs
Chapter8:Searching and Indexine
Introduction
Generating an inverted index using Hadoop MapReduce
Intradomain web crawling using Apache Nutch
Indexing and searching web documents using Apache Solr
Configuring Apache HBase as the backend data store for Apache Nutch
Whole web crawling with Apache Nutch using a HadooP/HBase cluster
Elasticsearch for indexing and searching
Generating the in—links graph for crawled web pages
Chapter 9:CIassmcatiOns。Recommendations,and Findineg RelationshipS
Introduction
Performing content—based recommendations
Classification using the naive Bayes classifier
Assigning advertisements to keywords using the Adwords
balance algorithm
Chapter 10:Mass Text Data processing
Introduction
Data preprocessing using Hadoop streaming and Python
De—duplicating data using Hadoop streaming
Loading large datasets to an Apache HBase data store—importtsv
and bulkload
Creating TF and TF—IDF vectors for the text data
Clustering text data using Apache Mahout
Topic discovery using Latent Dirichlet Allocation(LDA)
Document classification using Mahout Naive Bayes Classifier
Index

 

 

書城介紹  | 合作申請 | 索要書目  | 新手入門 | 聯絡方式  | 幫助中心 | 找書說明  | 送貨方式 | 付款方式 台灣用户 | 香港/海外用户
megBook.com.tw
Copyright (C) 2013 - 2025 (香港)大書城有限公司 All Rights Reserved.