Close Banner

Deploying a Hadoop Cluster

Analyze Data with Hadoop and MapReduce

中级

大约 3 个礼拜

6小时每周 (按照自己的节奏)

加入成千上万的全球学员

开始免费课程

加入课程
免费
可享受
课程视频
实战练习与参考项目指导
中级

大约 3 个礼拜

6小时每周 (按照自己的节奏)

加入成千上万的全球学员

课程概述

Learn how to tackle big data problems with your own Hadoop clusters! In this course, you’ll deploy Hadoop clusters in the cloud and use them to gain insights from large datasets.

为什么学习这门课程?

Using massive datasets to guide decisions is becoming more and more important for modern businesses. Hadoop and MapReduce are fundamental tools for working with big data. By knowing how to deploy your own Hadoop clusters, you’ll be able to start exploring big data on your own.

先修要求

This course is intended for students with some experience with Hadoop and MapReduce, Python, and bash commands.

You’ll have to be able to work with HDFS and write MapReduce programs. You can learn about these in our Intro to Hadoop and MapReduce course.

The MapReduce programs in the course are written in Python. It is possible to use Java and other languages, but we suggest using Python, on the level of our Intro to Computer Science course.

You’ll also be using remote cloud machines, so you’ll need to know these bash commands:

  • ssh
  • scp
  • cat
  • head/tail

You’ll also need to be able to work in an editor such as vim or nano. You can learn about these in our Linux Command Line Basics course.

查看使用优达学城的技术要求

你将学习什么内容?

项目

Hadoop Cluster

Deploy your own Hadoop cluster to analyze a huge dataset.

学习计划

Deploying a Hadoop cluster on Amazon EC2

Deploy a small Hadoop cluster on Amazon EC2 instances.

Deploy a Hadoop cluster with Ambari

Use Apache Ambari to automatically deploy a larger, more powerful Hadoop cluster.

On-demand Hadoop clusters

Use Amazon’s ElasticMapReduce to deploy a Hadoop cluster on-demand.

Project: Analyzing a big dataset with Hadoop and MapReduce

Use Hadoop and MapReduce to analyze a 150 GB dataset of Wikipedia page views.

讲师与合作伙伴

Mat Leonard

Mat Leonard

Mat 拥有加州大学伯克利分校的物理学博士学位,研究方向为与短期记忆有关的神经元活动。在研究生工作期间,Mat 开始热衷于分享自己的学习心得、Python 及一切与数据相关的事物。Mat 于 2015 年加入优达学城,从此便面向更多学生传道授业解惑。在设计新课程之外,他喜欢跑步、骑自行车以及在 matatat.org 上发表与数据相关的文章。

官方微信公众号二维码

优达学城(Udacity)微信