Big Data. Introduction to Big Data
Interested in increasing your knowledge of the Big Data landscape? This course is for those new to data science and interested in understanding why the Big Data Era has come to be. It is for those who want to become conversant with the terminology and the core concepts behind big data problems, applications, and systems. It is for those who want to start thinking about how Big Data might be useful in their business or career. It provides an introduction to one of the most common frameworks, Hadoop, that has made big data analysis easier and more accessible -- increasing the potential for data to transform our world!
At the end of this course, you will be able to:
* Describe the Big Data landscape including examples of real world big data problems including the three key sources of Big Data: people, organizations, and sensors.
* Explain the V’s of Big Data (volume, velocity, variety, veracity, valence, and value) and why each impacts data collection, monitoring, storage, analysis and reporting.
* Get value out of Big Data by using a 5-step process to structure your analysis.
* Identify what are and what are not big data problems and be able to recast big data problems as data science questions.
* Provide an explanation of the architectural components and programming models used for scalable big data analysis.
* Summarize the features and value of core Hadoop stack components including the YARN resource and job management system, the HDFS file system and the MapReduce programming model.
* Install and run a program using Hadoop!
This course is for those new to data science. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments.
Критерий завершения
все сабтаски закрыты
Личные ресурсы
базовые навыки математики и программирования
Экологичность цели
за большими данными будущее
-
WEEK 1
Welcome
Welcome to the Big Data Specialization! We're excited for you to get to know us and we're looking forward to learning about you!
2 видео, 3 материалов для самостоятельного изучения
- Видео: What's in Big Data Applications and Systems?
- Материал для самостоятельного изучения: By the end of this course you will be able to...
- Материал для самостоятельного изучения: Optional: Watch this fun video about the San Diego Supercomputer Center!
- Видео: Tell us about yourself and learn about your classmates
- Вопрос для обсуждения: Let's Discuss: Why are you taking this class?
- Материал для самостоятельного изучения: FAQ
Big Data: Why and Where
Data -- it's been around (even digitally) for a while. What makes data "big" and where does this big data come from?
13 видео, 13 материалов для самостоятельного изучения
- Видео: What launched the Big Data era?
- Видео: Applications: What makes big data valuable
- Вопрос для обсуждения: Let's Discuss: What application area interests you?
- Видео: Example: Saving lives with Big Data
- Видео: Example: Using Big Data to Help Patients
- Видео: A Sentiment Analysis Success Story: Meltwater helping Danone
- Материал для самостоятельного изучения: Did you know?: 25 facts about big data
- Материал для самостоятельного изучения: Slides: What Launched the Big Data Era?
- Материал для самостоятельного изучения: Slides: Applications: What Makes Big Data Valuable?
- Материал для самостоятельного изучения: Slides: Saving Lives With Big Data
- Материал для самостоятельного изучения: Slides: Using Big Data to Help Patients
- Видео: Getting Started: Where Does Big Data Come From?
- Видео: Machine-Generated Data: It's Everywhere and There's a Lot!
- Видео: Machine-Generated Data: Advantages
- Видео: Big Data Generated By People: The Unstructured Challenge
- Видео: Big Data Generated By People: How Is It Being Used?
- Видео: Organization-Generated Data: Structured but often siloed
- Видео: Organization-Generated Data: Benefits Come From Combining With Other Data Types
- Видео: The Key: Integrating Diverse Data
- Вопрос для обсуждения: Let's discuss: Who are you providing data to?
- Материал для самостоятельного изучения: Extra Resources
- Материал для самостоятельного изучения: Slides: Machine-Generated Data: It's Everywhere and There's a Lot!
- Материал для самостоятельного изучения: Slides: Machine-Generated Data: Advantages
- Материал для самостоятельного изучения: Slides: Big Data Generated By People: The Unstructured Challenge
- Материал для самостоятельного изучения: Slides: Big Data Generated By People: How is it Being Used?
- Материал для самостоятельного изучения: Slides: Organization-Generated Big Data: Structured But Often Siloed
- Материал для самостоятельного изучения: Slides: Organizaton-Generated Big Data: Benefits
- Материал для самостоятельного изучения: Slides: The Key - Integrating Diverse Data
Оцениваемый: Why Big Data and Where Did it Come From?
-
Welcome
-
Big Data: Why and Where
-
WEEK 2
Characteristics of Big Data and Dimensions of Scalability
You may have heard of the "Big Vs". We'll give examples and descriptions of the commonly discussed 5. But, we want to propose a 6th V and we'll ask you to practice writing Big Data questions targeting this V -- value.
7 видео, 9 материалов для самостоятельного изучения
- Видео: Getting Started: Characteristics Of Big Data
- Видео: Characteristics of Big Data - Volume
- Материал для самостоятельного изучения: What does astronomical scale mean?
- Видео: Characteristics of Big Data - Variety
- Видео: Characteristics of Big Data - Velocity
- Видео: Characteristics of Big Data - Veracity
- Видео: Characteristics of Big Data - Valence
- Видео: The Sixth V: Value
- Материал для самостоятельного изучения: A Small Definition of Big Data
- Вопрос для обсуждения: Practice: Writing Big Data questions
- Вопрос для обсуждения: Let's Discuss: Improving the Flamingo Game
- Материал для самостоятельного изучения: Slides: Getting Started - Characteristics of Big Data
- Материал для самостоятельного изучения: Slides: Characteristics of Big Data - Volume
- Материал для самостоятельного изучения: Slides: Characteristics of Big Data - Variety
- Материал для самостоятельного изучения: Slides: Characteristics of Big Data - Velocity
- Материал для самостоятельного изучения: Slides: Characteristics of Big Data - Veracity
- Материал для самостоятельного изучения: Slides: Characteristics of Big Data - Value
- Материал для самостоятельного изучения: Slides: Characteristics of Big Data - Valence
Оцениваемый: V for the V's of Big Data
-
Characteristics of Big Data and Dimensions of Scalability
-
Data Science: Getting Value out of Big Data
-
WEEK 3
Foundations for Big Data Systems and Programming
Big Data requires new programming frameworks and systems. For this course, we don't programming knowledge or experience -- but we do want to give you a grounding in some of the key concepts.
4 видео, 4 материалов для самостоятельного изучения
- Видео: Getting Started: Why worry about foundations?
- Видео: What is a Distributed File System?
- Видео: Scalable Computing over the Internet
- Видео: Programming Models for Big Data
- Материал для самостоятельного изучения: Slides: Getting Started-Why Worry About Foundations?
- Материал для самостоятельного изучения: Slides: What is a Distributed File System?
- Материал для самостоятельного изучения: Slides: Scalable Computing Over the Internet
- Материал для самостоятельного изучения: Slides: Programming Models for Big Data
Оцениваемый: Foundations for Big Data
Systems: Getting Started with Hadoop
Let's look at some details of Hadoop and MapReduce. Then we'll go "hands on" and actually perform a simple MapReduce task in the Cloudera VM. Pay attention - as we'll guide you in "learning by doing" in diagramming a MapReduce task as a Peer Review.
11 видео, 7 материалов для самостоятельного изучения
- Видео: Hadoop: Why, Where and Who?
- Видео: The Hadoop Ecosystem: Welcome to the zoo!
- Видео: The Hadoop Distributed File System: A Storage System for Big Data
- Видео: YARN: A Resource Manager for Hadoop
- Видео: MapReduce: Simple Programming for Big Results
- Материал для самостоятельного изучения: MapReduce in the Pasta Sauce Example
- Видео: When to Reconsider Hadoop?
- Видео: Cloud Computing: An Important Big Data Enabler
- Видео: Cloud Service Models: An Exploration of Choices
- Видео: Value From Hadoop and Pre-built Hadoop Images
- Материал для самостоятельного изучения: Slides for Getting Started With Hadoop
- Материал для самостоятельного изучения: Downloading and Installing the Cloudera VM Instructions (Mac)
- Материал для самостоятельного изучения: Downloading and Installing the Cloudera VM Instructions (Windows)
- Материал для самостоятельного изучения: Copy your data into the Hadoop Distributed File System (HDFS) Instructions
- Видео: Copy your data into the Hadoop Distributed File System (HDFS)
- Материал для самостоятельного изучения: Run the WordCount program Instructions
- Видео: Run the WordCount program
- Вопрос для обсуждения: Let's Discuss: Map Reduce in your life
- Материал для самостоятельного изучения: How do I figure out how to run Hadoop MapReduce programs?
Оцениваемый: Intro to Hadoop
Оцениваемый: Understand by Doing: MapReduce
Оцениваемый: Running Hadoop MapReduce Programs Quiz
-
Foundations for Big Data Systems and Programming
-
Systems: Getting Started with Hadoop
-
Cloudera. Introduction
-
Get Started
-
Analyze Your Data
-
Manage Your Cluster
-
- 1534
- 02 октября 2016, 19:46
Не пропустите новые записи!
Подпишитесь на цель и следите за ее достижением