1

Етап 1

Introduction. Examples, data science articulated, history and context, technology landscape

2

Етап 2

Databases and the relational algebra

3

Етап 3

Parallel databases, parallel query processing, in-database analytics

4

Етап 4

MapReduce, Hadoop, relationship to databases, algorithms, extensions, languages

5

Етап 5

Key-value stores and NoSQL; tradeoffs of SQL and NoSQL

6

Етап 6

Topics in statistical modeling: basic concepts, experiment design, pitfalls

7

Етап 7

Topics in machine learning

8

Етап 8

Visualization, data products, visual data analytics

9

Етап 9

Provenance, privacy, ethics, governance

10

Етап 10

Guest Lectures

11

Етап 11

Graph Analytics

1

Етап 1

Introduction. Examples, data science articulated, history and context, technology landscape

2

Етап 2

Databases and the relational algebra

3

Етап 3

Parallel databases, parallel query processing, in-database analytics

4

Етап 4

MapReduce, Hadoop, relationship to databases, algorithms, extensions, languages

5

Етап 5

Key-value stores and NoSQL; tradeoffs of SQL and NoSQL

6

Етап 6

Topics in statistical modeling: basic concepts, experiment design, pitfalls

7

Етап 7

Topics in machine learning

8

Етап 8

Visualization, data products, visual data analytics

9

Етап 9

Provenance, privacy, ethics, governance

10

Етап 10

Guest Lectures

11

Етап 11

Graph Analytics

10 липня 2014
Мета завершена % date%

Автор мети

Алексей

Росія, Москва

42 Рік / року / років

Знання та Навички

Introduction to Data Science on Coursera

Commerce and research are being transformed by data-driven discovery and prediction. Skills required for data analytics at massive levels – scalable data management on and off the cloud, parallel algorithms, statistical modeling, and proficiency with a complex ecosystem of tools and platforms – span a variety of disciplines and are not easy to obtain through conventional curricula. Tour the basic techniques of data science, including both SQL and NoSQL solutions for massive data management (e.g., MapReduce and contemporaries), algorithms for data mining (e.g., clustering and association rule mining), and basic statistical modeling (e.g., linear and non-linear regression).

  1. Introduction. Examples, data science articulated, history and context, technology landscape

    Readings

  2. Databases and the relational algebra

    Readings

  3. Parallel databases, parallel query processing, in-database analytics

    Readings for step 3-4-5

    Data cleaning, entity resolution, data integration, information extraction

    (NOT COVERED IN LECTURES)Readings / Talks

    Elmagarmid, et. al. Duplicate Record Detection: A Survey, Koudas, et. al. Record Linkage: Similarity Measures and Algorithms
  4. MapReduce, Hadoop, relationship to databases, algorithms, extensions, languages

    Readings for step 3-4-5

    Data cleaning, entity resolution, data integration, information extraction

    (NOT COVERED IN LECTURES)Readings / Talks

    Elmagarmid, et. al. Duplicate Record Detection: A Survey,Koudas, et. al. Record Linkage: Similarity Measures and Algorithms
  5. Key-value stores and NoSQL; tradeoffs of SQL and NoSQL

    Readings for step 3-4-5

    Data cleaning, entity resolution, data integration, information extraction

    (NOT COVERED IN LECTURES)Readings / Talks

    Elmagarmid, et. al. Duplicate Record Detection: A Survey,Koudas, et. al. Record Linkage: Similarity Measures and Algorithms
  6. Topics in statistical modeling: basic concepts, experiment design, pitfalls

    Readings

  7. Topics in machine learning

    1. Ssupervised learning (rules, trees, forests, nearest neighbor, regression),
    2. Optimization (gradient descent and variants),
    3. Unsupervised learning

    Readings

    Unsupervised learning: k-means, multi-dimensional scaling

    Readings

  8. Visualization, data products, visual data analytics

    Readings (well, watchings)

  9. Provenance, privacy, ethics, governance

    Backlash: Ethics, privacy, unreliable methods, irreproducible results
    (NOT COVERED IN LECTURES)

  10. Guest Lectures

  11. Graph Analytics

    • structure
    • traversals
    • analytics
    • PageRank
    • community detection
    • recursive queries
    • semantic web

    Readings

    Sherif Sakr, Processing large-scale graph data: A guide to current technology, June 2013(more to come)
  • 2268
  • 10 липня 2014, 11:15

Реєстрація

Можливості
безмежні.
Настав час
відкрити свої.

Уже зарегистрированы?
Вхід на сайт

Заходьте.
Відкрито.

Ще не зареєстровані?
 
Підключіться до будь-якого з ваших акаунтів, ваші дані будуть взяті з акаунту.
Забули пароль?