Data analysis has replaced data acquisition as the bottleneck to evidence-based decision making --- we are drowning in it. Extracting knowledge from large, heterogeneous, and noisy datasets requires not only powerful computing resources, but the programming abstractions to use them effectively. The abstractions that emerged in the last decade blend ideas from parallel databases, distributed systems, and programming languages to create a new class of scalable data analytics platforms that form the foundation for data science at realistic scales.
About this Course
Skills you will gain
- 5 stars57.18%
- 4 stars25.42%
- 3 stars9.09%
- 2 stars4.74%
- 1 star3.55%
TOP REVIEWS FROM DATA MANIPULATION AT SCALE: SYSTEMS AND ALGORITHMS
Well structured and nice overview of data manipulation. But the assignments should really be updated in order to use python 3.x instead of 2.7, which is not maintained anymore...
This is a quite wonderful course for large-scale data science. I believe I will have learned a lot via completing the courses.
It's pretty tough in assignments especially when there are mistakes in the given description, but I do learn the basic concepts of relational algorithm and MapReduce from them.
Last week of the course is too much information and without any assignments it kind of doesn't make much sense and it doesn't stick.
About the Data Science at Scale Specialization
Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
More questions? Visit the Learner Help Center.