Feedback
Loading...

Add to Buffet

Save course to Your Buffet - Get notified, Track Progress, Plan Future Learning.
48 People Have this course in their Buffet

Big Data Analysis with Apache Spark

Course Description:

Organizations use their data for decision support and to build data-intensive products and services, such as recommendation, prediction, and diagnostic systems. The collection of skills required by organizations to support these functions has been grouped under the term Data Science. This course will attempt to articulate the expected output of Data Scientists and then teach students how to use PySpark (part of Apache Spark)  to deliver against these expectations. The course assignments include Log Mining, Textual Entity Recognition, Collaborative Filtering exercises that teach students how to manipulate data sets using parallel processing with PySpark.  This course covers advanced undergraduate-level material. It requires a programming background and experience with Python (or the ability to learn it quickly). All exercises will use PySpark (part of Apache Spark), but previous experience with Spark or distributed computing is NOT required.

  • Instructor(s) Anthony D. Joseph
  • University
  • Provider
  • Start Date 01/Jun/2015
  • Duration 5 weeks
  • Main Language English
Did you find any errors in this course listing ? Help us improve and we would be eternally grateful

Related Courses

Other Computer Science Courses

Course Reviews

  • No Comments Yet! Be the first one to comment.