Machine Learning with Spark

Inquire now

Course Overview:

Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed. Machine Learning algorithms comb through data and identify patterns that are too complex to be discerned by the human mind.

These patterns can then be used for decision making and action Apache Spark is a powerful platform that for running Machine Learning. This course will how you how to perform various Machine Learning using Apache Spark built in MLib component.

Course Objectives:

  • Overview of Apache Spark
  • Clustering
  • Regression
  • Classification
  • Recommendation

Pre-requisites:

  • This is an intermediate course. Participants should have basic knowledge on the following subjects: Python Apache Spark

Target Audience:

  • Big Data Analysts
  • Data Scientists
  • Data Analysts

Course Duration:

  • 14 hours – 2 days

Course Content:

Module 1: Apache Spark Basics

  • Recap of Apache Spark Basics
  • Install Apache Spark on Local Computer
  • Read CSV Data
  • Manipulating Dataframe
  • ML Libraries

Module 2: Preprocessing 

  • Normalizer
  • Standardizer
  • Tokenizer
  • TF-IDF

Module 3: Clustering 

  • What is Clustering
  • Clustering Algorithms
  • KMeans Clustering
  • Hierarchical Clustering

Module 4: Classification 

  • What is Classification
  • Naives Bayes Clasiifier
  • Decision Tree Classifer
  • •Multi Layer Perception

Module 5: Regression 

  • What is Clustering
  • Clustering Algorithms
  • Linear Regression
  • Decision Tree Regression
  • Gradient Boosted Tree Regression

Module 6: ML Pipeline

  • What is Pipeline
  • Creating a Pipeline for Movie Review Classification

Module 7: Recommendation (Optional) 

  • Recommendation Systems
  • Collaborative Filtering
  • Summary and Closing Remarks

 

 

Course Customization Options

To request a customized training for this course, please contact us to arrange.

Inquire now

Best selling courses

Duration 3 days – 21 hrs   Overview    This Portfolio Management Training Course is designed to provide banking professionals with a comprehensive understanding of how to effectively manage investment...

Duration 2 days – 14 hrs   Overview   This comprehensive Planning and Forecasting Training Course is designed to empower professionals with the tools and techniques necessary to accurately predict...

Duration 2 days – 14 hrs   Overview   This hands-on course provides an introduction to Splunk, a powerful platform for searching, monitoring, and analyzing machine-generated data. The training focuses...

Duration 3 days – 21 hrs   Overview.   This course is designed for fresh graduates aspiring to build a career in Data Science. It introduces the fundamentals of data...

Among the most popular and widely implemented NoSQL databases is MongoDB. Its scalability, robustness, and flexibility have made it extremely popular among the Fortune 500 and Global 500 companies who use it to implement a variety of activities including social communications, analytics, content management, archiving, and other activities.

PROGRAMMING / CODING

ASP.NET

SP.NET is a framework for developing dynamic web applications. It supports languages like VB.Net, C#, Jscript.Net, etc. The programming logic and content can be developed separately in Microsoft Asp.Net.

CYBER SECURITY

Physical Security

Duration 3 days – 21 hrs   Overview   This course provides a comprehensive introduction to physical security principles, policies, technologies, and practices. It covers methods to assess physical risks,...

Duration 5 days – 35 hrs   Overview   This intensive 5-day course is designed for professionals seeking advanced-level skills in Microsoft SQL Server’s BI stack: SSRS (SQL Server Reporting...

We use cookies on our website to personalize your experience by storing your preferences and recognizing repeat visits. By clicking “Accept”, you agree to the use of all cookies. You can also select “Cookie Settings” to adjust your preferences and provide more specific consent. Cookie Policy