fbpx

Training with Iverson classes

Training is not a commodity – all training centres are not the same. Iverson Associates Sdn Bhd is the most established, the most reputable, and the top professional IT training provider in Malaysia. With a large pool of experienced and certified trainers, state-of-the-art facilities, and well-designed courseware, Iverson offers superior training, a more impactful learning experience and highly effective results.

At Iverson, our focus is on providing high-quality IT training to corporate customers, meeting their learning needs and helping them to achieve their training objectives. Iverson has the flexibility to provide training solutions whether for a single individual or the largest corporation in a well-paced or accelerated training programme.

Our courses continue to evolve along with the fast-changing technological advances. Our instructor-led training services are available on a public and a private (in-company) basis. Some of our courses are also available as online, on demand, and hybrid training.

This four-day workshop covers data science and machine learning workflows at scale using Apache Spark 2 and other key components of the Hadoop ecosystem. The workshop emphasizes the use of data science and machine learning methods to address real-world business challenges.

 

Using scenarios and datasets from a fictional technology company, students discover insights to support critical business decisions and develop data products to transform the business. The material is presented through a sequence of brief lectures, interactive demonstrations, extensive hands-on exercises, and discussions. The Apache Spark demonstrations and exercises are conducted in Python (with PySpark) and R (with sparklyr) using the Cloudera Data Science Workbench (CDSW) environment.

Additional Info

  • Certification Course only
  • Course Code CDST
  • Price RM13600
  • Exam Price Exclude
  • Duration 4 Days
  • Principals Cloudera
  • Schedule

    11-14 Mar 2024

    4-7 Jun 2024

    16-19 Dec 2024

  • Audience
  • Prerequisities
  • At Course Completion

    The workshop is designed for data scientists who currently use Python or R to work with smaller datasets on a single machine and who need to scale up their analyses and machine learning models to large datasets on distributed clusters. Data engineers and developers with some knowledge of data science and machine learning may also find this workshop useful.

    Workshop participants should have a basic understanding of Python or R and some experience exploring and analyzing data and developing statistical or machine learning models. Knowledge of Hadoop or Spark is not required.

     

  • Module 1 Title Overview of data science and machine learning at scale
  • Module 1 Content
  • Module 2 Title Overview of the Hadoop ecosystem
  • Module 2 Content
  • Module 3 Title Working with HDFS data and Hive tables using Hue
  • Module 3 Content
  • Module 4 Title Introduction to Cloudera Data Science Workbench
  • Module 4 Content
  • Module 5 Title Overview of Apache Spark 2
  • Module 5 Content
  • Module 6 Title Reading and writing data
  • Module 6 Content
  • Module 7 Title Inspecting data quality
  • Module 7 Content
  • Module 8 Title Cleansing and transforming data
  • Module 8 Content
  • Module 9 Title Summarizing and grouping data
  • Module 9 Content
  • Module 10 Title Combining, splitting, and reshaping data
  • Module 10 Content
  • Module 11 Title Exploring data
  • Module 11 Content
  • Module 12 Title Configuring, monitoring, and troubleshooting Spark applications
  • Module 12 Content
  • Module 13 Title Overview of machine learning in Spark MLlib
  • Module 13 Content
  • Module 14 Title Extracting, transforming, and selecting features
  • Module 14 Content
  • Module 15 Title Building and evaluating regression models
  • Module 15 Content
  • Module 16 Title Building and evaluating classification models
  • Module 16 Content
  • Module 17 Title Building and evaluating clustering models
  • Module 17 Content
  • Module 18 Title Cross-validating models and tuning hyperparameters
  • Module 18 Content
  • Module 19 Title Building machine learning pipelines
  • Module 19 Content
  • Module 20 Title Deploying machine learning models
  • Module 20 Content
  • Module 21 Content
  • Module 22 Content
  • Module 23 Content
  • Module 24 Content
  • Module 25 Content
  • Module 26 Content
  • Module 27 Content
  • Module 28 Content
  • Module 29 Content
  • Module 30 Content
  • Module 31 Content
  • Module 32 Content
  • Module 33 Content
  • Module 34 Content
  • Module 35 Content
  • Module 36 Content
  • Module 37 Content
  • Module 38 Content
  • Module 39 Content
  • Module 40 Content
  • Module 41 Content
  • Module 42 Content
  • Module 43 Content
  • Module 44 Content
  • Module 45 Content
  • Module 46 Content
  • Module 47 Content
  • Module 48 Content
  • Module 49 Content
  • Module 50 Content
RM13,600.00(+RM1,088.00 Tax)
* Training Dates:

PMP, Project Management Professional (PMP), CAPM, Certified Associate in Project Management (CAPM) are registered marks of the Project Management Institute, Inc.

We are using cookies to give you the best experience on our site. By continuing to use our website without changing the settings, you are agreeing to use of cookies.
Ok Decline