Big Data Certification

Become a SAS® Certified Big Data Professional.

Duration 522 Hours
Certificate SAS Global
Language English

Fees 2250


Power up your staff’s skills and boost your business

@

or Call us on 732-593-8343

Demonstrate your ability to use the tools and technology designed to handle big data. The SAS Certified Big Data Professional program delivers the extra edge you're looking for.

SAS® Big Data Certification Curriculum

Academic Courses Count

9 Courses

Course content is designed to prepare you for the certification exams.

Case Studies Details

Case Studies

Real-world case studies enable you to apply what you have learned.

Certification Exams Count

2 Exams

Pass both exams to earn your certification credential.

Topics Covered

  • Critical SAS programming skills.
  • Exploring and visualizing data.
  • Working with Hadoop, Hive, Pig and SAS.
  • Fundamentals of statistics and analytics.
  • Improving data quality for reporting and analytics.
  • Accessing, transforming and manipulating data.
  • Essential communication skills.
  • Critical SAS programming skills.
  • Exploring and visualizing data.
  • Working with Hadoop, Hive, Pig and SAS.
  • Fundamentals of statistics and analytics.
  • Improving data quality for reporting and analytics.
  • Accessing, transforming and manipulating data.
  • Essential communication skills.

SAS Software Covered

  • Base SAS
  • SAS® Enterprise Guide®
  • SAS® Enterprise Miner
  • SAS® In-Memory Statistics
  • SAS® Studio
  • SAS/STAT®
  • SAS® Visual Analytics
  • DataFlux® Data Management Server
  • DataFlux® Data Management Studio
  • Base SAS
  • SAS® Enterprise Guide®
  • SAS® Enterprise Miner
  • SAS® In-Memory Statistics
  • SAS® Studio
  • SAS/STAT®
  • SAS® Visual Analytics
  • DataFlux® Data Management Server
  • DataFlux® Data Management Studio

Choose a Format

Course Format Image
Course Format Image

Instructor-Led Classroom

  • Instructor-led training in a classroom setting.
  • Monday-Friday classes for six weeks.
  • Real-world case studies that help you apply what you learn.
  • Access to SAS software for practice.
  • Dedicated coach to guide you.
  • Certification exam vouchers.
  • Free access to SAS Programming for Data Science Fast Track to prepare.
Course Format Image 1
Course Format Image 2

Self-Paced e-Learning

  • Online e-learning courses accessible 24/7.
  • Available as monthly subscription or 12-month license.
  • Real-world case studies that help you apply what you learn.
  • Access to SAS software for practice.
  • An online community to support your learning.
  • 50% off instructor-led classes.

Prerequisite Skills

To enroll in the program, you need at least six months of programming experience in SAS or another programming language. If you need to brush up on your programming skills, the SAS Programming for Data Science Fast Track will give you a good foundation.

 

What You Will Learn

Module 1: Big Data Preparation, Statistics and Visual Exploration

This course provides an overview of the challenges associated with big data and analysis-driven data. 

Topics Covered

  • Reading external data files.
  • Storing and processing data.
  • Combining Hadoop and SAS.
  • Recognizing and overcoming big data challenges.

In this course, you'll learn how to use SAS Visual Analytics Explorer to explore in-memory tables from the SAS® LASR™ Analytic Server and perform advanced data analyses.

Topics Covered

  • Finding previously unknown relationships and spotting trends in your data.
  • Visualizing data using charts, plots and tables.
  • Using the autocharting function to visualize data in the best possible way.
  • Using advanced graphs, such as network diagrams, Sankey diagrams and word clouds.
  • Easily adding analytics to your graphs, and including descriptions of the analytics results.
  • Navigating through your data using on-the-fly hierarchies.

This introductory SAS/STAT® course focuses on t-tests, ANOVA and linear regression, and includes a brief introduction to logistic regression.

Topics Covered

  • Generating descriptive statistics and exploring data with graphs.
  • Performing analysis of variance and applying multiple comparison techniques.
  • Performing linear regression and assessing the assumptions.
  • Using regression model selection techniques to aid in the choice of predictor variables in multiple regression.
  • Using diagnostic statistics to assess statistical assumptions and identify potential outliers in multiple regression.
  • Using chi-square statistics to detect associations among categorical variables.
  • Fitting a multiple logistic regression model.
  • Scoring new data using developed models.

In this course, you'll learn how to perform data management tasks, such as improving data quality, entity resolution and data monitoring.

Topics Covered

  • Creating and reviewing data explorations.
  • Creating and reviewing data profiles.
  • Creating data jobs for data improvement.
  • Establishing monitoring aspects for your data.
  • Understanding the QKB components.
  • Using the component editors.
  • Understanding various definition types.
  • Building a new data type (optional).

 

Storytelling is a necessary skill when talking to key stakeholders. Insights uncovered in your data can move mountains if the right people say yes. But how do you move someone from simply being curious, all the way to, "Let's do this!" In this course, you'll learn why storytelling is a skill you need to develop, when a story works and when it doesn't, and how to communicate data in a meaningful way.

Module 2: Big Data Programming and Loading

This course teaches you how to use SAS programming methods to read, write and manipulate Hadoop data. You'll learn how to use Base SAS methods to read and write raw data with the DATA step, manage the Hadoop Distributed File System (HDFS) and execute MapReduce and Pig code from SAS via the HADOOP procedure. You'll also learn how to use SAS/ACCESS® Interface to Hadoop methods that allow LIBNAME access and SQL pass-through techniques to read and write Hive or Impala table structures.

Topics Covered

  • Accessing Hadoop distributions using the LIBNAME statement and the SQL pass-through facility.
  • Creating and using SQL procedure pass-through queries.
  • Using options and efficiency techniques for optimizing data access performance.
  • Joining data using the SQL procedure and the DATA step.
  • Reading and writing Hadoop files with the FILENAME statement.
  • Executing and using Hadoop commands with PROC HADOOP.
  • Using Base SAS procedures with Hadoop.

This course focuses on DS2, a fourth-generation SAS proprietary language for advanced data manipulation, which enables parallel processing and storage of large data with reusable methods and packages.

Topics Covered

  • Identifying the similarities and differences between the SAS DATA step and the DS2 DATA step.
  • Converting a Base SAS DATA step to DS2.
  • Creating DS2 variable declarations, expressions and methods for data conversion, manipulation and conditional processing.
  • Creating user-defined and predefined packages to store, share and execute DS2 methods.
  • Creating and executing DS2 threads for parallel processing.
  • Using the SAS In-Database Code Accelerator to execute DS2 code outside of a SAS session.
  • Executing DS2 code in the SAS High-Performance Analytics grid using the HPDS2 procedure.

In this course, you will use processing methods to prepare structured and unstructured big data for analysis. You will learn to organize the data into structured tabular form using Apache Hive and Apache Pig. You will also learn SAS software technology and techniques that integrate with Hive and Pig, as well as how to use these open source capabilities by programming with Base SAS and SAS/ACCESS Interface to Hadoop, and with SAS Data Integration Studio.

Topics Covered

  • Moving data into the Hadoop ecosystem.
  • Using Hive to design a data warehouse in Hadoop, perform data analysis using the Hive query language (HiveQL) and join data sources.
  • Performing extract, transform and load (ETL).
  • Organizing data in Hadoop by usage.
  • Analyzing unstructured data using Pig.
  • Joining massive data sets using Pig.
  • Using user-defined functions (UDFs).
  • Analyzing big data in Hadoop using Hive and Pig.
  • Using SAS programming to submit Hive and Pig programs that execute in Hadoop, and store results in Hadoop or return results to SAS.
  • Using SAS programming to move data between the SAS server and the HDFS.
  • Constructing SAS Data Integration Studio jobs that integrate with Hive and Pig processes and the HDFS.

This course focuses on accessing data on the SAS LASR Analytic Server and performing exploratory analysis and preparation. Topics include starting the server, loading data and manipulating data on the SAS LASR Analytic Server using the IMSTAT procedure. IMSTAT topics include deriving new temporary and permanent tables and columns, calculating summary statistics (e.g., mean, frequency and percentile), and creating filters and joins on in-memory data.

Topics Covered

  • Starting up a SAS LASR Analytic Server.
  • Loading tables into memory on the SAS LASR Analytic Server.
  • Processing in-memory tables with PROC LASR and PROC IMSTAT.
  • Accessing data more efficiently via intelligent partitioning.
  • Deriving new temporary and permanent tables and variables.
  • Creating filters and joins on in-memory data.
  • Exporting ODS result tables for client-side graphic development.
  • Producing descriptive statistics including counts, percentiles and means.
  • Creating multidimensional summaries including cross-tabulations and contingency tables.
  • Deriving kernel density estimates using normal functions.

Register Now

Self-Paced e-Learning

Monthly Subscription

  • 9 award-winning courses.
  • 15 hours of software access for practice.

$375/month


299/month

(20% savings)

Enroll Now

Annual License

  • 9 award-winning courses.
  • 100 hours of software access for practice.

Best value!


2,250/year

 

Enroll Now

Instructor-Led Classroom


Six-Week Program

  • 9 award-winning courses.


 


9,000

 

Enroll Now

SAS® Certified Big Data Professional

SAS® Certified Big Data Professional

Learn to manage big data, focusing on data quality and visual data exploration for advanced analytics, plus communication skills.

Select
SAS® Certified Advanced Analytics Professional

SAS® Certified Advanced Analytics Professional

Learn analytical modeling, machine learning, experimentation, forecasting and optimization.

Select
SAS® Certified Data Scientist

SAS® Certified Data Scientist

Learn it all. This program includes all coursework from both the big data and advanced analytics programs.

Select

Get Started Now

Our program advisers can help you get started.

To learn more about which program is right for you connect with us via your channel of choice.

Contact us Mobile732-593-8343 Contact us Emailinfo.usa@epoch.co.in

Grow With SAS

View More Training

View More Training

Browse our learning paths to find more courses and expand your knowledge.

Know More
View webinars

View webinars

Take an e-learning course to get started with SAS or advance your skills.

Know More
Subscribe to newsletter

Subscribe to newsletter

Stay current on training, certification and books with the SAS Learning Report.

Know More