Big Data Challenges and Analysis-Driven Data

Duration Days
Certificate SAS Global
Language English


Power up your staff’s skills and boost your business

or Call us on +91 79 403 270 00


Course Description

Data Science Academy attendees

About Program

This course provides an overview of the challenges associated with big data and analysis-driven data. 

Format of Training

Taught by certified instructors at high-tech facilities across the country

  • A SAS expert at your side.
  • Focused learning away from the office
  • Networking opportunities
  • State-of-the-art facilities
  • Electronic course notes downloadable to your device and permission to print
  • Business Knowledge Series: in-depth courses on the latest business topics
  • We offer Connected Classes! Watch for courses in Cary, New York, Arlington, Dallas and San Francisco that connect remote students via our Live Web classroom.

Prerequisite

This course addresses Base SAS, SAS Enterprise Guide, SAS/STAT, DataFlux Data Management Studio software.

Training Features

  • read external data files
  • store and process data
  • discuss how to combine Hadoop and SAS
  • recognize and overcome big data challenges.

Course Curriculum

Data and Digital Computers

  • introduction
  • digital representation of numeric values
  • redundancy and error-checking
  • reading external data files

Programming and Digital Computers

  • machine language versus higher level programming languages
  • program design considerations
  • software engineering considerations

Storing and Processing Data

  • relational databases
  • querying and reporting
  • analytics

Big Data Overview

  • why big data?
  • why Hadoop?
  • why Hadoop and SAS?
  • why now?

Challenges Associated with Big Data

  • machine language versus higher level programming languages
  • program design considerations
  • software engineering considerations
  • relational databases
  • querying and reporting
  • analytics
  • data quality challenges associated with more data from more sources (and raw data)
  • challenges associated with more users in more roles accessing the data