– Software Engineers
– Application Developers
– IT Architects
– System Administrators
– Data Analysts and Scientists

Big Data Foundation & Certification

Who is this certification for?

This course is best suited to IT professionals who possess intermediate to advanced programming, system administration, or relational database skills and are looking to move into the area of big data. These include:

  • Software Engineers
  • Application Developers
  • IT Architects
  • System Administrators
  • Data Analysts and Scientists

Syllabus – Big Data Foundation

 

Module 1. Big Data Fundamentals

  • Big Data – History, Overview, and Characteristics
    • History
    • Big Data Definition
    • Big Data Benefits
    • Big Data Characteristics – Volume, Velocity & Variety
  • Big Data Technologies – Overview
  • Big Data Success Stories
  • Big Data – Privacy and Ethics
    • Privacy – Compliance
    • Privacy – Challenges
    • Privacy – Approach
    • Ethics
  • Big Data Projects
    • Who Should Be Involved?
    • What Is Involved?

Module 2. Big Data Sources

  • Enterprise Data Sources
    • Enterprise Systems
    • Oracle
    • SAP
    • Microsoft
    • Data Warehouses
    • Unstructured Data – Introduction
    • Unstructured Data – Metadata
  • Social Media Data Sources
    • Introduction
  • Facebook – Introduction
  • Facebook – Public Feed API
  • Facebook – Keyword Insights API
  • Facebook – Graph API
  • Twitter – Introduction
  • Twitter – Streaming APIs
  • Twitter – REST APIs
  • Other Social Media
  • Public Data Sources
    • Introduction
    • Weather
    • Economics
    • Finance
    • Regulatory Bodies

 

Module 3. Data Mining – Concepts and Tools

 

  • Data Mining – Introduction
    • Introduction
    • Types of Data Mining – Overview
    • Types of Data Mining – Classification
    • Types of Data Mining – Association
    • Types of Data Mining – Clustering
  • Data Mining – Tools
    • Introduction
    • Weka
    • Modules of Weka Applications
    • KNIME
    • KNIME – Example
    • R Language

 

Module 4. Big Data Technologies – Hadoop

 

  • Hadoop Fundamentals
    • Introduction
    • Main Components of Hadoop
    • Additional Components of Hadoop
  • Install and Configure
    • Download
    • How to Install and Configure
  • MapReduce
    • Introduction
    • How Does It Work?
  • Data Processing with Hadoop
    • Introduction
    • Twitter Sentiment Analysis – Overview
    • Twitter Sentiment Analysis – Algorithm
  • Network Log Analysis – Overview
  • Network Log Analysis – Algorithm

Module 5. Big Data Technologies – MongoDB

  • MongoDB Fundamentals
    • Introduction
    • Replication
    • Sharding
    • Sharding and Replication
    • MongoDB Ecosystem – Languages and Drivers
    • MongoDB Ecosystem – Hadoop Integration
    • MongoDB Ecosystem – Tools
  • Install and Configure
    • Download
    • How to Install and Configure
  • Document Databases
    • Introduction
    • Documents
    • Document Design Considerations
    • Fields
  • Data Modelling with Document Databases
    • Introduction
    • Twitter Sentiment Analysis
    • Twitter Sentiment Analysis – Algorithm
    • Network Log Analysis
    • Network Log Analysis – Algorithm

 

Exam Details

 

Big Data Foundation Certification Exam
Exam Type Multiple Choice
No. of Questions 40
Duration 60 minutes
Additional Time Provisions 15 minutes additional time for candidates who speak English as a second language.
Prerequisite There are no required prerequisites. We recommend that participants possess intermediate to advanced programming, system administration, or relational database skills to understand the concepts in this certification.
Supervised (Proctored) Yes (Web/Live)
Open Book No
Pass Score 65%
Delivery Online

 

Cloud Credential Council

 The Cloud Credential Council (CCC) is an international member-based organization mandated to drive cloud readiness through effective competence development. The CCC has established critical cloud certifications for key IT roles in order to cultivate cloud-ready IT professionals. The certification scheme was developed after several years research investment in over 20 roles led by industry experts in conjunction with the leading technology vendors in the cloud computing arena.