Technology Course

Post Graduate Program in Big Data Analytics (PGP-Big Data Analytics)

Course Type: Certification | Study Mode: Online With Class Room
Keywords: Post Graduate Program in Big Data Analytics PGP-Big Data Analytics

Course Detail

Introduction
Statistical foundations necessary for data science
Big Data Technologies for a hands-on exploration of handling large, complex, disparate data
Machine Learning and Advanced Analytics techniques to draw inferences from complex datasets
Visualization skills necessary to display the data in a useful and compelling way
Curriculum Overview
Statistical Foundations 
-Descriptive & inferential statistics
-Experiment design
-Hypothesis testing and estimation
-Predictive analytics – regression (Ordinary least squares, multiple linear, logistic)
-Sampling
-Probability distributions
-Correlation and interactions
-Tools: R, Excel
 
Big Data Technologies 
-Hadoop and Spark ecosystem
-Data discovery and acquisition – Real time, web, DB, archives, machine logs
-Data storage and manipulation in HDFS
-NoSQL databases (MongoDB)
-Big Data in the cloud with AWS
-Data processing with Spark, Hive, Pig
-Tools: Hadoop, Hbase, Spark, Pig, Hive, MongoDB, AWS
 
Machine Learning on Big Data 
-Feature Engineering
-Dimensionality reduction
-Tree-based methods: Decision trees, random forest
-Classification
-Clustering
-Recommendation systems
-Graphical models and page rank algorithm
-Tools: R, Python, Mllib, GraphX
 
Visualization & Insight 
-Exploratory data analysis
-Graphical representation using libraries
-Visualizing graphical and network models
-Campaign analysis and dashboards
-Insight presentation – written & visual
-Case studies on real world data sets
-Tools: Tableau, Gephi, R libraries
Other Information
A Bachelor’s Degree in Engineering, Computer Science or Mathematics/Statistics with a minimum of 50% aggregate marks or equivalent.
Programming experience - preferably in Python, Java or C++
Familiarity with college-level mathematics and statistics
 
Course Duration : 12 Months