logo
  • youtube
  • googleplus
  • linkedin
  • gmail
  • whatsapp
icon9

R Language + Data Science

The Apache Data Science software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering…

Course Curriculum

  1. Introduction to signal and pattern detection
  • Basic commands in R
  • Vectors and matrices in R
  • Two main file types in Rstudio and importing data into R
  • Installing packages in Rstudio

2.Univariate analysis

  • Statistical concepts of Frequency Distribution/Central Distribution and Dispersion
  • Understanding various test
  • Test for mean/proportion
  • Difference of mean/proportions
  • Chi square
  • Regression test
  • Paired test
  • Statistical understanding and R implementation of Univariate analysis

3.Bivariate analysis

  • Statistical concepts of Cross tabulations and Correlation
  • P value interpretation
  • Types of correlation explained using a data set in R
  • Concept of hypothesis
  • Chi square test and worked example
  • Correlation explained with example
  • Statistical understanding and R implementation of Bivariate analysis

4.Advanced visualization

  • Heat maps
  • Geospatial maps usage and explation of importance
  • Small multiples
  • Various advanced visualization tools and techniques

5.business story telling

6.end to end case study

  • Survival analysis end-to-end case study And its interpretation
  • Attrition analysis and its interpretation
  • Active and inactive customers case study and its interpretation
  • Repeat purchase case study
  • Sales trends case study
  • segmenting customers case study

7.Machine learning 

  • Supervised Learning
  • Decisions tree plotting in R using a dataset
  • Concept of decision tree
  • Classification
  • Unsupervised Learning
  • Dimension Reduction
  • Principle component analysis and implementation in R using dataset.
  • Clustering
  • Time series analysis
  • supervised and unsupervised ML from statistical point and in R

8.Regression Analysis

statistical perspective of Regression

ROAD MAP

Priority Training

FAQ's

Play Video

news letter
GET CONNECTED
skype
facebook
gmail
youtube
twitter
whatsapp
google+
linkedin
viber
2009 - 2017 - erpXstreem. All Rights Reserved.
SAP® is the trademark or registered trademark of SAP AG. erpXstreem is not affiliated with or endorsed by SAP AG
MENU