Learn By Example: Statistics and Data Science in R

371 коментарий
Обучение платное
Сертификация бесплатная
9 часов курса
О курсе

Taught by a Stanford-educated, ex-Googler and an IIT, IIM - educated ex-Flipkart lead analyst. This team has decades of practical experience in quant trading, analytics and e-commerce. 

This course is a gentle yet thorough introduction to Data Science, Statistics and R using real life examples. 

Let’s parse that.

Gentle, yet thorough: This course does not require a prior quantitative or mathematics background. It starts by introducing basic concepts such as the mean, median etc and eventually covers all aspects of an analytics (or) data science career from analysing and preparing raw data to visualising your findings. 

Data Science, Statistics and R: This course is an introduction to Data Science and Statistics using the R programming language. It covers both the theoretical aspects of Statistical concepts and the practical implementation using R. 

Real life examples: Every concept is explained with the help of examples, case studies and source code in R wherever necessary. The examples cover a wide array of topics and range from A/B testing in an Internet company context to the Capital Asset Pricing Model in a quant finance context. 

What's Covered:

Data Analysis with R: Datatypes and Data structures in R, Vectors, Arrays, Matrices, Lists, Data Frames, Reading data from files, Aggregating, Sorting & Merging Data Frames

Linear Regression: Regression, Simple Linear Regression in Excel, Simple Linear Regression in R, Multiple Linear Regression in R, Categorical variables in regression, Robust regression, Parsing regression diagnostic plots

Data Visualization in R: Line plot, Scatter plot, Bar plot, Histogram, Scatterplot matrix, Heat map, Packages for Data Visualisation : Rcolorbrewer, ggplot2

Descriptive Statistics: Mean, Median, Mode, IQR, Standard Deviation, Frequency Distributions, Histograms, Boxplots

Inferential Statistics: Random Variables, Probability Distributions, Uniform Distribution, Normal Distribution, Sampling, Sampling Distribution, Hypothesis testing, Test statistic, Test of significance

You, This course and Us
This course is a gentle yet thorough introduction to Data Science, Statistics and R using real life examples.
Top Down vs Bottoms Up : The Google vs McKinsey way of looking at data
Q. How do companies make decisions?  A. Using data We talk about what it takes to go from data to making a decision from data. This sets the agenda for the rest of the course - each of the things on this journey is covered in the upcoming sections
R and RStudio installed
Get setup with R and Rstudio. All the examples that follow in this course will have source code attached. Download and run them in Rstudio
The 10 second answer : Descriptive Statistics
Descriptive Statistics : Mean, Median, Mode
Bosses are impatient. They often want you to cut to the chase, and give them an answer that's ok, but in a short amount of time. Descriptive statistics are the first place to start - they are often the 10s answer to any question about the data. 
Our first foray into R : Frequency Distributions
Computing a frequency distribution using R
Draw your first plot : A Histogram
A histogram is a good visual summary of your data. 
Computing Mean, Median, Mode in R
Computing the Mean, Median, Mode in R
What is IQR (Inter-quartile Range)?
The mean, median and mode are point estimates to represent your data. IQR is a measure that explains the spread of the data.
Box and Whisker Plots
Visualize the IQR and outliers using box and whisker plots
The Standard Deviation
The standard deviation measures the spread of a dataset, and it so happens, the standard deviation is actually very profound.
  • No prerequisites : We start from basics and cover everything you need to know. We will be installing R and RStudio as part of the course and using it for most of the examples. Excel is used for one of the examples and basic knowledge of excel is assumed.
Что Вы изучите?
  • Harness R and R packages to read, process and visualize data
  • Understand linear regression and use it confidently to build models
  • Understand the intricacies of all the different data structures in R
  • Use Linear regression in R to overcome the difficulties of LINEST() in Excel
  • Draw inferences from data and support them using tests of significance
  • Use descriptive statistics to perform a quick study of some data and present results
Loony Corn
Loony Corn
An ex-Google, Stanford and Flipkart team

Loonycorn is us, Janani Ravi and Vitthal Srinivasan. Between us, we have studied at Stanford, been admitted to IIM Ahmedabad and have spent years  working in tech, in the Bay Area, New York, Singapore and Bangalore.

Janani: 7 years at Google (New York, Singapore); Studied at Stanford; also worked at Flipkart and Microsoft

Vitthal: Also Google (Singapore) and studied at Stanford; Flipkart, Credit Suisse and INSEAD too

We think we might have hit upon a neat way of teaching complicated tech courses in a funny, practical, engaging way, which is why we are so excited to be here on Udemy!

We hope you will try our offerings, and think you'll like them :-)

Курсы Udemy подойдут для профессионального развития. Платформа устроена таким образом, что эксперты сами запускают курсы. Все материалы передаются в пожизненный доступ. На этой платформе можно найти курс, без преувеличений, на любую тему – начиная от тьюториала по какой-то камере и заканчивая теоретическим курсом по управлению финансовыми рисками. Язык и формат обучения устанавливается преподавателем, поэтому стоит внимательно изучить информацию о курсе перед покупкой.