Jun 12-13, 2014 - Introduction to Data Science with R
SupStat (Official Training partner with RStudio Inc) is hosting our two day Introduction to Data Science with R course in New York City this June. This is a two-day workshop, designed to provide a comprehensive introduction to R. We'll get you programing & analyzing data with R in no time. All participants will receive a copy of all slides, exercises, data sets, & R scripts used in the course.
We will emphasize how you can get work done easiy with Rstudio IDE.
Vivian S. Zhang (CTO of SupStat, Organizer of NYC Open Data Meetup, Founder of NYC Data Science Academy)
Check our past students' excellent testimonial
Discount pricingavailable for academics (33% off) & students (66% off). Space is limited, please write to Send Mail to confirm your eligibility.
Who should take this course?
This class will be a good fit for you if you are just starting with R or have dabbled in R, but wish to improve your skills. No prior experience with R or data science is required.
What will you learn?
Practical skills for visualizing, transforming, & modeling data in R. During this two-day course, you will learn how to explore & understand data as well as how to do basic programming in R. A full list of topics for each day is below.
What should you bring?
Be ready to learn. You need your laptop & the latest version ofR. We also recommend downloading the Rstudio IDE, as it provides a great learning environment for beginners as well as tools for when you transition into an advanced user.
Are you interested in better understanding your data, & not so interested in mastering a programming language? Have you tried learning R from a book or website, but have been discouraged? If so, this is the course for you. We assume that you've never programmed before (although some experience doesn't hurt), & we teach you the best tools to help analyze your data.
You won't be a master programmer by the end of this two-day course, but through immersion you will have learned the basics of R's syntax & grammar, & you'll have started building an effective R vocabulary for visualizing, transforming, & modeling data. You will learn how to load, save, & transform data as well as how to write functions, generate beautiful graphs, & fit basic statistical models to your data. We'll give you a theoretical framework to help you understand the process of data analysis, but our focus is on practical tools that you can use as soon as you get back from the course.
All techniques are motivated by real problems, & you'll be exposed to a number of real datasets throughout the course. We alternate brief lectures with hands-on practice: you'll get plenty of experience actually using R (not just hearing about it!), & there's plenty of help available if you get stuck. The course concludes with a 90-minute data analysis project. You can use this as an opportunity to start using R with your data, or work on answering some of our questions about a dataset.
This tried & true course has been taken by over 200 students, from biologists to humanists, many of whom had never programmed before. This course teaches the basic skills needed by anyone seriously interested in data.
Day 1 - Getting started & working with data
Thursday, June 12nd, 2013
An Introduction to R & data analysis- R is more than just a programming language. R is a statistical software application in its own right, an environment for interactive data analysis, & a community of passionate users. This orientation to the R language will help you get up & running.
How to download & update R & SupStat
How to find resources & help for R
Stages of data analysis
Best practices of data analysis
Visualizing data- R's is well known for its beautiful graphics. R packages, like `ggplot2`, provide an expressive & logical language for building clear & effective data visualizations.
Visualize the distribution of a variable
Exploring & plotting relationships between variables
Display very large data sets through graphs without over-plotting
Use best practices for Exploratory Data Analysis in R code
Working with data- R is a programming language with a purpose: to analyze data. Learning how R stores & handles data will help you apply R to any data source.
Loading different data formats into R
Working with factors in R
How to clean poorly formatted data
Saving your data
Manipulating data- R's methods for data manipulation make it easy & fast to extract information from data sets & to prepare raw data for analysis.
Subset, transform, summarize, & reorder data sets
Perform targeted, groupwise operations on data
Join multiple data sets together
Day 2 - Programming & modeling in R
Friday, Jun 13rd, 2013
Programming in R- Many people use R as an application, a sort of statistical calculator, but R is also a programming language. Once you learn to program in R, you will be a more versatile & capable data analyst. You'll learn to write code that provides the precise solutions you are looking for.
Create an if else statement
Write & optimize for & while loops in R
Use best practices for programming in R
R functions- Functions allow you to save your code for later or to share it with other R users. Knowing how to write a function will also streamline your workflow. Functions give code a more efficient structure that avoids duplication & aids debugging.
Organize a problem into a series of functions
Write a function in R
Apply best practices for writing functions in R
Simulation in R- Simulating data provides a way to test hypotheses & discover the uncertainty in your estimates.
Generate random numbers in R
Visualize uncertainty with bootstrapping in R
Construct a confidence interval with bootstrapping in R
Test a hypothesis with a permutation test in R
Modeling in R- R excels at statistical analysis & modeling, but its methods for modeling may seem unintuitive at first.
Write a formula in R
Fit a model to data in R
Explore data sets with models
In certain cases, we may need to cancel this workshop due to circumstances beyond our control or otherwise. If this happens, SupStat will refund all registration fees for those who signed up. SupStat is not responsible for any related expenses incurred by registered attendees (including but not limited to travel & hotel expenses).
Until May 30th, 2014 - Full refund, less 10% of registration fees|
May 31st, 2014 to June 5th, 2014 - 50% refund of registration fees
Jun 06, 2014 & after - No refund available
All public workshops hosted by SupStat with a no-questions-asked money-back guarantee.