Events  Deals  Jobs  NFT NYC 2024 
    Sign in  
 
 
With Vivian Zhang (CTO @ SupStat, Founder @ NYC Data Science Academy & Adjunct Prof. @ NYU).
Sun, Jun 07, 2015 @ 01:00 PM   $1590   NYC Data Science Academy, 205 E 42nd St
 
     
 
 
              

      
 
Sign up for our awesome New York
Tech Events weekly email newsletter.
   
LOCATION
EVENT DETAILS
Details
Each class is 20 hours of classroom guidance with an optional three week-long showcase project of students' own choices and optional presentation of their projects.

Date: Sunday |
May 17th, 30th and Jun 7th,14th, 28th (We take breaks on Memorial Weekend on May 24th and Father's Day on Jun 21st)
Time:
1:00pm - 5:00pm
Instructor:
Vivian Zhang, CTO of SupStat., Founder of NYC Data Science Academy, Adjunct Professor at NYU and Stony Brook Univ., with Masters Degree of Computer Science, and Masters Degree of Applied Math and Statistics
Venue:
205 E 42nd Street, 16th Floor, New York, NY 10017 (5 min from Grand Central)
Online course option:
Students can take it remotely through recorded youtube sessions with google hangout TA support, email info@nycdatascience.com to get enrolled
For corporate training or small group training inquiry:
Email info@nycdatascience.com to get corporate/group discount
Instructor
Vivian Zhang, Founder and CTO of SupStat Inc, Adjunct Prof at NYU and Stony Brook Univ.
Vivian is a data scientist who has being devoted to the analytics industry and data technologies over years. Prior to taking entrepreneurial steps, she worked as a Senior Financial Analyst at Memorial Sloan-Kettering Cancer Center and Scientific Programmer at the Center of Statistics of Brown University. Vivian received Double Master Degrees in Computer Science and Statistics.

Course Overview
This five week course is an introduction to data analysis with the Python programming language, and is aimed at beginners. We introduce how to work with different data structure in Python. We covered the most popular modules, including Numpy, Scipy, Pandas, matplotlib, Seaborn, and ggplot, to do data analytics and visualization. We use ipython notebook to demonstrate the results of codes and change codes interactively during the class. Our past students include people have no programming experience and people have little exposure by taking Python class at NYU or General Assembly. Students told us our classes are very engaging, interactive, hands-on and have tons of content.

Syllabus
Day 1 - Introduction to Python

Python is a high-level programming language.You will learn the basic syntax and data structures in Python. Ipython provides a robust and productive environment for interactive and exploratory computing, which is great tool to do scientific computation and education.

Introduction to Ipython
Basic objects in Python
Variables and self-defining functions
Control flow
Advanced data structures
Day 2 - Explore deeper with Python

Python is a object-oriented programming language. Learn a little about OOP will help you understand how Python codes work. To do data analysis, the first thing you need to know is how to deal with files which contains data. Sometime the data is dirty and unstructured, you will learn text processing including regular expressions to deal with them.

Classes: introduction to object-oriented programming
How to deal with files
Run Python scripts
Handling and processing strings
Day 3 - Scientific computation tools

There are three modules for scientific computation that make Python as powerful as Matlab: Numpy, Matplotlib and Scipy. Numpy, short for Numerical Python, is the foundational package for scientific computing in Python. Matplotlib is the most popular Python library for producing plots and other 2D data visualizations. SciPy is a collection of packages addressing a number of different standard problem domains in scientific computing.

Numpy
Matplotlib (mainly the sub-module *pyplot*)
Scipy (manily the sub-module *stats*)
Day 4 - Data Visualization

Python can also generate graphics easily by using appropriate tools like *Seaborn* and *ggplot*. Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface for drawing attractive statistical graphics. ggplot is a famous library port from R, which is a plotting system. ggplot provides a powerful model of graphics that makes it easy to produce complex multi-layered graphics.

Seaborn
ggplot
Pandas
Day 5 - Data manipulation with Pandas

Pandas provides rich data structures and functions designed to make working with structured data fast, easy, and expressive. The *DataFrame* object in pandas is just like the *data.frame* object in R. Pandas makes data manipulation(filter, select, group, aggregate, etc.) as easier as in R.

Project Demo Day and Certificates
From the rudimentary building blocks of programming basics, to data manipulation and use of advanced drawing packages, the course ends with a demonstration of a project of your choice on Project Demo Day. On Demo Day you will access and analyze real data, utilizing the tools and skill sets taught to you throughout the course. Upon successful completion of the course and demonstration of your final project, you will qualify for one of three certificates: Extraordinary Standing, Honorable Graduation, and Active Participation.

Certificates are awarded according to your understanding, skill, and participation. No prerequisite needed for the course.
 
 
 
 
© 2024 GarysGuide      About    Feedback    Press    Terms