Data Science
Register for Free Demo
R Programming:
R - Core
- R Environment, R Studio, R installation , Advantages of R, R scripts
- Variables
- Data Types
- Creating Functions
- Installing and using Packages
- Reading Data into R ( Excel, CSV, JSON )
- Control Statements and Loops ( if-else, while, for etc.)
- Data Reshaping, Data manipultation, Formatting
- Graphs, using various modules ( Bar, Line, Histogram,pie etc)
Statistics with R
- Summary
- Correlation and Covariance
- T-Tests
- ANOVA
- Probability Distributions ( Normal, Bionomial, Poission Distributions )
- Probability Distributions ( Linear Regression, Logical regression, Market Basket Analysis,Naive Bayes)
- Neural Networks ( Artificial Neural networks ) / Machine Learning / Deep Learning
- Principal Component Analysis
- Time Series Analysis
- Unsupervised learning: Clustering
- Decision Trees
- K Nearest Neighbours (kNN)
Very Advance and important for projects
- RServer
- RShiny
- RMarkdown
Python: ( Eclipse with PyDev)
- Introduction to Python, Concepts of Pythons language, Installations.
- Basic Syntax and Indenting, Variables, Data types, Collections ( Lists,Tuples, Dictionary)
- Control Statements and Loops ( if-else, while, for etc.)
- Functions, Built-in, User Defined
- Lambda, Filter, Zip and Reduce
- Modules, Standard Modules, Importing into code
- Exceptions. Built in, User defined
- Writing, Reading files ( Excel, CSV, JSON )
- Using the pickle module (Shelve)
- Regular Expressions
Object Oriented
- Classes
- Inheritance, Polymorphism and other OOP concepts
Database:
- connections to various databases ( MySQL, Postgresql : Any One)
- DB Related statements ( Create, Select, insert, update etc )
Important Features
- Decorators
- Python Threads, locks (Rlock) and multiprocessing with some basic examples.
- Iterators and generators.
Data science: ( Basic and Relevant concepts in Statistics )
Basics
- What is Sample and Population?
- What is Variable? How many types of Variables?
- What is Percentile? What are plots?
- What is Central Limit theorem? Mean, Median, Mode
- Standard deviation, Skewness, Degrees of freedom
- What is covariance? What is correlation?
- Bernoulli, Binomial, Poisson, Normal distribution
- Decision Making Rules,
- Test of Hypothesis
- Type I Error, Type II Error
- Null and Alternative Hypothesis
- Reject or acceptance criterion
- One sample Test, Two sample Test, ANOVA, P Value
Prediction Modelling
- Simple Linear Regression
- Multiple Linear Regression
- Extra sum of squares
- R – Square, R- Square Adjusted
- Variable Selection
- Multicollinearity
- Transformations
- Logistic Regression
Artificial Intelligence / Machine learning
- Supervised and Unsupervised Learning
- Neural Network
- Naïve Bayes
- Laplace Estimator
- Market Basket Analysis
- KNN – Classifier
- Random Forests
ML/DL using Python - Very Advance and important for projects
- Tensorflow - Image Preiction
- Tensorflow - Image classifier /
- Tensorboard
- Pandas ( Data structures & Analysis )
- Keras ( Deep Learning)
- Numpy ( fundamental package for scientific computing with Python)
- Scipy ( Python-based ecosystem of open-source software for mathematics, science, and engineering)
- Matplotlib ( Graphs)
- Jupyter Notebook - Interactive Web
- convolutional Neural network
Mini Project
Pre Requisites: C, C++ or any Programming Language knowledge.
Target Audience: B.E/B.Tech/MCA/M.Sc