• Live Online Training
  • Self-Paced + Faculty Support option (35% off)
  • Course Curriculum
  • See Class Video
  • Contact Us
Celebrate This Festive Season! Get 35% Off Till 22nd Oct! Enrol Now!
Next batch start date
Watch a free video through the sample class tab to learn more about Data Science using Python

  • 48 hours (8 weekends) Live, Online, Instructor led, Practical training
  • Weekend batch: Live, online, interactive classes held on Saturdays & Sundays
  • Get the benefits of learning from your home in a fully, 2- way interactive, online environment
  • Interact with the instructors and fellow participants through chat, voice and video as if you are in a classroom
  • Learn practically through over 12 case studies
  • We will award you with a certificate after working on multiple real-world projects post the course
  • Online sessions will be recorded for you to revise later or if you miss a class
  • 24x7 lifetime access to the recorded sessions, course material and Python lab (installed on your own machine)
Click on the button below to enrol now and receive the attractive discount.

Fees: Rs. 30,000 / USD 480 Rs. 19,500 / USD 315 till 22nd Oct only (taxes extra)

Enrol Now

We offer a full refund till 6 days post the start date of the batch if you would like to cancel, though with such a good deal we wonder who would!

Contact us at info@edvancer.in or call on +91 8080928948 for more details


Celebrate This Festive Season! Get 35% Off Till 22nd Oct! Enrol Now!

  • 48 hours self-paced, online video based course
  • Learn at your own time and pace from anywhere through recorded course class videos
  • Get faculty support through email, forum & scheduled calls
  • Get 24x7, life-time access to recorded classes and course material on our learning management system. No time limits!
  • Learn practically through over 12 case studies
  • Download the free Python softwares on your own computer to practice 24x7x365
  • Work on and submit real-world projects at your own time and pace to earn the certificate
  • Get career assistance post certification to get into a career in data science
Fees: Rs. 22,000/USD 360 Pay only Rs.15,500/- or USD 249 only (plus taxes) till 22nd Oct.
So enrol Now and save money!!

Enrol Now

We offer a full refund up to 3 days post your enrolment if you would like to cancel, though with such a good deal we wonder who would!

Contact us at info@edvancer.in or call us on +91 8080928948 for more details


Topic

What does it mean?

Introduction to Data Science

  • What is data science and why is it so important?
  • Applications of data science
  • Various data science tools
  • Data Science project methodology
  • Tool of choice-Python: what & why?
  • Case study
In this section we shall provide you an overview into the world of data science & machine learning. You will learn about the various applications of data science, how companies from all sort of domains are solving their day to day to long term business problems. We’ll learn about required skill sets of a data scientist which make them capable of filling up this vital role. Once the stage is set and we understand where we are heading we discuss why Python is the tool of choice in data science.

Introduction to Python

  • Installation of Python framework and packages: Anaconda & pip
  • Writing/Running python programs using Spyder Command Prompt
  • Working with Jupyter notebooks
  • Creating Python variables
  • Numeric , string and logical operations
  • Data containers : Lists , Dictionaries, Tuples & sets
  • Practice assignment
Python is one of the most popular & powerful languages for data science used by most top companies like Facebook, Amazon, Google, Yahoo etc. It is free and open source. This module is all about learning how to start working with Python. We shall teach you how to use the Python language to work with data.

Iterative Operations & Functions in Python

  • Writing for loops in Python
  • While loops and conditional blocks
  • List/Dictionary comprehensions with loops
  • Writing your own functions in Python
  • Writing your own classes and functions
  • Practice assignment
This is where you shall learn the functionalities and powerful capabilities of Python that will make it easy for you to work with data and set the stage for using Python for machine learning & data science.

Data summary & visualization in Python

  • Need for data summary & visualization
  • Summarising numeric data in pandas
  • Summarising categorical data
  • Group wise summary of mixed data
  • Basics of visualisation with ggplot & Seaborn
  • Inferential visualisation with Seaborn
  • Visual summary of different data combinations
  • Practice assignment
Data visualization is extremely important to understand what the data is saying and gain insights in just one glance. Visualization of data is a strong point of the Python software using the latest ggplot & Seaborn packages and you will learn the same in this module.

Data Handling in Python using NumPy & Pandas

  • Introduction to NumPy arrays, functions & properties
  • Introduction to Pandas & data frames
  • Importing and exporting external data in Python
  • Feature engineering using Python
Python is a very versatile language and in this module we expand on its capabilities related to data handling. Focusing on packages numpy and pandas we learn how to manipulate data which will be eventually useful in converting raw data suitable for machine learning algorithms.

Data Science & Machine Learning in Python

Machine Learning Basics

  • Converting business problems to data problems
  • Understanding supervised and unsupervised learning with examples
  • Understanding biases associated with any machine learning algorithm
  • Ways of reducing bias and increasing generalisation capabilites
  • Drivers of machine learning algorithms
  • Cost functions
  • Brief introduction to gradient descent
  • Importance of model validation
  • Methods of model validation
  • Cross validation & average error
In this module we understand how we can transform our business problems to data problems so that we can use machine learning algos to solve them. We will further get into discovering what categories of business problems and subsequently machine learning algos are there. Then we will get updated on methodologies associated with solving such problems. These methodologies will form basis of techniques we learn ahead in the course. We’ll wrap up this module with discussion on importance and methods of validation of our results.

Generalised Linear Models in Python

  • Linear Regression
  • Regularisation of Generalised Linear Models
  • Ridge and Lasso Regression
  • Logistic Regression
  • Methods of threshold determination and performance measures for classification score models
  • Case Study
We start with implementing machine learning algorithms in this module. We also get exposed to some important concepts related to regression and classification which we will be using in the later modules as well. Also this is where we get introduced to scikit-learn, the legendary python library famous for its machine learning prowess.

Case Studies:
  1. Automate lender & borrower matching through prediction of loan interest rates - In this case study, we try to automate the process of lender and borrower matching for a fintech company by predicting interest rates offered.
  2. Classify customers based on revenue potential for a wealth management firm- In this classification case study, we help a financial institution to predict which one of their customers are going to fall in high revenue grid so that they can be given selective discounts for customer acquisition in a highly competitive industry of wealth management.

Tree Models using Python

  • Introduction to decision trees
  • Tuning tree size with cross validation
  • Introduction to bagging algorithm
  • Random Forests
  • Grid search and randomized grid search
  • ExtraTrees (Extremely Randomised Trees)
  • Partial dependence plots
  • Case Study & Assignment
In this module you will learn a very popular class of machine learning models which are rule based tree structures also known as Decision Trees. We'll examine the biased nature of these models and learn how to use bagging methodologies to arrive at a new technique known as Random Forest to analyse data.

Case Studies: In the class we continue with the case studies taken in previous module of simple linear models and see how the tree based models compare in terms of performance in comparison to the linear models. In take home exercises we have two case studies:
  1. Capture risks associated with micro loans: In the 1st exercise you will work on micro loans. Its inherently risky to hand out micro loans because of lack of checks in the natural process of micro loans. and in this case study we try to capture risk associated with these micro loans.
  2. How do the tech specifications of a vehicle impact its emissions? In the 2nd case study we find out effect of technical design specification of a vehicle on average emission and thus its environmental impact.

Boosting Algorithms using Python

  • Concept of weak learners
  • Introduction to boosting algorithms
  • Adaptive Boosting
  • Extreme Gradient Boosting (XGBoost)
  • Case Study & assignment
Want to win data science contest on Kaggle or data hackathons or be known as a top data scientist? Then learning boosting algorithms is a must as they provide a very powerful way of analysing data and solving hard to crack problems.

Case Studies:
  1. Save lives by predicting health issues in diabetics: A health care system in a state is struggling with poor detection of severity of health issues in diabetic people. This results in need for re-hospitalisation and many unfortunately not in time. Find out if boosting algos can save lives!
  2. Predicting annual income based on census data: In the take home exercise, find out whether someone is going to have annual income higher than a certain amount just by simple census data and thus identifying potential fraud cases when it comes to filing their taxes.

Support Vector Machines (SVM) & kNN in Python

  • Introduction to idea of observation based learning
  • Distances and similarities
  • k Nearest Neighbours (kNN) for classification
  • Brief mathematical background on SVM/li>
  • Regression with kNN & SVM
  • Case Study
We step in a powerful world of “observation based algorithms” which can capture patterns in the data which otherwise go undetected. We start this discussion with KNN which is fairly simple. After that we move to SVM which is very powerful at capturing non-linear patterns in the data.

Case Study: Since KNN and SVM take a lot of processing time, we have kept the class discussion case study simple. Same implementation steps can be used to work on any complex business problem as well.

Unsupervised learning in Python

  • Need for dimensionality reduction
  • Principal Component Analysis (PCA)
  • Difference between PCAs and Latent Factors
  • Factor Analysis
  • Hierarchical, K-means & DBSCAN Clustering
  • Case study
Many machine learning algos become difficult to work with when dealing with many variables in the data. We will learn methods which help solve this problem and also clustering techniques. Case Studies:
  1. Understanding impact of cash assistance programs in New York: To understand PCA, we take up data of cash assistance programs in New York. This has more than 60 variables. We’ll see how can we reduce the size of the data.
  2. Car Survey Data: We take up car survey data which contains technical & price detail of vehicles through 11 numeric variables. We’ll see if these 11 variables represent any hidden factors representing different properties of a vehicle.
  3. Pricing wines based on chemical properties: For K-Means we take data containing chemical properties of 4000+ white wines and examine whether we can find segments of wines based on their chemical compositions.
  4. Customer spend data at a retail chain: For DBSCAN we see how DBSCAN can be used for anomaly detection using expense data of customers from a retail chain.

Text Mining in Python

  • Gathering text data using web scraping with urllib
  • Processing raw web data with BeautifulSoup
  • Interacting with Google search using urllib with custom user agent
  • Collecting twitter data with Twitter API
  • Naive Bayes Algorithm
  • Feature Engineering with text data
  • Sentiment analysis
  • Case study
Text data forms a big chunk of data available in the world today. Analysing text data can give a business very powerful insights to take advantage of. Python provides very useful ways to scrape data from the web or extract data from social media sites using APIs and then analyse the data. Case Studies:
  1. Live demonstrations of web scraping and data cleaning
  2. Making a portfolio tracking tool using Yahoo finance with Python
  3. Tagging an SMS as SPAM or NON-SPAM based on its content algorithmically with Naive Bayes

Version Control using Git and Interactive Data Products

  • Need and Importance of Version Control
  • Setting up git and github accounts on local machine
  • Creating and uploading GitHub Repos
  • Push and pull requests with GitHub App
  • Merging and forking projects
  • Introduction to Bokeh charts and plotting
  • Examples of static and interactive data products
  • Case study
We finish the course with discussion on two very important aspects of a data scientist’s work. First is version control which enables you to work on large projects with multiple team members scattered across the globe. We learn about git and most widely used public platform version control that is GitHub. Second is making a quick prototype of your solutions as an interactive visualisation in the form of standalone or hosted web pages. We introduce you to Bokeh, an evolving library in python which has all the tools that you’ll need to make small prototypes of data products which can be scaled later.
See a class video free

We would love to hear from you regarding any query that you may have be it about the course or about your career.

Contact us for more info



Or email us at info@edvancer.in

Or call us at +91 8080928948



  • Edvancer’s content is better than other institutes with whom I enquired and at much economical cost. After the course I got a job as a Campaign Management Analyst using SAS in ICICI Lombard.

    Rohit Kashid – Campaign Analyst, ICICI Lombard
  • It was a great experience and pleasure to learn from Edvancer.  The online class room is as good as a real class room. It was highly interactive with brainstorming on many ideas. The course content also depicts real life scenarios. Altogether it was a great learning experience.

    Vinodh S, Sr. Specialist Architect, Sapient Corp.
  • sumit kamra - Edvancer's Student

    The course was of very high quality and engaging. The interactive atmosphere and live examples were refreshing. The instructor had the real world experience to understand our needs and was easily reachable at any point of the time. I highly recommend this course.

    Sumit Kamra, Project Manager, ICICI Bank
  • The business analytics course provides an in-depth understanding of analytics with hands-on experience on SAS using case studies from varied domains. You get all one needs for excelling in the field of analytics. The faculty have a very good grasp of all the concepts and the Edvancer team is very supportive.

    Girish Punjabi, Senior Business Analyst, IKen-IIT Bombay
  • I got a great job as Sr. Analyst with a 75% pay hike post this course! The course is a perfect blend of analytics tools and techniques. If you want to learn real stuff in analytics and not just the theoretical concepts, this course is for you.

    Ashish Kumar – B.Tech, IIT Madras

Benefits of taking the Data Science with Python course

  • Learn to analyze data using machine learning techniques in Python
  • Become one of the most in-demand Data Scientists in the world today
  • Learn how to analyze large amounts of data to bring out insights
  • Relevant examples and cases make the learning more effective and easier
  • Gain hands-on knowledge through the problem solving based approach of the course along with working on a project at the end of the course


Who should take this course?

This course is designed for anyone who:
  • wants to get into a career in Data Science
  • wants to analyse large amounts of data to bring out the insights from the same
  • wants to learn Python for working on data science projects


Pre-requisites

  • Ideally you should be familiar with some programming(in any language).