Application Development & Programming Training Icon
Data Analyst Training Icon

Machine Learning with Python


3 Days Classroom Session   |  
3 Days Live Online
Classroom Registration
Individual:
$1995.00
Group Rate:
$1795.00
(per registrant, 2 or more)
GSA Individual:
$1456.35
Live Online Registration
Live Online:
$1995.00
Private Onsite Package

This course can be tailored to your needs for private, onsite delivery at your location.

Request a Private Onsite Price Quote

Professional Credits

IIBA (CDU)

ASPE is an IIBA Endorsed Education Provider of business analysis training. Select Project Delivery courses offer IIBA continuing development units (CDU) in accordance with IIBA standards.

PMI (PDU)

Select courses offer Leadership (PDU-L), Strategic (PDU-S) and Technical PMI professional development units that vary according to certification. Technical PDUs are available in the following types: ACP, PBA, PfMP, PMP/PgMP, RMP, and SP.

Certification
Overview

This hands-on machine learning course advances your data analysis skills into the realm of real-world data science. If you have a working familiarity with Python, our three-day class equips you to go back to work with real-world predictive modeling and basic machine learning techniques. Led by expert data scientists, you will work with Python to lay your data science foundation and learn techniques that allow you to leverage your data in sophisticated, powerful new ways.

In this Machine Learning Python Course, You Will Learn How to:

  • Address business needs and identifying new business opportunities using machine learning
  • Work with missing values, outliners and duplicate records with Python
  • Implement hypothesis testing for model evaluation analysis
  • Utilize both supervised and unsupervised machine learning
  • Build a linear regression model with Python
  • Build a classification model with Python
  • Use the K-means clustering method for cluster analysis with Python
Upcoming Dates and Locations
All Live Online times are listed in Eastern Time Guaranteed To Run
Request a quote for private onsite training Request
Dec 16, 2019 – Dec 18, 2019    8:30am – 4:30pm Live Online Register
Jan 27, 2020 – Jan 29, 2020    8:30am – 4:30pm San Francisco, California

Learn IT
33 New Montgomery St.
Suite 300
San Francisco, CA 94105
United States

Register
Jan 27, 2020 – Jan 29, 2020    11:30am – 7:30pm Live Online Register
Feb 24, 2020 – Feb 26, 2020    8:30am – 4:30pm Seattle, Washington

Allied Business Systems - Computer Classrooms
10604 NE 38th Place, Suite 118
Yarrow Bay Office Park-1 North
Kirkland, WA 98033
United States

Register
Feb 24, 2020 – Feb 26, 2020    11:30am – 7:30pm Live Online Register
Mar 30, 2020 – Apr 1, 2020    8:30am – 4:30pm Live Online Register
Mar 30, 2020 – Apr 1, 2020    8:30am – 4:30pm Washington, District of Columbia

Microtek-Washington, DC
1110 Vermont Avenue NW
Suite 700
Washington, DC 20005
United States

Register
Apr 27, 2020 – Apr 29, 2020    8:30am – 4:30pm Austin, Texas

Embassy Suites Austin Central
5901 North IH-35
Frontage Rd
Austin, TX 78723
United States

Register
Apr 27, 2020 – Apr 29, 2020    9:30am – 5:30pm Live Online Register
May 26, 2020 – May 28, 2020    8:30am – 4:30pm Live Online Register
May 26, 2020 – May 28, 2020    8:30am – 4:30pm Atlanta, Georgia

Microtek Atlanta
1000 Abernathy Rd. NE Ste 194
Northpark Bldg 400
Atlanta, GA 30328
United States

Register
Jun 29, 2020 – Jul 1, 2020    8:30am – 4:30pm Live Online Register
Jun 29, 2020 – Jul 1, 2020    8:30am – 4:30pm Chicago, Illinois

Microtek Chicago
230 W. Monroe
Suite 900
Chicago, IL 60606
United States

Register
Jul 27, 2020 – Jul 29, 2020    8:30am – 4:30pm Live Online Register
Jul 27, 2020 – Jul 29, 2020    8:30am – 4:30pm New York, New York

NYC Seminar and Conference Center
71 West 23rd
Suite 515-Lower Level
New York, NY 10010
United States

Register
Aug 24, 2020 – Aug 26, 2020    8:30am – 4:30pm Live Online Register
Aug 24, 2020 – Aug 26, 2020    8:30am – 4:30pm Raleigh, North Carolina

ASPE Training
2000 Regency Parkway
Suite 335
Cary, NC 27518
United States

Register
Sep 28, 2020 – Sep 30, 2020    8:30am – 4:30pm Live Online Register
Sep 28, 2020 – Sep 30, 2020    8:30am – 4:30pm Philadelphia, Pennsylvania

Hyatt Place
440 American Avenue
King Of Prussia, PA 19406
United States

Register
Oct 26, 2020 – Oct 28, 2020    8:30am – 4:30pm Houston, Texas

Texas Training and Conference
11490 Westheimer Rd.
Suite 600
Houston, TX 77077
United States

Register
Course Outline

Part 1: Overview of Data Science

  1. Data Science as a quantitative discipline
    • How to define Data Science scopes
    • The many faces of Data Science: Data Mining, Data Analysis, Data Analytics, Machine Learning, Predictive Modeling, Statistical Learning, Mathematical Modeling. What are these all about?
    • Data Mining as a data exploration process
    • Machine Learning: supervised vs. unsupervised
    • Machine Learning vs. Predictive Analytics
    • Big Data Analytics: what is it and why it's important
  2. Overview of a Data Mining process cycle
    • Understanding business needs and identifying new business opportunities
    • Formulating a business problem and associated requirements
    • Defining key quantitative metrics to measure success and evaluating business benefits
    • Translating business requirements into technical requirements and documentation
    • Formulating data models based on business and technical requirements
    • Identifying a set of quantitative models based on technical requirements and metrics of success
    • Running the models and evaluating results
    • Selecting the best model
    • Deploying the model

Part 2: The Data Foundation

  1. Data sources
  2. Types of data
    • Structured vs. unstructured data
    • Static data vs. real-time data
    • Types of data attributes: numerical vs. categorical
    • Role of time factor and time trends in data analysis
  3. Working with missing values
    • Main causes of missing data
    • Understanding the importance of missing information
    • Types of missing information
    • Restoring missing values
    • Imputing missing values and selecting imputation techniques
    • Understanding and evaluating potential consequences of manipulating records with missing values
  4. Working with outliers
    • Defining quantitative criteria for outlier detection in 1D cases
    • Understanding role of outliers in model building
    • Deciding on outlier removal
    • Defining outlier detection metrics in multi-dimensional space
  5. Working with duplicate records
    • Defining duplicates
    • Understanding sources of duplicates
    • Deciding on duplicate removal

Part 3: Sampling and Hypothesis Testing

  1. Why sampling may be important for Machine Learning
  2. Sampling techniques and sample bias
  3. Statistical hypothesis
  4. Z-score, t-score and F statistic
  5. P-values
  6. Implementation of hypothesis testing for model evaluation analysis

Part 4: Machine Learning Fundamentals

  1. What is Machine Learning?
  2. Supervised vs. unsupervised learning
  3. Overview of supervised Machine Learning
    • Regression models
    • Classification models
  4. Overview of unsupervised Machine Learning
    • Clustering methods
    • Principal component analysis and dimension reduction
    • Association rules
  5. Overview of major steps in building and testing quantitative models
    • Criteria for model selection
    • How to prepare a training set
    • Criteria for selecting model attributes/predictors
    • Working with collinear variables
    • Addressing imbalance problem
    • Dealing with over-fitting; bias-variance tradeoff
    • Validation and cross-validation

Part 5: Building a Linear Regression Model with Python

  1. Univariate regression vs. multiple regression
  2. Mathematical foundation of linear regression overview: least square method vs. maximum likelihood method
  3. Model assumptions
  4. Working with continuous attributes
  5. Dealing with collinear variable
  6. Model subset selection:
    • Forward stepwise selection
    • Backward selection
    • Shrinkage methods: ridge regression and Lasso
    • Dimension reduction
    • Information criteria
  7. Automating model selection procedure
  8. Model parameter evaluation, R squared vs. adjusted R squared
  9. Validating the model
  10. Working with categorical variables
  11. Considering input variable interactions

Part 6: Example of building a Classification Model with Python

  1. Dealing with imbalanced training sets
  2. Understanding confusion matrix
  3. Evaluating binary classifiers using ROC / AUC

Part 7: Example of Cluster Analysis with Python

  1. Overview of cluster analysis mathematical foundation
  2. K-means clustering method
    • Algorithm overview
    • Convergence criteria
    • How to determine the number of clusters

Part 8: Dimension Reduction techniques with Python

  1. What is dimension reduction?
  2. The practical goals of dimension reduction implementation
  3. Principal component analysis vs. singular value decomposition
  4. How many components to choose

Part 9: Class Conclusion

  1. What was not covered in the class
  2. Big Data Analytics – the future of machine learning: main tools and concepts
Who should attend

Intermediate level data analysts interested in expanding their data mining processes. We emphasize Data Foundation and Machine Learning concepts. All exercises are performed using Python.

Pre-Requisites

This machine learning with Python course is for individuals intermediate data analysis skills and basic knowledge of descriptive statistics. Any experience with Python is also beneficial. 

Technical requirements: Installed Python and some Python packages. 

Additionally, although it is not mandatory, students who have completed the Introduction to Basic Python Programming course have found it very helpful when completing this course.

0
1