Logistic Regression, LDA & KNN in Python - Predictive Modeling
A free online course about the classification techniques involved in logistic regression, LDA and KNN in Python.
Take this certificate on your own.
Start now and learn at your own pace.
CertificationView course modules
Are you looking to become an expert in solving real-life problems using different classification algorithms in Python? This course will give you the ability to interpret the outcomes of a logistic regression model in Python. You will be able to use these results when making strategic decisions in your organization. Gain insight into the methods of dispersion, which will help you understand the spread of a data set, namely range, standard deviation and variance. Discover that when comparing centres, the mean is not always the best measure of central tendency as outliers heavily influence it, which is the principal reason why the median is preferred over the mean. You will be taught about the advantages of using the mode to measure centres, including the fact that it can be calculated for both quantitative and qualitative data, and that mean and median can only be used for quantitative data. The course will also introduce you to the main Python libraries, which are Pandas, NumPy and Seaborn.
Discover the first key steps in building a machine learning model, where you convert your business problem into a statistical problem, define the dependent and independent variables, and identify whether you want to predict or infer. You will learn about training data and testing data, where training data refers to the information used to train an algorithm and testing data includes only the input data and is used to access the created model’s accuracy or the predictor function made using the training data. Uncover the importance of handling missing values in real-world data and the importance of managing it appropriately since many machine learning algorithms do not support data sets with missing values. You will then study the most common methods of imputing missing values, which are segment-based imputation, impute with zero and impute with median, mean or mode.
Next, you will learn about the linear discriminant analysis technique, which is based on Bayes’ theorem, as the preferred method when the response variable has more than two classes. Discover how, with a given set of predictor values, you can use this technique to calculate the probability of a particular observation belonging to each group and assign the group with the highest probability to that observation. You will then identify the drawbacks of the k-nearest neighbors technique, including that it does not mention each variable’s relationship and the response variable. Finally, you will learn about interpreting the classification models’ results, creating a confusion matrix in Python, evaluating model performance, and dummy variable creation in Python. This course will be of interest to data scientists, executives or students interested in learning about classification techniques. Why wait? Start this course today and become a classification model and problem-solving expert.Start Course Now
Machine Learning and Statistics
Machine Learning and Statistics - Learning Outcomes
Setting Up Python
Diverse Libraries in Python
Machine Learning and Statistics - Lesson Summary
Classification Models in Python
Classification Models in Python - Learning Outcomes
Outlier Treatment in Python
Confusion Matrix and LDA
Test-Train Split and Classifiers
Classification Models in Python - Lesson Summary
Upon successful completion of this course, you should be able to:
- Outline the examples of descriptive statistic tools
- Describe the components of a comprehensive data dictionary
- Explain the important libraries in Python
- Discuss the different types of Seaborn plot functions
- Explain the three measures of dispersion
- Discuss the three performance metrics for evaluating models in Python
- Explain the three examples of classification techniques
- Discuss the linear discriminant analysis technique
- Describe the k-nearest neighbors classification technique
All Alison courses are free to enrol, study and complete. To successfully complete this Certificate course and become an Alison Graduate, you need to achieve 80% or higher in each course assessment. Once you have completed this Certificate course, you have the option to acquire an official Certificate, which is a great way to share your achievement with the world. Your Alison Certificate is:
Ideal for sharing with potential employers - include it in your CV, professional social media profiles and job applications
An indication of your commitment to continuously learn, upskill and achieve high results
An incentive for you to continue empowering yourself through lifelong learning
Alison offers 3 types of Certificates for completed Certificate courses:
Digital Certificate - a downloadable Certificate in PDF format, immediately available to you when you complete your purchase
Certificate - a physical version of your officially branded and security-marked Certificate, posted to you with FREE shipping
Framed Certificate - a physical version of your officially branded and security-marked Certificate in a stylish frame, posted to you with FREE shipping
All Certificates are available to purchase through the Alison Shop. For more information on purchasing Alison Certificates, please visit our FAQs. If you decide not to purchase your Alison Certificate, you can still demonstrate your achievement by sharing your Learner Record or Learner Achievement Verification, both of which are accessible from your Dashboard. For more details on our Certificate pricing, please visit our Pricing Page.