Introduction to Feature Engineering in Machine Learning

Показать описание

In this video, we will learn about feature engineering in Machine Learning.
Feature engineering is a critical task that data scientists have to perform prior to training AI/ML models.

Check out top-rated Udemy courses here:

10 days of No Code AI Bootcamp

Modern Artificial Intelligence with Zero Coding

Python & Machine Learning for Financial Analysis

Modern Artificial Intelligence Masterclass: Build 6 Projects

AWS SageMaker Practical for Beginners | Build 6 Projects

Data Science for Business | 6 Real-world Case Studies

AWS Machine Learning Certification Exam | Complete Guide

TensorFlow 2.0 Practical

TensorFlow 2.0 Practical Advanced

Machine Learning Regression Masterclass in Python

Machine Learning Practical Workout | 8 Real-World Projects

Machine Learning Classification Bootcamp in Python

MATLAB/SIMULINK Bible|Go From Zero to Hero!

Python 3 Programming: Beginner to Pro Masterclass

Autonomous Cars: Deep Learning and Computer Vision in Python

Control Systems Made Simple | Beginner's Guide

Artificial Intelligence in Arabicالذكاء الصناعي مبتدئ لمحترف

The Complete MATLAB Computer Programming Bootcamp

As a data scientist, you may need to:
- Highlight important information in the data
- Remove/isolate unnecessary information (e.x.: outliers).
- Add your own expertise and domain knowledge to alter the data.

Feature engineering is an art of introducing new features that weren’t existing before. Data scientists spend 80% of their time performing feature engineering. The remaining 20% is the easy part which includes training the model and performing hyperparameters optimization.

Performing proper feature engineering is crucial to improve AI/ML model performance.

As a data scientist, you need to answer the following questions:
- What are the capabilities of the ML model I have?
- Which features should I select?
- Can I add my domain knowledge to use less features?
- Can I come up with new features from the data I have at hand?
- What should I put in the missing data locations?

It is important to choose features that are most relevant to the problem.

Adding new features that are unnecessary will increase the computational requirements needed to train the model (curse of dimensionality).

There are numerous techniques that could be used to reduce the number of features (compress/encode the data) such as Principal Component Analysis (PCA).

Thanks.

#featureengineering #machinelearning