Get the feature names output by a ColumnTransformer

preview_player
Показать описание
Need to get the feature names output by a ColumnTransformer?
Use get_feature_names(), which now works with "passthrough" columns (new in version 0.23)!

👉 New tips every TUESDAY and THURSDAY! 👈

=== WANT TO GET BETTER AT MACHINE LEARNING? ===

3) LET'S CONNECT!
Рекомендации по теме
Комментарии
Автор

If I may suggest, it would be interesting to see the difference between pipeline inside a column transformer vs. a column transformer inside a pipeline. I personally always put CT inside a pipeline, so keen to know a use case for the other one.

Dara-ljrk
Автор

Hello, I want get the whole set of columns name including numerical, but with your method, it returned "Transformer num (type Pipeline) does not provide get_feature_names." Do you have any other suggestions?

haitingyou
Автор

oh, I didn't know that you can get the features names from the output. What's the different then if I were to do a df.columns of the output dataframe ?

I still cannot get the idea of doing OHE and then do PCA on the output in a pipeline. Have you done a video on that already ? If not, could give some tips how to transform the output from the previous steps, like in the scenario I've described ? Thanks very much in advance.

eatbreathedatascience
Автор

this just saved me from a big headache

zeinat
Автор

hey i want one question and i want solution from you i have one arry like string ["4599"], ["6625"], ["7777"], ["12345"], [7070] but i want print only this out put like this ["4599"], ["6625"], ["7777"] means i want print that string which string have same string last two or 1st and next if are same and or whole are same like in this arry ["7777"] how will sove this question

python