AWS Glue PySpark: Change Column Data Types

preview_player
Показать описание
This video is about how to change column data types in AWS Glue using PySpark. This tutorial will walk through how to achieve this using the resolveChoice method in a dynamic frame

#aws #awsglue #pyspark
Рекомендации по теме
Комментарии
Автор

Thanks @DataEng Uncomplicated Very Informative Videos!! could you also please create a video to move Pyspark dynamic transformed Data Frame to Redshift Table

harishtripathi
Автор

I saw aws glue does not support binary type. But my situation is I have to do ETL job from mongodb which some of column's type is UUID by binary. is there any chance that i can change to useable type in glue?

KyuGShim
Автор

Interesting video, thanks! I was under the impression that resolveChoice could only be used on the "ChoiceType" schema in AWS Glue? I've been using applyMapping to resolve this issue (which requires generating a mapping for all columns you wish to keep, not just the ones you want to change... tedious for frames with lots of columns!)

ryanalex