filmov
tv
13. Pepsico pyspark interview question and answer | azure data engineer interview Q & A | databricks
Показать описание
Pepsico PySpark interview questions and answers?
PySpark interview q & a?
Databricks interview question and answers?
#Databricks #PysparkInterviewQuestions #deltalake
Azure Databricks #spark #pyspark #azuredatabricks #azure
In this video, I discussed Pepsico PySpark scenario based interview questions and answers.
Pepsico PySpark interview questions and answers?
Create dataframe:
======================================================
data=[(1,'2024-01-01',"I1",10,1000),(2,"2024-01-15","I2",20,2000),(3,"2024-02-01","I3",10,1500),(4,"2024-02-15","I4",20,2500),(5,"2024-03-01","I5",30,3000),(6,"2024-03-10","I6",40,3500),(7,"2024-03-20","I7",20,2500),(8,"2024-03-30","I8",10,1000)]
schema=["SOId","SODate","ItemId","ItemQty","ItemValue"]
display(df1)
-----------------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------------
display(df2)
-------------------------------------------------------------------------------------------------------------------
display(df3)
-------------------------------------------------------------------------------------------------------------------
display(df4)
-------------------------------------------------------------------------------------------------------------------
============================================================
Learn PySpark, an interface for Apache Spark in Python. PySpark is often used for large-scale data processing and machine learning.
Azure data factory tutorial playlist:
ADF interview question & answer:
1. pyspark introduction | pyspark tutorial for beginners | pyspark tutorial for data engineers:
2. what is dataframe in pyspark | dataframe in azure databricks | pyspark tutorial for data engineer:
3. How to read write csv file in PySpark | Databricks Tutorial | pyspark tutorial for data engineer:
4. Different types of write modes in Dataframe using PySpark | pyspark tutorial for data engineers:
5. read data from parquet file in pyspark | write data to parquet file in pyspark:
6. datatypes in PySpark | pyspark data types | pyspark tutorial for beginners:
7. how to define the schema in pyspark | structtype & structfield in pyspark | Pyspark tutorial:
8. how to read CSV file using PySpark | How to read csv file with schema option in pyspark:
9. read json file in pyspark | read nested json file in pyspark | read multiline json file:
10. add, modify, rename and drop columns in dataframe | withcolumn and withcolumnrename in pyspark:
11. filter in pyspark | how to filter dataframe using like operator | like in pyspark:
12. startswith in pyspark | endswith in pyspark | contains in pyspark | pyspark tutorial:
13. isin in pyspark and not isin in pyspark | in and not in in pyspark | pyspark tutorial:
14. select in PySpark | alias in pyspark | azure Databricks #spark #pyspark #azuredatabricks #azure
15. when in pyspark | otherwise in pyspark | alias in pyspark | case statement in pyspark:
16. Null handling in pySpark DataFrame | isNull function in pyspark | isNotNull function in pyspark:
17. fill() & fillna() functions in PySpark | how to replace null values in pyspark | Azure Databrick:
18. GroupBy function in PySpark | agg function in pyspark | aggregate function in pyspark:
19. count function in pyspark | countDistinct function in pyspark | pyspark tutorial for beginners:
20. orderBy in pyspark | sort in pyspark | difference between orderby and sort in pyspark:
21. distinct and dropduplicates in pyspark | how to remove duplicate in pyspark | pyspark tutorial:
PySpark interview q & a?
Databricks interview question and answers?
#Databricks #PysparkInterviewQuestions #deltalake
Azure Databricks #spark #pyspark #azuredatabricks #azure
In this video, I discussed Pepsico PySpark scenario based interview questions and answers.
Pepsico PySpark interview questions and answers?
Create dataframe:
======================================================
data=[(1,'2024-01-01',"I1",10,1000),(2,"2024-01-15","I2",20,2000),(3,"2024-02-01","I3",10,1500),(4,"2024-02-15","I4",20,2500),(5,"2024-03-01","I5",30,3000),(6,"2024-03-10","I6",40,3500),(7,"2024-03-20","I7",20,2500),(8,"2024-03-30","I8",10,1000)]
schema=["SOId","SODate","ItemId","ItemQty","ItemValue"]
display(df1)
-----------------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------------
display(df2)
-------------------------------------------------------------------------------------------------------------------
display(df3)
-------------------------------------------------------------------------------------------------------------------
display(df4)
-------------------------------------------------------------------------------------------------------------------
============================================================
Learn PySpark, an interface for Apache Spark in Python. PySpark is often used for large-scale data processing and machine learning.
Azure data factory tutorial playlist:
ADF interview question & answer:
1. pyspark introduction | pyspark tutorial for beginners | pyspark tutorial for data engineers:
2. what is dataframe in pyspark | dataframe in azure databricks | pyspark tutorial for data engineer:
3. How to read write csv file in PySpark | Databricks Tutorial | pyspark tutorial for data engineer:
4. Different types of write modes in Dataframe using PySpark | pyspark tutorial for data engineers:
5. read data from parquet file in pyspark | write data to parquet file in pyspark:
6. datatypes in PySpark | pyspark data types | pyspark tutorial for beginners:
7. how to define the schema in pyspark | structtype & structfield in pyspark | Pyspark tutorial:
8. how to read CSV file using PySpark | How to read csv file with schema option in pyspark:
9. read json file in pyspark | read nested json file in pyspark | read multiline json file:
10. add, modify, rename and drop columns in dataframe | withcolumn and withcolumnrename in pyspark:
11. filter in pyspark | how to filter dataframe using like operator | like in pyspark:
12. startswith in pyspark | endswith in pyspark | contains in pyspark | pyspark tutorial:
13. isin in pyspark and not isin in pyspark | in and not in in pyspark | pyspark tutorial:
14. select in PySpark | alias in pyspark | azure Databricks #spark #pyspark #azuredatabricks #azure
15. when in pyspark | otherwise in pyspark | alias in pyspark | case statement in pyspark:
16. Null handling in pySpark DataFrame | isNull function in pyspark | isNotNull function in pyspark:
17. fill() & fillna() functions in PySpark | how to replace null values in pyspark | Azure Databrick:
18. GroupBy function in PySpark | agg function in pyspark | aggregate function in pyspark:
19. count function in pyspark | countDistinct function in pyspark | pyspark tutorial for beginners:
20. orderBy in pyspark | sort in pyspark | difference between orderby and sort in pyspark:
21. distinct and dropduplicates in pyspark | how to remove duplicate in pyspark | pyspark tutorial:
Комментарии