Extracted
Extracted
Below is a comprehensive list of Python-related questions and topics from the provided
interview guides and resources. These questions cover general Python programming, data
manipulation, ETL, PySpark, and automation as relevant to data engineering roles.
Implement a Python function to count unique words from a file and write them to another
file.
Write a decorator function to log the execution time of a function.
Create a Python program to demonstrate the use of set operations (union, intersection).
Implement file handling in Python to read a CSV and store only specific columns in a
dictionary.
Explain the difference between mutable and immutable objects in Python. [2] [3]
How would you handle an exception in Python? Provide an example.
What are lambda functions in Python? How are they different from regular functions?
How would you iterate over a dictionary in Python and print its keys and values?
Explain the concept of generators in Python. Provide an example of a generator function.
How would you sort a list of dictionaries based on a specific key in Python?
What is the difference between shallow copy and deep copy in Python? When would you
use each?
How can you read data from a CSV file in Python? Provide an example.
Explain the concept of object-oriented programming (OOP) in Python. Give an example of a
class and its usage.
How would you handle memory management in Python? What is the purpose of garbage
collection? [3]
Scheduling jobs in Databricks and defining tasks using Python scripts. [13]
Sample Scenario-Based and Conceptual Questions
How would you handle missing data in a DataFrame in Python?
How do you handle duplicate or corrupted data in a batch ETL job?
How would you debug a failing ETL pipeline in production?
How do you automate data insertion into BigQuery using Python?
How do you interact with Google BigQuery using Python?
How would you merge and deduplicate two sorted lists in Python?
How would you create a function to detect anomalies in sales trends using Pandas and
NumPy?
How would you build a reusable ETL framework using Airflow and Python?
How do you handle schema changes in PySpark over time?
How do you monitor and troubleshoot data pipeline failures using Python-based tools?
How do you manage memory allocation in Spark using PySpark?
How do you handle skewed data in a Spark job using PySpark?
How do you validate data using Python in ETL pipelines?
How do you implement file handling in Python for reading and writing CSVs?
How do you implement object-oriented programming concepts in Python for data
engineering tasks? [4] [8] [5] [13] [12] [1] [2] [9] [11] [10] [14] [6] [15] [3] [7]
This list covers the breadth of Python-related questions you may encounter in data engineering
interviews, including practical coding, data pipeline design, ETL automation, PySpark, and data
manipulation tasks.
⁂
Implement a Python function to count unique words from a file and write them to another
file. [19]
Write a decorator function to log the execution time of a function. [19]
Create a Python program to demonstrate the use of set operations (union, intersection). [19]
Implement file handling in Python to read a CSV and store only specific columns in a
dictionary. [19]
Explain the difference between mutable and immutable objects in Python. [19]
This list comprehensively covers all Python-related questions and scenarios from the provided
interview materials, including coding, ETL, automation, PySpark, and data pipeline design.
⁂
1. Amazon-Fresher.pdf
2. American-Express.pdf
3. https://www.careerflow.ai/blog/amazon-data-engineer-interview
4. Aarate_1.pdf
5. Adidas.pdf
6. https://www.linkedin.com/posts/prakhar-srivastava-615922150_dataengineer-adidasinterview-bigdata-
activity-7280997662022144000-DfmK
7. https://dataengineeracademy.com/blog/data-engineer-interview-questions-with-python-detailed-answ
ers/
8. Accenture-Azure-Data-Engineer-3.pdf
9. Bitwise.pdf
10. Bristol-Myers-Squibb.pdf
11. Boston-Consulting-Group-_BCG.pdf
12. Amazon-Experienced.pdf
13. Altimetrik.pdf
14. https://www.interviewquery.com/p/data-engineer-python-questions
15. https://www.interviewquery.com/interview-guides/altimetrik-data-engineer
16. Aarate_1.pdf
17. Amazon-Fresher.pdf
18. Accenture-Azure-Data-Engineer-3.pdf
19. American-Express.pdf
20. Adidas.pdf
21. Boston-Consulting-Group-_BCG.pdf
22. Amazon-Experienced.pdf
23. Bitwise.pdf
24. Bristol-Myers-Squibb.pdf
25. Altimetrik.pdf