Interviewer asked about projects and fundamentals of Python, SQL, and PySpark, including ETL processes, SQL queries, joins, data transformations, and distributed data processing concepts used in real-time scenarios and also about the duplicate records