1. How to trigger a lambda when file is placed in S3?
  2. Lets assume same file is being placed in S3 multiple times. Design a solution that does to trigger lambda for duplicate file insertion in S3 bucket.
  3. How to read cloudwatch logs in aws lambda using boto3?
  4. How to achieve cross account copy in AWS. Assume you have a file in account A and want to share that with account B?
  5. What is the difference between aws Glue Dynamic frames and spark dataframes?
  6. How data changes will reflect in aws Glue Catalog table? 
  7. Write a Pyspark program to read a csv file from S3, filter the data based on conditions and write it back to same S3 bucket?
  8. Explain Spark Architecture?
  9. What is the difference between logical plan and Physical plan in Pyspark?
  10. Write a python program to group anagrams? input: ['cat', 'tea', 'ate', 'tan', 'ant', 'cup', 'puc']
  11. Write a python program to rotate an array by K times?
  12. What is read and write consistency in S3?
  13. How to change memory in aws Glue?
  14. What is the main difference between resource allocation in aws Glue and Spark?
  15. What is the constrain in reading and writing data to same s3 path in glue?
  16. What should be the S3 folder structure if the data is partitioned at day level?
  17. Write a python program to read data from a gzip file?
  18. What are the common error handlings in current project?
  19. How to handle PII data in Pyspark?
  20. How to handle slow running jobs in Spark?
  21. How to calculate the resources for a glue job?
  22. What is the use of aws Athena?