PDF Google Drive Downloader v1.1


Report a problem

Content text EngineerPro-Amazon Interview Review.pdf

HR round (30 mins): ● General questions Phone screen interview (Data Engineer - 75 mins): ● 1 LP Talk about the most challenging project. Follow-up question: Did you ever need to extend the project deadline? ● 1 Python Tìm số xuất hiện nhiều nhất trong list: Ex: [1,2,2,2,3,3] -> output: 2 ● 1 SQL Tìm product_id được bán trong nhiều ngày liên tiếp nhất? (ngày liên tiếp >= 3) Ex: Product_id sold_date -> output: 2 1 1/1/2024 2 4/1/2024 2 5/1/2024 2 6/1/2024 2 7/1/2024 2 8/1/2024 2 9/1/2024 3 13/1/2024 3 14/1/2024 3 15/1/2024 3 16/1/2024
Final loop interview (5 rounds, 1 hour per round): Round 1 (Data Engineer): ● ETL workflow designing: How do you design a data warehouse? STAR and SNOWFLAKE schema (DIM FACT tables) How do you design data pipelines? Talk from the design to the implementation. From data extraction (API, FTP, ...) to data transformation (which tools/techniques to do this) Talk about the AWS S3, Lambda function Talk about the solution to handle concurrent requests Talk about the error handling in the Lambda function ● Data Modeling: Design a data mart for gym app amazfit? User signup for a class Table to track user activity Data mart serves for 2 metrics: Active session for that class: Users are counted as active session when they join the class >= 15 mins. (they can pause and start multiple times) Active user: Users are counted as active when they join class >= 3 days Write some queries based on the design Round 2 (BIE Manager): ● 3 LP + follow-up questions A time you used customer feedback to drive improvement A time you realized you needed to have a deeper level of subject matter expertise to do your job well A time you didn’t know what to do next or how to solve a challenging problem Round 3 (Data Engineer): ● 3 LP + follow-up questions Talk about the time you disagree with the group but must follow group decisions Talk about the time you had a great solution but the manager did not support What is the most innovative thing you’ve done and why it’s innovative ● 1 Python Calculate the sum of the list but except for some numbers Ex: [1,2,3,4,5,13], remove_number [13] -> result = 15 ● SQL Difference between all JOIN types: (LEFT, RIGHT, INNER, OUTER) Difference between UNION and UNION ALL (When to use in the pipeline) Difference between DELETE and TRUNCATE (When to use in the pipeline) Problem: Write a query to find the second level of the manager of the employee Ex:
Emp_id - emp_name - manager_id 1 - A - None 2 - B - 1 3 - C - 2 Result: Emp_name - second_manager_name C - A Round 4 (Data Engineer Manager): ● 3 LP + follow-up questions: A time you took on something significant outside your responsibility Describe the most significant improvement project you’ve led A time you received tough or critical feedback ● Performance Optimization If a query takes much time to get results, what to do to optimize performance? ● SQL Given a SQL query, try to find an issue for that query (don’t remember query) Round 5 (Data Engineer): ● 1 LP + follow-up questions Talk about the tight deadline project ● SQL Explain Data Modeling Explain logical and physical data models Explain the database surrogate key Explain SCD, all types, and follow-up questions for Type 1, 2 Design a simple data model for SCD types 1 and 2 and give an example ● Data Modeling Case 1: design data model for this flow: vendor -> data warehouse Case 2: design data model for e-commerce system: users buy products on the website and the company will ship products to the user's address. Metrics to calculate: What is the most sold product for a specific month? Case 3: design data model for return flow: if users don’t like the product, they can return products.

Related document

x
Report download errors
Report content



Download file quality is faulty:
Full name:
Email:
Comment
If you encounter an error, problem, .. or have any questions during the download process, please leave a comment below. Thank you.