AWS Data Engineer Road Map – Roadmap

📅 Study Plan Calculator





👉 Course Duration: -- days (~ -- weeks)

✅ Expected Completion Date: --

Total Course Duration: 66 Hours

🔒 Login to track hours & progress

Level 1
Learn Python Basics To Advanced Level

Knowing Python is not enough. What really matters is knowing how to structure your code so others can read, maintain, and scale it. This course focuses exactly on that mindset.

▶ Play Course
⏱ 9.0 Hours

Level 2
AWS Lambda ETL Project

Python AWS Lambda ETL project, PyCharm & GitHub integration, project structure, code optimization, Pytest & pre-commit hooks, AWS Lambda Layers, environment variables, hands-on ETL assignment

▶ Play Course
⏱ 10.0 Hours

Level 3
AWS Lambda ETL Orchestration Project

ETL orchestration project, database setup, sprint board task planning, cloud-based project setup and testing, input data validation using Pydantic, ORM class design, ETL orchestration abstract design, end-to-end ETL flow explanation

▶ Play Course
⏱ 8.0 Hours

Level 4
AWS API Gateway Webservice Project

AWS API Gateway web service project, API fundamentals and deployment, API project setup, code optimization, ORM integration, user CRUD endpoints, create and update APIs, API YAML configuration for frontend integration

▶ Play Course
⏱ 8.0 Hours

Level 5
ReactJs Application Integration

ReactJs Application Integration 5.01 - Project-4: ReactJs Introduction Locked 5.02 - Project-4: User Management Delete Request (ReactJs Project) Locked 5.03 - Project-4: User Management Create and Update Request (ReactJs Project) Locked

▶ Play Course
⏱ 3.0 Hours

Level 6
Devops Deployment Process

DevOps deployment process, application deployment strategy, manual ReactJS deployment, GitHub Actions CI/CD workflow, automated build and deployment pipeline

▶ Play Course
⏱ 4.0 Hours

Assignments

  • No assignments

Level 7
PySpark project development

PySpark big data project development, Pandas to PySpark transition, PySpark fundamentals and installation, web scraping with PySpark, Parquet file concepts, PySpark with AWS Athena, PySpark deployment, environment variables handling, dependency and module management

▶ Play Course
⏱ 12.0 Hours

Level 8
Redshift Database Integration

AWS Redshift database fundamentals, Redshift cluster overview, loading data into Redshift from local systems, basic Redshift data integration workflows

▶ Play Course
⏱ 2.0 Hours

Level 9
ETL Live Student Full Project

Live ETL student project walkthrough, real-time project explanation, data migration from RDS to S3, incremental data movement, S3 to Redshift ETL job, Redshift data ingestion workflow

▶ Play Course
⏱ 7.0 Hours

Level 10
Airflow Integration

Apache Airflow fundamentals, ETL data flow orchestration, Airflow project setup, scheduling and managing AWS Glue jobs

▶ Play Course
⏱ 3.0 Hours