Complete your Databricks User Groups profile!

Fill out a few details about yourself so the community can get to know you.
Get Certified: AWS Databricks Platform Architect Accreditation β€” Real-World Lakehouse Architecture & Hands-On Design

AWS Databricks Platform Architect Flashcard

πŸ” Identity & Access

Flashcard 1
Q: What enables Databricks to access AWS services securely?
A: IAM Role (Cross-account / Instance profile)

Flashcard 2
Q: What role is required for accessing external storage (S3)?
A: Self-assuming cross-account IAM role

Flashcard 3
Q: What is the trusted principal for external storage permissions?
A: Static Unity Catalog IAM role

Flashcard 4
Q: What privileges are required for query federation?
A: Metastore admin + Account admin

πŸ—‚οΈ Storage & Data Access

Flashcard 5
Q: Where are storage credentials created?
A: Workspace β†’ Data Explorer

Flashcard 6
Q: What are the minimum requirements to access external storage?
A: Storage credential + IAM role

Flashcard 7
Q: What is required before creating a metastore?
A: S3 bucket + IAM role

Flashcard 8
Q: What is the AWS workspace storage requirement?
A: One bucket for both DBFS and system data

🧱 Architecture & Networking

Flashcard 9
Q: What determines region in AWS Databricks deployment?
A: VPC

Flashcard 10
Q: What subnet configuration is required?
A: Two private subnets

Flashcard 11
Q: What registers networking in Databricks?
A: Network configuration

Flashcard 12
Q: How do clusters access AWS services?
A: Instance profile attached to cluster

πŸ” Security & Encryption

Flashcard 13
Q: What does encryption key configuration do?
A: Registers KMS key for Databricks

Flashcard 14
Q: Can encryption keys be rotated?
A: Yes

Flashcard 15
Q: What resources can encryption apply to?
A: Root bucket, system bucket, cluster disks

πŸ”„ Data Federation & Integration

Flashcard 16
Q: What is query federation?
A: Query external data without moving it

Flashcard 17
Q: Is query federation read/write or read-only?
A: Read-only

Flashcard 18
Q: What is required for AWS service integration?
A: Enable service + create IAM role

βš™οΈ Compute & Performance

Flashcard 19
Q: How do you reduce compute cost?
A: Autoscaling + right instance type + tagging

Flashcard 20
Q: What cluster type reduces idle cost?
A: Job clusters (ephemeral compute)

πŸ”— External Locations & Unity Catalog

Flashcard 21
Q: Steps to create external location?
A: Catalog β†’ Connection β†’ Storage credential

Flashcard 22
Q: What does Unity Catalog manage?
A: Data, access control, lineage, governance

Flashcard 23
Q: Relationship between metastores and workspaces?
A: One metastore can serve multiple workspaces

🌐 Data Platforms & Services

Flashcard 24
Q: Which AWS service supports foundation model tuning?
A: Amazon Bedrock

Flashcard 25
Q: Default port for Redshift federation?
A: 5439

0 comments