Cloud Data Platform Engineering with Azure Databricks

Type product
Tijdsduur

Cloud Data Platform Engineering with Azure Databricks

FreshMinds B.V.
Logo van FreshMinds B.V.
Opleiderscore: starstarstarstarstar 9,6 FreshMinds B.V. heeft een gemiddelde beoordeling van 9,6 (uit 7 ervaringen)

Tip: incompany training nodig? Vraag een offerte aan bij meerdere aanbieders!

10
Gemiddeld cijfer voor Cloud Data Platform Engineering with Azure Databricks
Gebaseerd op 1 ervaring Lees alle ervaringenchevron_right
starstarstarstarstar
Megan Graves
10
Cloud Data Platform Engineering with Azure Databricks

"Ontzettend goede en nuttige training. Ik heb weinig ervaring met Cloud Engineering maar ondanks dat was de training goed te volgen en Bram beantwoorde alle vragen die ik had. Als ik ergens niet uit kwam nam hij de tijd om met mij te gaan zitten. het is vooral heel fijn om de theorie meteen te kunnen toepassen, dit maakt het heel tastbaar. Ik heb nu een goede basis en begrijp de concepten die aangeleerd worden ook goed! Super tevreden!" - 13-12-2023 09:44

"Ontzettend goede en nuttige training. Ik heb weinig ervaring met Cloud Engineering maar ondanks dat was de training goed te volgen en Bram b… alles lezen - 13-12-2023 09:44

Beschrijving

DESCRIPTION

The Data Platform Engineering with Azure Databricks course is designed to provide participants with a comprehensive understanding of how to leverage Azure Databricks to build scalable and efficient data platforms. Throughout this course, participants will gain hands-on experience in designing, implementing, and managing data engineering workflows using Azure Databricks. From data ingestion and transformation to advanced analytics and machine learning, participants will explore the key features and best practices of Azure Databricks for building robust data platforms.

The course follows a hands-on approach, participants engage in live coding sessions and have access to all cours…

Lees de volledige beschrijving

Veelgestelde vragen

Er zijn nog geen veelgestelde vragen over dit product. Als je een vraag hebt, neem dan contact op met onze klantenservice.

Nog niet gevonden wat je zocht? Bekijk deze onderwerpen: Databricks, Cloud Computing, Microsoft Azure, Data Science en Machine learning.

DESCRIPTION

The Data Platform Engineering with Azure Databricks course is designed to provide participants with a comprehensive understanding of how to leverage Azure Databricks to build scalable and efficient data platforms. Throughout this course, participants will gain hands-on experience in designing, implementing, and managing data engineering workflows using Azure Databricks. From data ingestion and transformation to advanced analytics and machine learning, participants will explore the key features and best practices of Azure Databricks for building robust data platforms.

The course follows a hands-on approach, participants engage in live coding sessions and have access to all course materials upon completion of the course. The course is taught using the Microsoft Azure cloud platform and Databricks, the engineering principles are very much applicable to other cloud platforms and tooling.

TARGET AUDIENCE

This course is designed for professionals working with data daily who want to understand what it takes to build a data platform in the cloud. This course is perfect for all:

  • data engineers and data architects working on on-premises systems who want to learn about cloud platforms
  • software developers working in the data domain
  • BI developers and data scientists who want to learn more about the full stack of data systems.

Prerequisites:

- Proficiency in Python programming language and SQL  

- Basic understanding of data engineering principles and technologies

- Basic understanding of DevOps principles

TOPICS COVERED

Topic 1: Introduction to Azure Databricks

  • Overview of Azure Databricks and its role in data platform engineering
  • Understanding the architecture and components of Azure Databricks
  • Databricks dataplatform architecture, Bronze, Silver and Gold layers
  • Exploring the Azure Databricks workspace and notebooks

Topic 2: Data Ingestion

  • Orchestration, Azure Datafactory or Databricks
  • Overview of data ingestion techniques with Azure Databricks
  • Ingestion with Databricks or Azure Data Factory
  • Integrating Azure Databricks with data sources like Azure Storage, Azure Data Lake, and more
  • Configuring delta loads

Topic 3: Data Engineering Workflows

  • Designing and implementing end-to-end data engineering workflows with Azure Databricks
  • Building scalable ETL (Extract, Transform, Load) processes
  • A discussion about different data source characteristics
  • A discussion about the difference between data cleaning and applying business logic
  • Performing data preparation and transformation using Spark DataFrame API
  • Handling slowly changing dimensions
  • ACID transactions and the delta log

Topic 4: Scheduling and monitoring

  • Managing data pipelines and scheduling jobs in Azure Databricks
  • Monitor data pipelines using Azure Log Analytics and Logic apps
  • Strategies for resolving schema drift

Topic 5: Packaging your pipelines, Deployment and Integration

  • Deployment strategies for Azure Databricks workspaces
  • Integrating Azure Databricks with other Azure services like Azure Data Factory, Azure Synapse Analytics, etc.
  • Continuous integration and deployment (CI/CD) workflows with Azure DevOps and Azure Databricks
  • Unit tests for your solutions

Topic 6: Delivering analytics to the business

  • Dimensional modelling in Spark SQL
  • Leveraging slowly changing dimensions type 2 in your gold layer
  • Presenting your data using Power BI

Note: The course outline provided above is a general guideline and can be customized or expanded based on specific requirements or audience needs.

ABOUT THE TRAINER

Bram Durieux

Bram is a seasoned data professional and open-source software enthusiast with over 15 years of experience in the business as Cloud platform engineer, systems architect, data scientist and engineer and data systems specialist on platforms such as the Microsoft ecosystem and Oracle SQL servers. Bram is driven to make high performance ETL pipelines and to provide value to the business from leveraging data.

In his role as lead engineer, Bram has educated and guided starting data engineers and scientists in their journey to becoming a well-respected data professional.  

Extra information

This training can be given in Dutch or English, based on the language of the participants. 

10
Gemiddeld cijfer voor Cloud Data Platform Engineering with Azure Databricks
Gebaseerd op 1 ervaring
starstarstarstarstar
Megan Graves
10
Cloud Data Platform Engineering with Azure Databricks

"Ontzettend goede en nuttige training. Ik heb weinig ervaring met Cloud Engineering maar ondanks dat was de training goed te volgen en Bram beantwoorde alle vragen die ik had. Als ik ergens niet uit kwam nam hij de tijd om met mij te gaan zitten. het is vooral heel fijn om de theorie meteen te kunnen toepassen, dit maakt het heel tastbaar. Ik heb nu een goede basis en begrijp de concepten die aangeleerd worden ook goed! Super tevreden!" - 13-12-2023 09:44

"Ontzettend goede en nuttige training. Ik heb weinig ervaring met Cloud Engineering maar ondanks dat was de training goed te volgen en Bram b… alles lezen - 13-12-2023 09:44

Er zijn nog geen veelgestelde vragen over dit product. Als je een vraag hebt, neem dan contact op met onze klantenservice.