🎧 Connecting Azure Cosmos DB with Apache Kafka - Better Together ft. Ryan CrawCour

alice.richardson · 14 April 2021 14:03

There’s a new Streaming Audio episode - check it out!

When building solutions for customers in Microsoft Azure, it is not uncommon to come across customers who are deeply entrenched in the Apache Kafka® ecosystem and want to continue expanding within it. Thus, figuring out how to connect Azure first-party services to this ecosystem is of the utmost importance.

Ryan CrawCour is a Microsoft engineer who has been working on all things data and analytics for the past 10+ years, including building out services like Azure Cosmos DB, which is used by millions of people around the globe. More recently, Ryan has taken a customer-facing role where he gets to help customers build the best solutions possible using Microsoft Azure’s cloud platform and development tools.

In one case, Ryan helped a customer leverage their existing Kafka investments and persist event messages in a durable managed database system in Azure. They chose Azure Cosmos DB, a fully managed, distributed, modern NoSQL database service as their preferred database, but the question remained as to how they would feed events from their Kafka infrastructure into Azure Cosmos DB, as well as how they could get changes from their database system back into their Kafka topics.

Although integration is in his blood, Ryan confesses that he is relatively new to the world of Kafka and has learned to adjust to what he finds in his customers’ environments. Oftentimes this is Kafka, and for many good reasons, customers don’t want to change this core part of their solution infrastructure. This has led him to embrace Kafka and the ecosystem around it, enabling him to better serve customers.

He’s been closely tracking the development and progress of Kafka Connect. To him, it is the natural step from Kafka as a messaging infrastructure to Kafka as a key pillar in an integration scenario. Kafka Connect can be thought of as a piece of middleware that can be used to connect a variety of systems to Kafka in a bidirectional manner. This means getting data from Kafka into your downstream systems, often databases, and also taking changes that occur in these systems and publishing them back to Kafka where other systems can then react.

One day, a customer asked him how to connect Azure Cosmos DB to Kafka. There wasn’t a connector at the time, so he helped build two with the Confluent team: a sink connector, where data flows from Kafka topics into Azure Cosmos DB, as well as a source connector, where Azure Cosmos DB is the source of data pushing changes that occur in the database into Kafka topics.

EPISODE LINKS

Listen to the episode

Topic		Replies	Views
✍️ Integrating Azure and Confluent: Ingesting Data to Azure Cosmos DB through Kafka News and Blogs	0	3187	18 March 2021
How to run multiple cosmos source connectors for same Container with different transform configurations in Kafka connect Kafka Connect	3	3110	28 August 2022
Getting ready to deploy Kafka for POC Architecture and Design	5	4170	30 March 2021
Connector Kafka for Azure Cosmos DB Gremlin Source Kafka Connect	1	3199	9 July 2021
🎧 Intro to Kafka Connect: Core Components and Architecture ft. Robin Moffatt News and Blogs	0	3203	28 September 2021

🎧 Connecting Azure Cosmos DB with Apache Kafka - Better Together ft. Ryan CrawCour

Related topics