🎧 Using Apache Kafka as Cloud-Native Data System ft. Gwen Shapira

alice.richardson · 7 December 2021 08:10

There’s a new Streaming Audio episode - check it out!

What does cloud native mean, and what are some design considerations when implementing cloud-native data services? Gwen Shapira (Apache Kafka® Committer and Principal Engineer II, Confluent) addresses these questions in today’s episode. She shares her learnings by discussing a series of technical papers published by her team, which explains what they’ve done to expand Kafka’s cloud-native capabilities on Confluent Cloud.

Gwen leads the Cloud-Native Kafka team, which focuses on developing new features to evolve Kafka to its next stage as a fully managed cloud data platform. Turning Kafka into a self-service platform is not entirely straightforward, however, Kafka’s early day investment in elasticity, scalability, and multi-tenancy to run at a company-wide scale served as the North Star in taking Kafka to its next stage— a fully managed cloud service where users will just need to send us their workloads and everything else will magically work. Through examining modern cloud-native data services, such as Aurora, Amazon S3, Snowflake, Amazon DynamoDB, and BigQuery, there are seven capabilities that you can expect to see in modern cloud data systems, including:

Elasticity: Adapt to workload changes to scale up and down with a click or APIs—cloud-native Kafka omits the requirement to install REST Proxy for using Kafka APIs
Infinite scale: Kafka has the ability to elastic scale with a behind-the-scene process for capacity planning
Resiliency: Ensures high availability to minimize downtown and disaster recovery
Multi-tenancy: Cloud-native infrastructure needs to have isolations—data, namespaces, and performance, which Kafka is designed to support
Pay per use: Pay for resources based on usage
Cost-effectiveness: Cloud deployment has notably lower costs than self-managed services, which also decreases adoption time
Global: Connect to Kafka from around the globe and consume data locally

Building around these key requirements, a fully managed Kafka as a service provides an enhanced user experience that is scalable and flexible with reduced infrastructure management costs. Based on their experience building cloud-native Kafka, Gwen and her team published a four-part thesis that shares insights on user expectations for modern cloud data services as well as technical implementation considerations to help you develop your own cloud-native data system.

EPISODE LINKS

Listen to the episode

Topic	Replies	Views
🎧 Expanding Apache Kafka Multi-Tenancy for Cloud-Native Systems ft. Anna Povzner and Anastasia Vela News and Blogs	2889	27 January 2022
🎧 The Evolution of Apache Kafka: From In-House Infrastructure to Managed Cloud Service ft. Jay Kreps News and Blogs	2905	24 February 2022
🎧 Lessons Learned From Designing Serverless Apache Kafka ft. Prachetaa Raghavan News and Blogs	2982	14 December 2021
✍️ Making Confluent Cloud 10x More Elastic Than Apache Kafka News and Blogs	2800	31 May 2022
Recording ready to view: SPEAKER Q&A THREAD: 21 April 2022- Apache Kafka® The Core Technology Events	3434	28 April 2022

🎧 Using Apache Kafka as Cloud-Native Data System ft. Gwen Shapira

Related topics