KafkaWikiStream is a lightweight Spring Boot application designed to stream real-time edits from Wikimedia into Apache Kafka. It showcases a simple end-to-end pipeline that connects a public API to a Kafka topic for further processing or analysis.
- Pulls live update events from the Wikimedia recent changes stream.
- Produces those events into an Apache Kafka topic.
- Includes a Kafka consumer that reads and logs the streamed data.
- Java 8 or higher
- Spring Boot
- Apache Kafka
- Maven
- Start your Kafka server and Zookeeper.
- Build the project using Maven: mvn clean install
- Run the Producer and Consumer applications separately to stream and consume data.
- Producer/ β Connects to Wikimedia and sends data to Kafka.
- Consumer/ β Subscribes to Kafka topic and logs incoming updates.
- Real-time data ingestion
- Kafka practice for streaming architecture
- Learning project for Spring Boot with Kafka integration