Category: Apache Flink

Sending additional data to and from Flink cluster
In the first part of the article, I'll describe how to send configuration data to our Flink cluster. In the second part of the article, I will describe a non-trivial way of sending data back from a Flink cluster.

Four ways to optimize your Flink applications
Apache Flink provides many ways to tweak its execution. Here I'll show four different ways to improve the performance of your Flink applications.

Getting started with stream processing using Apache Flink
Apache Flink is taking the world of Big Data by storm. Learn how to start develop complex stream processing algorithms using it.

Getting started with batch processing using Apache Flink
Apache Flink is the next big thing in Big Data and has excellent support for both batch and stream processing. Learn how to start develop batch processing algorithms using it.

Apache Flink vs. Apache Spark
If you look at this image with a list of Big Data tools it may seem that all possible niches in this field are already occupied. With so much competition it should be very tough to come up with a...

Distributed graphs processing with Pregel
Graphs processing is an important part of data analysis in many domains. But graphs processing is tricky may be tricky since general purpose distributed computing tools are not suited for graphs processing. It is not surprising that an important advancement...

Graphs processing with Apache Flink
Graphs are everywhere. Internet, maps, and social networks to name just a few are all examples of massive graphs that contains vast amounts of useful information. Since the size of these networks is growing and processing them become more and...

Implementing Flink batch data connector
Apache Flink has a versatile set of connectors for externals data sources. It can read and write data from databases, local and distributed file systems. However, sometimes what Flink provides is not enough, and we need to read some uncommon...

Using Apache Flink with Java 8
JDK 8 has introduced a lot of long-anticipated features to Java language. Among those, the most notable was the introduction of lambda functions. They allowed adding new frameworks such as Java 8 Streams, as well as, new features to existing...

Calculating movies ratings distribution with Apache Flink
In this post, I would like to show how to implement a simple batch processing algorithm using Apache Flink. In the process, I'll show few tricks that you can use to improve the performance of your Flink applications.

Apache Flink: A New Landmark on the Big Data Landscape
In this article, I would like to introduce Apache Flink, describe what its main features are, and why is it different from other available solutions. I'll end the article with an example of a simple stream processing application using Flink.