Tudor Lapusan's Blog

Post info:

Data serialization with Apache Avro, part 1

This article is the first post in a two-part series about data serialization with Avro in HDFS with a focus on benefits of having associated a schema to your data, Avro intro and its main characteristics. The second article will be focus on data serialization usingĀ Apache Avro with practical examples in Java and MapReduce. HDFS is a very flexible distributed storage system, which let’s you store any kind of data in it. If you store your data in it’s raw

Read the full post