Tudor Lapusan's Blog

Post info:

HDFS : The Hadoop Distributed Filesystem, part 2

Here we are with the second part of the HDFS article. If you didn’t read the first part, you can find it here. If in the first part of the article, I wrote about the main HDFS concepts, like blocks, datanode, namenode, now I will write about other HDFS characteristics, like file operations, HDFS challenges and its integrations with other BigData frameworks.   HDFS operations HDFS offers a simple API which allows us to handle data inside it. There are

Read the full post

Post info:

HDFS : The Hadoop Distributed Filesystem, part 1

As we all know or heard, the amount of data grows exponentially each year. Nowadays almost each person has a mobile phone which is a data generator, there are a lot of websites on internet which generate a lot of logs with click events, user interactions, etc. and in the last years appeared the Internet of things (IoT) where each device contains sensors which can also can generate massive amount of information. So as you may guess, there is a big

Read the full post