Saturday, 7 June 2014

What is Big Data

Over the last couple of years the penetration of Internet has increased manifold in our lives. The explosion of smartphones, tablets and other gadgets have further increased our browsing time and the ability to be online on the go. Additionally, hardware prices are constantly becoming inexpensive. Because of the above three reasons, humans are generating more data than ever. This data is generated due to social networking, web browsing etc. We are generating data in the form of posts in social networking sites, downloading and uploading pictures, videos, audios, twitters, liking and disliking stuff, buying products online, search histories etc.

Other than human footprint, lots of data is generated by machines. Whether its a data generated by stock exchanges, machine logs, web servers logs etc.

There are 3 main features of big data: 3 Vs of big data

Volume
The data that is generated is of huge volume. Data that is generated to the tune of Petabytes.

Velocity
It specifies that the speed with which the data is generated. Its immensely quick. Just imagine how the web clickstream data would be generated. Web clickstream data is the one that is generated in response to every click you make while browsing.

Variety
It specifies the type or format of data. Its diverse. We are talking about data in the form of xml files, images, audios, videos, posts, twitters etc. So, data can be in unstructured, semi-structured and structured format.

No comments:

Post a Comment