There is not a general consensus in the literature as to how to define big data, but a common framework to describe them are the so-called three Vs or the three dimensions of big data: volume (the quantity of stored big data), variety (the variety of documents that constitute big data), and velocity (the speed at which new data is being generated). Big data are characterized by high values of all these three dimensions. A fourth dimension has been added later, veracity, which refers to the biases, noise, and abnormality in data that may make any result of their analysis completely meaningless if not properly collected and treated. Big data should be distinguished from large data sets or “lots of data” because it is not the volume of data alone that makes a set of data “big,” but their inherent complexity. The advent of big data has been hailed by some scholars as a new era for research, based on a new paradigm of science. Anyhow, another part of the scientific community expresses perplexity about the effective extent of the data revolution pointing out a number of critical points. Anyhow, without a doubt, big data analytics is a trending topic offering a large assortment of analyses designed to drive evidence-based decision-making both for the private business/industrial and the public/social sectors. Nevertheless, it raises some privacy issues that call for the definition of new de-identification techniques and procedures to guarantee the individual’s privacy.

Big Data

di Bella, Enrico
2018-01-01

Abstract

There is not a general consensus in the literature as to how to define big data, but a common framework to describe them are the so-called three Vs or the three dimensions of big data: volume (the quantity of stored big data), variety (the variety of documents that constitute big data), and velocity (the speed at which new data is being generated). Big data are characterized by high values of all these three dimensions. A fourth dimension has been added later, veracity, which refers to the biases, noise, and abnormality in data that may make any result of their analysis completely meaningless if not properly collected and treated. Big data should be distinguished from large data sets or “lots of data” because it is not the volume of data alone that makes a set of data “big,” but their inherent complexity. The advent of big data has been hailed by some scholars as a new era for research, based on a new paradigm of science. Anyhow, another part of the scientific community expresses perplexity about the effective extent of the data revolution pointing out a number of critical points. Anyhow, without a doubt, big data analytics is a trending topic offering a large assortment of analyses designed to drive evidence-based decision-making both for the private business/industrial and the public/social sectors. Nevertheless, it raises some privacy issues that call for the definition of new de-identification techniques and procedures to guarantee the individual’s privacy.
File in questo prodotto:
File Dimensione Formato  
Wiley - Big Data.pdf

accesso chiuso

Tipologia: Documento in versione editoriale
Dimensione 105.65 kB
Formato Adobe PDF
105.65 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11567/919302
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact