Big Data is about the growing challenge that organizations face as they deal with large and fast-growing sources of data or information that also present a complex
range of analysis and use problems. These can include:
- Having a computing infrastructure that can ingest, validate, and analyze high volumes (size and/or rate) of data
- Assessing mixed data (structured and unstructured) from multiple sources
- Dealing with unpredictable content with no apparent schema or structure
- Enabling real-time or near-real-time collection, analysis, and answers
Big Data technologies describe a new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of
data, by enabling high-velocity capture, discovery, and/or analysis.
The data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and
cell phone GPS signals to name a few. This data is big data.
Three V's of BigData
- Volume describes the amount of data generated by organizations or individuals.
- Variety describes structured and unstructured data ,such as text, sensor data, audio, video, click streams, log files and more.
- Velocity describes the frequency at which data is generated, captured and shared.