A brief introduction to Big Data
Big Data: Concepts, Applications, & Challenge
Any organization either public or private rely on accurate data analytic to take decisions. The utilization of big data is pivotal for driving organization extract value from a large amount of data. Big data is a method and technique to retrieve, collect, managed analyze a very huge volume of both structure and unstructured data that is difficult to process using traditional database which entail new technologies and technique to analyze them. This post covers the big data with the intentions to find out the concept, its applications and challenges from literature analysis as well as discussing and reviewing interpretations from these findings along with possible recommendations.
An Introduction
Big data is a term for large and complex sets of data in which traditional methods of processing data are insufficient. This essay will cover about the big data as a whole (i.e., the big picture), along with the intentions to find out the origins of the concept, its applications and challenges from findings resulted from cited sources and publications as well as discussing and reviewing interpretations from these findings along with possible recommendations. Big data need to be analyzed to gain its value either the trends, patterns or behavior anything related to the people or customers. Though data comes from the rapid growth of volume, yet it does rapidly and efficiently processing those data refers to velocity of data processing. Big data analytic leads to more precise analysis thus helps to bring more accurate decision-making and better performance. Big data are collected either through structured or unstructured data sources (online or offline). Unstructured data can come from social media (FB, Instagram, Twitter posts,). While, structured data sources can come from internal database of organization. In business, both sources are used to understand the patterns of the customers. Indeed, an organization nowdays relies the fact that any data could be analyzed and used to reveal patterns of their customers. In other words, big data will help the organization to understand the behavior of their customers and use it to win a competition.
Even though business organizations are still in early stage of perceiving big data as an asset, public agencies are still struggling with the issue of open data, whereas science and technology are exploring the potentials of big data and its innovation, yet general public are keep producing a huge amount of data in daily basis poses, challenges for all organizations. An organization faces the fact that the reality of big data that can affect their competitiveness. The aim of this study is to examine fundamental concept, applications, and challenges that are closely related to big data in organization.
Data Structures
We must first understand the new types of data structures. Traditionally, we have been focused towards structured and unstructured data. Structured data is that which is contained in relational databases and spreadsheets. Structured data conforms to a database model having a fixed structure of format of capturing data. Database tools and additional reporting and analyzing tools tools have been used to help analyze this data and creating meaningful reports.
Unstructured data doesn't have a pre-defined data model nor is it organized in a predefined manner. It is typically text heavy and may contain data such as dates, numbers, and facts and include untagged data representing photos and graphic images. Word processing documents, presentations, and PDF files are prime examples of unstructured data.
New data structures that have come up are semi-structured data and quasi-structured data.
Semi-structured data is not the raw data and is not stored in a conventional database system. It is structured data but is not organized in a rational model like a table or an object-based graph. Semi-structured data contains tags or markers to separate semantic elements and enforce hierarchies of records and fields within the data. The entities belonging to the same class may have different attributes even though they are grouped together irrespective of the attributes' order. Markup languages like XML, email, and EDI are forms of semistructured data. These support nested or hierarchical data simplifying the data models representing complex relationships between entities. These also support the lists of objects that simplify data models by avoiding messy translations of lists into a relational data model.





Comments
Post a Comment