MIS-101 (MMS-232): 🔖 Topic- 10: What is BIG DATA? Introduction, Types, Characteristics & Example: 10: What is BIG DATA? Introduction, Types, Characteristics & Example

10: What is BIG DATA? Introduction, Types, Characteristics & Example

What is Data?

The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media.

What is Big Data?

Big Data is also data but with a huge size. Big Data is a term used to describe a collection of data that is huge in volume and yet growing exponentially with time. In short such data is so large and complex that none of the traditional data management tools are able to store it or process it efficiently.

Examples Of Big Data

Following are some examples of Big Data-

The New York Stock Exchange generates about one terabyte of new trade data per day.

Social Media

The statistic shows that 500+terabytes of new data get ingested into the databases of social media site Facebook, every day. This data is mainly generated in terms of photo and video uploads, message exchanges, putting comments etc.

A single Jet engine can generate 10+terabytes of data in 30 minutes of flight time. With many thousand flights per day, generation of data reaches up to many Petabytes.

Types Of Big Data

BigData' could be found in three forms:

Structured
Unstructured
Semi-structured

Structured

Any data that can be stored, accessed and processed in the form of fixed format is termed as a 'structured' data. Over the period of time, talent in computer science has achieved greater success in developing techniques for working with such kind of data (where the format is well known in advance) and also deriving value out of it. However, nowadays, we are foreseeing issues when a size of such data grows to a huge extent, typical sizes are being in the rage of multiple zettabytes.

Do you know? 10²¹ bytes equal to 1 zettabyte or one billion terabytes forms a zettabyte.

Looking at these figures one can easily understand why the name Big Data is given and imagine the challenges involved in its storage and processing.

Do you know? Data stored in a relational database management system is one example of a 'structured' data.

Examples Of Structured Data

An 'Employee' table in a database is an example of Structured Data

Employee_ID	Employee_Name	Gender	Department	Salary_In_lacs
2365	Rajesh Kulkarni	Male	Finance	650000
3398	Pratibha Joshi	Female	Admin	650000
7465	Shushil Roy	Male	Admin	500000
7500	Shubhojit Das	Male	Finance	500000
7699	Priya Sane	Female	Finance	550000

Unstructured

Any data with unknown form or the structure is classified as unstructured data. In addition to the size being huge, un-structured data poses multiple challenges in terms of its processing for deriving value out of it. A typical example of unstructured data is a heterogeneous data source containing a combination of simple text files, images, videos etc. Now day organizations have wealth of data available with them but unfortunately, they don't know how to derive value out of it since this data is in its raw form or unstructured format.

Examples Of Un-structured Data

The output returned by 'Google Search'

Semi-structured

Semi-structured data can contain both the forms of data. We can see semi-structured data as a structured in form but it is actually not defined with e.g. a table definition in relational DBMS. Example of semi-structured data is a data represented in an XML file.

Examples Of Semi-structured Data

Personal data stored in an XML file-

<rec><name>Prashant Rao</name><sex>Male</sex><age>35</age></rec>
<rec><name>Seema R.</name><sex>Female</sex><age>41</age></rec>
<rec><name>Satish Mane</name><sex>Male</sex><age>29</age></rec>
<rec><name>Subrato Roy</name><sex>Male</sex><age>26</age></rec>
<rec><name>Jeremiah J.</name><sex>Male</sex><age>35</age></rec>

Data Growth over the years

Please note that web application data, which is unstructured, consists of log files, transaction history files etc. OLTP systems are built to work with structured data wherein data is stored in relations (tables).

Characteristics Of Big Data

1. Volume:

When we talk about Big data, probably volume is the very first criteria for consideration. The range of volume justifies whether it should be considered as ‘big’ or not. Usually, if the volume of data is above gigabytes then only it is considered as big data from a volume perspective. What does measurement signifies here? It could be petabytes, terabytes, Exabyte. This volume amount is considered based on data surveys of different organizations and here are some of the examples:

Also, this is actually the purpose of differentiating such enormous size of data as Big data from traditional structured data. In addition to that, RDBMS, or traditional database systems are not efficient to process or handle this data. Because it will take extended query time, cost, reliability, etc.

Also, as per IDC estimation by 2020, business transactions on the internet for B2B and B2C will reach 450 billion per day.

2. Velocity:

Stream analytics is a popular term today where high-speed data is processed using tools. But do you know stream analytics associated with which characteristics of big data? No doubt, it is the velocity of data. Here velocity means data generation speed, how frequent it is delivered and analyzed.

Now, the amount of data generated in today’s scenario is massive. Most importantly it needs real-time processing for analysis purpose. For example, Google alone generates more than 40k search queries per second. Hence, we can imagine how fast processing is required to get insights from data.

3. Variety:

Big data deals with any data formats – structured, unstructured, semi-structured or even very complex structured. So, storing and processing unformatted data through RDBMS is not easy. However, such unstructured data provides more valuable insights on the information which we rarely get from structured data. Besides, a variety of data means different data sources. So, this characteristic of big data also provides information on the data sources.

4. Veracity:

Not that all data that come for processing are valuable. So, unless the data is cleansed correctly, it is not wise to store or process complete data. Specially when the volume is such massive. There comes this dimension of big data – veracity. This particular characteristics also helps to know whether the data is coming from a reliable source or it is the right fit for the analytic model.

5. Variability:

In Big data analysis data inconsistency is a common scenario which arises as the data is sourced from different sources. Besides, it contain different data types. Hence, to get meaningful data out of that enormous amount of data anomaly and outlier detection are essential. So, variability is considered as one of the characteristics of big data.

6. Value:

The primary interest for big data is probably for its business value. Perhaps this is the most crucial characteristic of big data. Because unless you get any business insights out of it, there is no meaning of other characteristics of big data.

Source:
https://www.guru99.com/what-is-big-data.html
https://www.motivaction.nl/en/news/blog/big-data-the-6-vs-you-need-to-look-at-for-important-insights

You have completed 0% of the lesson

Welcome to "Information Technologies Management for Entrepreneurs" Course

🙋‍♂️ Hey, I am your Mentor for this course

📑 Course Curriculum

📚 Reference Book

Session- 1: Information Systems in Business Today

🔖 Topic- 1: How Information Systems Are Transforming Business?

🔖 Topic- 2: Information system- How, What, Why?

🎬 Creativity, Innovation and Entrepreneurship by Glenn Gaudette

💻 Lecture Slide: Information Systems in Business Today

📚 Book Chapter

📑 Reading Materials: Chapter- 1 Information Systems in Business Today

Session- 2 & 3: Global E‐business and Collaboration

🔖 Topic- 1: What are business processes? How are they related to information systems?

🔖 Topic- 2: Systems for Linking the Enterprise

🔖 Topic- 3: Why are systems for collaboration and social business so important?

💻 Lecture Slide: Information Systems in Business Today

🎬 The Impact Of Technological Change And How Technology Affects Business Company Culture with Daren Martin, PhD

☑️ e-Test: 1 || Check your Understanding

📑 Reading Materials: Session- 2 &amp; 3: Global E‐business and Collaboration

Session- 4 & 5: Information Systems, Organizations, and Strategy

💻 Lecture Slide 4 &amp; 5: Information Systems, Organizations, and Strategy

🔖 Topic- 1: Which features of organizations do managers need to know about to build and use IS successfully?

🔖 Topic- 2: Porter’s competitive forces model, the value chain model, synergies, core competencies, and network economics

👨‍💼 What Does a Chief Information Officer (CIO) Do?

🗣️ Discuss and Share Learning: Future of Remote Learning

📑 Reading Materials:Session- 4 &amp; 5: Information Systems, Organizations, and Strategy

Case Study

Session- 6 & 7: Ethical and Social Issues in Information Systems

💻 Lecture Slide- 6 &amp; 7: Ethical and Social Issues in Information Systems

🔖 Topic- 6.1: What ethical, social, and political issues are raised by information systems?

🔖 Topic- 6.2: What specific principles for conduct can be used to guide ethical decisions?

🔖 Topic- 7.1: Protection of individual privacy and intellectual property

🔖 Topic- 7.2: Property Rights: Intellectual Property

📌 Bangladesh Government’s Computer Incident Response Team | BGD e-GOV CIRT

🎬 Video: The ethical dilemma we face on AI and autonomous tech | Christine Fox | TEDxMidAtlantic

🔐 Digital Security Act - Bangladesh | BGD e-GOV CIRT

📑 Reading Materials:Session- 6 &amp; 7: Ethical and Social Issues in Information Systems

Session- 8 & 9: IT Infrastructure and Emerging Technologies

💻 Lecture Slide- 8: IT Infrastructure and Emerging Technologies

🔖 Topic- 8.1: What is IT infrastructure, and what are the stages and drivers of IT infrastructure evolution?

🔖 Topic- 8.2: Components of IT infrastructure and current trends in computer hardware

🎬 Video: Quantum Computing Expert Explains One Concept in 5 Levels of Difficulty | WIRED | IBM

✅ Why 12 Big Companies Going Green?Google vs Apple vs Amazon Who win the race for urging Clean energy and Sustainability goal?

🎬 Video: Green computing using next-generation nanotechnology | Amalio Fernández-Pacheco | TEDxZaragoza

🌐 Meet 6 women entrepreneurs offering Eco-Friendly solutions for a Greener World | yourstory.com

📑 Reading Materials: Session- 8 &amp; 9: IT Infrastructure and Emerging Technologies

Online Lecture

Session- 10 & 11: Foundation of Business Intelligence: Databases & IS

💻 Lecture Slide-10: Foundation of Business Intelligence: Databases &amp; IS

🔖 Topic- 10: What is BIG DATA? Introduction, Types, Characteristics &amp; Example

📑 Reading Materials:- 10: Foundation of Business Intelligence Databases IS

🌐 ER Diagram Tutorial in DBMS (with Example) | guru99

📝 e-Test- 3 || Foundation of Business Intelligence: Databases &amp; IS

Online Lecture-1

Online Lecture-2

ASSIGNMENT

Case Study-1 (2.5 marks)

Case Study-2 (2.5 marks)

Assignment Portal

Session- 12 & 13: Telecommunications, the Internet, and Wireless Technology

💻 Lecture Slide- 12 &amp; 13: Telecommunications, the Internet, and Wireless Technology

🔖 Topic- 12: What are the principal components of telecommunications networks and key networking technologies?

🔖 Topic- 13: How do the Internet and Internet technology work, and how do they support communication and e-business?

📑 Reading Materials:- 11 &amp; 12: Telecommunications, the Internet, and Wireless Technology

🎥 World's Top 10 Telecom Company Rankings (2010- 2019)

🎥 What is 5G? | CNBC Explains

🗣️ Discuss and Thought Sharing: Virtual Communication and Remote Working

Session- 14 & 15: Securing Information Systems

🔖 Topic- 14: Why are information systems vulnerable to destruction, error, and abuse?

🔖 Topic- 15: What are the components of an organizational framework for security and control?

📔 Google #BeInternetAwesome | Digital Citizenship

🎥 Why Companies Like Google And Facebook Pay Hackers Millions | CNBC

🎥 Top hacker shows us how it's done | Pablos Holman | TEDxMidwest

🌐 Ways to Make your Company Network Secure

🗣️ Discuss and Thought Sharing: Internet world and securing yourself

🎯 Play, Learn, Earn Points | Interland 🎖️

📑 Reading Materials:- 14 &amp; 15: Securing Information Systems

QUIZ-3

MIS-102 QUIZ-3 PORTAL

Session- 16 & 17: Achieving Operational Excellence and Customer Intimacy

📑 Reading Materials: Session- 2 & 3: Global E‐business and Collaboration

💻 Lecture Slide 4 & 5: Information Systems, Organizations, and Strategy

📑 Reading Materials:Session- 4 & 5: Information Systems, Organizations, and Strategy

💻 Lecture Slide- 6 & 7: Ethical and Social Issues in Information Systems

📑 Reading Materials:Session- 6 & 7: Ethical and Social Issues in Information Systems

📑 Reading Materials: Session- 8 & 9: IT Infrastructure and Emerging Technologies

💻 Lecture Slide-10: Foundation of Business Intelligence: Databases & IS

🔖 Topic- 10: What is BIG DATA? Introduction, Types, Characteristics & Example

📝 e-Test- 3 || Foundation of Business Intelligence: Databases & IS

💻 Lecture Slide- 12 & 13: Telecommunications, the Internet, and Wireless Technology

📑 Reading Materials:- 11 & 12: Telecommunications, the Internet, and Wireless Technology

📑 Reading Materials:- 14 & 15: Securing Information Systems

💻 Session- 16 & 17: Achieving Operational Excellence and Customer Intimacy

💻 Session- 18 & 19: e-Commerce: Digital Markets, Digital Goods

💻 Session- 20 & 21: Future Trends in Information Systems: Tomorrow and Beyond

🎥 Video: Career & Job Opportunities | Management Information System & Business Analytics