Question - 1
Data in ___________ bytes size is called Big Data.
Answer- C
Question - 2
Transaction data of the bank is?
Structured data
Unstructured data
Both A and B
None of these
Show Answer
Solutions
Answer- A
Question - 3
The overall percentage of the world's total data that has been created just within the past two years is?
Answer- C
Question - 4
By 2027, the volume of data produced digitally will reach:
Answer- C
Question - 5
For drawing insights for business what are needed?
Collecting the data
Storing the data
Analyszing the data
All of the above
Show Answer
Solutions
Answer- D
Question - 6
____________is an open-source framework for storing data and running application on clusters of commodity hardware.
HDFS
Hadoop
MapReduce
Cloud
Show Answer
Solutions
Answer- B
Question - 7
Tweets stored in a flat file
A collection of image files in a directory
An extract of rows from a database table stored in a CSV formatted file
All of the above
Show Answer
Solutions
Answer- D
Question - 8
Multiple schemas
Multiple formats and types of data
Multiple Data Models
None of these
Show Answer
Solutions
Answer- B
Question - 9
Big data analysis does the following except:
Collects data
Spreads data
Organizes data
Analyzes data
Show Answer
Solutions
Answer- B
Question - 10
The new source of big data that will trigger a Big Data revolution in the years to come is:
Business transactions
Social media
Transactional data and sensor data
RDBMS
Show Answer
Solutions
Answer- C
Question - 11
The unit of data that flows through a Flume agent is:
Answer- C
Question - 12
The word 'Big Data' was coined in the year:
Answer- D
Question - 13
The feature of big data that refers to the quality of the stored data is ______.
Variety
Volume
Variability
Veracity
Show Answer
Solutions
Answer- D
Question - 14
Input to the __________ is the sorted output of the mappers.
Reducer
Mapper
Shuffle
None of these
Show Answer
Solutions
Answer- A
Question - 15
A ________ serves as the master and there is only one NameNode per cluster.
Data node
Name node
Data block
Replication
Show Answer
Solutions
Answer- B
Question - 16
Apache Kafka is an open-source platform that was created by?
LinkedIn
Facebook
Google
IBM
Show Answer
Solutions
Answer- A
Question - 17
What are the main components of Big Data?
MapReduce
HDFS
YARN
All of the above
Show Answer
Solutions
Answer- D
Question - 18
HDFS works in a __________ fashion.
master-worker
master-slave
worker/slave
All of the above
Show Answer
Solutions
Answer- A
Question - 19
HDFS works in a __________ fashion.
HDFS is not suitable for scenarios requiring multiple/simultaneous writes to the same file
HDFS is suitable for storing data related to applications requiring low latency data access
HDFS is suitable for storing data related to applications requiring high latency data access
None of these
Show Answer
Solutions
Answer- A
Question - 20
HDFS provides a command line interface called __________ used to interact with HDFS.
HDFS Shell
FB Shell
DFSA Shell
None
Show Answer
Solutions
Answer- B
Practice more set questions