APACHE FLUME QUIZ DESCRIPTION

Apache Flume was introduced in

  • 2015
     

  •  2016
     

  • 2017
     

  • 2018

What is true about Apache Flume?

  • Apache Flume is a reliable and distributed system for collecting, aggregating and moving massive quantities of log data.
     

  •  It has a simple yet flexible architecture based on streaming data flows
     

  • Apache Flume is used to collect log data present in log files from web servers and aggregating it into HDFS for analysis
     

  • All of the above

What are the uses of Flume?

  • Collecting impressions from custom apps for an ad network
     

  •  Collecting readings from network devices in order to monitor their performance
     

  • Flume is targeted to preserve the reliability, scalability, manageability, and extensibility while it serves a maximum number of clients with higher QoS
     

  • All of the above

What is true about Apache Flume?

  • Apache Flume is a reliable and distributed system for collecting, aggregating and moving massive quantities of log data.
     

  •  It has a simple yet flexible architecture based on streaming data flows
     

  •  Apache Flume is used to collect log data present in log files from web servers and aggregating it into HDFS for analysis.
     

  •  All of the above

A number of ____________ source adapters give you the granular control to grab a specific file.

  • Multimedia file
     

  • Text file
     

  • Image file
     

  • None of the mentioned

List the various types of "Cluster Managers in Spark.

  •  Standalone
     

  • Apache Mesos
     

  • YARN
     

  • All

What are possible types of channel selectors?

  • Default channel selectors
     

  • Multiplexing channel selectors
     

  • Both A and B
     

  • None

This gathering of data can be?

  • scheduled
     

  •  event-driven
     

  •  user-defined
     

  • Both A and B

Point out the wrong statement.

  •  Version 1.4.0 is the fourth Flume release as an Apache top-level project
     

  • Apache Flume 1.5.2 is a security and maintenance release that disables SSLv3 on all components in Flume that support SSL/TLS
     

  •  Flume is backwards-compatible with previous versions of the Flume 1.x codeline
     

  •  None of the mentioned

This gathering of data can be?

  • scheduled
     

  • event-driven
     

  •  user-defined
     

  • Both A and B

What are the important steps in the configuration?

  • Moreover, every Sink must have only one channel
     

  • Every component must have a specific type
     

  • Both A and B
     

  • None

Point out the wrong statement.

  • Version 1.4.0 is the fourth Flume release as an Apache top-level project
     

  • Apache Flume 1.5.2 is a security and maintenance release that disables SSLv3 on all components in Flume that support SSL/TLS
     

  • Flume is backwards-compatible with previous versions of the Flume 1.x codeline
     

  •  Flume deploys as one or more agents, each contained within its own instance of chunks

____ is used when you want the sink to be the input source for another operation.

  •  Collector Tier Event
     

  • Agent Tier Event
     

  •  Basic
     

  •  All of the mentioned

_____ was created to allow you to flow data from a source into your Hadoop environment.

  • Imphala
     

  • Oozie
     

  • Flume
     

  • All of the mentioned

___________ is where you would land a flow (or possibly multiple flows joined together) into an HDFS-formatted file system.

  • Collector Tier Event
     

  • Agent Tier Event
     

  •  Basic
     

  • All of the mentioned

A number of ____ source adapters give you the granular control to grab a specific file.

  • multimedia file
     

  • text file
     

  • image file
     

  •  none of the mentioned

What are the advantages of using Flume?

  • Using Apache Flume we can store the data in to any of the centralized stores (HBase, HDFS)
     

  • When the rate of incoming data exceeds the rate at which data can be written to the destination, Flume acts as a mediator between data producers and the centralized stores and provides a steady flow of data between them
     

  • Both A and B
     

  •  None

Apache Flume is written in

  • JavaScript
     

  •  Java
     

  • Assembly Language
     

  • None

Name a few commonly used Spark Ecosystems

  • Spark SQL (Shark)
     

  • Spark Streaming
     

  • GraphX
     

  • All of the above mentioned

Point out the correct statement.

  • Flume is a distributed, reliable, and available service
     

  • Version 1.5.2 is the eighth Flume release as an Apache top-level project
     

  • Flume 1.5.2 is production-ready software for integration with hadoop
     

  • All of the above

Apache Flume 1.3.0 is the fourth release under the auspices of Apache of the so-called ____ codeline.

  • NF
     

  •  NE
     

  • NG
     

  • NP

Point out the wrong statement

  • Version 1.4.0 is the fourth Flume release as an Apache top-level project
     

  • Apache Flume 1.5.2 is a security and maintenance release that disables SSLv3 on all components in Flume that support SSL/TLS
     

  • Flume is backwards-compatible with previous versions of the Flume 1.x codeline
     

  •  None of the mentioned

____ is where you would land a flow (or possibly multiple flows joined together) into an HDFS-formatted file system.

  • Collector Tier Event
     

  • Agent Tier Event
     

  • Basic
     

  • All of the mentioned

Flume deploys as one or more agents, each contained within its own instance of ___

  • JVM
     

  • Channels
     

  • Chunks
     

  •  None of the mention

What are the important steps in the configuration?

  • Every Source must have atleast one channel
     

  • Every Sink must have only one channel
     

  • Every Components must have a specific type
     

  • All of the above

Flume Hadoop can also be used to transport event data including but not limited to network traffic data, data generated by social media websites and email messages.

  • TRUE
     

  •  FALSE
     

  • Can be true or false
     

  •  Can not say

____ sink can be a text file, the console display, a simple HDFS path, or a null bucket where the data is simply deleted.

  • Collector Tier Event
     

  • Agent Tier Event
     

  • Basic
     

  •  None of the mentioned

What are the tools used in big data?

  • Flume
     

  •  Mahout
     

  •  Sqoop
     

  • All of the above

 What is true about Apache Flume?

  • Apache Flume is a reliable and distributed system for collecting, aggregating and moving massive quantities of log data.
     

  •  It has a simple yet flexible architecture based on streaming data flows
     

  •  Apache Flume is used to collect log data present in log files from web servers and aggregating it into HDFS for analysis.
     

  • All of the above

What are the core components of Flume?

  • Channel
     

  •  Agent
     

  • Client
     

  • All of these