Professional Documents
Culture Documents
N° de la feuille
MINISTERE surveillants
DE L’ENSEIGNEMENT SUPERIEUR ET DE LA
RECHERHCE SCIENTIFIQUE
FEUILLE
Nombre totale
D'EXAMEN
UNIVERSITE DE SOUSSE
Ecole Nationale d’Ingénieurs de Sousse des feuilles
Nom :............................................................
Prénom :....................................................... Identifiant secret
N° CIN : ......:......:......:......:......:......:......:…… :
Epreuve de : Big Data
Ne rien écrire ici
Spécialité :..IA2, GT2........ Session : principale A.U.20-21 Groupe :.............................
---------------------------------------------------------------------------------------------------------------------------------------------------------
True or False? Creating users through the Ambari UI will also create the user on the HDFS.
A. True
B. False
True or False? You can use the CURL commands to issue commands to Ambari.
A. True
B. False
Apache Spark can run on which two of the following cluster managers? Select the TWO answers that apply
A. Nomad
B. Linux Cluster Manager
C. oneSIS
D. Apache Mesos
E. Hadoop YARN
1
Ne rien écrire ici
---------------------------------------------------------------------------------------------------------------------------------------------------
Hadoop 2 consists of which three open-source sub-projects maintained by the Apache Software Foundation?
Select the THREE answers that apply
A. Cloudbreak
B. HDFS
C. MapReduce
D. Hive
E. Big SQL
F. YARN
If a Hadoop node goes down, which Ambari component will notify the Administrator?
A. Ambari Alert Framework
B. Ambari Metrics System
C. Ambari Wizard
D. REST API
Which component of the Apache Ambari architecture integrates with an organization's LDAP or Active
Directory service
A. REST API
B. Authorization Provider
C. Postgres RDBMS
D. Ambari Alert Framework
2
What is an example of a Key-value type of NoSQL datastore?
E. Sesame
F. MongoDB
G. Neo4j
H. REDIS
Apache Spark provides a single, unifying platform for which three of the following types of operations? Select
the THREE answers that apply.
A. graph operations
B. batch processing
C. record locking
D. machine learning
E. ACID transactions
F. transaction processing
What are two ways the command-line parameters for a Sqoop invocation can be simplified?
A. Use the --import-command line argument.
B. Run Sqoop using the vi editor.
C. Place the commands in a file.
D. Include the --options-file command line argument.
Which statement is true about the Combiner phase of the MapReduce architecture?
A. It aggregates all input data before it goes through the Map phase.
B. It reduces the amount of data that is sent to the Reducer task nodes.
C. It determines the size and distribution of data split in the Map phase.
D. It is performed after the Reducer phase to produce the final output.
3
What command is used to list the "magic" commands in Jupyter?
A. %dirmagic
B. %list-all-magic
C. %lsmagic
D. %list-magic
Why might a data scientist need a particular kind of GPU (graphics processing unit)?
A. To collect video for use in streaming data applications.
B. To perform certain data transformation quickly.
C. To display a simple bar chart of data on the screen.
D. To input commands to a data science notebook
What does the user interface for Jupyter look like to a user?
A. Common desktop app.
B. App in web browser.
C. Database interface.
D. Linux SSH session.
Using the Java SQL Shell, which command will connect to a database called mybigdata?
A. ./jsqsh mybigdata
B. ./java mybigdata
C. ./jsqsh go mybigdata
D. ./java tables
You need to enable impersonation. Which two properties in the bigsql-conf.xml file need to be marked true?
Select the TWO answers that apply
A. DB2COMPOPT
B. bigsql.alltables.io.doAs
C. DB2_ATS_ENABLE
D. bigsql.impersonation.create.table.grant.public
E. $BIGSQL_HOME/conf
Which directory permissions need to be set to allow all users to create their own schema?
A. 666
B. 700
C. 777
D. 755
You are creating a new table and need to format it with parquet. Which partial SQL statement would create the
table in parquet format?
A. STORED AS parquet
B. CREATE AS parquetfile
C. STORED AS parquetfile
D. CREATE AS parquet
4
Which definition best describes RCAC?
A. It grants or revokes certain directory privileges.
B. It limits access by using views and stored procedures.
C. It grants or revokes certain user privileges.
D. It limits the rows or columns returned based on certain criteria.
Which two commands would you use to give or remove certain privileges to/from a user?
A. SELECT
B. INSERT
C. REVOKE
D. LOAD
E. GRANT
When connecting to an external database in a federation, you need to use the correct database driver and
protocol. What is this federation component called in Big SQL?
A. Wrapper
B. Data source
C. Nickname
D. User mapping
You need to determine the permission setting for a new schema directory. Which tool would you use?
A. GRANT
B. umask
C. HDFS
D. Kerberos
5
Where does the unstructured data of a project reside in Watson Studio?
A. Wrapper
B. Tables
C. Database
D. Object Storage
Which type of cell can be used to document and comment on a process in a Jupyter notebook?
A. Kernel
B. Markdown
C. Output
D. Code
Before you create a Jupyter notebook in Watson Studio, which two items are necessary?
A. Project
B. URL
C. Spark Instance
D. File
E. Scala
Select the storage with the biggest strength in working with unstructured, infrequently modified, and remotely
accessed data.
A. DB2 Storage
B. File Storage
C. Object Storage
D. Block Storage
Which machine learning approach detects patterns and relationships between data without using labeled data?
A. Supervised Learning
B. Reinforcement Learning
C. Semi-supervised Learning
D. Unsupervised Learning
Which tool would you use to create a connection to your Db2 Big SQL database?
A. Db2 Big SQL console
B. Scheduler
C. Jupyter
D. Ambari
6
What is a "magic" command used for in Jupyter?
A. Running common statistical analyses.
B. Parsing and loading data into a notebook.
C. Extending the core language with shortcuts.
D. Autoconfiguring data connections using a registry.
Which feature makes Apache Spark much easier to use than MapReduce?
A. Suitable for transaction processing.
B. APIs for Scala, Python, C++, and .NET.
C. Applications run in-memory.
D. Libraries that support SQL queries.
What are three IBM value-add components to the Hortonworks Data Platform (HDP)?
A. Big SQL
B. Big Data
C. Big YARN
D. Big Replicate
E. Big Index
F. Big Match
Which Big SQL feature allows users to join a Hadoop data set to data in external databases?
A. Grant/Revoke privileges
B. Impersonation
C. Integration
D. Fluid query
Fin.