现在注册

登录

忘记密码

忘记密码? 请输入您的电子邮件地址. 您将收到一个链接,将创建通过电子邮件新密码.

添加后

你必须登录后添加 .

添加问题

您必须登录才能提问.

登录

现在注册

欢迎Scholarsark.com! 您的注册将授予您访问使用该平台的更多功能. 你可以问问题, 做出贡献或提供答案, 查看其他用户以及更多的个人资料. 现在注册!

LinkedIn 技能评估答案和问题 — Hadoop

Hadoop is a popular framework for processing large-scale data sets using distributed computing. Many companies use Hadoop to store, 管理和分析他们的数据, 他们需要能够使用这项技术的熟练专业人员. If you want to prove your Hadoop skills and get certified by LinkedIn, you need to pass the LinkedIn skill assessment test 对于 Hadoop. This test consists of multiple-choice questions that cover various topics related to Hadoop, such as its architecture, 组件, 命令, 配置, 和更多.

To help you prepare for this test, I have compiled a list of questions and answers that you may encounter in the exam. These questions and answers are based on my own experience and research, and they are not official or endorsed by LinkedIn. 然而, they can give you an idea of what to expect and how to answer the questions correctly. 这里有 问题答案 为了 LinkedIn skill assessment test for Hadoop.

第一季度. Partitioner controls the partitioning of what data?

  • final keys
  • final values
  • intermediate keys
  • intermediate values

Q2. SQL Windowing functions are implemented in Hive using which keywords?

  • UNION DISTINCT, RANK
  • 超过, RANK
  • 超过, EXCEPT
  • UNION DISTINCT, RANK

Q3. Rather than adding a Secondary Sort to a slow Reduce job, it is Hadoop best practice to perform which optimization?

  • Add a partitioned shuffle to the Map job.
  • Add a partitioned shuffle to the Reduce job.
  • Break the Reduce job into multiple, chained Reduce jobs.
  • Break the Reduce job into multiple, chained Map jobs.

第四季度. Hadoop Auth enforces authentication on protected resources. Once authentication has been established, it sets what type of authenticating cookie?

  • encrypted HTTP
  • unsigned HTTP
  • compressed HTTP
  • signed HTTP

Q5. MapReduce jobs can be written in which language?

  • Java or Python
  • SQL only
  • SQL or Java
  • Python or SQL

Q6. To perform local aggregation of the intermediate outputs, MapReduce users can optionally specify which object?

  • Reducer
  • Combiner
  • Mapper
  • 柜台

Q7. To verify job status, look for the value ___ 在里面 ___.

  • SUCCEEDED; syslog
  • SUCCEEDED; stdout
  • DONE; syslog
  • DONE; stdout

Q8. Which line of code implements a Reducer method in MapReduce 2.0?

  • public void reduce(Text key, Iterator values, Context context){...}
  • public static void reduce(Text key, IntWritable[] 值, Context context){...}
  • public static void reduce(Text key, Iterator values, Context context){...}
  • public void reduce(Text key, IntWritable[] 值, Context context){...}

Q9. To get the total number of mapped input records in a map job task, you should review the value of which counter?

  • FileInputFormatCounter
  • FileSystemCounter
  • JobCounter
  • TaskCounter (NOT SURE)

辅酶Q10. Hadoop Core supports which CAP capabilities?

  • 一个, P
  • C, 一个
  • C, P
  • C, 一个, P

Q11. What are the primary phases of a Reducer?

  • 结合, 地图, and reduce
  • shuffle, 种类, and reduce
  • reduce, 种类, and combine
  • 地图, 种类, and combine

Q12. To set up Hadoop workflow with synchronization of data between jobs that process tasks both on disk and in memory, 使用 ___ 服务, 这是 ___.

  • Oozie; 开源
  • Oozie; commercial software
  • Zookeeper; commercial software
  • Zookeeper; 开源

Q13. For high availability, which type of multiple nodes should you use?

  • 数据
  • 名称
  • 记忆
  • worker

Q14. DataNode supports which type of drives?

  • hot swappable
  • cold swappable
  • warm swappable
  • non-swappable

Q15. Which method is used to implement Spark jobs?

  • on disk of all workers
  • on disk of the master node
  • in memory of the master node
  • in memory of all workers

Q16. In a MapReduce job, where does the map() function run?

  • on the reducer nodes of the cluster
  • on the data nodes of the cluster (NOT SURE)
  • on the master node of the cluster
  • on every node of the cluster

Q17. To reference a master file for lookups during Mapping, what type of cache should be used?

  • distributed cache
  • local cache
  • partitioned cache
  • cluster cache

Q18. Skip bad records provides an option where a certain set of bad input records can be skipped when processing what type of data?

  • cache inputs
  • reducer inputs
  • intermediate values
  • map inputs

Q19. Which command imports data to Hadoop from a MySQL database?

  • spark importconnect jdbc:mysql://mysql.example.com/sparkusername sparkwarehouse-dir user/hue/oozie/deployments/spark
  • sqoop importconnect jdbc:mysql://mysql.example.com/sqoopusername sqoopwarehouse-dir user/hue/oozie/deployments/sqoop
  • sqoop importconnect jdbc:mysql://mysql.example.com/sqoopusername sqooppassword sqoopwarehouse-dir user/hue/oozie/deployments/sqoop
  • spark importconnect jdbc:mysql://mysql.example.com/sparkusername sparkpassword sparkwarehouse-dir user/hue/oozie/deployments/spark

Q20. In what form is Reducer output presented?

  • compressed (NOT SURE)
  • sorted
  • not sorted
  • encrypted

Q21. Which library should be used to unit test MapReduce code?

  • JUnit
  • XUnit
  • MRUnit
  • HadoopUnit

Q22. If you started the NameNode, then which kind of user must you be?

  • hadoop-user
  • super-user
  • node-user
  • admin-user

Q23. State _ between the JVMs in a MapReduce job

Q24. To create a MapReduce job, what should be coded first?

  • a static job() 方法
  • a Job class and instance (NOT SURE)
  • a job() 方法
  • a static Job class

Q25. To connect Hadoop to AWS S3, which client should you use?

  • S3A
  • S3N
  • S3
  • the EMR S3

Q26. HBase works with which type of schema enforcement?

  • schema on write
  • no schema
  • external schema
  • schema on read

Q27. HDFS files are of what type?

  • read-write
  • read-only
  • write-only
  • append-only

Q28. A distributed cache file path can originate from what location?

  • hdfs or top
  • HTTP
  • hdfs or http
  • hdfs

Q29. Which library should you use to perform ETL-type MapReduce jobs?

  • 蜂巢
  • Pig
  • Impala
  • Mahout

Q30. What is the output of the Reducer?

  • a relational table
  • an update to the input file
  • 一个, combined list
  • a set of <钥匙, 值> 对

map function processes a certain key-value pair and emits a certain number of key-value pairs and the Reduce function processes values grouped by the same key and emits another set of key-value pairs as output.

Q31. To optimize a Mapper, what should you perform first?

  • Override the default Partitioner.
  • Skip bad records.
  • Break up Mappers that do more than one task into multiple Mappers.
  • Combine Mappers that do one task into large Mappers.

Q32. When implemented on a public cloud, with what does Hadoop processing interact?

  • files in object storage
  • graph data in graph databases
  • relational data in managed RDBMS systems
  • JSON data in NoSQL databases

Q33. In the Hadoop system, what administrative mode is used for maintenance?

  • data mode
  • safe mode
  • single-user mode
  • pseudo-distributed mode

Q34. In what format does RecordWriter write an output file?

  • <钥匙, 值> 对
  • 钥匙
  • <值, 钥匙> 对

Q35. To what does the Mapper map input key/value pairs?

  • an average of keys for values
  • a sum of keys for values
  • a set of intermediate key/value pairs
  • a set of final key/value pairs

Q36. Which Hive query returns the first 1,000 值?

  • SELECT…WHERE value = 1000
  • SELECT … LIMIT 1000
  • SELECT TOP 1000 ...
  • SELECT MAX 1000…

Q37. To implement high availability, how many instances of the master node should you configure?

Q38. Hadoop 2.x and later implement which service as the resource coordinator?

  • 州长
  • JobManager
  • JobTracker
  • YARN

Q39. In MapReduce, _ have _

  • 任务; 工作
  • 工作; 活动
  • 工作; 任务
  • 活动; 任务

Q40. What type of software is Hadoop Common?

  • 数据库
  • distributed computing framework
  • 操作系统
  • productivity tool

Q41. If no reduction is desired, you should set the numbers of _ tasks to zero.

  • combiner
  • reduce
  • mapper
  • 中间

Q42. MapReduce applications use which of these classes to report their statistics?

  • mapper
  • reducer
  • combiner
  • counter

Q43. _ is the query language, and _ is storage for NoSQL on Hadoop.

  • 高密度文件系统; HQL
  • HQL; HBase
  • 高密度文件系统; 查询语句
  • 查询语句; HBase

Q44. MapReduce 1.0 _ YARN.

  • does not include
  • is the same thing as
  • 包括
  • replaces

Q45. Which type of Hadoop node executes file system namespace operations like opening, closing, and renaming files and directories?

  • ControllerNode
  • DataNode
  • MetadataNode
  • NameNode

Q46. HQL queries produce which job types?

  • Impala
  • MapReduce
  • 火花
  • Pig

Q47. Suppose you are trying to finish a Pig script that converts text in the input string to uppercase. What code is needed on line 2 下面?

1 data = LOAD ‘/user/hue/pig/examples/data/midsummer.txt’… 2

  • 如 (文本:CHAR[]); upper_case = FOREACH data GENERATE org.apache.pig.piggybank.evaluation.string.UPPER(文本);
  • 如 (文本:CHARARRAY); upper_case = FOREACH data GENERATE org.apache.pig.piggybank.evaluation.string.UPPER(文本);
  • 如 (文本:CHAR[]); upper_case = FOREACH data org.apache.pig.piggybank.evaluation.string.UPPER(文本);
  • 如 (文本:CHARARRAY); upper_case = FOREACH data org.apache.pig.piggybank.evaluation.string.UPPER(文本);

Q48. In a MapReduce job, which phase runs after the Map phase completes?

  • Combiner
  • Reducer
  • Map2
  • Shuffle and Sort

Q49. Where would you configure the size of a block in a Hadoop environment?

  • dfs.block.size in hdfs-site.xmls
  • orc.write.variable.length.blocks in hive-default.xml
  • mapreduce.job.ubertask.maxbytes in mapred-site.xml
  • hdfs.block.size in hdfs-site.xml

辅酶50. Hadoop systems are _ RDBMS systems.

  • replacements for
  • not used with
  • substitutes for
  • additions for

Q51. Which object can be used to distribute jars or libraries for use in MapReduce tasks?

  • distributed cache
  • library manager
  • lookup store
  • registry

Q52. To view the execution details of an Impala query plan, which function would you use?

  • explain
  • query action
  • detail
  • query plan

Q53. Which feature is used to roll back a corrupted HDFS instance to a previously known good point in time?

  • partitioning
  • 快照
  • 复制
  • 高可用性

参考

Q54. Hadoop Common is written in which language?

  • C ++
  • C
  • Haskell
  • Java的

Q55. Which file system does Hadoop use for storage?

  • NAS
  • FAT
  • 高密度文件系统
  • NFS

Q56. What kind of storage and processing does Hadoop support?

  • encrypted
  • verified
  • 分散式
  • remote

Q57. Hadoop Common consists of which components?

  • Spark and YARN
  • HDFS and MapReduce
  • HDFS and S3
  • Spark and MapReduce

Q58. Most Apache Hadoop committerswork is done at which commercial company?

  • Cloudera
  • 微软
  • 谷歌
  • 亚马逊

Q59. To get information about Reducer job runs, which object should be added?

  • Reporter
  • IntReadable
  • IntWritable
  • 作家

Q60. After changing the default block size and restarting the cluster, to which data does the new size apply?

  • all data
  • no data
  • existing data
  • new data

Q61. Which statement should you add to improve the performance of the following query?

SELECT
  c.id,
  c.name,
  c.email_preferences.categories.surveys
FROM customers c;
  • 通过...分组
  • 筛选
  • SUB-SELECT
  • SORT

Q62. What custom object should you implement to reduce IO in MapReduce?

  • Comparator
  • Mapper
  • Combiner
  • Reducer

Q63. You can optimize Hive queries using which method?

  • secondary indices
  • summary statistics
  • column-based statistics
  • a primary key index

Q64. If you are processing a single action on each input, what type of job should you create?

  • partition-only
  • map-only
  • reduce-only
  • combine-only

Q65. The simplest possible MapReduce job optimization is to perform which of these actions?

  • Add more master nodes.
  • Implement optimized InputSplits.
  • Add more DataNodes.
  • Implement a custom Mapper.

Q66. When you implement a custom Writable, you must also define which of these object?

  • a sort policy
  • a combiner policy
  • a compression policy
  • a filter policy

Q67. To copy a file into the Hadoop file system, what command should you use?

  • hadoop fs -copy
  • hadoop fs -copy
  • hadoop fs -copyFromLocal
  • hadoop fs -copyFromLocal

Q68. Delete a Hive _ table and you will delete the table _.

  • managed; metadata
  • external; data and metadata
  • external; metadata
  • managed; 数据

Q69. To see how Hive executed a JOIN operation, use the _ statement and look for the _ value.

  • EXPLAIN; JOIN Operator
  • QUERY; MAP JOIN Operator
  • EXPLAIN; MAP JOIN Operator
  • QUERY; JOIN Operator

Q70. Pig operates in mainly how many nodes?

  • Five

Q71. After loading data, _ and then run a(n) _ query for interactive queries.

  • invalidate metadata; Impala
  • validate metadata; Impala
  • invalidate metadata; 蜂巢
  • validate metadata; 蜂巢

Q72. In Hadoop MapReduce job code, what must be static?

  • 配置
  • Mapper and Reducer
  • Mapper
  • Reducer

参考

Q73. In Hadoop simple mode, which object determines the identity of a client process?

  • Kerberos ticket
  • kubernetes token
  • guest operating system
  • host operating system

参考

Q74. Which is not a valid input format for a MapReduce job?

  • FileReader
  • CompositeInputFormat
  • RecordReader
  • TextInputFormat

参考

Q75. If you see org.apache.hadoop.mapred, which version of MapReduce are you working with?

  • 1.X
  • 0.X
  • 2.X
  • 3.X

参考

作者

  • 海伦·贝西

    你好, I'm Helena, 一位热衷于在教育领域发布有洞察力内容的博客作者. 我相信教育是个人和社会发展的关键, 我想与所有年龄和背景的学习者分享我的知识和经验. 在我的博客上, 您会找到有关学习策略等主题的文章, 在线教育, 职业指导, 和更多. 我也欢迎读者的反馈和建议, 所以请随时发表评论或联系我. 我希望您喜欢阅读我的博客并发现它有用且鼓舞人心.

    查看所有帖子

关于 海伦·贝西

你好, I'm Helena, 一位热衷于在教育领域发布有洞察力内容的博客作者. 我相信教育是个人和社会发展的关键, 我想与所有年龄和背景的学习者分享我的知识和经验. 在我的博客上, 您会找到有关学习策略等主题的文章, 在线教育, 职业指导, 和更多. 我也欢迎读者的反馈和建议, 所以请随时发表评论或联系我. 我希望您喜欢阅读我的博客并发现它有用且鼓舞人心.

发表评论