Jawaban Huawei
Jawaban Huawei
Part 1
Part 2
PART 4 :
Device,edge : true
An ai processor : true
21. Which of the following is the main function of HUAWEI CLOUD GeoGenius?
22. Which of the following is not a feature of the MindSpore core architecture?
B. Automatic parallelism
C. Automatic deployment
A. Automatic differentiation
D. Automatic tuning
23. Which one of the following actions can the function tf.squeeze be used for in TensorFlow 2.0?
C. Tensor concatenation
A. Element-wise addition
D. Dimensionality reduction
24. Which of the following statements is false about the ReLU function?
A. The ReLU function is not differentiable at x = 0 and a derivative is forcibly defined at this point.
D. Compared with Sigmoid and tanh, the convergence of the ReLU function is slow.
C. The surface defined at the zero point of the ReLU function is not smooth enough in some
regression problems.
A. MindIR
C. MindArmour
B. MindSpore Serving
D. MindInsight
26. Which of the following functions is used in HUAWEI CLOUD General Text OCR experiments?
B. ocr_client.ocr_service_base64
D. ocr.request_ocr_service_base128
A. ocr_client.request_ocr_service_base64
C. ocr_client.request_base64
D. Code complexity
C. Prediction rate
B. Explainability
A. Generalization capability
28. Dirty data refers to data with quality problems. Which of the following statements is false about
the data quality?
C. Unprocessed: Data for which feature engineering has not been performed.
A. (44,200,200,6)
B. (44,100,100,3)
C. (40,100,100,6)
D. (44,100,100,6)
C. Voiceprint recognition
A. Speech recognition
B. Text to speech
31. Which of the following is the type of labels predicted by ensemble learning algorithms?
C. Discrete
B. Continuous
32. Which operation is not a step in the network definition process during application development?
D. Model compression
B. Weight initialization
C. Scenarios
A. Data
D. Computing power
B. Algorithms
34. Intelligent quality inspection is based on the cloud-edge-device synergy of deep learning
algorithms. Which of the following operations is performed on edge devices?
C. Model delivery
A. Model training
B. Data labeling
D. Onsite inference
35. A vendor wants to provide an intelligent EMR system for a hospital Which of the following
technologies is involved in the system?
B. Object detection
36. Which of the following statements is false about gradient descent algorithms?
B. When there are too many samples and GPUs are not used for parallel computing, the convergence
process of the global gradient descent is time-consuming.
A. The global gradient descent is more stable than the stochastic gradient descent (SGD).
C. When GPUs are not used for parallel computing, the mini-batch gradient descent (MBGD) takes
less time than the SGD to complete an epoch.
D. Fach time the global descent updates its weight, all training samples need to be calculated.
A. Model generalization capability: the extent to which a learned model can be applied to new
samples, also called robustness.
B. Error: difference between the prediction of a learned model on a sample and the actual result of
the sample. Errors can be classified into training errors and generalization errors.
38. In the Da Vinci Architecture, which of the following computation data types is supported by the
vector unit?
C. FP32
B. BFloat16
D. BFloat32
A. INT16
D. The learning rate of the momentum optimizer does not need to be manually set.
B. Unlike the RMS prop optimizer, the Adagrad optimizer does not automatically update the learning
rate.
C. The SGD and momentum optimizers use the same learning rate for each iteration.
A. The RMSprop optimizer inherits the advantages of the Adam optimizer. The learning rate is
automatically updated.
22. The Relu function is often used in deep learning and neural networks. Which of the following
statements is true about this function?
C. The key step to building a decision tree involves splitting it based on feature attributes.
B. The C4.5 algorithm uses the information gain ratio to select feature attributes.
A. Except the root node, all nodes in a decision tree are called leaf nodes.
24. Which of the following functions is used in HUAWEI CLOUD General Text OCR experiments?
B. ocr_client.ocr_service_base64
D. ocr.request_ocr_service_base128
A. ocr_client.request_ocr_service_base64
C. ocr_client.request_base64
Congratulations! Correct Answer: A
B. Variance refers to the difference between the prediction of a learned model on a sample and the
actual result of the sample.
D. Bias refers to errors you get when you run the model on new samples.
C. During model testing, errors can be classified into training errors and sample variances.
A. The goal of machine learning is for a trained model to perform well on new samples, not just on
samples used for training.
26. Which of the following statements is true about variance and bias?
D. A model with high bias and variance performs poorly and will not be used.
A. A model with low bias but high variance has high robustness.
B. A model with high bias but low variance has high precision.
27. Which of the following is the correct shape of tensor [[0,1], [2,3], [4,5]]?
C. (1,6)
A. (2,3)
D. (6,1)
B. (3,2)
28, HUAWEI CLOUD Traffic Intelligent Twins (or TrafficGo) is a comprehensive urban traffic
management solution. Which of the following functions is provided by this solution?
C. LSTM is suitable for processing events with long interval and delay in the time sequence.
D. Distributed
A. Single-device
C. Device-cloud synergy
B. Multi-device
31. A vendor wants to provide an intelligent EMR system for a hospital. Which of the following
technologies is involved in the system?
B. Object detection
B. No-manual labeling
C. Patient care
A. Genome
B. Smart surgery
34. A generative adversarial network is like a game system. The generator produces fake samples,
and the discriminator tries to distinguish real data from the data created by the generator. What is
the ideal result?
C. The discriminator distinguishes real data from the data created by the generator.
B. The discriminator cannot distinguish real data from the data created by the generator.
D. Develop a high-precision discriminator and a generator that cannot fool the discriminator.
35. Which of the following is NOT a main function of HUAWEI CLOUD GeoGenius?
C. Weather forecast
36. When the v1 compatibility package of TensorFlow 2.0 is used to inherit the Tensorflow 1.x code,
the eager operation needs to be disabled. Which of the following commands is used to disable it?
C. tf.uneager_execution()
B. tf.no_eager_execution()
A. tf.disable_eager_run()
D. tf.disable_eager_execution()
B. BFloat16
A. INT16
D. BFloat32
C. FP32
38. Which of the following statement is false about the running process of the MindArmour
subsystem?
B. Fuzzing execution: Generate trusted test data randomly based on the model coverage and
configuration policies.
A. Configuration policies: Define test policies based on threat vectors and trustworthiness
certification requirements and select appropriate test data generation methods.
21. Which of the following is the reasoning method of the production system that draws conclusions
through a rule library?
B. Backward
D. Random
C. Bidirectional
A. Forward
D. Select
A. Mul
C. Callback
B. ControlDepend
B. K in K-means
24. HUAWEI CLOUD Traffic Intelligent Twins (or TrafficGo) is a comprehensive urban traffic
management solution. Which of the following functions is provided by this solution?
B. Weight initialization
D. Model compression
26. Which of the following products can be used by a company that wants to provide intelligent
customer
service?
A. RDB
D. GES
C. OBS
B. Conversational Bot
27. A generative adversarial network is like a game system. The generator produces fake samples,
and the discriminator tries to distinguish real data from the data created by the generator. What is
the ideal result?
D. Develop a high-precision discriminator and a generator that cannot fool the discriminator.
B. The discriminator cannot distinguish real data from the data created by the generator.
C. The discriminator distinguishes real data from the data created by the generator.
B. Text to speech
C. Voiceprint recognition
A. Speech recognition
A. Single-device
D. Distributed
B. Multi-device
C. Device-cloud synergy
30. Which of the following statements is true about the loss functions typically used in deep
learning?
B. The quadratic cost function reflects the gap between the target output and the actual output.
A. Quadratic cost functions are usually used for classification, while cross-entropy cost functions are
used for clustering.
C. The quadratic cost function reflects the gap between two probability distributions.
C. Data augmentation
B. Network integration
32. Which of the following statements is true about variance and bias?
A. A model with low bias but high variance has high robustness.
D. A model with high bias and variance performs poorly and will not be used.
B. A model with high bias but low variance has high precision.
B. Machines begin to see, listen, understand, make judgments, and take simple actions.
35. Which of the following is the correct shape of tensor [[[0,1],[2,3]], [[4,5],[6,7]]]?
A. [3,3,2]
C. [2,3,4]
B. [3,2,4]
D. [2.2.2]
36. Which of the following statements is true about the ReLU function?
D. The gradient is always 1, and the vanishing gradient problem can be perfectly solved.
C. Weather forecast
38. When the v1 compatibility package of TensorFlow 2.0 is used to inherit the TensorFlow 1.x code,
the eager operation needs to be disabled. Which of the following commands is used to disable it?
A. tf.disable_eager_run()
B. tf.no_eager_execution()
C. tf.uneager_execution()
D. tf.disable_eager_execution()
21. Which one of the following actions can the function tf.squeeze be used for in TensorFlow 2.0?
C. Tensor concatenation
D. Dimensionality reduction
A. Element-wise addition
22, Which of the following products can be used by a company that wants to provide intelligent
customer service?
D. GES
C. OBS
A. RDB
B. Conversational Bot
B. Machines begin to see, listen, understand, make judgments, and take simple actions.
24, Which operation is not a step in the network definition process during application development?
D. Model compression
B. Weight initialization
26. Which of the following is the type of labels predicted by ensemble learning algorithms?
C. Discrete
B. Continuous
27, Which of the following HUAWEI CLOUD services is NOT a basic Al platform service?
B. Huawei HiLens
A. ModelArts
28. An image recognition experiment uses 42,510 training images. There are more than 200
categories, each of which contains only 10 images. There are also categories with only 20 to 50
images. Which of the following data problems does this phenomenon fit into?
B. Data imbalance
C. Data loss
A. Data augmentation
D. Data fitting
C. The surface defined at the zero point of the ReLU function is not smooth enough in some
regression problems.
D. Compared with Sigmoid and tanh, the convergence of the ReLU function is slow.
A. The ReLU function is not differentiable at x = 0 and a derivative is forcibly defined at this point.
D. isinstance0
A. asnumpy0
B. size0
C. dim0
31. In the Da Vinci Architecture, which of the following computation data types is supported by the
vector unit?
A. INT16
D. BFloat32
C. FP32
B. BFloat16
32. Which of the following statement is false about the running process of the MindArmour
subsystem?
A. Configuration policies: Define test policies based on threat vectors and trustworthiness
certification requirements and select appropriate test data generation methods.
B. Fuzzing execution: Generate trusted test data randomly based on the model coverage and
configuration policies.
Congratulations!Correct Answer: B
33. Which of the following indicators cannot be used to evaluate a model?
D. Code complexity
C. Prediction rate
A. Generalization capability
B. Explainability
A. Model generalization capability: the extent to which a learned model can be applied to new
samples, also called robustness.
B. Error: difference between the prediction of a learned model on a sample and the actual result of
the sample. Errors can be classified into training errors and generalization errors.
Congratulations!Correct Answer: D
B. GPU parallel computing provides faster computing if there are more network parameters.
Congratulations!Correct Answer: A
36. The Relu function is often used in deep learning and neural networks. Which of the following
statements is true about this function?
A. [[1,2,3]]
C. [[1,2,3,1,2,3,1,2,3]]
38. Which of the following statements is true about variance and bias?
D. A model with high bias and variance performs poorly and will not be used.
A. A model with low bias but high variance has high robustness.
B. A model with high bias but low variance has high precision.
21. Which of the following is a lightweight and high-performance service module that helps
MindSpore developers efficiently deploy online inference services in production environments?
D. MindInsight
C. MindArmour
B. MindSpore Serving
A. MindIR
B. Error: difference between the prediction of a learned model on a sample and the actual result of
the sample. Errors can be classified into training errors and generalization errors.
A. Model generalization capability: the extent to which a learned model can be applied to new
samples, also called robustness.
23. Which of the following statements is false about gradient descent algorithms?
D. Each time the global descent updates its weight, all training samples need to be calculated.
C. When GPUs are not used for parallel computing, the mini-batch gradient descent (MBGD) takes
less time than the SGD to complete an epoch.
B. When there are too many samples and GPUs are not used for parallel computing, the convergence
process of the global gradient descent is time-consuming.
A. The global gradient descent is more stable than the stochastic gradient descent (SGD).
24. An image recognition experiment uses 42,510 training images. There are more than 200
categories, each of which contains only 10 images. There are also categories with only 20 to 50
images. Which of the following data problems does this phenomenon fit into?
B. Data imbalance
C. Data loss
D. Data fitting
A. Data augmentation
25. "Knowledge representation is the unique method of representing knowledge using a set of
symbols in a structure that can be understood by computers." Which of the following is true about
this statement?
C. This statement is correct. The knowledge representation can support expert systems.
D. This statement is false. Knowledge representation cannot be used for expert rules or fuzzy
inference.
26. Which of the following is the correct shape of tensor [[[0, 1],[2,3]], [[4,5],[6,7]]]?
A. [3,3,2]
B. [3,2,4]
C. [2,3,4]
D. [2,2,2]
B. GPU parallel computing provides faster computing if there are more network parameters.
29. Which of the following is not a feature of the MindSpore core architecture?
A. Automatic differentiation
B. Automatic parallelism
C. Automatic deployment
D. Automatic tuning
30. Which of the following is a math operator in MindSpore?
A. Mul
B. ControlDepend
C. Callback
D. Select
31. Which of the following statements is true about the loss functions typically used in deep
learning?
C. The quadratic cost function reflects the gap between two probability distributions.
A. Quadratic cost functions are usually used for classification, while cross-entropy cost functions are
used for clustering.
B. The quadratic cost function reflects the gap between the target output and the actual output
C. During model testing, errors can be classified into training errors and sample variances.
D. Bias refers to errors you get when you run the model on new samples.
B. Variance refers to the difference between the prediction of a learned model on a sample and the
actual result of the sample.
A. The goal of machine learning is for a trained model to perform well on new samples, not just on
samples used for training.
33. "Deep learning is a complex, comprehensive discipline. It includes Al and machine learning."
Which of the following is true about this statement?
C. This statement is incorrect. Machine learning has nothing to do with deep learning.
B. The C4.5 algorithm uses the information gain ratio to select feature attributes.
A. Except the root node, all nodes in a decision tree are called leaf nodes.
C. The key step to building a decision tree involves splitting it based on feature attributes.
35. Which of the following is the main function of HUAWEI CLOUD GeoGenius?
36. Which of the following statement is false about the running process of the MindArmour
subsystem?
B. Fuzzing execution: Generate trusted test data randomly based on the model coverage and
configuratio policies.
A. Configuration policies: Define test policies based on threat vectors and trustworthiness
certification requirements and select appropriate test data generation methods.
MindSpore?
B. Operator overloading
D. Just-in-time compilation
D. abs0
B. astype()
C. switch()
A. dtype()
40. Which of the following are part of the Huawei full-stack Al solution?
B. openEuler
A. TBE
C. AscendCL
D. Ascend
41. Which of the following are the features of the Speech Interaction Service (SIS) provided by
HUAWEI CLOUD EI?
C. Low requirements
B. High usability
42. Which of the following statements are true about MindSpore and Huawei all-scenario solutions?
C. Distributed training of ultra-large models and ultra-large datasets requires only data parallelism.
A. The full-scenario deployment solution includes model generation and efficient execution.
B. Transformer
D. BERT
A. ResNet
A. Threshold theory
C. Behavioral theory
D. Logic theory
B. Activation functions
D. Neuron weight
D. Easy to scale
A. Easy to use
48. Which of the following statements are true about linear regression?
D. Due to algorithm complexity, linear regression cannot use the gradient descent method to
calculate the weight parameter if the loss function reaches the minimum value.
B. The loss function of linear regression can be obtained using the normal distribution function and
maximum likelihood estimation (MLE).
A. The error of linear regression is affected by many independent factors. According to the central
limit theorem (CLT), the error follows normal distribution.
49. Which of the following statements are true about ensemble learning?
C. A batch of features are randomly selected for the subtree training in a random forest.
B. Keras is a neural network development package used to build CNN sequential models. It cannot be
used to build other neural networks.
A. Like TensorFlow, Keras is a multi-layer neural network development package. However, Keras has
simpler syntax and is easier to use.
C. The neural network model built using Keras must be compiled before data can be input into it for
training.
51. Which of the following functions are supported by the HIAI Engine platform?
A. Form recognition
C. Video summarization
B. Keyword extraction
D. WithGradCell
C. cos
OB. TFRecord
A. MAELoss
53. Which of the following are common ensemble learning algorithms in machine learning?
B. GBDT
A. Random forest
C. Polynomial regression
D. Adaboost
54. Which of the following statements are false about common optimizers?
B. When the momentum optimizer is used, parameters are updated by using the same learning rate,
but momentum coefficients keep changing with each iteration.
A. One of the advantages of Adagrad optimizers is that the parameter update operation does not end
too early.
PART 2
C. Behavioral theory
D. Logic theory
B. The theory of evolution
A. Threshold theory
40. Which of the following statements are true about decision trees?
D. Building a decision tree means selecting and measuring feature attributes and determining their
topology.
B. Each non-leaf node in a decision tree represents a test on a feature attribute; each branch
represents the output of the feature attribute within a value range; and each leaf node holds a class
label.
C. Except the root node, all nodes in a decision tree are called leaf nodes.
C. Inception
B. Transformer
D. BERT
A. ResNet
42. Which of the following technologies may be involved in room service robots?
B. Speech recognition
D. Object detection
A. Path planning
C. Sentiment analysis
43. Which of the following methods provided by TensorFlow 2.0 cannot be used to check whether an
object is a tensor?
C. device
A. is tensor
D. tftypes
B. isinstance
A. Form recognition
B. Keyword extraction
C. Video summarization
45. Which of the following operations are involved when TensorFlow is used for model training?
46. Which of the following statements are true about grid search based on hyperparameter tuning?
C. Grid search works well when there are relatively few hyperparameters.
A. Grid search exhaustively searches for all possible hyperparameter combinations to form a
hyperparameter value grid.
B. RTX3080
C. Kunpeng 920
D. Ascend 910
A. Ascend 310
48. Which of the following statements are false about the functions of the pooling layers?
49. Which of the following statements are true about linear regression?
D. Due to algorithm complexity, linear regression cannot use the gradient descent method to
calculate the weight parameter if the loss function reaches the minimum value.
A. The error of linear regression is affected by many independent factors. According to the central
limit theorem (CLT), the error follows normal distribution.
B. The loss function of linear regression can be obtained using the normal distribution function and
maximum likelihood estimation (MLE).
50. Which of the following are common application scenarios of HUAWEI CLOUD OCR?
51. Which of the following statements are true about activation functions?
C. There are many activation functions. You need to select one based on the actual situation.
B. If we do not use an activation function, the output signals will be just a simple linear function.
A. The existence of the activation function introduces linearity into the network.
D. Activation functions play an important role in learning and understanding complex and nonlinear
functions of neural network models.
52. Which of the following statements are true about the MindSpore components?
C. nn: neural network cells in MindSpore that define loss functions and optimizers.
A. communication: processes data flow communication between the CPU and memory.
D. train: relates to training model and model quantization module.
53. Backed by HUAWEI CLOUD's accumulated knowledge and expertise in Al, big data, and other
cutting-edge technologies, GeoGenius offers a one-stop Al development cloud platform for remote
sensing data processing, mining, and management. Which of the following are the main functions of
GeoGenius?
A. Weather forecasting
B. Afforestation
B. GANS are a type of framework. They train the generator and discriminator through an adversarial
process.
D. The input of the discriminator is mainly sample data provided by the generator.
PART 3
39. Which of the following functions are supported by the HiAl Engine platform?
C. Video summarization
B. Keyword extraction
A. Form recognition
40. Which of the following statements are true about grid search based on hyperparameter tuning?
C. Grid search works well when there are relatively few hyperparameters.
D. Grid search suits neural networks well.
A. Grid search exhaustively searches for all possible hyperparameter combinations to form a
hyperparameter value grid.
41. Which of the following statements are true about the MindSpore components?
C. nn: neural network cells in MindSpore that define loss functions and optimizers.
A. communication: processes data flow communication between the CPU and memory.
B. Team labeling
43, Backed by HUAWEI CLOUD's accumulated knowledge and expertise in Al, big data, and other
cutting-edge technologies, GeoGenius offers a one-stop Al development cloud platform for remote
sensing data processing, mining, and management. Which of the following are the main functions of
GeoGenius?
B. Afforestation
A. Weather forecasting
44. Which of the following format conversion operations are not performed by the storage control
unit of the Da Vinci Architecture?
D. Img2Col
45. Which of the following are common ensemble learning algorithms in machine learning?
C. Polynomial regression
B. GBDT
A. Random forest
D. Adaboost
46. Which of the following products can be equipped with Ascend 310 Processors?
A. Ascend 310
B. RTX3080
C. Kunpeng 920
D. Ascend 910
A. Learning rate, iteration count, and batch size in a training neural network
49. Which of the following statements are true about hidden layers?
50. Which of the following statements are false about the functions of the pooling layers?
C. Pooling reduces the size of the input data at the next layer and the number of parameters, but
increases the computation amount.
C. TensorFlow 2.0 uses the dynamic graph mechanism by default, with a higher running efficiency
than the static graph mechanism.
52. Which of the following statements are true about decision trees?
D. Building a decision tree means selecting and measuring feature attributes and determining their
topology.
C. Except the root node, all nodes in a decision tree are called leaf nodes.
B. Each non-leaf node in a decision tree represents a test on a feature attribute; each branch
represents the output of the feature attribute within a value range; and each leaf node holds a class
label.
53. Which of the following statements are false about the Gated Recurrent Unit (GRU)?
A. Unlike long short-term memory (LSTM), GRU merges the cell state and hidden state.
C. GRU combines the forget and update gates into a single input gate.
54. Which of the following are steps of the Back Propagation Trough Time (BPTT) algorithm?
PART 4
39. Which of the following products can be equipped with Ascend 310 Processors?
40, Which of the following statements are true about regression analysis?
A. Regression analysis is a statistical analysis method used to determine the quantitative relationship
between two or more variables.
OD. Linear regression with an absolute loss (L2 regularization) is called Lasso regression.
A. Learning rate, iteration count, and batch size in a training neural network
B. Afforestation
A. Weather forecasting
43. Which of the following statements are true about common activation functions in deep learning?
A. The sigmoid function is monotonic, continuous, and easy to derive. Its output is bounded, making
the network converge better.
D. During training of a deep neural network, the sigmoid, tanh, and softsign functions cannot prevent
the vanishing gradient problem.
C. The tanh function is symmetric with respect to the origin, and the mean of its output is closer to 0.
A. Easy to use
D. Easy to scale
B. Activation functions
D. Neuron weight
46. Which of the following are the features of the Speech Interaction Service (SIS) provided by
HUAWEI CLOUD
EI?
C. Low requirements
B. High usability
A. Ascend 310
OB. RTX3080
D. Ascend 910
48. Which of the following format conversion operations are not performed by the storage control
unit of the Da Vinci Architecture?
OD. Img2Col
49. Which of the following statements are true about decision trees?
B. Each non-leaf node in a decision tree represents a test on a feature attribute; each branch
represents the output of the feature attribute within a value range; and each leaf node holds a class
label.
D. Building a decision tree means selecting and measuring feature attributes and determining their
topology.
C. Except the root node, all nodes in a decision tree are called leaf nodes.
50. Which of the following statements are true about the three main schools of Al?
D. Symbolism states that Al focuses on behavior control, adaptive computation, and evolutionary
computation.
51. Which of the following statements are true about activation functions?
B. If we do not use an activation function, the output signals will be just a simple linear function.
D. Activation functions play an important role in learning and understanding complex and nonlinear
functions of neural network models.
A. The existence of the activation function introduces linearity into the network.
C. There are many activation functions. You need to select one based on the actual situation.
52. Which of the following are steps of the Back Propagation Trough Time (BPTT) algorithm?
53. Which of the following operations are involved when TensorFlow is used for model training?
B. Team labeling
PART 5
39. Which of the following are supported by data management on ModelArts?
B. Team labeling
答错了!正确答案: ABC
40. Which of the following statements are false about the universal engine of the Ascend 310
software stack?
B. astype()
D. abs()
A. dtype()
OC. switch()
恭喜你,答对了!正确答案: ABD
42. Which of the following statements are true about MindSpore and Huawei all-scenario solutions?
C. Distributed training of ultra-large models and ultra-large datasets requires only data parallelism.
A. The full-scenario deployment solution includes model generation and efficient execution.
恭喜你,答对了!正确答案:ABD
43. Which of the following format conversion operations are not performed by the storage control
unit of the Da Vinci Architecture?
D. Img2Col
恭喜你,答对了!正确答案:BC
A. Threshold theory
D. Logic theory
恭喜你,答对了!正确答案:ABD
45. Which of the following development modes are supported by the ModelArts training platform?
46. Which of the following statements are false about the combination of the model bias and
variance?
恭喜你,答对了!正确答案:ABD
47. Which of the following operations are involved when TensorFlow is used for model training?
恭喜你,答对了!正确答案:ABD
48. Which of the following statements are false about common optimizers?
B. When the momentum optimizer is used, parameters are updated by using the same learning rate,
but momentum coefficients keep changing with each iteration.
A. One of the advantages of Adagrad optimizers is that the parameter update operation does not end
too early.
恭喜你,答对了!正确答案:AB
49. Which of the following statements are true about support vector machines (SVMs)?
C. The learning algorithm of SVMs is the optimal algorithm for concave quadratic programming.
D. Based on the structural risk minimization (SRM), SVMs build an optimal hyperplane in the feature
space so that the learner can be globally optimized.
A. SVMs are binary classification models. Their basic model is the linear classifier with the largest
interval defined in the feature space.
B. SVMs also have a kernel trick, which enables them to perform as a nonlinear classifier.
恭喜你,答对了!正确答案:ABCD
50. Which of the following are common application scenarios of HUAWEI CLOUD OCR?
恭喜你,答对了!正确答案:BCD
51. Which of the following statements are true about the MindSpore components?
C. nn: neural network cells in MindSpore that define loss functions and optimizers.
A. communication: processes data flow communication between the CPU and memory.
恭喜你,答对了!正确答案:BC
52. Which of the following statements are true about gradient descent?
C. Mini-batch gradient descent (MBGD) is a balance between BGD and SGD, and is the optimal choice
for all datasets.
B. Batch gradient descent (BGD) is the most unstable method and consumes too many compute
resources.
A. Stochastic gradient descent (SGD) randomly chooses samples for each training job, causing
unstability. As a result, the loss function fluctuates or even encounters reverse displacements during
the process of dropping to the minimum value.
恭喜你,答对了!正确答案: ACD
53. Which of the following functions are supported by the HIAI Engine platform?
C. Video summarization
B. Keyword extraction
A. Form recognition
恭喜你,答对了!正确答案:ABCD
54. Which of the following statements are true about ensemble learning?
C. A batch of features are randomly selected for the subtree training in a random forest.
恭喜你,答对了!正确答案:BC
BLANK FILLING
The derivative of the ReLU activation function in the negative half interval is fixed at (). (Fill in the
blank with a number.)
Tensor [[[2,3]]] is a/an ()-dimensional tensor. (Fill in the blank with a number.)
As the neural network increasingly deepens, two issues may be encountered during network
training: one is (), and the other is gradient vanishing/exploding.
Generally, a model with a () volume has a higher precision, and a model with a () volume has a higher
efficiency. (Fill in using "large" or "small")
Generally, a model with a () volume has a higher precision, and a model with a () volume has a higher
efficiency. (Fill in using "large" or "small".)
The Naive Bayes algorithm needs to obtain the () probability of the dataset.
PRIOR
(1) MindCompiler
Parts 3 ;
As the neural network increasingly deepens, two issues may be encountered during network
training: one is 0, and the other is gradient vanishing/exploding.
Linear regression using a loss function with the LO-norm regular term is also called ridge regression.
2
.Assume that there are 10,000 data pieces in a cancer data sample, of which 100 pieces are from
cancer patients, and the other 9900 pieces are normal. If a classification model predicts that 9000
out of the 9900 pieces are normal, and 90 out of the 100 pieces are from cancer patients, the
accuracy rate of the model is 0%.
90.9
The Atlas 200 DK is designed to run on the Ascend () Al Processor. (Full in using Arabic numerals.)
(1) 310
Generally, a model with a () volume has a higher precision, and a model with a () volume has a higher
efficiency. (Fill in using "large" or "small")
(1) BPTT
Prt 4
Tensor [[[2,3]]] is a/an ()-dimensional tensor. (Fill in the blank with a number.)
In TensorFlow 2.0, if tf.keras.layers.RNN is used to process timing information and you want to obtain
the output status at each moment, set () to True.
(1):2
(1) bue
(1) dutx
PARTS : 5
The Speech Interaction Service (SIS) on HUAWEI CLOUD provides text recognition through open ().
(Fill in the blank with the abbreviation.)
(1) APIs
Generally, a model with a () volume has a higher precision, and a model with a () volume has a higher
efficiency. (Fill in using "large" or "small".)
(1) large
(2) amall
(1) 15
(2) 15
(3) 256
(1) 15
(2) 15
(3) 256