A company uses a long short-term memory (LSTM) model to evaluate the risk factors of a particular energy sector. The model reviews multi-page text documents to analyze each sentence of the text and categorize it as either a potential risk or no risk. The model is not performing well, even though the Data Scientist has experimented with many different network structures and tuned the corresponding hyperparameters. Which approach will provide the MAXIMUM performance boost?
A) Initialize the words by term frequency-inverse document frequency (TF-IDF) vectors pretrained on a large collection of news articles related to the energy sector.
B) Use gated recurrent units (GRUs) instead of LSTM and run the training process until the validation loss stops decreasing.
C) Reduce the learning rate and run the training process until the training loss stops decreasing.
D) Initialize the words by word2vec embeddings pretrained on a large collection of news articles related to the energy sector.
Correct Answer:
Verified
Q73: A credit card company wants to build
Q74: A Machine Learning Specialist previously trained a
Q75: A trucking company is collecting live image
Q76: A Data Scientist needs to analyze employment
Q77: A Machine Learning Specialist is training a
Q79: A Data Scientist is training a multilayer
Q80: A Machine Learning Specialist is preparing data
Q81: A data scientist has developed a machine
Q82: A data scientist needs to identify fraudulent
Q83: A logistics company needs a forecast model
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents