⟩ In a normal distribution, approximately what percentage of the cases, to the nearest whole number, falls within 4 standard deviations of the mean
1. 68%
2. 100%
3. 99%
4. 75%
Answer: B
1. 68%
2. 100%
3. 99%
4. 75%
Answer: B
In Latent Dirichlet Allocation model for text classification purposes, what does alpha and beta hyperparameter represent- A) Alpha number of topics within documents, beta number of terms within topics False B) Alpha density of terms generated within topics, beta density of topics generated within terms False C) Alpha number of topics within documents, beta number of terms within topics False D) Alpha density of topics generated within documents, beta density of terms generated within topics True
Which of the following techniques can be used for the purpose of keyword normalization, the process of converting a keyword into its base form? Lemmatization Levenshtein Stemming Soundex A) 1 and 2 B) 2 and 4 C) 1 and 3 D) 1, 2 and 3 E) 2, 3 and 4 F) 1, 2, 3 and 4
Tell me what are the different categories you can categorized the sequence learning process?
Tell me what is batch statistical learning?
Explain me what is the difference between heuristic for rule learning and heuristics for decision trees?
N-grams are defined as the combination of N keywords together. How many bi-grams can be generated from given sentence “Analytics Vidhya is a great source to learn data science” A) 7 B) 8 C) 9 D) 10 E) 11
True or False Word2Vec model is a machine learning model used to create vector notations of text objects. Word2vec contains multiple deep neural networks A) TRUE B) FALSE
You have created a document term matrix of the data, treating every tweet as one document. Which of the following is correct, in regards to document term matrix? Removal of stopwords from the data will affect the dimensionality of data Normalization of words in the data will reduce the dimensionality of data Converting all the words in lowercase will not affect the dimensionality of the data A) Only 1 B) Only 2 C) Only 3 D) 1 and 2 E) 2 and 3 F) 1, 2 and 3
What are the possible features of a text corpus Count of word in a document Boolean feature – presence of word in a document Vector notation of word Part of Speech Tag Basic Dependency Grammar Entire document as a feature A) 1 B) 12 C) 123 D) 1234 E) 12345 F) 123456
Retrieval based models and Generative models are the two popular techniques used for building chatbots. Which of the following is an example of retrieval model and generative model respectively. A) Dictionary based learning and Word 2 vector model B) Rule-based learning and Sequence to Sequence model C) Word 2 vector and Sentence to Vector model D) Recurrent neural network and convolutional neural network