Python for Data Science NPTEL | Week 4
Session: JAN-APR 2024/ JULY-DEC 2023
Course name: Python For Data Science
Course Link: Click Here
These are NPTEL Python for Data Science Assignment 4 Answers
Q1. Which of the following are regression problems? Assume that appropriate data is given. Predicting the house price. Predicting whether it will rain or not on a given day. Predicting the maximum temperature on a given day. Predicting the sales of the ice-creams.
Answer: a, c, d
Q2. Which of the followings are binary classification problems? Predicting whether a patient is diagnosed with cancer or not. Predicting whether a team will win a tournament or not. Predicting the price of a second-hand car. Classify web text into one of the following categories: Sports, Entertainment, or Technology.
Answer: a, b
Q3. If a linear regression model achieves zero training error, can we say that all the data points lie on a hyperplane in the (d+1)-dimensional space? Here, d is the number of features. Yes No
Answer: Yes
Q4. Which of the following machine learning techniques would NOT be appropriate to solve the problem given in the problem statement? kNN Random Forest Logistic Regression Linear regression
Answer: Linear regression
Q5. After applying logistic regression, what is/are the correct observations from the resultant confusion matrix? True Positive = 29, True Negative = 94 True Positive = 94, True Negative = 29 False Positive = 5, True Negative = 94 None of the above
Answer: a, c
Q6. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test data? 60 – 79 90 – 95 30 – 59 80 – 89
Answer: 90 – 95
Q7. How are categorical variables preprocessed before model building? Standardization Dummy variables Correlation None of the above
Answer: Dummy variables
Q8. A multiple linear regression model is built on the Global Happiness Index dataset ‘GHI_Report.csv’. What is the RMSE of the baseline model? 2.00 0.50 1.06 0.75
Answer: 1.06
Q9. A regression model with the following function y=60+5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week. What is the predicted difference in rainfall? 156 mm 15.6 mm -156 mm None of the above
Answer: 156 mm
Q10. X nd Y are two variables that have a strong linear relationship. Which of the following statements are incorrect? There cannot be a negative relationship between the two variables. The relationship between the two variables is purely causal. One variable may or may not cause a change in the other variable. The variables can be positively or negatively correlated with each other.
More Weeks of Python for Data Science: Click here
More Nptel Courses: Click here
Session: JAN-APR 2023
Course Name: Python for Data Science
Q1. Which of the following are regression problems? Assume that appropriate data is given. a. Predicting the house price. b. Predicting whether it will rain or not on a given day. c. Predicting the maximum temperature on a given day. d. Predicting the sales of the ice-creams.
Q2. Which of the followings are binary classification problems? a. Predicting whether a patient is diagnosed with cancer or not. b. Predicting whether a team will win a tournament or not. c. Predicting the price of a second-hand car. d. Classify web text into one of the following categories: Sports, Entertainment, or Technology.
Q3. If a linear regression model achieves zero training error, can we say that all the data points lie on a hyperplane in the (d+1)-dimensional space? Here, d is the number of features. a. Yes b. No
Answer: a. Yes
Read the information given below and answer the questions from 4 to 6: Data Description: An automotive service chain is launching its new grand service station this weekend.They offer to service a wide variety of cars. The current capacity of the station is to check 315 cars thoroughly per day. As an inaugural offer, they claim to freely check all cars that arrive on their launch day, and report whether they need servicing or not! Unexpectedly, they get 450 cars. The servicemen will not work longer than the working hours, but the data analysts have to!
Can you save the day for the new service station? How can a data scientist save the day for them? He has been given a data set, ‘ServiceTrain.csv’ that contains some attributes of the car that can be easily measured and a conclusion that if a service is needed or not. Now for the cars they cannot check in detail, they measure those attributes and store them in ‘ ServiceTest.csv ’ Problem Statement: Use machine learning techniques to identify whether the cars require service or not Read the given datasets ‘ ServiceTrain.csv ’ and ‘ ServiceTest.csv ’ as train data and test data respectively and import all the required packages for analysis.
Q4. Which of the following machine learning techniques would NOT be appropriate to solve the problem given in the problem statement? a. kNN b. Random Forest c. Logistic Regression d. Linear regression
Answer: d. Linear regression
Prepare the data by following the steps given below, and answer questions 6 and 7.
- Encode categorical variable, Service – Yes as 1 and No as 0 for both the train and test datasets.
- Split the set of independent features and the dependent feature on both the train and test datasets.
- Set random_state for the instance of the logistic regression class as 0.
Q5. After applying logistic regression, what is/are the correct observations from the resultant confusion matrix? a. True Positive = 29, True Negative = 94 b. True Positive = 94, True Negative = 29 c. False Positive = 5, True Negative = 94 d. None of the above
Q6. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test data? a. 60 – 79 b. 90 – 95 c. 30 – 59 d. 80 – 89
Answer: b. 90 – 95
Q7. How are categorical variables preprocessed before model building? a. Standardization b. Dummy variables c. Correlation d. None of the above
Answer: b. Dummy variables
The Global Happiness Index report contains the Happiness Score data with multiple features (namely the Economy, Family, Health, and Freedom) that could affect the target variable value. Prepare the data by following the steps given below, and answer question 8
- Split the set of independent features and the dependent feature on the given dataset
- Create training and testing data from the set of independent features and dependent feature by splitting the original data in the ratio 3:1 respectively, and set the value for random_state of the training/test split method’s instance as 1
Q8. A multiple linear regression model is built on the Global Happiness Index dataset “GHI Report.csv”. What is the RMSE of the baseline model? a. 2.00 b. 0.50 c. 1.06 d. 0.75
Answer: c. 1.06
Q9. A regression model with the following function y = 60 + 5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week. What is the predicted difference in rainfall? a. 156 mm b. 15.6 mm c. -156 mm d. None of the above
Answer: a. 156 mm
Q10. X and Y are two variables that have a strong linear relationship. Which of the following statements are incorrect? a. There cannot be a negative relationship between the two variables. b. The relationship between the two variables is purely causal. c. One variable may or may not cause a change in the other variable. d. The variables can be positively or negatively correlated with each other.
More Weeks of Python for Data Science NPTEL: Click here
More NPTEL courses: https://progiez.com/nptel
Session: JULY-DEC 2022
Course name: Python for Data Science
Link to Enroll: Click Here
Q1. The power consumption of an individual house in a residential complex has been recorded for the previous year. This data is analysed to predict the power consumption for the next year. Under which type of machine learning problem does this fall under? a. Classification b. Regression c. Reinforcement Learning d. None of the above
Answer: b. Regression
Q2. A dataset contains data collected by the Tamil Nadu Pollution Control Board on environmental conditions (154 variables) from one of their monitoring stations. This data is further analyzed to understand the most significant factors that affect the Air Quality Index. The predictive algorithm that can be used in this situation is __________. a. Logistic Regression b. Simple Linear Regression c. Multiple Linear Regression d. None of the above
Answer: c. Multiple Linear Regression
Q3. A regression model with the following function y = 60 + 5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week. What is the predicted difference in rainfall? a. 156 mm b. 15.6 mm c. -156 mm d. None of the above
5. The plot shown below denotes the percentage distribution of the target column values within the train_data dataframe. Which of the following options are correct?
a. Yes > 20, No > 60 b. No > 70, Yes > 20 c. Yes > 30, No > 70 d. Yes > 70, No > 30
Answer: b. No > 70, Yes > 20
Q6. After applying logistic regression, what is/are the correct observations from the resultant confusion matrix? a. True Positive = 29, True Negative = 94 b. True Positive = 94, True Negative = 29 c. False Positive = 5, True Negative = 94 d. None of the above
Answer: b. True Positive = 94, True Negative = 29
Q7. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test data? a. 60 – 79 b. 90 – 95 c. 30 – 59 d. 80 – 89
Answer: b. 90 – 95
Q8. How are categorical variables preprocessed before model building? a. Standardization b. Dummy variables c. Correlation d. None of the above
Q9. A multiple linear regression model is built on the Global Happiness Index dataset “GHI_Report.csv”. What is the RMSE of the baseline model? a. 2.00 b. 0.50 c. 1.06 d. 0.75
10. X and Y are two variables that have a strong linear relationship. Which of the following statements are incorrect? a. There cannot be a negative relationship between the two variables. b. The relationship between the two variables is purely causal. c. One variable may or may not cause a change in the other variable. d. The variables can be positively or negatively correlated with each other.
Python for Data Science NPTEL All weeks: https://progies.in/answers/nptel/python-for-data-science
More NPTEL course answers: https://progies.in/answers/nptel
NPTEL Python for Data Science Assignment 4 Answers 2023
NPTEL Python for Data Science Assignment 4 Answers 2023:- All the Answers provided below to help the students as a reference, You must submit your assignment at your own knowledge.
NPTEL Python For Data Science Week 4 Assignment Answer 2023
1. Which of the following are regression problems? Assume that appropriate data is given.
- Predicting the house price.
- Predicting w h ether it will rain or not on a given day.
- Predicting the maximum temperature on a g iven day.
- Predicting the sales of the ice-creams.
2. Which of the followings are binary classification problems?
- Predicting whether a patient is diagnosed with cancer or not.
- Predicting whether a team will win a tournament or not.
- Predicting the price of a second-hand car.
- Classify web text into one of the follow in g categories: Sports, Entertainment, or Technology.
3. If a linear regression model achieves zero training error, can we say that all the data points lie on a hyperplane in the (d+1)-dimensional space? Here, d is the nu m ber of features.
Read the information given below and answer the questions from 4 to 6: Data Description: An automotive service chain is launching its new grand service station this weekend. They offer to service a wide variety of cars. The current capacity of the station is to check 315 cars thoroughly per day. As an inaugural offer, they claim to freely check all cars that arrive on their launch day, and report whether they need servicing or not!
Unexpectedly, they get 450 cars. The servicemen will not work longer than the working hours, but the data analysts have to!
Can you save the day for the new service station?
How can a data scientist save the day for them?
He has been given a data set, ‘ ServiceTrain.csv ’ that contains some attributes of the car that can be easily measured and a conclusion that if a service is needed or not.
Now for the cars they cannot check in detail, they measure those attributes and store them in ‘ ServiceTest.csv ’
Problem Statement:
Use machine learning techniques to identify whether the cars require service or not
Read the given datasets ‘ ServiceTrain.csv ’ and ‘ ServiceTest.csv ’ as train data and test data respectively and import all the required packages for analysis.
4. Which of the following machine learning techniques would NOT be ap p ropriate to solve the problem given in the problem statement?
- Random Forest
- Logisti c Regression
- Linear regression
5. After applying logistic regression, what is/are the correct observat ion s from the resultant confusion matrix?
- True Positive = 29, True Negative = 94
- True Positive = 94, Tr u e Negative = 29
- False Positive = 5, True Negative = 94
- None of the above
Prepare the data by following th e steps given below, and answer questions 6 and 7.
- Encode categorical variable, Service – Yes as 1 and No as 0 for both the train and test datasets.
- Split the set of independent features and the dependent feature on both the train and test datasets.
- Set random_state for the instance of the logistic regression class as 0.
6. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test dat a ?
- 60 – 79
- 90 – 9 5
7. How are categorical variables preprocessed before m odel building?
- Standardization
- Dummy var i ables
- Correlation
The Global Happiness Index report contains the Happiness Score data w i th multiple features (namely the Economy, Family, Health, and Freedom) that could affect the target variable value.
Prepare the data by following the steps g iven below, and answer question 8
- Split the set of independent features and the dependent feature on the given dataset
- Create training and testing data from the set of independent features and dependent feature by splitting the original data in the ratio 3:1 respectively, and set the value for random_state of the training/test split method’s instance as 1
8. A multiple linear regression model is built on the Global Happiness Index dat a set ‘GHI_Report.csv’. What is the RMSE of the baseline model?
9. A regression model with the following function y=60+5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week. Wh a t is the predicted difference in rainfall?
10. X and Y are two variables that have a strong linear relationship. Whi c h of the following statements are incorrect?
- There cannot be a negative relationship between the two variables.
- The relationship between the two variables is purely causal.
- One variable may or may not cause a change in the other variable.
- The variables can be positively or negativel y correlated with each other
About Python For Data Science
The course aims at equipping participants to be able to use python programming for solving data science problems. CRITERIA TO GET A CERTIFICATE Average assignment score = 25% of the average of the best 3 assignments out of the total 4 assignments given in the course. Exam score = 75% of the proctored certification exam score out of 100 Final score = Average assignment score + Exam score YOU WILL BE ELIGIBLE FOR A CERTIFICATE ONLY IF AVERAGE ASSIGNMENT SCORE >=10/25 AND EXAM SCORE >= 30/75. If one of the 2 criteria is not met, you will not get the certificate even if the Final score >= 40/100.
NPTEL Python for Data Science Assignment 4 Answers July 2022
1. The power consumption of an individual house in a residential complex has been recorded for the previous year. This data is analysed to predict the power consumption for the next year . Under which type of machine learning problem does this fall under? a. Classification b. Regression c. Reinforcement Learning d. None of the above
2. A dataset contains data collected by the Tamil Nadu Pollution Control Board on environmental conditions (154 variables) from one of their monitoring stations. This data is further analyzed to understand the most significant factors that affect the Air Quality Index. The predictive algorithm that can be used in this situation is _______ _ ___. a. Logistic Regression b. Simple Linear Regression c. Multiple Linear Regression d . None of the above
Answers will be Uploaded Shortly and it will be Notified on Telegram, So JOIN NOW
3. A regression model with the following function y = 60 + 5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week . What is the predicted difference in rainfall? a. 156 mm b. 15.6 mm c. -156 mm d. None of the above
4. Which of the following machine learning techniques would NOT be appropriate to solve the problem given in the problem statement? a. kNN b. Random Forest c . Logistic Regression d. Linear regression
5. The plot shown below denotes the percentage distribution of the target column values within the train_data dataframe. Which of the following options are correct?
6. After applying logistic regression, what is/are the correct observations from the resultant confusion matrix? a. True Positive = 29, True Negative = 94 b. True Positive = 94, True Negative = 29 c. False Positive = 5 , True Negative = 94 d. None of the above
👇 For Week 04 Assignment Answers 👇
7. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test data? a. 60 – 79 b. 90 – 95 c. 30 – 59 d. 80 – 89
8. How are categorical variables preprocessed before model building? a. Standardization b. Dummy variables c. Correlation d. None of the above
9. A multiple linear regression model is built on the Global Happiness Index dataset “GHI_Report.csv”. What is the RMSE of the baseline model? a. 2.00 b. 0.50 c. 1.06 d. 0.75
10. X and Y are two variables that have a strong linear relationship. Which of the following statements are incorrect? a. There cannot be a negative relationship between the two variables. b. The relationship between the two variables is purely causal. c. One variable may or may not cause a change in the other variable . d. The variables can be positively or negatively correlated with each other.
For More NPTEL Answers:- CLICK HERE Join Our Telegram:- CLICK HERE
NPTEL Python for Data Science Assignment 4 Answers Jan 2022
Q1. How many unique values are present in the Sbal feature; also, what is the most frequent value within Sbal?
(A) 5, Rs. >= 10,000 (B) 4, Rs. < 1000 (C) 5, Rs. < 1000 (D) 4, ‘1000 <= Rs. < 5,000’
Answer:- (C) 5, Rs. < 1000
The answers will be Uploaded Shortly and it will be notified on Telegram. So Join Now
Q2. Find the average age of those customers who have a credit history [Chist] wherein the dues are not paid earlier.
(A) 35.54 (B) 38.44 (C) 33.00 (D) None of the above
Answer:- (B) 38.44
Q3. A Logistic Regression model is built in which none of the features used are standardized. The train to test proportion is 75:25 and the random state is set to 1. The accuracy of the model is ________.
(A) Less than 50% (B) Between 50% and 60% (C) Greater than 70% (D) None of the above
Answer:- (C) Greater than 70%
Q4. Import StandardScaler() from the sklearn.preprocessing package to standardize the features. Use the same train-test proportion and the random state should be set to 1. After standardizing the logistic regression model, by what percentage has the misclassified samples changed?
(A) 11.11% (B) 3.7% (C) 20% (D) 39.2%
Answer:- (C) 20%
Q5. When KNN classification is applied on the same standardized data at the optimal value for k nearest neighbours, the accuracy achieved is ______.
(A) 64% (B) 78% (C) 76.4% (D) None of the above
Answer:- (A) 64%
Q6. A multiple linear regression model is built on the Global Happiness Index dataset “ GHI_Report.csv ”. What is the rmse of the baseline model?
(A) 1.99 (B) 0.85 (C) 1.06 (D) 0.33
Answer:- (C) 1.06
Q7. From the multiple linear regression model built on the GHI index, we get an R-squared value of _______ on the test data subset.
(A) 55.63 (B) 45.81 (C) 75.59 (D) 81.46
Answer:- (D) 81.46
Q8. Which of the following statement/s about Linear Regression is / are true?
(A) Linear Regression assumes that there exists a linear relationship between the independent variable and dependent variable. (B) The error terms are assumed to be independent and normally distributed. (C) The percentage of variation in the dependent variable as explained by the independent variable/variables is expressed by R-squared value. (D) Residuals are the product of the predicted value and the actual observed value.
Answer:- (A), (B), (C)
Q9. Which of the following statements is inaccurate about Logistic Regression?
(A) Logistic Regression doesn’t require a linear relationship between the dependent and independent variables. (B) The value of the logistic function being a probability will range between 0 and 1. (C) Cost function of Logistic Regression is also called as the Log Loss function. (D) The dependent variable can be of both numerical or categorical type just like the independent variables.
Answer:- (C) Cost function of Logistic Regression is also called as the Log Loss function.
Q10. In a KNN model, by which means do we handle categorical variables?
(A) Standardization (B) Dummy variables (C) Correlation (D) None of the above
Answer:- (B) Dummy variables
Disclaimer :- We do not claim 100% surety of solutions, these solutions are based on our sole expertise, and by using posting these answers we are simply looking to help students as a reference, so we urge do your assignment on your own.
For More NPTEL Answers:- CLICK HERE
Join Our Telegram:- CLICK HERE
NPTEL Python for Data Science Assignment 4 Answers 2022:- All the Answers provided below to help the students as a reference, You must submit your assignment at your own knowledge.
Leave a Comment Cancel reply
You must be logged in to post a comment.
DBC Itanagar
All India News
{Week 4} NPTEL Python For Data Science Assignment Answer 2023
NPTEL Python For Data Science Assignment Answer
NPTEL Python For Data Science Week 4 Assignment Answer 2023
1. Which of the following are regression problems? Assume that appropriate data is given.
- Predicting the house price.
- Predicting w h ether it will rain or not on a given day.
- Predicting the maximum temperature on a g iven day.
- Predicting the sales of the ice-creams.
2. Which of the followings are binary classification problems?
- Predicting whether a patient is diagnosed with cancer or not.
- Predicting whether a team will win a tournament or not.
- Predicting the price of a second-hand car.
- Classify web text into one of the follow in g categories: Sports, Entertainment, or Technology.
3. If a linear regression model achieves zero training error, can we say that all the data points lie on a hyperplane in the (d+1)-dimensional space? Here, d is the nu m ber of features.
Read the information given below and answer the questions from 4 to 6: Data Description: An automotive service chain is launching its new grand service station this weekend. They offer to service a wide variety of cars. The current capacity of the station is to check 315 cars thoroughly per day. As an inaugural offer, they claim to freely check all cars that arrive on their launch day, and report whether they need servicing or not!
Unexpectedly, they get 450 cars. The servicemen will not work longer than the working hours, but the data analysts have to!
Can you save the day for the new service station?
How can a data scientist save the day for them?
He has been given a data set, ‘ ServiceTrain.csv ’ that contains some attributes of the car that can be easily measured and a conclusion that if a service is needed or not.
Now for the cars they cannot check in detail, they measure those attributes and store them in ‘ ServiceTest.csv ’
Problem Statement:
Use machine learning techniques to identify whether the cars require service or not
Read the given datasets ‘ ServiceTrain.csv ’ and ‘ ServiceTest.csv ’ as train data and test data respectively and import all the required packages for analysis.
4. Which of the following machine learning techniques would NOT be ap p ropriate to solve the problem given in the problem statement?
- Random Forest
- Logisti c Regression
- Linear regression
5. After applying logistic regression, what is/are the correct observat ion s from the resultant confusion matrix?
- True Positive = 29, True Negative = 94
- True Positive = 94, Tr u e Negative = 29
- False Positive = 5, True Negative = 94
- None of the above
Prepare the data by following th e steps given below, and answer questions 6 and 7.
- Encode categorical variable, Service – Yes as 1 and No as 0 for both the train and test datasets.
- Split the set of independent features and the dependent feature on both the train and test datasets.
- Set random_state for the instance of the logistic regression class as 0.
6. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test dat a ?
- 60 – 79
- 90 – 9 5
7. How are categorical variables preprocessed before m odel building?
- Standardization
- Dummy var i ables
- Correlation
The Global Happiness Index report contains the Happiness Score data w i th multiple features (namely the Economy, Family, Health, and Freedom) that could affect the target variable value.
Prepare the data by following the steps g iven below, and answer question 8
- Split the set of independent features and the dependent feature on the given dataset
- Create training and testing data from the set of independent features and dependent feature by splitting the original data in the ratio 3:1 respectively, and set the value for random_state of the training/test split method’s instance as 1
8. A multiple linear regression model is built on the Global Happiness Index dat a set ‘GHI_Report.csv’. What is the RMSE of the baseline model?
9. A regression model with the following function y=60+5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week. Wh a t is the predicted difference in rainfall?
10. X and Y are two variables that have a strong linear relationship. Whi c h of the following statements are incorrect?
- There cannot be a negative relationship between the two variables.
- The relationship between the two variables is purely causal.
- One variable may or may not cause a change in the other variable.
- The variables can be positively or negativel y correlated with each other
NPTEL Python For Data Science Week 3 Assignment Answer 2023
1. Which of the following is the correct approach to fill missing values in case of categorical variable?
2. Of the following set of statements, which of them can be used to extract the column Type as a separate dataframe?
- df_cars[[‘Type’]]
- df_cars.iloc[[:, 1]
- df_cars.loc[:, [‘Type’]]
3. The method df_cars.describe() will give description of which of the following column?
- Price (in lakhs)
- All of the above
4. Which pandas function is used to stack the dataframes vertically?
- pd.concat()
5. Which of the following are libraries in Python?
6. Which of the following variable have null values?
- Review Date
7. Which of the following countries have maximum locations of cocoa manufacturing companies?
8. After checking the data summary, which feature requires a data conversion considering the data values held?
- Review date
- Bean origin
9. What is the maximum rating of chocolates?
- [bool, int, float, float, str]
- [str, int, float, float, str]
- [bool, int, float, int, str]
- [bool, int, int, float, str]
NPTEL Python For Data Science Week 2 Assignment Answer 2023
1. Which of the following object does not support ind e xing?
- dict i onary
2. Given a NumPy array, arr = np.array([[[1, 2, 3], [4, 5, 6], [7, 8, 9]]]), what is the output of the command, print(arr[0][1])?
- [[1 2 3] [4 5 6] [7 8 9]
3. What is the output of the following code?
- [2, 3, 4, 5]
- [1, 2, 3, 4]
- Will throw an error: Set objects are no t iterable.
5. Which of the following code gives output My friend’s house is in Chennai?
6. Let t1=(1,2, “tuple”,4) and t2=(5,6,7). Which of the follo w ing will not give any error after the execution?
- t1.append(5)
- x=t2[t1[1]]
- t3=(t1 , t2)
- t3=(list(t1), list(t2))
7. Let d={1:“Pyhton”,2:[1,2,3]}. Which among the fol l owing will not give the error after the execution?
- d[2].append(4)
- d.update({‘one’ : 22})
8. Wh i ch of the following data type is immutable?
9. student = {‘name’: ‘Jane’, ‘age’: 25 , ‘courses’: [‘Math’, ‘Statistics’]} Which among the following will return {‘name’: ‘Jane’, ‘age’: 26, ‘courses’: [‘Math’ , ‘Statistics’], ‘phone’: ‘123-456’}?
- student.update({‘age’ : 26})
- student.update({‘age’ : 26, ‘phone’: ‘123-456’})
- student[‘phone’] = ‘123-456’
[‘M’, ‘A’, ‘H’, ‘E’, ‘S’, ‘H’] [‘m’, ‘a’, ‘h’, ‘e’, ‘s’ , ‘h’] [‘M’, ‘a’, ‘h’, ‘e’, ‘s’, ‘h’] [‘m’, ‘A’, ‘H’, ‘E’, ‘S’, ‘H’]
NPTEL Python For Data Science Week 1 Assignment Answer 2023
- Error: Invalid operation, unsupported operator ‘*’ used between ‘int’ and ‘str’
- Code will throw an error.
4. Which of the following variable names are INVALID in Python?
- variable_ 1
5. While naming the variable, use of any special character other than unders c ore(_) ill throw which type of error?
- Syntax error
- Value er r or
- Index error
6. Let x = “Mayur”. Which of the following commands converts the ‘x’ to float datatype?
- str(float,x)
- x.flo a t()
- Cannot convert a string to float data type
7. Which Python library is commonly used for data wrangling and manipulation?
9. Given two variables, j = 6 and g = 3.3. If both normal division and floor division operators were used to divide j by g, what would be the data type of the value obtained from the operations?
- float, float
Latest News
Nptel soft skill development week 1 assignment answers 2024.
CBSE Schools : सीबीएसई बोर्ड की बड़ी कार्रवाई ! बिहार – झारखंड के 36 स्कूलों की मान्यता रद्द, देखें पूरी लिस्ट.
Bank Holidays 2024 : जनवरी में 16 दिन बंद रहेंगे बैंक, ब्रांच जाने से पहले चेक कर लें छुट्टियों की लिस्ट.
32 के तगड़े माइलेज से Creta की वाट लगा रही Maruti की शानदार कार, बेहतरीन फीचर्स के साथ इतनी है कीमत
Indian Oil Corporation Limited 1603 Recruitment इंडियन ऑयल कॉरपोरेशन लिमिटेड अप्रेंटिसशिप 1603 पदों पर भर्ती
Sign in to your account
Username or Email Address
Remember Me
[Week 1-4] NPTEL Python For Data Science Assignment Answers 2023
NPTEL Python For Data Science Assignment Answers
Table of Contents
NPTEL Python For Data Science Week 4 Assignment Answer 2023
1. Which of the following are regression problems? Assume that appropriate data is given.
- Predicting the house price.
- Predicting w h ether it will rain or not on a given day.
- Predicting the maximum temperature on a g iven day.
- Predicting the sales of the ice-creams.
2. Which of the followings are binary classification problems?
- Predicting whether a patient is diagnosed with cancer or not.
- Predicting whether a team will win a tournament or not.
- Predicting the price of a second-hand car.
- Classify web text into one of the follow in g categories: Sports, Entertainment, or Technology.
3. If a linear regression model achieves zero training error, can we say that all the data points lie on a hyperplane in the (d+1)-dimensional space? Here, d is the nu m ber of features.
Read the information given below and answer the questions from 4 to 6: Data Description: An automotive service chain is launching its new grand service station this weekend. They offer to service a wide variety of cars. The current capacity of the station is to check 315 cars thoroughly per day. As an inaugural offer, they claim to freely check all cars that arrive on their launch day, and report whether they need servicing or not!
Unexpectedly, they get 450 cars. The servicemen will not work longer than the working hours, but the data analysts have to!
Can you save the day for the new service station?
How can a data scientist save the day for them?
He has been given a data set, ‘ ServiceTrain.csv ’ that contains some attributes of the car that can be easily measured and a conclusion that if a service is needed or not.
Now for the cars they cannot check in detail, they measure those attributes and store them in ‘ ServiceTest.csv ’
Problem Statement:
Use machine learning techniques to identify whether the cars require service or not
Read the given datasets ‘ ServiceTrain.csv ’ and ‘ ServiceTest.csv ’ as train data and test data respectively and import all the required packages for analysis.
4. Which of the following machine learning techniques would NOT be ap p ropriate to solve the problem given in the problem statement?
- Random Forest
- Logisti c Regression
- Linear regression
5. After applying logistic regression, what is/are the correct observat ion s from the resultant confusion matrix?
- True Positive = 29, True Negative = 94
- True Positive = 94, Tr u e Negative = 29
- False Positive = 5, True Negative = 94
- None of the above
Prepare the data by following th e steps given below, and answer questions 6 and 7.
- Encode categorical variable, Service – Yes as 1 and No as 0 for both the train and test datasets.
- Split the set of independent features and the dependent feature on both the train and test datasets.
- Set random_state for the instance of the logistic regression class as 0.
6. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test dat a ?
- 60 – 79
- 90 – 9 5
7. How are categorical variables preprocessed before m odel building?
- Standardization
- Dummy var i ables
- Correlation
The Global Happiness Index report contains the Happiness Score data w i th multiple features (namely the Economy, Family, Health, and Freedom) that could affect the target variable value.
Prepare the data by following the steps g iven below, and answer question 8
- Split the set of independent features and the dependent feature on the given dataset
- Create training and testing data from the set of independent features and dependent feature by splitting the original data in the ratio 3:1 respectively, and set the value for random_state of the training/test split method’s instance as 1
8. A multiple linear regression model is built on the Global Happiness Index dat a set ‘GHI_Report.csv’. What is the RMSE of the baseline model?
9. A regression model with the following function y=60+5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week. Wh a t is the predicted difference in rainfall?
10. X and Y are two variables that have a strong linear relationship. Whi c h of the following statements are incorrect?
- There cannot be a negative relationship between the two variables.
- The relationship between the two variables is purely causal.
- One variable may or may not cause a change in the other variable.
- The variables can be positively or negativel y correlated with each other
NPTEL Python For Data Science Week 3 Assignment Answer 2023
1. Which of the following is the correct approach to fill missing values in case of categorical variable?
2. Of the following set of statements, which of them can be used to extract the column Type as a separate dataframe?
- df_cars[[‘Type’]]
- df_cars.iloc[[:, 1]
- df_cars.loc[:, [‘Type’]]
3. The method df_cars.describe() will give description of which of the following column?
- Price (in lakhs)
- All of the above
4. Which pandas function is used to stack the dataframes vertically?
- pd.concat()
5. Which of the following are libraries in Python?
6. Which of the following variable have null values?
- Review Date
7. Which of the following countries have maximum locations of cocoa manufacturing companies?
8. After checking the data summary, which feature requires a data conversion considering the data values held?
- Review date
- Bean origin
9. What is the maximum rating of chocolates?
- [bool, int, float, float, str]
- [str, int, float, float, str]
- [bool, int, float, int, str]
- [bool, int, int, float, str]
NPTEL Python For Data Science Week 2 Assignment Answer 2023
1. Which of the following object does not support ind e xing?
- dict i onary
2. Given a NumPy array, arr = np.array([[[1, 2, 3], [4, 5, 6], [7, 8, 9]]]), what is the output of the command, print(arr[0][1])?
- [[1 2 3] [4 5 6] [7 8 9]
3. What is the output of the following code?
- [2, 3, 4, 5]
- [1, 2, 3, 4]
- Will throw an error: Set objects are no t iterable.
5. Which of the following code gives output My friend’s house is in Chennai?
6. Let t1=(1,2, “tuple”,4) and t2=(5,6,7). Which of the follo w ing will not give any error after the execution?
- t1.append(5)
- x=t2[t1[1]]
- t3=(t1 , t2)
- t3=(list(t1), list(t2))
7. Let d={1:“Pyhton”,2:[1,2,3]}. Which among the fol l owing will not give the error after the execution?
- d[2].append(4)
- d.update({‘one’ : 22})
8. Wh i ch of the following data type is immutable?
9. student = {‘name’: ‘Jane’, ‘age’: 25 , ‘courses’: [‘Math’, ‘Statistics’]} Which among the following will return {‘name’: ‘Jane’, ‘age’: 26, ‘courses’: [‘Math’ , ‘Statistics’], ‘phone’: ‘123-456’}?
- student.update({‘age’ : 26})
- student.update({‘age’ : 26, ‘phone’: ‘123-456’})
- student[‘phone’] = ‘123-456’
[‘M’, ‘A’, ‘H’, ‘E’, ‘S’, ‘H’] [‘m’, ‘a’, ‘h’, ‘e’, ‘s’ , ‘h’] [‘M’, ‘a’, ‘h’, ‘e’, ‘s’, ‘h’] [‘m’, ‘A’, ‘H’, ‘E’, ‘S’, ‘H’]
NPTEL Python For Data Science Week 1 Assignment Answer 2023
- Error: Invalid operation, unsupported operator ‘*’ used between ‘int’ and ‘str’
- Code will throw an error.
4. Which of the following variable names are INVALID in Python?
- variable_ 1
5. While naming the variable, use of any special character other than unders c ore(_) ill throw which type of error?
- Syntax error
- Value er r or
- Index error
6. Let x = “Mayur”. Which of the following commands converts the ‘x’ to float datatype?
- str(float,x)
- x.flo a t()
- Cannot convert a string to float data type
7. Which Python library is commonly used for data wrangling and manipulation?
9. Given two variables, j = 6 and g = 3.3. If both normal division and floor division operators were used to divide j by g, what would be the data type of the value obtained from the operations?
- float, float
Share your love
Related posts, [week 1] nptel introduction to database systems assignment answers 2024.
[Week 1-12] NPTEL Introduction To Machine Learning Assignment Answer 2023
[Week 1, 2] NPTEL Business And Sustainable Development Assignment Answers 2023
[week 1] nptel compiler design assignment answers 2024.
[Week 1-11] NPTEL Soft Skills Assignment Answers 2023
[week 1] nptel introduction to industry 4.0 and industrial internet of things assignment answers 2024, leave a comment cancel reply.
Your email address will not be published. Required fields are marked *
Save my name, email, and website in this browser for the next time I comment.
Course Name: Python for Data Science
- About Course
- Certificate Type
- Toppers list
- Registration
Course abstract
The course aims at equipping participants to be able to use python programming for solving data science problems
Course Instructor
Prof. Ragunathan Rengasamy
Teaching assistant(s), course duration : jul-aug 2021, view course, syllabus, enrollment : 20-may-2021 to 02-aug-2021, exam registration : 17-jun-2021 to 20-aug-2021, exam date : 26-sep-2021, course statistics will be published shortly, certificate eligible, certified category count, successfully completed, participation.
Category : Successfully Completed
Category : Elite
Category : Silver
Category : Gold
Final score calculation logic.
- Assignment Score = Average of best 3 out of 4 assignments. Exam Score = 50% of Certification Exam Score out of 100
- Final Score(Score on Certificate)= 50% of Exam Score +25%Unproctored programming exam score+ 25% of Assignment Score Note: We have taken best assignment score and unproctored Score from July 2020, Jan 2021 and July 2021 course
SAPTARSHI SARKAR 95%
INDIAN INSTITUTE OF TECHNOLOGY,KHARAGPUR
RISHABH LALLA 94%
Indian Institute of Technology, Hyderabad
MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE
ANURADHA SANAPALA 92%
DAISY WADHWA 91%
Guru Nanak Khalsa College for Women, Model Town, Ludhiana
RANJAN KUMAR 91%
Aryabhatta College
ANIRBAN KOPTY 91%
IIT Kharagpur
SREEVENU CHINTAPALLI 91%
SAISUBRAMANIAN B 90%
INDIAN INSTITUTE OF TECHNOLOGY,MADRAS
NIRANJANA M 90%
ALPHO ANN JOSE 90%
M S BHARGAVI 90%
BANGALORE INSTITUTE OF TECHNOLOGY
LAKSHIT PANDE 90%
Kirori Mal College
TEJAS LIPARE 90%
PUNE INSTITUTE OF COMPUTER TECHNOLOGY
PRIYANKA PATEL 90%
Powergrid Corporation of India Limited
ADITYA NANDAKUMAR 90%
Ramco Systems
VENKAT SAI KONDURU 90%
University College of Engineering, Kakinada
MOHIT KUMAR 90%
AMAN KUMAR 88%
BHAGALPUR COLLEGE OF ENGINEERING, BHAGALPUR
HITEN PRAKASH KOTHARI 88%
Society for Applied Microwave Electronics Engineering and Research
DHANUSH RAJ 88%
NMAM INSTITUTE OF TECHNOLOGY
KUMAR RAMANAND 88%
KARTHICK SRIVATSAN 88%
ROHAN KAR 88%
ASHISH V PANCHAL 88%
FR. CONCEICAO RODRIGUES COLLEGE OF ENGINEERING
K NARMADHA 87%
MADRAS INSTITUTE OF TECHNOLOGY
M NIKITHA 87%
KESHAV MEMORIAL INSTITUTE OF TECHNOLOGY (KMIT)
DANIEL PRAMOD AARONS 87%
HARI PRIYA ABBURI 87%
INDIAN INSTITUTE OF TECHNOLOGY,JODHPUR
ANURADHA K 87%
SASTRA DEEMED TO BE UNIVERSITY
BARIDULA UMAMAHESHWAR REDDY 87%
VARDHAMAN COLLEGE OF ENGINEERING
TARNNUM GUPTA 87%
YASHAS REDDY K 87%
NEW HORIZON COLLEGE OF ENGINEERING
MEENA A 87%
SRI KRISHNA COLLEGE OF ENGINEERING AND TECHNOLOGY
KANAK PANDIT 86%
THAKUR COLLEGE OF ENGINEERING AND TECHNOLOGY
JYOTIRMOY DEY 86%
Indian Institute of Space Science and Technology
AMIT KUMAR CHAMPATIRAY 86%
PRASENJIT KUMAR MUDI 86%
B.P. PODDAR INSTITUTE OF MANAGEMENT AND TECHNOLOGY
ASHUTOSH THITE 86%
NITHYA KALYANI P 86%
Indian Overseas Bank
SAMRIDDHI DWIVEDI 86%
VIT UNIVERSITY-VELLORE
PRAGATI 86%
Byjus Future School
P PRAMOD 86%
LOYOLA ACADEMY
KOUSHIKA PADMANABAN 86%
SRI SIVASUBRAMANIYA NADAR COLLEGE OF ENGINEERING
DHAIRYA HANS 85%
MORADABAD INSTITUTE OF TECHNOLOGY
C PRIYANKA 85%
Efficient Light Source Technologies Pvt Ltd
RAM PRASATH S T 85%
RISHABH SHUKLA 85%
B N M INSTITUTE OF TECHNOLOGY
MRITHULLA PADMANABHAN 85%
College of Engineering, Guindy, Anna University
S SUHAASINI 85%
ST.JOSEPH COLLEGE OF ENGINEERING, CHENNAI
AMRUTA CHANDAVALE 85%
JAYANTA CHATTERJEE 85%
MOL-IT India Pvt Ltd.
HARSHIT AWASTHI 85%
SAHIL GIRHEPUJE 85%
WINIL C W 85%
TOC H INSTITUTE OF SCIENCE & TECHNOLOGY
ANISETTI SIDHARTHA 85%
SATHYABAMA INSTITUTE OF SCIENCE AND TECHNOLOGY
ABHAY H KASHYAP 85%
R V COLLEGE OF ENGINEERING
PRIYANKA SHARMA 85%
B DHANALAKSHMI 85%
B.S. ABDUR RAHMAN CRESCENT INSTITUTE OF SCIENCE AND TECHNOLOGY
IMRAN SHARIEFF MOHAMMED 85%
NIMAL P 85%
MALLIKARJUNA DODDAMANE 85%
ISHITA ASHOK 85%
Kannur University Campus, Mangattuparamba
LAKSHMI R 85%
CHARANYA MANI ADITYA 85%
TEENA KB 85%
EAST POINT COLLEGE OF ENGINEERING AND TECHNOLOGY
VIJAYABHASKAR V 85%
R.M.K.ENGINEERING COLLEGE
ASHUTOSH DASH 85%
RASHMIN PRAJAPATI 85%
SARDAR VALLABHBHAI PATEL INSTITUTE OF TECHNOLOGY
SABRIGIRIRAJ M 84%
SVS COLLEGE OF ENGINEERING
SADU MADHUVAMSHI REDDY 84%
KAKATIYA INSTITUTE OF TECHNOLOGY & SCIENCE
TARUSHI KHANNA 84%
AJAY KUMAR GARG ENGINEERING COLLEGE
AZIZ MUJEEBUR RAHMAN 84%
Tata Consultancy Services
KISHEN SANIL 84%
Matrimony.com Ltd
CHIRAKSHA SHETTY 84%
POOJASREE S J 84%
Anna University regional campus, Madurai
B SANTHANAKRISHNAN 84%
VIT UNIVERSITY CHENNAI
ADARSH DOMAKONDA 84%
ENID FERNANDES 84%
DON BOSCO COLLEGE OF ENGINEERING
G SATHISH 84%
VERIZON DATA SERVICES INDIA LIMITED
PRATIK AMAN 84%
BIDITA SAMANTA 84%
KALINGA INSTITUTE OF INDUSTRIAL TECHNOLOGY
MRIGANK MUKHERJEE 84%
Sikkim Manipal Institute of Technology
SUGITHA S 84%
Agricultural College and Research Institute,Tamilnadu Agricultural University
SUYOG CHOUDHARI 84%
Society of Applied Microwave Electronics Engineering and Research, Mumbai
YASH SHRIVASTAVA 84%
Shri Govindram Seksaria Institute of Technology and Science, Indore
RAMAN KUMAR 84%
NAVROOP KAUR 83%
SHOUGAIJAM DIBYALAXMI DEVI 83%
Fidelity Investments
DONIPARTHI VENKATA SAI RAJ KIRAN 83%
University College of Engineering Kakinada(Autonomous), JNTUK
MD HAMID HUSSAIN 83%
GAUTAM BUDDHA UNIVERSITY
SHIVENDRA SHRINET 83%
MOHAMMED TOUHEED PATEL 83%
CHAITANYA BHARATHI INSTITUTE OF TECHNOLOGY,HYDERABAD
AJAY MUKUND S 83%
Sri Sathya Sai Institute of Higher Learning
BHAVYESHKUMAR KARSAN SOLANKI 83%
Softnautics LLP
KANAK AGRAWAL 83%
SHRI GOVINDRAM SEKSARIA INSTITUTE OF TECHNOLOGY AND SCIENCES
AKASH SIVAKUMAR 83%
SUBHAJIT KAR 83%
WEST BENGAL STATE UNIVERSITY
J BHAVANA 83%
Central University Of Karnataka
FATEMA SINGAPURWALA 83%
SIVAKUMAR M 83%
Volvo CE India Pvt Ltd
SIVA SHANKAR S 83%
Agricultural College and Research Institute,Tamil Nadu Agricultural University
DEEPANKAR DE 83%
Tata Medical And Diagnostics
SNEHA CHITTOOR SHANKAR 83%
RUDRA SIKRI 83%
VORA DARSHIL ASHISHBHAI 83%
LEELABEN DASHRATHBHAI RAMDAS PATEL (LDRP) INSTITUTE OF TECHNOLOGY AND RESEARCH
AGITHA ACHYUTH KUMAR 83%
CENTRAL UNIVERSITY OF RAJASTHAN
BOLLOJU PRANAY KUMAR 83%
C NITHYA 83%
SHRAVYA V SHET 83%
DHRUV LIMBANI 83%
S.R.M. INSTITUTE OF SCIENCE AND TECHNOLOGY
BALACHANDER 83%
Alcatel Lucent
ESWER S 83%
KARAN SAINI 83%
SAMPATHKUMAR GURUNATHAN 83%
JKTechnosoft
THOTA RAJESHWAR RAO 83%
SIKKIM MANIPAL INSTITUTE OF TECHNOLOGY
AKANSHA SHARMA 83%
HARSH PATEL 83%
Dhirubhai Ambani Institute of Information and Technology (DAIICT)
Enrollment Statistics
Total enrollment: 44182, registration statistics, total registration : 3509, assignment statistics, score distribution graph - legend, assignment score: distribution of average scores garnered by students per assignment., exam score : distribution of the final exam score of students., final score : distribution of the combined score of assignments and final exam, based on the score logic..
Python for data science was very useful for my studies and also it is easy to understand
Python for Data Science | NPTEL | Week 4 Answers
This set of MCQ(multiple choice questions) focuses on the Python for Data Science NPTEL Week 4 Answers
You should practice these questions to improve fundamentals of Data Science needed for various interviews (like company interview, campus interview, walk-in interview), entrance exams, placements and other competitive exams. All the questions in this particular section are based on only “ Python for Data Science NPTEL Week 4 Answers “.
Course layout
Week 1 : Basics of Python Spyder Week 2: Sequence data types & associated operations Week 3: Data frames Week 4: Case study
NOTE: You can check your answer immediately by clicking show answer button. Moreover, this set of “Python for Data Science NPTEL Week 4 Answers” contains 10 questions.
Now, start attempting the quiz.
Python for Data Science NPTEL Week 4 Answers
Q1. Which of the following are regression problems? Assume that appropriate data is given.
a) Predicting the house price. b) Predicting whether it will rain or not on a given day. c) Predicting the maximum temperature on a given day. d) Predicting the sales of the ice-creams.
Answer: a), c), d)
Q2 . Which of the following are binary classification problems?
a) Predicting whether a patient is diagnosed with cancer or not. b) Predicting whether a team will win a tournament or not. c) Predicting the price of a second-hand car. d) Classify web text into one of the following categories: Sports, Entertainment, or Technology.
Answer: a), b)
Q3. If a linear regression model achieves zero training error, can we say that all the data points lie on a hyperplane in the (d+1)-dimensional space? Here, d is the number of features.
a) Yes b) No
Answer: a) Yes
Q4. Which of the following machine learning techniques would NOT be appropriate to solve the problem given in the problem statement?
a) kNN b) Random Forest c) Logistic Regression d) Linear regression
Answer: d) Linear regression
Q5. After applying logistic regression, what is/are the correct observations from the resultant confusion matrix?
a) True Positive = 29, True Negative = 94 b) True Positive = 94, True Negative = 29 c) False Positive = 5, True Negative = 94 d) None of the above
Answer: a), c)
Q6. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test data?
a) 60 -79 b) 90 – 95 c) 30 – 59 d) 80 – 89
Answer: b) 90 – 95
Q7. How are categorical variables preprocessed before model building?
a) Standardization b) Dummy variables c) Correlation d) None of the above
Answer: b) Dummy variables
Q8. A multiple linear regression model is built on the Global Happiness Indes dataset ‘GHI_Report.csv’. What is the RMSE of the baseline model?
a) 2.00 b) 0.50 c) 1.06 d) 0.75
Answer: c) 1.06
Q9. A regression model with the following function y = 60 + 5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week. What is the predicted difference in rainfall?
a) 156 mm b) 15.6 mm c) -156 mm d) None of the above
Answer: a) 156 mm
Q10. X and Y are two variables that have a strong linear relationship. Which of the following statements are incorrect?
a) There cannot be a negative relationship between the two variables. b) The relationship between the two variables is purely causal. c) One variable may or may not cause a change in the other variable. d) The variables can be positively or negatively correlated with each other.
Q1. How many unique values are present in the Sbal feature; also, what is the most frequent value within Sbal?
a) 5, Rs. >= 10,000 b) 4, Rs. < 1000 c) 5, Rs. < 1000 d) 4, ‘1000 <= Rs. < 5,000’
Q2. Find the average age of those customers who have a credit history [Chist] wherein the dues are not paid earlier
a) 35.54 b) 38.44 c) 33.00 d) None of the above
Q3. A Logistic Regression model is built in which none of the features used are standardized. The train to test proportion is 75:25 and the random state is set to 1. The accuracy of the model is ________.
a) Less than 50% b) Between 50% and 60% c) Greater than 70% d) None of the above
Q4. Import StandardScaler() from the sklearn.preprocessing package to standardize the features. Use the same train-test proportion and the random state should be set to 1. After standardizing the logistic regression model, by what percentage has the misclassified samples changed?
a) 11.11% b) 3.7% c) 20% d) 39.2%
Q5. When KNN classification is applied on the same standardized data at the optimal value for k nearest neighbours, the accuracy achieved is ______.
a) 64% b) 78% c) 76.4% d) None of the above
Q6. A multiple linear regression model is built on the Global Happiness Index dataset “GHI_Report.csv”. What is the rmse of the baseline model?
a) 1.99 b) 1.06 c) 0.85 d) 0.33
Q7. From the multiple linear regression model built on the GHI index, we get an R-squared value of _______ on the test data subset.
a) 55.63 b) 45.81 c) 75.59 d) 81.46
Q8. Which of the following statement/s about Linear Regression is / are true?
a) Linear Regression assumes that there exists a linear relationship between the independent variable and dependent variable. b) The error terms are assumed to be independent and normally distributed. c) The percentage of variation in the dependent variable as explained by the independent variable/variables is expressed by R-squared value. d) Residuals are the product of the predicted value and the actual observed value.
Answer: a), b), c)
Q9. Which of the following statements is inaccurate about Logistic Regression?
a) Logistic Regression doesn’t require a linear relationship between the dependent and independent variables. b) The value of the logistic function being a probability will range between 0 and 1. c) Cost function of Logistic Regression is also called as the Log Loss function. d) The dependent variable can be of both numerical or categorical type just like the independent variables.
Q10. In a KNN model, by which means do we handle categorical variables?
<< Prev- Python for Data Science Week 3 Assignment Solutions
Programming in Java NPTEL week 1 quiz answers
Nptel – Deep Learning assignment solutions
For discussion about any question, join the below comment section. And get the solution of your query. Also, try to share your thoughts about the topics covered in this particular quiz.
Related Posts
Operating system fundamentals | nptel | week 0 assignment 0 solution, nptel operating system fundamentals week 1 assignment solutions, nptel operating system fundamentals week 10 answers, nptel operating system fundamentals week 2 assignment solutions, nptel operating system fundamentals week 3 assignment solutions, nptel operating system fundamentals week 4 assignment solutions, leave a comment cancel reply.
Your email address will not be published. Required fields are marked *
Save my name, email, and website in this browser for the next time I comment.
- 1st Central Law Reviews: Expert Legal Analysis & Insights
- Amazon Quiz
- Flipkart Quiz
- Play & Win 50,000 Coins
- Privacy Policy
NPTEL Python for Data Science Assignment 4 Answers 2022
- by QuizXp Team
- August 24, 2022 August 24, 2022
NPTEL Python for Data Science Assignment 4 Answers :- Hello students in this article we are going to share NPTEL Python for Data Science assignment week 4 answers. All the Answers provided below to help the students as a reference, You must submit your assignment at your own knowledge.
Below you can find NPTEL Python for Data Science Assignment 4 Answers
NPTEL Python for Data Science Assignment 4 Answers 2022:-
Q1. The power consumption of an individual house in a residential complex has been recorded for the previous year. This data is analysed to predict the power consumption for the next year. Under which type of machine learning problem does this fall under?
a. Classification b. Regression c. Reinforcement Learning d. None of the above
Answer : b. Regression
Q2. A dataset contains data collected by the Tamil Nadu Pollution Control Board on environmental conditions (154 variables) from one of their monitoring stations. This data is further analyzed to understand the most significant factors that affect the Air Quality Index. The predictive algorithm that can be used in this situation is ___________.
Answer : c. Multiple Linear Regression
Q3. A regression model with the following function y = 60 + 5.2x was built to understand the impact of humidity (x) on rainfall (y). The humidity this week is 30 more than the previous week. What is the predicted difference in rainfall?
Answer: a. 156 mm
Q4. Which of the following machine learning techniques would NOT be appropriate to solve the problem given in the problem statement?
Answer: d. Linear regression
Q5. The plot shown below denotes the percentage distribution of the target column values within the train_data dataframe. Which of the following options are correct?
Answer: b. No > 70, Yes > 20
Q6. After applying logistic regression, what is/are the correct observations from the resultant confusion matrix?
Answer: b. True Positive = 94, True Negative = 29
Q7. The logistic regression model built between the input and output variables is checked for its prediction accuracy of the test data. What is the accuracy range (in %) of the predictions made over test data?
Answer: b. 90 – 95
Q8. How are categorical variables preprocessed before model building?
Answer: b. Dummy variables
Q9. A multiple linear regression model is built on the Global Happiness Index dataset “GHI_Report.csv”. What is the RMSE of the baseline model?
Answer: c. 1.06
Q10. X and Y are two variables that have a strong linear relationship. Which of the following statements are incorrect?
Answer: a. There cannot be a negative relationship between the two variables
c. One variable may or may not cause a change in the other variable.
For More NPTEL Answers:- CLICK HERE
Disclaimer: We do not claim 100% surety of answers, these answers are based on our sole knowledge, and by posting these answers we are just trying to help students, so we urge do your assignment on your own.
if you have any suggestions then comment below or contact us at [email protected]
If you found this article Interesting and helpful, don’t forget to share it with your friends to get this information.NPTEL Python for Data Science Assignment 4 Answers 2022
Search code, repositories, users, issues, pull requests...
Provide feedback.
We read every piece of feedback, and take your input very seriously.
Saved searches
Use saved searches to filter your results more quickly.
To see all available qualifiers, see our documentation .
nptel-solutions
Here are 47 public repositories matching this topic..., kishanrajput23 / nptel-the-joy-of-computing-using-python.
Study materials related to this course.
- Updated Oct 27, 2023
kishanrajput23 / NPTEL-Programming-In-java
- Updated Apr 14, 2022
kadeep47 / NPTEL-Getting-Started-With-Competitive-Programming
[Aug - Oct 2023] Solutions for NPTEL Course Getting started with competitive programming weekly assignment.
- Updated Sep 6, 2023
omunite215 / NPTEL-Programming-in-Java-Ultimate-Guide
I am sharing my journey of studying a course on Programming in Java taught by Prof.Debasis Samanta Sir IIT Kharagpur
- Updated Dec 4, 2023
Md-Awaf / NPTEL-Course-Getting-started-with-Competitive-Programming
Solutions for NPTEL Course Getting started with competitive programming weekly assignment.
- Updated Apr 20, 2023
rvutd / NPTEL-Joy-of-Computing-2020
Programming Assignment Solutions
- Updated May 5, 2020
guru-shreyansh / NPTEL-Programming-in-Java
The sole intention behind this repository is to help the beginners in Java with the course contents.
- Updated Aug 1, 2021
gunjanmimo / NPTEL-The-Joy-of-Computing-using-Python
- Updated Jan 26, 2020
AdishiSood / The-Joy-of-Computing-using-Python
- Updated Apr 28, 2021
avinashyadav16 / The-Joy-of-Computing-Using-Pyhton
12 Weeks long NPTEL Elective MOOC Course's codes, assignments and solutions.
- Updated Oct 30, 2023
- Jupyter Notebook
NPTEL-Course / Programming-Data-Structures-And-Algorithms-Using-Python
Nptel Course Solutions : Programming, Data Structures And Algorithms Using Python
- Updated Nov 30, 2020
tdishant / NPTEL-Joy-of-Computing-Using-Python
Python code from week-3 to week-12 for the NPTEL course The Joy of Computing using Python
- Updated Oct 26, 2021
TarunSehgal27 / NPTEL-JAVA-2020
this is a repo about the java program headed by Debasis Samantha during 2020
- Updated Apr 23, 2020
gxuxhxm / NPTEL-The-Joy-of-Computing-using-Python
NPTEL-The-Joy-of-Computing-using-Python with NOTES and Weekly quizes Answers
- Updated Dec 31, 2023
NPTEL-Course / Google-Cloud-Computing-Foundations
Nptel Course Solution : Google Cloud Computing Foundations
- Updated Nov 19, 2020
Anmol-PROgrammar / SWAYAM-Programming_In_Java-NPTEL
This site contains the weekly( i.e. 1-9) questions and their solution of NPTEL-SWAYAM course "Programming in Java".
- Updated Aug 19, 2021
lonebots / python-programming-joc-nptel
Python programming repository for NPTEL joy of computing course
- Updated Dec 21, 2020
Gangaduraie / Introduction-to-IoT-and-Industry-4.0
NPTEL- Introduction to IoT and Industry 4.0
- Updated Jan 27, 2024
CGreenP / NPTEL-Programming-Assignment-Road-Trips-and-Museums
NPTEL Programming Assignment: Road Trips and Museums.
- Updated Mar 3, 2024
CGreenP / NPTEL-Programming-Assignment-T1-Q2
NPTEL Programming Assignment: T1-Q2
- Updated Mar 7, 2024
Improve this page
Add a description, image, and links to the nptel-solutions topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the nptel-solutions topic, visit your repo's landing page and select "manage topics."
Python for Data Science
Note: This exam date is subjected to change based on seat availability. You can check final exam date on your hall ticket.
Page Visits
Course layout.
- Reading files
- Exploratory data analysis
- Data preparation and preprocessing
- Scatter plot
- if-else family
- for loop with if break
- Predicting price of pre-owned cars
- Classifying personal income
Books and references
Instructor bio.
Prof. Ragunathan Rengasamy
Course certificate.
DOWNLOAD APP
SWAYAM SUPPORT
Please choose the SWAYAM National Coordinator for support. * :
- AWS Cloud Foundation
- _Joy Of Computing
- _Data Science For Engineers
- Internships
Nptel data Science for Engineers Assignment 4 Answers 2023| Data Science for Engineers | Study2night
NPTEL Data Science for Engineers Assignment 4 Answers 2023? In this article we will discuss about the answers for Week 4 assignment of Data science for Engineers. All these answers are make it as reference.I am confident in providing these answers.
Also Read: NPTEL Data Science for Engineers Assignment 3 Answers 2023
About Nptel
National programme on Technology enhanced learning ( NPTEL ) , is a very best online platform to do internships. In this Platform, it is providing free internship to students and professionals in India. It is initiated taken by Ministry of Human Resource Development (MHRD) in the year 2003. It is collabarated with 7 IIT's and IIT sc . It is providing high quality education and cost effective education to students and the professionals across the country. It is providing free resource of learning, assignments and quizes to solve.
NPTEL Data Science for Engineers Assignment 4 Answers 2023
Last Date: 22-02-2023
You can find the answers for Data Science for Engineers Assignment 4 Answers 2023 below
Q1. Let f(x)=x^3+6x^2−3x−5. Select the correct options from the given options below :
a. −2+√5 will give the maximum for f(x).
b. −2+√5 will give the minimum for f(x).
c. The stationary points of f(x) are −2+√5 and −2+√5
d. The stationary points of f(x) are -4 and 0
Answer: [ c , b ]
Reason : f '' (x) at −2+√5 is > 0. Hence, it is called as local minima.
To find the stationary points of f(x), we need to find the values of x where f'(x) = 0.
f(x) = x^3 + 6x^2 - 3x - 5
f'(x) = 3x^2 + 12x - 3
Consider f'(x) = 0, we get: 3x^2 + 12x - 3 = 0
Dividing on both sides by 3: x^2 + 4x - 1 = 0
Using the quadratic formula, we get:
x= -b ± sqrt(4*a*c - 2*a)
x = (-4 ± sqrt(16 + 4))/2 = (-4 ± sqrt(20))/2
x = -2 ± sqrt(5)
So the stationary points of f(x) are -2 + sqrt(5) and -2 - sqrt(5).
To determine whether each point is a maximum or minimum , we need to look at the sign of f''(x) at each point.
f''(x) = 6x + 12
At x = -2 + sqrt(5), we have,
f''(-2 + sqrt(5)) = 6(-2 + sqrt(5)) + 12 = -6 + 6sqrt(5) > 0
So -2 + sqrt(5) is a local minimum.
At x = -2 - sqrt(5), we have,
f''(-2 - sqrt(5)) = 6(-2 - sqrt(5)) + 12 = -6 - 6sqrt(5) < 0
So -2 - sqrt(5) is a local maximum.
From the following information given below, answer the below questions Q2 and Q3.
Consider the following Optimization problem.
maxxϵRf(x) , where f(x)=x^4+7x^3+5x^2−17^x+3
Let x∗ be the maximizer of f(x)
Q2. What is the second order sufficient condition for x∗ to be the maximize of the function f(x)?
a. 4x^3 + 21x^2 + 10x - 17 = 0
b. 12x^2 + 42x + 10 = 0
c. 12x^2 + 42x + 10 > 0
d. 12x^2 + 42x + 10 < 0
Answer: [ c] 12x^2 + 42x + 10 > 0
Q3. Find the value of x*
Answer: [ c ] -1.43
Q4. Let f(x) = 2Sin x , 0≤x≤2π . Select the correct the options from the given following .
a. π/2 is the global maximum of f(x).
b. π is the global minimum of f(x).
c. 3π/2 is the global maximum of f(x).
d. 3π/2 is the global maximum of f(x).
Answer: [ a, d ]
Using the following information answer the following questions Q5, Q6, Q7,Q8.
Let f(x) = 2x1^2 + 3x1x2 + 3x2^2 + x1 + 3x2
Q5. Find the gradient of f(x).
Conclusion:
One Comment Please !
IMAGES
VIDEO
COMMENTS
#pythonfordatascience #nptel #swayam #python #datascience Python for Data Science All week Assignment Solution - https://www.youtube.com/playlist?list=PL__28...
Answer: d. Linear regression. These are NPTEL Python for Data Science Assignment 4 Answers. Prepare the data by following the steps given below, and answer questions 6 and 7. Encode categorical variable, Service - Yes as 1 and No as 0 for both the train and test datasets.
The course aims at equipping participants to be able to use python programming for solving data science problems.INTENDED AUDIENCE : Final Year Undergraduate...
Python for Data Science Week 4: Assignment 4 Solutions || 2023#nptel #nptel2023
September 2023 NPTEL Exams - Hall Tickets Released !! ... The questions will be on the computer and the answers will have to be entered on the computer; type of questions may include multiple choice questions, fill in the blanks, essay-type answers, etc. ... Python For Data Science : Assignment 4 is live now!!
Answer :- For Answer Click Here. Prepare the data by following th e steps given below, and answer questions 6 and 7. Encode categorical variable, Service - Yes as 1 and No as 0 for both the train and test datasets. Split the set of independent features and the dependent feature on both the train and test datasets.
Python for Data Science. By Prof. Ragunathan Rengasamy | IIT Madras. Learners enrolled: 49366. ABOUT THE COURSE : The course aims at equipping participants to be able to use python programming for solving data science problems. INTENDED AUDIENCE : Final Year Undergraduates. PRE-REQUISITES : Knowledge of basic data science algorithms.
Answer :- For Answer Click Here. Prepare the data by following th e steps given below, and answer questions 6 and 7. Encode categorical variable, Service - Yes as 1 and No as 0 for both the train and test datasets. Split the set of independent features and the dependent feature on both the train and test datasets.
Answer :- a. Read the information given below and answer the questions from 4 to 6: Data Description: An automotive service chain is launching its new grand service station this weekend. They offer to service a wide variety of cars. The current capacity of the station is to check 315 cars thoroughly per day.
There will be a live interactive session where a Course team member will explain some sample problems, how they are solved - that will help you solve the weekly assignments. We invite you to join the session and get your doubts cleared and learn better. Date: August 7, 2022 - Sunday. Time: 04.00 PM - 05.00 PM.
🔊NPTEL Python for Data Science Week4 Assignment Solution 2023 | GATE NPTEL⛳ABOUT THE COURSE :The course aims at equipping participants to be able to use pyt...
Solutions and exemplary problems coded while attending a 4 weeks course in data science using Python offered by Indian Institute of Technology Madras, India. Python for Data Science. By Prof. Ragunathan Rengasamy | IIT Madras. Course Begin: July 25, 2022. Course Exam (Programming Test): September 16, 2022 (Duration of the session will be 3 hrs ...
Prof. Ragunathan Rengasamy is a Professor at the Department of Chemical Engineering and a core member of the recently established Robert Bosch Center for Data Science and AI (RBC-DSAI) at IIT Madras. He is also a co-Founder and Director of Gyan Data Pvt. Ltd. (GDPL, identified as one of the top 10 start-ups to watch out for in 2018 by Analytics ...
Course layout. Week 1: Basics of Python Spyder. Week 2: Sequence data types & associated operations. Week 3: Data frames. Week 4: Case study. NOTE: You can check your answer immediately by clicking show answer button. Moreover, this set of "Python for Data Science NPTEL Week 4 Answers" contains 10 questions. Now, start attempting the quiz.
NPTEL Python for Data Science Assignment 4 Answers 2022:-. Q1. The power consumption of an individual house in a residential complex has been recorded for the previous year. This data is analysed to predict the power consumption for the next year. Under which type of machine learning problem does this fall under?
🔊NPTEL Python for Data Science 2023 | https://techiestalk.in/⛳ABOUT THE COURSE :The course aims at equipping participants to be able to use python programmi...
Full course on python for data science from NPTEL along with notes - oojas/Python-for-data-science-NPTEL-
Add this topic to your repo. To associate your repository with the nptel-solutions topic, visit your repo's landing page and select "manage topics." GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects.
The questions will be on the computer and the answers will have to be entered on the computer; type of questions may include multiple choice questions, fill in the blanks, essay-type answers, etc. ... Python for Data Science : Assignment 4 is live now!! ... Here is the much-awaited announcement on registering for the Jan 2023 NPTEL course ...
NPTEL Python for data science 4 week course assignments 2023 7:42 pm python for data science unit week answer submitted. nptel. Skip to document. University; High School; Books; Discovery. Sign in. ... 7 8 4 6 Yes, the answer is correct. Score: 1 Accepted Answers: 7.
The course is free to enroll and learn from. But if you want a certificate, you have to register and write the proctored exam conducted by us in person at any of the designated exam centres. The exam is optional for a fee of Rs 1000/- (Rupees one thousand only).Date and Time of Exams:26 September 2021Morning session 9am to 12 noon; Afternoon ...
NPTEL Data Science for Engineers Assignment 4 Answers 2023 Last Date: 22-02-2023. You can find the answers for Data Science for Engineers Assignment 4 Answers 2023 below. Q1. Let f(x)=x^3+6x^2−3x−5. Select the correct options from the given options below : a. −2+√5 will give the maximum for f(x). b. −2+√5 will give the minimum for f(x).
Python for Data Science|| WEEK-4 Quiz assignment Answers 2023||NPTEL||#SKumarEduThese are solutions regarding submission of NPTEL " PYTHON FOR DATA SCIENCE "...