Our Services

Get 15% Discount on your First Order

check the attachements Please read the instructions and questions carefully in ” Assignment_4_2023_Fall.pdf” file and use “Auto.csv” to

March 26, 2024

check the attachements

Please read the instructions and questions carefully in ” Assignment_4_2023_Fall.pdf” file and use “Auto.csv” to finish the assignment. You should submit both 1) an R code ; 2) A PDF report with answers through the link “Submit Assignment 4 Here”

Guidelines:

· Use only R for this assignment

· Submit both R code and Report on findings

· Work is to be done individually for this assignment

Fitting a Classification Tree

1.
This problem involves the OJ data set which is part of the ISLR package (
Hint: the first three lines of codes should be: library (tree), library (ISLR), attach (OJ)).

1.1 Create a training set containing a random sample of 800 observations, and a test set containing the remaining observations. Take a screenshot of your code. (Hint: set.seed (2), train=sample())

1.2 Fit a tree to
the training data, with
Purchase as the response and the other variables as predictors. Use the summary( ) function to produce summary statistics about the tree. Take a screenshot of the summary statistics. How many terminal nodes does the tree have? What is the training misclassification error rate?

1.3 Plot the tree and take a screenshot of the tree (Hint: plot() and text())

1.4 Predict the response on the test data, and produce a confusion matrix comparing the test labels to the predicted test labels. What is the accuracy rate?

1.5 Apply the cv.tree() function to the training set in order to determine the optimal tree size. (Use set.seed(7)). Print the results (Hint: the results should contain the size, k, method etc).

1.6 Produce a plot with tree size (i.e. size) on the x-axis and cross-validated classification error rate (i.e. dev) on the y-axis.

1.7 Which tree size corresponds to the lowest cross-validated classification error rate (i.e. dev)?

1.8 Produce a pruned tree corresponding to the optimal tree size obtained using cross-validation. Take a screenshot of a pruned tree. What is the accuracy rate for the pruned tree? Is it improved compared to the accuracy rate in (1.4)?

1.9 If cross-validation does not lead to selection of a pruned tree (i.e. the accuracy rate produced in (1.8) is lower than the one in (1.4)), then create a pruned tree with five terminal nodes. What is the accuracy rate now?

Fitting a Regression Tree

2.
In the lab, a classification tree was applied to the Carseats data set after converting Sales into a qualitative response variable. Now we will seek to predict Sales using regression trees and related approaches, treating the response as a quantitative variable.

2.1 Using the validation-set approach to split the data set into a training set and a test set (Hint:
use set.seed(2); validation-set approach: half of the observations are selected as the training dataset while half of observations are treated as the test dataset). Take a screenshot of your code.

2.2 Fit a regression tree to the training set.

a) Use summary () to print out the results. How many terminal nodes do you get? What is RMD (Residual Mean Deviance)?

b) Plot the tree and take a screenshot of the tree;

c) What test MSE do you obtain?

2.3 Use cross-validation in order to determine the optimal level of tree complexity (use set.seed(2)).

a) Produce a plot with tree size on the x-axis and cross-validated classification error rate on the y-axis.

b) What is the optimal level of tree complexity?

c) Using the optimal level of tree size to prune the tree, does pruning the tree improve the test MSE?

2.4 Use the bagging approach in order to analyze this data. Take a screenshot of the results. What test MSE do you obtain? (Hint: use set.seed (1);
mtry=10 since we have 10 predictors in Carseats dataset and we use all of the predictors in the bagging approach).

2.5 Use random forests to analyze this data.

a) What test MSE do you obtain? (Hint: use set.seed(1);
mtry=10/3 since we usually use 1/3 of the predictors when building a random forest of regression trees)

b) Use the importance() function to determine which variables are most important. Take a screenshot of your results.

c) Plots of these importance measures can be produced using the varImpPlot() function. Take a screenshot of your output.

d) So which variables are most important?

What to submit:

1. R code.

2. Report.

Should include all the code to accomplish the tasks.

Clear and concise comments to indicate what part of the assignment each code chunk pertains to.

Code should be easily readable.

Filename should be in the format of: LastnameFirstname_A4.R

Take screenshots of your outputs in R Studio and answer all the questions. Submit in PDF format.

Answers questions clearly and concisely.

Includes appropriate plots. Make sure the plots are properly labeled.

The assignment will be graded on the correctness of the answers, comprehensiveness of the analysis, clarity of results’ presentation and neatness of the report.

>Computer Science homework help

Share This Post

Order a Similar Paper and get 15% Discount on your First Order

Related Questions

please read instructions Phase 3 is about results, and this part of the paper will be based on hypothetical analysis. Since we will not be

please read instructions Phase 3 is about results, and this part of the paper will be based on hypothetical analysis. Since we will not be implementing the process, the results described will be based on whatever the students want the research results to be. You will need to provide results

Participative budgeting can be very frustrating for accountants. The dilemma is that in order for this plan to be effective, then the responsible manager must

Participative budgeting can be very frustrating for accountants. The dilemma is that in order for this plan to be effective, then the responsible manager must own the “numbers”. However, most operating managers have little knowledge of how to create a budget. This week’s competency check involves both operating and financial

Case Briefing Legal issue What is being appealed? What is the legal issue? Holding What is the decision of the appellate court? Did they reverse the

Case Briefing Legal issue What is being appealed? What is the legal issue? Holding What is the decision of the appellate court? Did they reverse the lower court decision or affirm it? Reasoning What is the reasoning behind the holding of the majority opinion? Are there any dissents? Concurring opinions?

In this assignment, you will a compare, contrast, and evaluate new media to a time before you were born. Remember new media by its definition has some form of

In this assignment, you will a compare, contrast, and evaluate new media to a time before you were born. Remember new media by its definition has some form of electronic communication. 1. Starting with your birthdate, go back at least 20 years. My birthday is september 1st 2002 2. Compare

my name is Esteban tankou Fill out the Hierarchy of Information table with information from the reading: Irvin’s “What is Academic Writing.” This is part of

my name is Esteban tankou Fill out the Hierarchy of Information table with information from the reading: Irvin’s “What is Academic Writing.” This is part of your engagement grade.

56 y/o Caucasian male presents to the primary care clinic with complaints of dizziness and nausea x 4 days. The patient reports he has not been able to get out

56 y/o Caucasian male presents to the primary care clinic with complaints of dizziness and nausea x 4 days. The patient reports he has not been able to get out of bed since the symptoms started. The patient reports symptoms are worse when he tries to get out of bed

How to focus on couple’s experience with rules and roles This assignment will be submitted to Turnitin.

How to focus on couple’s experience with rules and roles This assignment will be submitted to Turnitin™. Instructions For this assignment, you will apply an EFT lens to examine a case scenario where the rules have changed in a relationship. You will consider how EFT could be integrated into a

Module 5 Discussion Ethical Dilemma Describe a situation of ethical dilemma that

Module 5 Discussion Ethical Dilemma Describe a situation of ethical dilemma that you have experienced in practice and how it was resolved. Submission Instructions: · Your initial post should be no more than 520 words, formatted and cited in current APA style with support from at least 3

You are required to create a thread in response to 1 of the 4-5 provided prompts (choose only 1) for each forum.

Here is an AI written paper. Can you rewrite it please so that it is not AI detected at all. There are two essays just find the best one that is detailed or

Here is an AI written paper. Can you rewrite it please so that it is not AI detected at all. There are two essays just find the best one that is detailed or combine them.

Instructions: To complete Lab 1, you will be required to compose a comprehensive report based on survey research. The report will mainly focus on two

Instructions: To complete Lab 1, you will be required to compose a comprehensive report based on survey research. The report will mainly focus on two significant aspects: a thorough literature review and descriptive statistics. A good literature review will have a detailed assessment of existing research that is pertinent to our

Introduction

Introduction I. Start with an attention-grabber: Will you begin with a quotation, personal story, humor, or fact? II. Listener relevance: tell us why the topic matters III. Speaker Credibility: personal authority on the topic or why did you choose this topic? IV. Preview points you plan to discuss in the

Adversarial justice systems start with what law is broken, who broke it, and how to punish the offender. Restorative justice policies ask what harm was do

Adversarial justice systems start with what law is broken, who broke it, and how to punish the offender. Restorative justice policies ask what harm was done, how to repair the harm, and who’s responsible for repairing the harm. How do we incorporate restorative justice policies into resolving the issues of

see attachment Module 7: Discussion Forum 1: Non-Parametric Tests I This space has been created for you to share

see attachment Module 7: Discussion Forum 1: Non-Parametric Tests I This space has been created for you to share the link of a newspaper or blog article that discusses the importance for companies of using non-parametric tests, for example U Mann-Whitney testing with your classmates. 1. Write a summary about

Week 7: Due on Feb 25, 2024 11:59 PM This week’s assignment involves writing a Python program to collect all the data of a road trip and calculate each person’s

Week 7: Due on Feb 25, 2024 11:59 PM This week’s assignment involves writing a Python program to collect all the data of a road trip and calculate each person’s share of the cost. Prompt the user for

Module 3: Proposal Revision and Formatting the Paper Overview (Read First) This module will require intense focus on your proposal. It is important to pay

Module 3: Proposal Revision and Formatting the Paper Overview (Read First) This module will require intense focus on your proposal. It is important to pay close attention to the guidelines set forth by the Graduate School and your respective department within the School of Education. Please review the APA guidelines

Unit 6: Global Health Group Project Assignment Overview: In this assignment

Unit 6: Global Health Group Project Assignment Overview: In this assignment, you will be assigned to work in small groups to research the health of two different countries. Your group will select one underdeveloped country (least developed country) and one developing country (middle-income country) to contrast and compare. This determination

Our Services

check the attachements Please read the instructions and questions carefully in ” Assignment_4_2023_Fall.pdf” file and use “Auto.csv” to

Share This Post

Related Questions

Use Our 6 Free Tools