代做ETW1001 Introduction to Statistical Analysis代做留学生SQL语言

- 首页 >> Web

ETW1001 Introduction to Statistical Analysis

Group Assignment

Semester: Oct Intake, 2025

Due: Monday, 14 January 2025, 11:55 p.m.

The unit learning objectives of this assignment are:

Assess the relevance and usefulness of predictive modelling to address business and economic challenges.

Communicate statistical results to stakeholders effectively to propose business and economic solutions as a team.

This assignment is worth 30% of your final mark for this unit. The total number of marks for this assignment is 80.

INSTRUCTIONS

1.     Make sure that you regularly make back-up copies of your work. Computer, disk,  or  cloud  problems  will not be  accepted  as  valid  reasons  for  late submissions or requests for extensions.

2.     Students  should  pay  particular  emphasis  on the narration,  and how the results are presented and interpreted. Students should endeavour to ensure that the report is complete and well-composed. Poor presentation, poor command of English writing and/or failure to comply with instructions may result in a mark penalty.

3.    Your answer to the questions should be no more than 15 pages (inclusive of graphs and tables). Any part of the report beyond the 15-page limit will be struck out and not marked.

(a)  Use default format, paragraph, and margin settings.

(b)   Font size: 12

(c)   At least 1.15 line spacing between lines.

(d)   Reference list is not counted in the 15-page limit.

(e)   Penalties may apply if the assignment does not conform to the formatting guide- lines.

(g)   All  workings  and  relevant  Excel  output  must  be  clearly  shown  where appropriate as marks will be awarded for workings. Make sure all your workings are included in an Excel file with proper labels. All tables and visualisations must be included in the written report. The presentation of output must be in reasonable size and readable.

4.     Students must uphold academic integrity at all times. Any students caught for cheating, plagiarizing or permitting others to plagiarize their work will be  reported  to  the  Responsible  Officer  for  academic  misconduct  in accordance  to  the   Student  Academic  Misconduct  Procedure.   Severe penalties may apply resulting from the investigation.

5.     Generative AI tools are restricted for certain functions in this assessment  task. In this assessment, you can use generative artificial intelligence (AI) in order to conduct research pertaining to the assessment task only. Any use of generative AI must be appropriately acknowledged (see Learn HQ).

6.     All submissions will be via Moodle by 14th January,2025 [before 11.55pm]

(a)   Please type your report in Microsoft Word, save it as a PDF file, and submit the PDF document. Additionally, you must submit the accompanying Excel document. In total, you are required to submit two files:

1.  The PDF file (containing all relevant answers). Written report (Format: .pdf) [Should have the name and student ID for each member]

2.  The Excel file. Excel workbook (Format: .xlsx). Important:

All answers must be included in the PDF file. Only the PDF file will be graded. Any answers found exclusively in the Excel file will not be marked.

(b)   You will also be required to put your assignment through a Turnitin report. The similarity index should not be more than 20%. Note that this is only a  rough guideline we understand that some common usage of phrases and sentences may contribute to the similarity index. Students should not be worried for this particular instance.

Problem Scenario

Presume that you are a real estate agent working for an international property firm. Your task is to investigate the variables that are relevant in determining house selling prices. The firm has access to a large dataset, and you have selected a sample of 1,250 properties for your analysis.

As a property agent, your primary role is to identify and analyze the significant factors that influence house selling prices. By understanding these variables, you can provide valuable insights to clients, assist in strategic pricing decisions, and support the firmin staying competitive in the real estate.

Data

Download the data file “HousePrice” from Moodle

The key dependent variables are as follows:

The file contains the following variable:

Selling Price: Selling house price in $. [Dependent Variable]

Land Value : Land area value in $.

Building Value  : Total building value in $.

Basement: basement room in square feet.

Baths: Number of bathrooms.

[Note: Most bathrooms contain a toilet and sink as well as a bathtub and shower]

Toilets: Number of toilets.

[Note: In most houses, the toilet is located within the bathroom. However, in newer homes, it is increasingly common to find toilets situated in separate spaces, such as a powder room or a dedicated hall area, which contain only a toilet and a sink, without a bathtub or shower.]

Fireplaces: Number of fireplaces in a house.

Beds: Number of bedrooms in a house.

Rooms: Rooms without bed such as power room, TV room etc.

AC:  Indicator variable for house being air-conditioned (1 = air-condition, 0 = otherwise).

Age: age of the house.

Your group is required to use a subset of the survey data to answer the following questions.   Specifically,   your   sample   should   consist   of   250   consecutive observations, starting from the observation whose ID matches the last three digits of any group member’s  student number. For example, if a group member’s student ID is 20275749, group should start with observation 749 and include observations up to 998.

Question 1 [Total 40 marks]

a)     Construct  an  appropriate chart to illustrate the relationship between the dependent variable on the land value, building value, age of the house, toilets and air condition. Describe the relationship suggested by the charts in part (a). [10 marks]

b)    Run a Simple Linear Regression (SLR) with the dependent variable on the land value, building value, age of the house, toilets and air condition. The  summary output of the SLR should be shown. [5 marks]

c)     Report the estimated equation from part (b). Label each of the models as Model 1, Model 2, Model 3, Model 4, and Model 5. [5 marks]

d)    Interpret the estimated values of the regression coefficients for Model 1 and Model 2 only. [Hint: required to interpret intercepts and slopes] [4 marks]

e)     Obtain a 95% confidence interval for the slope coefficient in Model 1 [You required to show the calculation for the confidence interval. Answer direct from the excel output will not be awarded any marks. You are required to show the working] Interpret your results. [4 marks]

f)     What is the value of the coefficient of determination for Model 1? Interpret this value. [2 marks]

g)    Test the null hypothesis that land value is not a significant predictor of the selling house prices at a 5% level of significance against the alternative that are significant. Use the critical value approach. Carefully show all steps. [4 marks]

h)    Using  a  p-value  approach  at  a  5%  level  of  significance,  test  the  null hypothesis that building value not a significant predictor of the selling house prices at a 5% level of significance against the alternative that it has a significant positive effect. Carefully show all the steps. [2 marks]

i)     Predict the selling price of a house if the building value is $70,000. [1 mark]

j)     Predict the selling price of a house if the building value is $156,000. [1 mark]

k)    Explain whether the predictions in (i) and (j) are reliable. [2 marks]

Question 2 [Total 28 marks]

a)     Run a Multiple Linear Regression (MLR) with a house selling price as the dependent variable with ALL the independent variables. Name this model as Model 6. Summary output of the MLR should be shown. [3 marks]

b)    Formulate  an  appropriate  Multiple  Linear  Regression  estimated  model [Model 6] that predicts the selling house price. [4  marks]

c)     Write down the estimated Multiple Linear Regression equation based on Model 6. [3 marks]

d)     Interpret the estimated coefficient of the land value and air-condition using Model 6. [4 marks]

e)     What is the expected sign for age of the house? Explain your reasoning. [2 marks]

f)     Using  a  p-value  approach  at  a  5%  level  of  significance,  test  the  null hypothesis that age of the house is not a significant predictor of the selling house prices at a 5% level of significance against the alternative that it has a significant negative effect. Carefully show all the steps. [4 marks]

g)    Without doing any calculation which variables contribute significantly to the prediction of house selling price? Why? [2 marks]

h)    By removing all the insignificant variables from Model 6 and then form a Multiple Linear Regression with significant variables. [Name it as Model 7]. Summary output should be shown. [2 marks]

i)     Using  an  appropriate  method  compare Model  6 and 7. Which model is better? Explain. [2 marks]

j)     Using  Model  7,  predict the house  selling price by  substituting the  fifth observation from your sample. [2 marks]

Question 3 [10 marks]

Based  on  your  analysis  above,  write  a  concise  report  summarizing  the  key findings for your firm.

Your report should highlight the significant variables influencing house selling prices  and  explain  how  these  factors  can  guide  strategic  pricing  decisions. Emphasize the practical implications of the results, such as how the insights can help  the  firm  optimize  pricing  strategies  for  the  house  sellers  and  remain competitive in the real estate market. Additionally, discuss how these findings can  be  used  to  identify  trends,  improve  client  recommendations,  practical implications of the results,  enhance the firm's overall market positioning and recommendations. [Your report should be less than 250 words]

Your report should have following scopes:

o   An introduction to the topic

o    Key findings or analysis

o   A conclusion or summary

o   References if any [10 marks]

Formatting [2 marks] The overall report should provide a concise and consistent format with a clear label for each figure. Remember, you are representing your organization to present this study so prepare your report that will be detailed and suitable for readers.




站长地图