**Assignment Task**

**Task**

**Task A:**

Get Help Now!A data analyst has been commissioned by the state government to examine the relationship between the number of hours worked per week and income earned per year amongst the profession of restaurant waiters. A random sample of 220 waiters from different age groups who work in restaurants across different suburbs in a large city were selected for collecting the data on the two concerned variables. These data are stored in the .xls file attached.

First, the data analyst categorised the data into five age group categories and four suburb categories. She further calculated the frequencies for each category and presented the information in the tables below.

**1. **Use the data provided in the tables above to answer the following questions.

**(a) **The data analyst is interested in comparing the number of waiters in each age category. Which chart would you recommend for the analyst to use? Explain the reason in selecting the graphical chart.

**(b)** Use Excel to construct the chart you selected for part a). Display the chart. Then briefly describe what you can observe about the number of waiters in each age category.

**(c) **The data analyst is interested in comparing the proportion of the number of waiters in each suburb category. Which chart would you recommend for the data analyst to use? Explain the reason in selecting the graphical chart.

**(d)** Use Excel to construct the chart you selected for part c). Display the chart. Then briefly describe what you can observe about the proportion of the number of waiters in each suburb category.

**2. **Second, the data analyst would like to use an appropriate graphical descriptive technique in presenting summaries of the data on each of the two variables: hours worked per week and income earned per year. Refer to the .xls file attached.

**(a)** The data analyst believes that using 9 class intervals would be best in constructing a histogram for each of the two variables. Explain how the data analyst could have decided on having 9 as the number of the class intervals.

**(b) **The data analyst suggests using the below class intervals:

- 10 < X>
- 20 < X>

**Explain how the data analyst could have decided on the width of the above class intervals.**

**(c) **Draw a histogram for each of the two variables. In drawing the histograms, you are to use the appropriate BIN values from part (b). Moreover, provide comment on the shape of the two distributions.

**3. **Moreover, the data analyst is interested in attaining numerical descriptive measures to further summarize the data on each of the two variables: hours worked per week and income earned per year.

**(a)** Present and display two numerical summary reports – one report for hours worked variable, and another one for income earned variable. Ensure that the calculated measures of mean, median, mode, range, variance, standard deviation, smallest value, largest value and the three quartiles are included in the report.

**(b) **Give a brief interpretation on the reported mean value for the hours worked variable.

**(c)** Give a brief interpretation on the reported third quartile value for the income earned variable.

**(d) **Calculate and present the correlation coefficient value of the linear relationship between the two variables. Give an interpretation of the calculated correlation value.

**Task B:**

“In the digital economy the world has seen an exponential increase in the amount of data generated per second. When strategically managed and analysed, data transform into useful information for business decision making.”

Herewith, you are tasked to select and utilise real-data that are available from any relevant source. These could be data from a variety of areas of student interest (e.g. economics, finance, accounting, marketing, management, sports, tourism, etc). You could also select and utilise data regarding the Covid-19 pandemic situation if you like. Copy and paste the data you select prior to presenting your report in addressing items a) to d) below.

The emphasis here would be for you to demonstrate skills in data visualisation and descriptive data analysis using graphical and numerical techniques taught in modules six to eight of our course. Moreover, drawing from the graphical and numerical outputs obtained, students are to present a report (600 words maximum in word length). In your report, please ensure that you cover the following items:

**(a)** Identification of the type of data you have.

**(b) **Discussion on why you choose to use the certain graphical technique/s for the data you have.

**(c) **Presentation and analysis of the relevant graphical outputs and the numerical measures summary report.

**(d) **Discussion on the important information you extract from the graphical outputs and the numerical measures summary report. Specifically, in the discussion student is to address some recommendations that propose innovative business solutions for decision makers who may benefit from the use of the data.

