You are to analyse data from the file plALL2017.xlsx, using univari… Question You are to analyse data from the file plALL2017.xlsx, using univariate statistics (i.e., one variable at a time). The file contains information I have assembled about nearly 2700 postcode districts in the UK (e.g., AB12, CF13, BS2, YO8, etc). There are enough variables (columns) for you to develop your own focus. Note that it is important to develop a perspective; doing so will inevitably mean you ignore many of the variables. Do not try to analyse all the data from all perspectives! It really does help if the assignment seems to be about something, or a connected set of issues. It also helps if the assignment seems to be going somewhere. There are a number of ways to get this sense of momentum going. For instance, you may present a finding, then try to rule out competing interpretations. Or you may show that an unexpected interpretation is really what is going on. Or you may present a series of linked findings that make the same kind of point even more forcefully. All of the data applies to England and Wales, but only some of the data is available for Scotland. You must therefore decide whether it is relevant to include Scotland or not. You may also may need to take a view whether to analyse postcode districts with very few habitations. There may be something strange about them. One such postcode I have already removed for you is PE35 (Sandringham), one of the places Her Majesty The Queen lives. Other oddities are postcodes consisting of out-of-town shopping malls, car sales showrooms (where licensed cars may greatly exceed people), universities, factories, GCHQ, DVLA, and so on. Start with a brief paragraph setting the scene for what your perspective is. This can already start momentum by setting up expectations of what is to follow. In the main body, be sure to consider using (1) informed descriptive statistics and their interpretation; (2) graphical exploration of the data; (3) formal testing of hypotheses. Develop hypotheses on common sense grounds (no need to go to a background literature), and remember that unless a strong case can be made for a direction in what you expect, the default setting must be to use 2-tailed testing. In theory, this “strong case” should be made before you ever look at the data. Please lay out hypotheses as follows: H1: The more computer lab classes a student attends, the better their assignment mark. H0: Students who attend more computer lab classes do not get better assignment marks. This may be tested via correlation, so that more algebraically: H1: R(lab, mark) > 0. H0: R(lab, mark) ≤ 0. Or it may be tested another way. There are often several ways to test a hypothesis. But don’t just do all possible analyses to cover your back, hoping that one of them is the right one; or to show me you can do everything. It is better that you choose Read more…